Overview

Dataset statistics

Number of variables6
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory52.9 B

Variable types

Text3
Categorical1
Numeric1
DateTime1

Dataset

Description춘천시에서 운영중인 양식장에 대한 업체명, 지번주소, 양식방법, 수조면적, 주생산품목, 데이터기준일에 대한 자료
Author강원특별자치도 춘천시
URLhttps://www.data.go.kr/data/15113429/fileData.do

Alerts

데이터기준일 has constant value ""Constant
양식방법 is highly imbalanced (53.0%)Imbalance
업체명 has unique valuesUnique
지번주소 has unique valuesUnique
수조면적 has unique valuesUnique

Reproduction

Analysis started2024-04-29 23:07:47.855124
Analysis finished2024-04-29 23:07:50.000630
Duration2.15 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-04-30T08:07:50.139913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length4
Mean length5.1470588
Min length4

Characters and Unicode

Total characters175
Distinct characters70
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row광판수산
2nd row강원수산
3rd row양지수산
4th row와담농장
5th row한국수산기술연구원㈜
ValueCountFrequency (%)
광판수산 1
 
2.8%
오월수산 1
 
2.8%
다운수산 1
 
2.8%
광판양어장 1
 
2.8%
소양강양어장 1
 
2.8%
㈜농업회사법인 1
 
2.8%
팜에프 1
 
2.8%
푸른자라 1
 
2.8%
발산양어장 1
 
2.8%
강원수산 1
 
2.8%
Other values (26) 26
72.2%
2024-04-30T08:07:50.469304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
12.6%
20
 
11.4%
13
 
7.4%
13
 
7.4%
10
 
5.7%
4
 
2.3%
4
 
2.3%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (60) 80
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 171
97.7%
Other Symbol 2
 
1.1%
Space Separator 2
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
12.9%
20
 
11.7%
13
 
7.6%
13
 
7.6%
10
 
5.8%
4
 
2.3%
4
 
2.3%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (58) 76
44.4%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 173
98.9%
Common 2
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
12.7%
20
 
11.6%
13
 
7.5%
13
 
7.5%
10
 
5.8%
4
 
2.3%
4
 
2.3%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (59) 78
45.1%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 171
97.7%
None 2
 
1.1%
ASCII 2
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
12.9%
20
 
11.7%
13
 
7.6%
13
 
7.6%
10
 
5.8%
4
 
2.3%
4
 
2.3%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (58) 76
44.4%
None
ValueCountFrequency (%)
2
100.0%
ASCII
ValueCountFrequency (%)
2
100.0%

지번주소
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-04-30T08:07:50.636498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length38.5
Mean length21.735294
Min length9

Characters and Unicode

Total characters739
Distinct characters54
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row남산면 광판리 919 외 4필지
2nd row신북읍 천전리 692-4번지
3rd row동면 지내리 359-1번지
4th row동면 월곡리 399번지
5th row남산면 노일길 158
ValueCountFrequency (%)
16
 
11.3%
동면 11
 
7.8%
신북읍 8
 
5.7%
서면 7
 
5.0%
지내리 5
 
3.5%
신매리 3
 
2.1%
사농동 3
 
2.1%
남산면 3
 
2.1%
산천리 2
 
1.4%
장학리 2
 
1.4%
Other values (76) 81
57.4%
2024-04-30T08:07:50.929153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
108
 
14.6%
1 62
 
8.4%
50
 
6.8%
- 39
 
5.3%
9 32
 
4.3%
3 29
 
3.9%
29
 
3.9%
29
 
3.9%
4 29
 
3.9%
7 26
 
3.5%
Other values (44) 306
41.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 278
37.6%
Decimal Number 267
36.1%
Space Separator 108
 
14.6%
Dash Punctuation 39
 
5.3%
Other Punctuation 17
 
2.3%
Open Punctuation 15
 
2.0%
Close Punctuation 15
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
18.0%
29
 
10.4%
29
 
10.4%
22
 
7.9%
16
 
5.8%
16
 
5.8%
15
 
5.4%
12
 
4.3%
8
 
2.9%
8
 
2.9%
Other values (29) 73
26.3%
Decimal Number
ValueCountFrequency (%)
1 62
23.2%
9 32
12.0%
3 29
10.9%
4 29
10.9%
7 26
9.7%
6 23
 
8.6%
5 20
 
7.5%
2 16
 
6.0%
0 15
 
5.6%
8 15
 
5.6%
Space Separator
ValueCountFrequency (%)
108
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 39
100.0%
Other Punctuation
ValueCountFrequency (%)
, 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 461
62.4%
Hangul 278
37.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
18.0%
29
 
10.4%
29
 
10.4%
22
 
7.9%
16
 
5.8%
16
 
5.8%
15
 
5.4%
12
 
4.3%
8
 
2.9%
8
 
2.9%
Other values (29) 73
26.3%
Common
ValueCountFrequency (%)
108
23.4%
1 62
13.4%
- 39
 
8.5%
9 32
 
6.9%
3 29
 
6.3%
4 29
 
6.3%
7 26
 
5.6%
6 23
 
5.0%
5 20
 
4.3%
, 17
 
3.7%
Other values (5) 76
16.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 461
62.4%
Hangul 278
37.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
108
23.4%
1 62
13.4%
- 39
 
8.5%
9 32
 
6.9%
3 29
 
6.3%
4 29
 
6.3%
7 26
 
5.6%
6 23
 
5.0%
5 20
 
4.3%
, 17
 
3.7%
Other values (5) 76
16.5%
Hangul
ValueCountFrequency (%)
50
18.0%
29
 
10.4%
29
 
10.4%
22
 
7.9%
16
 
5.8%
16
 
5.8%
15
 
5.4%
12
 
4.3%
8
 
2.9%
8
 
2.9%
Other values (29) 73
26.3%

양식방법
Categorical

IMBALANCE 

Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
유수식
29 
순환여과식
지수식
 
2

Length

Max length5
Median length3
Mean length3.1764706
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유수식
2nd row유수식
3rd row유수식
4th row지수식
5th row순환여과식

Common Values

ValueCountFrequency (%)
유수식 29
85.3%
순환여과식 3
 
8.8%
지수식 2
 
5.9%

Length

2024-04-30T08:07:51.066761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T08:07:51.186369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유수식 29
85.3%
순환여과식 3
 
8.8%
지수식 2
 
5.9%

수조면적
Real number (ℝ)

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1386.3947
Minimum32.15
Maximum8973.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-04-30T08:07:51.285307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum32.15
5-th percentile87.27
Q1407.525
median686
Q31875
95-th percentile4178.082
Maximum8973.6
Range8941.45
Interquartile range (IQR)1467.475

Descriptive statistics

Standard deviation1766.9719
Coefficient of variation (CV)1.2745085
Kurtosis9.8714868
Mean1386.3947
Median Absolute Deviation (MAD)470.785
Skewness2.8129536
Sum47137.42
Variance3122189.6
MonotonicityNot monotonic
2024-04-30T08:07:51.401429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
2610.2 1
 
2.9%
2286.0 1
 
2.9%
967.2 1
 
2.9%
653.12 1
 
2.9%
392.0 1
 
2.9%
592.93 1
 
2.9%
664.0 1
 
2.9%
3816.28 1
 
2.9%
321.0 1
 
2.9%
714.0 1
 
2.9%
Other values (24) 24
70.6%
ValueCountFrequency (%)
32.15 1
2.9%
82.2 1
2.9%
90.0 1
2.9%
130.0 1
2.9%
208.43 1
2.9%
222.0 1
2.9%
321.0 1
2.9%
392.0 1
2.9%
392.7 1
2.9%
452.0 1
2.9%
ValueCountFrequency (%)
8973.6 1
2.9%
4850.0 1
2.9%
3816.28 1
2.9%
3062.0 1
2.9%
2753.42 1
2.9%
2610.2 1
2.9%
2286.0 1
2.9%
2000.48 1
2.9%
1950.0 1
2.9%
1650.0 1
2.9%
Distinct29
Distinct (%)85.3%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-04-30T08:07:51.597883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length20
Mean length12.147059
Min length2

Characters and Unicode

Total characters413
Distinct characters50
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)73.5%

Sample

1st row송어+산천어+다슬기
2nd row송어+산천어+동자개+뱀장어+붕어+메기
3rd row송어+향어+산천어
4th row북방산개구리
5th row장어+새우+다슬기+송어+연어+산천어+은대구+쏘가리+메기+붕어+잉어+게
ValueCountFrequency (%)
송어 3
 
8.8%
송어+산천어 2
 
5.9%
송어+향어+산천어 2
 
5.9%
북방산개구리 2
 
5.9%
붕어+동자개+송어+뱀장어 1
 
2.9%
송어+산천어+다슬기 1
 
2.9%
뱀장어+동자개 1
 
2.9%
송어+향어+산천어+메기 1
 
2.9%
송어+산천어+기타담수어류 1
 
2.9%
송어+산천어+암어 1
 
2.9%
Other values (19) 19
55.9%
2024-04-30T08:07:51.939231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 92
22.3%
86
20.8%
26
 
6.3%
21
 
5.1%
19
 
4.6%
15
 
3.6%
12
 
2.9%
12
 
2.9%
11
 
2.7%
11
 
2.7%
Other values (40) 108
26.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 320
77.5%
Math Symbol 92
 
22.3%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
26.9%
26
 
8.1%
21
 
6.6%
19
 
5.9%
15
 
4.7%
12
 
3.8%
12
 
3.8%
11
 
3.4%
11
 
3.4%
10
 
3.1%
Other values (38) 97
30.3%
Math Symbol
ValueCountFrequency (%)
+ 92
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 320
77.5%
Common 93
 
22.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
26.9%
26
 
8.1%
21
 
6.6%
19
 
5.9%
15
 
4.7%
12
 
3.8%
12
 
3.8%
11
 
3.4%
11
 
3.4%
10
 
3.1%
Other values (38) 97
30.3%
Common
ValueCountFrequency (%)
+ 92
98.9%
. 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 320
77.5%
ASCII 93
 
22.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 92
98.9%
. 1
 
1.1%
Hangul
ValueCountFrequency (%)
86
26.9%
26
 
8.1%
21
 
6.6%
19
 
5.9%
15
 
4.7%
12
 
3.8%
12
 
3.8%
11
 
3.4%
11
 
3.4%
10
 
3.1%
Other values (38) 97
30.3%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum2024-04-22 00:00:00
Maximum2024-04-22 00:00:00
2024-04-30T08:07:52.053100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:07:52.143878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-30T08:07:49.715914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T08:07:52.218010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명지번주소양식방법수조면적주생산품목
업체명1.0001.0001.0001.0001.000
지번주소1.0001.0001.0001.0001.000
양식방법1.0001.0001.0000.0001.000
수조면적1.0001.0000.0001.0000.000
주생산품목1.0001.0001.0000.0001.000
2024-04-30T08:07:52.308505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수조면적양식방법
수조면적1.0000.000
양식방법0.0001.000

Missing values

2024-04-30T08:07:49.859746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T08:07:49.954512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명지번주소양식방법수조면적주생산품목데이터기준일
0광판수산남산면 광판리 919 외 4필지유수식2610.2송어+산천어+다슬기2024-04-22
1강원수산신북읍 천전리 692-4번지유수식457.34송어+산천어+동자개+뱀장어+붕어+메기2024-04-22
2양지수산동면 지내리 359-1번지유수식648.55송어+향어+산천어2024-04-22
3와담농장동면 월곡리 399번지지수식82.2북방산개구리2024-04-22
4한국수산기술연구원㈜남산면 노일길 158순환여과식32.15장어+새우+다슬기+송어+연어+산천어+은대구+쏘가리+메기+붕어+잉어+게2024-04-22
5우성양식장신북읍 율문리 659유수식452.0송어+산천어+쏘가리2024-04-22
6호반수산신동 978-1번지유수식1424.0송어+붕어+잉어+향어2024-04-22
7봉의수산동면 지내리 963번지 외 3필지(963-1, 965, 395)유수식2753.42송어+향어+붕어+메기+산천어+쏘가리+뱀장어+동자개+대농갱이+참마자+미꾸라지+묵납자루2024-04-22
8무지개양어장신북읍 산천리 1173번지 외 1필지(1174)유수식1230.88송어+산천어2024-04-22
9오월수산서면 오월리 232-13번지 외 1필지(231)유수식392.7송어+산천어+메기+동자개2024-04-22
업체명지번주소양식방법수조면적주생산품목데이터기준일
24한우리수산신북읍 율문리856번지유수식664.0송어+향어+메기+산천어2024-04-22
25우리송어양어장서면 신매리 108-7번지유수식3816.28송어2024-04-22
26정일수산동면 지내리 950번지 외 3필지(951, 950-4, 951-4)유수식2286.0붕어+잉어+메기+동자개+대농갱이+뱀장어등2024-04-22
27발산양어장남면 발산리 649-1번지유수식321.0메기+동자개+대농갱이+뱀장어+붕어.잉어2024-04-22
28푸른자라서면 월송리 727-6번지유수식480.0자라2024-04-22
29㈜농업회사법인 팜에프서면 월송리 748번지순환여과식208.43역돔+뱀장어2024-04-22
30소양강양어장동면 지내리 2-9번지 외 1필지(2-10번지)유수식8973.6송어+산천어+암어2024-04-22
31광판양어장남산면 광판리 1139-45번지 외 6필지(1139,1139-43,1139-44,1139-52,1139-171,573)유수식4850.0송어+산천어+기타담수어류2024-04-22
32다운수산신북읍 율문리 372번지유수식708.0송어+향어+산천어+메기2024-04-22
33금정수산동면 품걸리 4-1번지 외 1필지(4-2)유수식1650.0붕어+잉어2024-04-22