Overview

Dataset statistics

Number of variables6
Number of observations274
Missing cells166
Missing cells (%)10.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.5 KiB
Average record size in memory50.5 B

Variable types

Text3
Categorical1
Numeric2

Dataset

Description강원도 양양군 관내 축산농가에 대한 데이터로 농장의 명칭, 가축의 종류, 농장 주소, 사육 두수, 농장의 위치정보(위도 및 경도) 등을 제공합니다.
Author강원도 양양군
URLhttps://www.data.go.kr/data/15092053/fileData.do

Alerts

사업장 위도 is highly overall correlated with 사업장 경도High correlation
사업장 경도 is highly overall correlated with 사업장 위도High correlation
축종 is highly imbalanced (73.4%)Imbalance
사업장 위도 has 83 (30.3%) missing valuesMissing
사업장 경도 has 83 (30.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 05:40:20.078641
Analysis finished2023-12-12 05:40:21.253726
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct255
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T14:40:21.537028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length4
Mean length4.770073
Min length3

Characters and Unicode

Total characters1307
Distinct characters219
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)86.9%

Sample

1st row입압리 농원
2nd row수리농장
3rd row황금농장
4th row화일리목장
5th row서인농장
ValueCountFrequency (%)
농장 5
 
1.7%
용천농장 3
 
1.0%
상복농장 3
 
1.0%
미래농장 3
 
1.0%
금풍농장 2
 
0.7%
한우농장 2
 
0.7%
벽실농장 2
 
0.7%
우리농장 2
 
0.7%
목우원 2
 
0.7%
해성농장 2
 
0.7%
Other values (254) 265
91.1%
2023-12-12T14:40:22.053984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
246
 
18.8%
208
 
15.9%
46
 
3.5%
34
 
2.6%
21
 
1.6%
20
 
1.5%
19
 
1.5%
18
 
1.4%
17
 
1.3%
17
 
1.3%
Other values (209) 661
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1257
96.2%
Decimal Number 23
 
1.8%
Space Separator 17
 
1.3%
Close Punctuation 4
 
0.3%
Open Punctuation 4
 
0.3%
Other Punctuation 1
 
0.1%
Letter Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
246
19.6%
208
 
16.5%
46
 
3.7%
34
 
2.7%
21
 
1.7%
20
 
1.6%
19
 
1.5%
18
 
1.4%
17
 
1.4%
17
 
1.4%
Other values (197) 611
48.6%
Decimal Number
ValueCountFrequency (%)
2 13
56.5%
1 5
 
21.7%
0 1
 
4.3%
6 1
 
4.3%
4 1
 
4.3%
5 1
 
4.3%
3 1
 
4.3%
Space Separator
ValueCountFrequency (%)
17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1257
96.2%
Common 49
 
3.7%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
246
19.6%
208
 
16.5%
46
 
3.7%
34
 
2.7%
21
 
1.7%
20
 
1.6%
19
 
1.5%
18
 
1.4%
17
 
1.4%
17
 
1.4%
Other values (197) 611
48.6%
Common
ValueCountFrequency (%)
17
34.7%
2 13
26.5%
1 5
 
10.2%
) 4
 
8.2%
( 4
 
8.2%
0 1
 
2.0%
6 1
 
2.0%
. 1
 
2.0%
4 1
 
2.0%
5 1
 
2.0%
Latin
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1257
96.2%
ASCII 49
 
3.7%
Number Forms 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
246
19.6%
208
 
16.5%
46
 
3.7%
34
 
2.7%
21
 
1.7%
20
 
1.6%
19
 
1.5%
18
 
1.4%
17
 
1.4%
17
 
1.4%
Other values (197) 611
48.6%
ASCII
ValueCountFrequency (%)
17
34.7%
2 13
26.5%
1 5
 
10.2%
) 4
 
8.2%
( 4
 
8.2%
0 1
 
2.0%
6 1
 
2.0%
. 1
 
2.0%
4 1
 
2.0%
5 1
 
2.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

축종
Categorical

IMBALANCE 

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
한우
248 
돼지
 
10
염소
 
8
산양
 
7
사슴
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row한우
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 248
90.5%
돼지 10
 
3.6%
염소 8
 
2.9%
산양 7
 
2.6%
사슴 1
 
0.4%

Length

2023-12-12T14:40:22.231354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:40:22.339877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 248
90.5%
돼지 10
 
3.6%
염소 8
 
2.9%
산양 7
 
2.6%
사슴 1
 
0.4%
Distinct264
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T14:40:22.731113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length20.281022
Min length17

Characters and Unicode

Total characters5557
Distinct characters109
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)92.7%

Sample

1st row강원도 양양군 현남면 주리 산 93-2
2nd row강원도 양양군 서면 수리 336-3
3rd row강원도 양양군 서면 논화리 220
4th row강원도 양양군 양양읍 화일리 466
5th row강원도 양양군 현북면 말곡리 17-3
ValueCountFrequency (%)
강원도 274
19.9%
양양군 274
19.9%
강현면 70
 
5.1%
손양면 54
 
3.9%
서면 45
 
3.3%
현남면 40
 
2.9%
양양읍 40
 
2.9%
현북면 25
 
1.8%
죽리 12
 
0.9%
삽존리 12
 
0.9%
Other values (317) 530
38.5%
2023-12-12T14:40:23.323720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1248
22.5%
685
12.3%
355
 
6.4%
286
 
5.1%
276
 
5.0%
274
 
4.9%
274
 
4.9%
234
 
4.2%
1 192
 
3.5%
- 137
 
2.5%
Other values (99) 1596
28.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3246
58.4%
Space Separator 1248
 
22.5%
Decimal Number 919
 
16.5%
Dash Punctuation 137
 
2.5%
Other Punctuation 3
 
0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
685
21.1%
355
10.9%
286
8.8%
276
8.5%
274
 
8.4%
274
 
8.4%
234
 
7.2%
137
 
4.2%
57
 
1.8%
49
 
1.5%
Other values (84) 619
19.1%
Decimal Number
ValueCountFrequency (%)
1 192
20.9%
2 130
14.1%
3 111
12.1%
6 94
10.2%
5 90
9.8%
4 86
9.4%
0 57
 
6.2%
8 55
 
6.0%
7 53
 
5.8%
9 51
 
5.5%
Space Separator
ValueCountFrequency (%)
1248
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 137
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3246
58.4%
Common 2311
41.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
685
21.1%
355
10.9%
286
8.8%
276
8.5%
274
 
8.4%
274
 
8.4%
234
 
7.2%
137
 
4.2%
57
 
1.8%
49
 
1.5%
Other values (84) 619
19.1%
Common
ValueCountFrequency (%)
1248
54.0%
1 192
 
8.3%
- 137
 
5.9%
2 130
 
5.6%
3 111
 
4.8%
6 94
 
4.1%
5 90
 
3.9%
4 86
 
3.7%
0 57
 
2.5%
8 55
 
2.4%
Other values (5) 111
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3246
58.4%
ASCII 2311
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1248
54.0%
1 192
 
8.3%
- 137
 
5.9%
2 130
 
5.6%
3 111
 
4.8%
6 94
 
4.1%
5 90
 
3.9%
4 86
 
3.7%
0 57
 
2.5%
8 55
 
2.4%
Other values (5) 111
 
4.8%
Hangul
ValueCountFrequency (%)
685
21.1%
355
10.9%
286
8.8%
276
8.5%
274
 
8.4%
274
 
8.4%
234
 
7.2%
137
 
4.2%
57
 
1.8%
49
 
1.5%
Other values (84) 619
19.1%
Distinct72
Distinct (%)26.3%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T14:40:23.573559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length4
Mean length3.7518248
Min length3

Characters and Unicode

Total characters1028
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)12.4%

Sample

1st row 80
2nd row 47
3rd row 19
4th row 23
5th row 30
ValueCountFrequency (%)
2 21
 
7.7%
3 18
 
6.6%
4 17
 
6.2%
5 16
 
5.8%
6 12
 
4.4%
10 10
 
3.6%
20 9
 
3.3%
12 9
 
3.3%
7 9
 
3.3%
19 8
 
2.9%
Other values (62) 145
52.9%
2023-12-12T14:40:24.021537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
548
53.3%
1 85
 
8.3%
0 83
 
8.1%
2 79
 
7.7%
3 51
 
5.0%
5 43
 
4.2%
4 38
 
3.7%
9 29
 
2.8%
6 28
 
2.7%
8 19
 
1.8%
Other values (2) 25
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Space Separator 548
53.3%
Decimal Number 473
46.0%
Other Punctuation 7
 
0.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 85
18.0%
0 83
17.5%
2 79
16.7%
3 51
10.8%
5 43
9.1%
4 38
8.0%
9 29
 
6.1%
6 28
 
5.9%
8 19
 
4.0%
7 18
 
3.8%
Space Separator
ValueCountFrequency (%)
548
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1028
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
548
53.3%
1 85
 
8.3%
0 83
 
8.1%
2 79
 
7.7%
3 51
 
5.0%
5 43
 
4.2%
4 38
 
3.7%
9 29
 
2.8%
6 28
 
2.7%
8 19
 
1.8%
Other values (2) 25
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1028
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
548
53.3%
1 85
 
8.3%
0 83
 
8.1%
2 79
 
7.7%
3 51
 
5.0%
5 43
 
4.2%
4 38
 
3.7%
9 29
 
2.8%
6 28
 
2.7%
8 19
 
1.8%
Other values (2) 25
 
2.4%

사업장 위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct173
Distinct (%)90.6%
Missing83
Missing (%)30.3%
Infinite0
Infinite (%)0.0%
Mean38.060257
Minimum37.908866
Maximum38.157032
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2023-12-12T14:40:24.213535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.908866
5-th percentile37.949085
Q138.023185
median38.058488
Q338.111207
95-th percentile38.148366
Maximum38.157032
Range0.24816597
Interquartile range (IQR)0.088022086

Descriptive statistics

Standard deviation0.062682483
Coefficient of variation (CV)0.0016469275
Kurtosis-0.73382182
Mean38.060257
Median Absolute Deviation (MAD)0.043095973
Skewness-0.30159244
Sum7269.5091
Variance0.0039290936
MonotonicityNot monotonic
2023-12-12T14:40:24.395956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.9753156322 3
 
1.1%
38.0920528712 3
 
1.1%
38.131441388 3
 
1.1%
38.0222793947 2
 
0.7%
38.0744799914 2
 
0.7%
38.0405752011 2
 
0.7%
38.1552086885 2
 
0.7%
38.0376362042 2
 
0.7%
38.1360650565 2
 
0.7%
37.9740904708 2
 
0.7%
Other values (163) 168
61.3%
(Missing) 83
30.3%
ValueCountFrequency (%)
37.9088660701 1
0.4%
37.9148035784 1
0.4%
37.919834395 1
0.4%
37.925817573 1
0.4%
37.9260322314 1
0.4%
37.9365748407 1
0.4%
37.9445901773 1
0.4%
37.9456405246 1
0.4%
37.9471450351 1
0.4%
37.9484597843 1
0.4%
ValueCountFrequency (%)
38.1570320423 1
0.4%
38.1562652009 1
0.4%
38.1552086885 2
0.7%
38.1543357668 1
0.4%
38.1540254056 1
0.4%
38.1536263138 1
0.4%
38.1528799747 1
0.4%
38.1527495014 1
0.4%
38.1484429297 1
0.4%
38.1482894092 1
0.4%

사업장 경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct173
Distinct (%)90.6%
Missing83
Missing (%)30.3%
Infinite0
Infinite (%)0.0%
Mean128.62842
Minimum128.49966
Maximum128.80333
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2023-12-12T14:40:24.597173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.49966
5-th percentile128.5605
Q1128.58829
median128.6162
Q3128.64549
95-th percentile128.74987
Maximum128.80333
Range0.30366852
Interquartile range (IQR)0.05719732

Descriptive statistics

Standard deviation0.059384039
Coefficient of variation (CV)0.00046167121
Kurtosis0.40187517
Mean128.62842
Median Absolute Deviation (MAD)0.02803344
Skewness0.82822224
Sum24568.029
Variance0.0035264641
MonotonicityNot monotonic
2023-12-12T14:40:24.782212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.7392408438 3
 
1.1%
128.6340036952 3
 
1.1%
128.5676515961 3
 
1.1%
128.6408583106 2
 
0.7%
128.6496870044 2
 
0.7%
128.6417462467 2
 
0.7%
128.5876733573 2
 
0.7%
128.6520989291 2
 
0.7%
128.5782993705 2
 
0.7%
128.7368963187 2
 
0.7%
Other values (163) 168
61.3%
(Missing) 83
30.3%
ValueCountFrequency (%)
128.4996604176 1
0.4%
128.5109993104 1
0.4%
128.5143152829 1
0.4%
128.5159793877 1
0.4%
128.5256676981 1
0.4%
128.5406431198 2
0.7%
128.5572273479 1
0.4%
128.5590947172 1
0.4%
128.5603800421 1
0.4%
128.5606240908 1
0.4%
ValueCountFrequency (%)
128.8033289424 1
0.4%
128.7800735898 1
0.4%
128.778607728 1
0.4%
128.7749299203 1
0.4%
128.7656184122 1
0.4%
128.7605011593 1
0.4%
128.75683911 1
0.4%
128.7566331501 1
0.4%
128.7553418146 1
0.4%
128.7503900622 1
0.4%

Interactions

2023-12-12T14:40:20.687424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:40:20.469486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:40:20.821112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:40:20.575683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:40:24.906330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종사육두수사업장 위도사업장 경도
축종1.0000.8710.3850.262
사육두수0.8711.0000.0000.000
사업장 위도0.3850.0001.0000.869
사업장 경도0.2620.0000.8691.000
2023-12-12T14:40:25.037200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장 위도사업장 경도축종
사업장 위도1.000-0.6200.235
사업장 경도-0.6201.0000.148
축종0.2350.1481.000

Missing values

2023-12-12T14:40:20.986437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:40:21.101310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T14:40:21.206357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

농장명축종농장주소사육두수사업장 위도사업장 경도
0입압리 농원한우강원도 양양군 현남면 주리 산 93-28037.919834128.756839
1수리농장한우강원도 양양군 서면 수리 336-34738.026069128.605485
2황금농장한우강원도 양양군 서면 논화리 2201938.064176128.565779
3화일리목장한우강원도 양양군 양양읍 화일리 4662338.096887128.572081
4서인농장한우강원도 양양군 현북면 말곡리 17-33038.027603128.693474
5청산목장한우강원도 양양군 양양읍 감곡리 5902038.092849128.614513
6삽존농장한우강원도 양양군 손양면 삽존리 55-22038.029812128.637254
7방우재목장한우강원도 양양군 손양면 상왕도리 3806038.053788128.631844
8고노골농장한우강원도 양양군 손양면 상왕도리 6213538.055597128.627068
9강선리한우농장한우강원도 양양군 강현면 강선리 1356538.153626128.597602
농장명축종농장주소사육두수사업장 위도사업장 경도
264골짜구니농장염소강원도 양양군 서면 서림리 368-6150<NA><NA>
265김종문농장염소강원도 양양군 손양면 상양혈리 169-520<NA><NA>
266설악농장2염소강원도 양양군 강현면 하복리 1136038.137674128.578315
267농업회사법인양떼구름(주)산양강원도 양양군 강현면 적은리 84438.119031128.596586
268염소농장염소강원도 양양군 현남면 주리 산 133-150<NA><NA>
269서가네염소농장염소강원도 양양군 현남면 남애리 306-330<NA><NA>
270솔밭농장산양강원도 양양군 손양면 수여리 29010038.074425128.651211
271준농장염소강원도 양양군 양양읍 포월리 38-121<NA><NA>
272상월천농장염소강원도 양양군 현남면 상월천리 16635<NA><NA>
273광산목장염소강원도 양양군 손양면 삽존리 53-12538.029702128.638471