Overview

Dataset statistics

Number of variables4
Number of observations1536
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory49.6 KiB
Average record size in memory33.1 B

Variable types

Text2
Categorical1
Numeric1

Dataset

Description충청남도 예산군_축산현황 및 가금류 농가 현황_20211104충청남도 예산군의 축산 현황 및 가금류 농가 현황(농장명,축종,사육두수,소재지) 데이터를 일부 제공하고 있습니다.
Author충청남도 예산군
URLhttps://www.data.go.kr/data/15034259/fileData.do

Alerts

주사육업종 is highly imbalanced (67.9%)Imbalance
사육두수 has 19 (1.2%) zerosZeros

Reproduction

Analysis started2023-12-11 22:55:02.367730
Analysis finished2023-12-11 22:55:02.843941
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1253
Distinct (%)81.6%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
2023-12-12T07:55:03.021842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length4
Mean length4.2630208
Min length2

Characters and Unicode

Total characters6548
Distinct characters374
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1065 ?
Unique (%)69.3%

Sample

1st row동방축산
2nd row응봉농장
3rd row은현농장
4th row은곡농장
5th row자영농장
ValueCountFrequency (%)
태신목장 9
 
0.6%
우리농장 8
 
0.5%
하나농장 8
 
0.5%
이티농장 6
 
0.4%
구만농장 6
 
0.4%
농장 6
 
0.4%
한우농장 6
 
0.4%
가나안농장 5
 
0.3%
하늘농장 5
 
0.3%
신흥농장 5
 
0.3%
Other values (1253) 1498
95.9%
2023-12-12T07:55:03.355579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1426
21.8%
1146
 
17.5%
294
 
4.5%
95
 
1.5%
89
 
1.4%
88
 
1.3%
72
 
1.1%
72
 
1.1%
67
 
1.0%
62
 
0.9%
Other values (364) 3137
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6419
98.0%
Decimal Number 43
 
0.7%
Space Separator 26
 
0.4%
Close Punctuation 20
 
0.3%
Open Punctuation 20
 
0.3%
Uppercase Letter 19
 
0.3%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1426
22.2%
1146
 
17.9%
294
 
4.6%
95
 
1.5%
89
 
1.4%
88
 
1.4%
72
 
1.1%
72
 
1.1%
67
 
1.0%
62
 
1.0%
Other values (345) 3008
46.9%
Uppercase Letter
ValueCountFrequency (%)
H 3
15.8%
B 3
15.8%
S 2
10.5%
K 2
10.5%
A 2
10.5%
E 1
 
5.3%
R 1
 
5.3%
P 1
 
5.3%
J 1
 
5.3%
O 1
 
5.3%
Other values (2) 2
10.5%
Decimal Number
ValueCountFrequency (%)
2 40
93.0%
1 2
 
4.7%
3 1
 
2.3%
Space Separator
ValueCountFrequency (%)
26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6419
98.0%
Common 109
 
1.7%
Latin 20
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1426
22.2%
1146
 
17.9%
294
 
4.6%
95
 
1.5%
89
 
1.4%
88
 
1.4%
72
 
1.1%
72
 
1.1%
67
 
1.0%
62
 
1.0%
Other values (345) 3008
46.9%
Latin
ValueCountFrequency (%)
H 3
15.0%
B 3
15.0%
S 2
10.0%
K 2
10.0%
A 2
10.0%
1
 
5.0%
E 1
 
5.0%
R 1
 
5.0%
P 1
 
5.0%
J 1
 
5.0%
Other values (3) 3
15.0%
Common
ValueCountFrequency (%)
2 40
36.7%
26
23.9%
) 20
18.3%
( 20
18.3%
1 2
 
1.8%
3 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6419
98.0%
ASCII 128
 
2.0%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1426
22.2%
1146
 
17.9%
294
 
4.6%
95
 
1.5%
89
 
1.4%
88
 
1.4%
72
 
1.1%
72
 
1.1%
67
 
1.0%
62
 
1.0%
Other values (345) 3008
46.9%
ASCII
ValueCountFrequency (%)
2 40
31.2%
26
20.3%
) 20
15.6%
( 20
15.6%
H 3
 
2.3%
B 3
 
2.3%
S 2
 
1.6%
1 2
 
1.6%
K 2
 
1.6%
A 2
 
1.6%
Other values (8) 8
 
6.2%
Number Forms
ValueCountFrequency (%)
1
100.0%

주사육업종
Categorical

IMBALANCE 

Distinct15
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
한우
1217 
젖소
 
106
돼지
 
104
육계
 
42
육우
 
17
Other values (10)
 
50

Length

Max length6
Median length2
Mean length2.0227865
Min length2

Unique

Unique5 ?
Unique (%)0.3%

Sample

1st row돼지
2nd row돼지
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 1217
79.2%
젖소 106
 
6.9%
돼지 104
 
6.8%
육계 42
 
2.7%
육우 17
 
1.1%
사슴 16
 
1.0%
산양 12
 
0.8%
종계/산란계 8
 
0.5%
염소 7
 
0.5%
메추리 2
 
0.1%
Other values (5) 5
 
0.3%

Length

2023-12-12T07:55:03.469591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1217
79.2%
젖소 106
 
6.9%
돼지 104
 
6.8%
육계 42
 
2.7%
육우 17
 
1.1%
사슴 16
 
1.0%
산양 12
 
0.8%
종계/산란계 8
 
0.5%
염소 7
 
0.5%
메추리 2
 
0.1%
Other values (5) 5
 
0.3%
Distinct1513
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
2023-12-12T07:55:03.732293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length99
Median length82
Mean length32.32487
Min length4

Characters and Unicode

Total characters49651
Distinct characters152
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1500 ?
Unique (%)97.7%

Sample

1st row충청남도 예산군 예산읍 간양리 492번지 2호
2nd row충청남도 예산군 응봉면 계정리 222번지 외1필지(237-6)
3rd row충청남도 예산군 오가면 분천리 16번지 4호 외3필지(-7,-11,-12)
4th row충청남도 예산군 봉산면 마교리 산 99번지 4호 외 1필지(마교리 산99-3)
5th row충청남도 예산군 오가면 원천리 844번지 19호
ValueCountFrequency (%)
충청남도 1526
 
15.3%
예산군 1526
 
15.3%
292
 
2.9%
1호 276
 
2.8%
오가면 214
 
2.1%
고덕면 213
 
2.1%
신양면 207
 
2.1%
2호 169
 
1.7%
광시면 168
 
1.7%
삽교읍 166
 
1.7%
Other values (1818) 5214
52.3%
2023-12-12T07:55:04.146505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10919
22.0%
2171
 
4.4%
1 2138
 
4.3%
1863
 
3.8%
1568
 
3.2%
1547
 
3.1%
1527
 
3.1%
1527
 
3.1%
1526
 
3.1%
1526
 
3.1%
Other values (142) 23339
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25621
51.6%
Space Separator 10919
22.0%
Decimal Number 10294
20.7%
Dash Punctuation 1033
 
2.1%
Other Punctuation 801
 
1.6%
Open Punctuation 493
 
1.0%
Close Punctuation 490
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2171
 
8.5%
1863
 
7.3%
1568
 
6.1%
1547
 
6.0%
1527
 
6.0%
1527
 
6.0%
1526
 
6.0%
1526
 
6.0%
1526
 
6.0%
1488
 
5.8%
Other values (126) 9352
36.5%
Decimal Number
ValueCountFrequency (%)
1 2138
20.8%
2 1474
14.3%
3 1271
12.3%
4 1089
10.6%
5 993
9.6%
6 781
 
7.6%
7 741
 
7.2%
8 644
 
6.3%
0 608
 
5.9%
9 555
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 797
99.5%
. 4
 
0.5%
Space Separator
ValueCountFrequency (%)
10919
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1033
100.0%
Open Punctuation
ValueCountFrequency (%)
( 493
100.0%
Close Punctuation
ValueCountFrequency (%)
) 490
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25621
51.6%
Common 24030
48.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2171
 
8.5%
1863
 
7.3%
1568
 
6.1%
1547
 
6.0%
1527
 
6.0%
1527
 
6.0%
1526
 
6.0%
1526
 
6.0%
1526
 
6.0%
1488
 
5.8%
Other values (126) 9352
36.5%
Common
ValueCountFrequency (%)
10919
45.4%
1 2138
 
8.9%
2 1474
 
6.1%
3 1271
 
5.3%
4 1089
 
4.5%
- 1033
 
4.3%
5 993
 
4.1%
, 797
 
3.3%
6 781
 
3.3%
7 741
 
3.1%
Other values (6) 2794
 
11.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25621
51.6%
ASCII 24030
48.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10919
45.4%
1 2138
 
8.9%
2 1474
 
6.1%
3 1271
 
5.3%
4 1089
 
4.5%
- 1033
 
4.3%
5 993
 
4.1%
, 797
 
3.3%
6 781
 
3.3%
7 741
 
3.1%
Other values (6) 2794
 
11.6%
Hangul
ValueCountFrequency (%)
2171
 
8.5%
1863
 
7.3%
1568
 
6.1%
1547
 
6.0%
1527
 
6.0%
1527
 
6.0%
1526
 
6.0%
1526
 
6.0%
1526
 
6.0%
1488
 
5.8%
Other values (126) 9352
36.5%

사육두수
Real number (ℝ)

ZEROS 

Distinct257
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2699.0898
Minimum0
Maximum645400
Zeros19
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size13.6 KiB
2023-12-12T07:55:04.326048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4
Q116
median41.5
Q3110.25
95-th percentile2425
Maximum645400
Range645400
Interquartile range (IQR)94.25

Descriptive statistics

Standard deviation24839.68
Coefficient of variation (CV)9.2029838
Kurtosis409.28697
Mean2699.0898
Median Absolute Deviation (MAD)31.5
Skewness18.4935
Sum4145802
Variance6.170097 × 108
MonotonicityNot monotonic
2023-12-12T07:55:04.494788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50 71
 
4.6%
30 63
 
4.1%
20 51
 
3.3%
15 46
 
3.0%
40 46
 
3.0%
100 45
 
2.9%
10 42
 
2.7%
150 41
 
2.7%
60 35
 
2.3%
4 30
 
2.0%
Other values (247) 1066
69.4%
ValueCountFrequency (%)
0 19
1.2%
1 5
 
0.3%
2 21
1.4%
3 14
0.9%
4 30
2.0%
5 26
1.7%
6 22
1.4%
7 16
1.0%
8 23
1.5%
9 16
1.0%
ValueCountFrequency (%)
645400 1
 
0.1%
480000 1
 
0.1%
357000 1
 
0.1%
160000 1
 
0.1%
140000 1
 
0.1%
100000 3
0.2%
98000 1
 
0.1%
97000 1
 
0.1%
80000 4
0.3%
75000 2
0.1%

Interactions

2023-12-12T07:55:02.646923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T07:55:04.572483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육두수
주사육업종1.0000.769
사육두수0.7691.000
2023-12-12T07:55:04.658107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.491
주사육업종0.4911.000

Missing values

2023-12-12T07:55:02.738943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:55:02.806664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명칭주사육업종사업장소재지(지번)사육두수
0동방축산돼지충청남도 예산군 예산읍 간양리 492번지 2호2300
1응봉농장돼지충청남도 예산군 응봉면 계정리 222번지 외1필지(237-6)1100
2은현농장한우충청남도 예산군 오가면 분천리 16번지 4호 외3필지(-7,-11,-12)117
3은곡농장한우충청남도 예산군 봉산면 마교리 산 99번지 4호 외 1필지(마교리 산99-3)60
4자영농장한우충청남도 예산군 오가면 원천리 844번지 19호300
5한택농장한우충청남도 예산군 대술면 궐곡리 353번지 외5(353-1, 353-2, 353-3, 613-1, 613-3)150
6동진농장한우충청남도 예산군 오가면 양막리 144번지 1호193
7덕은농장한우충청남도 예산군 오가면 양막리 105번지 3호 외 3필지(71-22, 105-2, 71)96
8기품농장한우충청남도 예산군 오가면 양막리 139번지 2호60
9황소와농부한우충청남도 예산군 오가면 양막리 38번지 5호80
사업장명칭주사육업종사업장소재지(지번)사육두수
1526다산농장한우충청남도 예산군 봉산면 하평리 177번지 5호 외 1필지(177-2)11
1527해우2농장한우충청남도 예산군 대술면 산정리 394번지153
1528명호농장한우충청남도 예산군 대술면 궐곡리 561번지 외 1필지(560-2)263
1529은하목장한우충청남도 예산군 고덕면 용리 747번지229
1530계촌농장한우충청남도 예산군 신암면 계촌리 260번지 21호13
1531축산예일농장한우충청남도 예산군 신암면 조곡리 380번지 7호15
1532진수농장한우충청남도 예산군 신양면 무봉리 77번지 6호19
1533별리목장육우충청남도 예산군 신암면 별리 493번지20
1534유일농장한우충청남도 예산군 고덕면 사리 854번지 1호 외 2필지(854-2, 봉산면 효교리 127)182
1535창정농장한우충청남도 예산군 삽교읍 상하리 273번지 2호195