Overview

Dataset statistics

Number of variables5
Number of observations943
Missing cells145
Missing cells (%)3.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory37.9 KiB
Average record size in memory41.1 B

Variable types

Text3
Categorical1
Numeric1

Dataset

Description전라북도 고창군에 위치한 축산업 관리현황에 대한 정보입니다. 사업장 명칭, 주사육업종, 사업장 소재지, 사육두수에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15034241/fileData.do

Alerts

주사육업종 is highly imbalanced (55.6%)Imbalance
사업장소재지(도로명) has 145 (15.4%) missing valuesMissing
사육두수 has 165 (17.5%) zerosZeros

Reproduction

Analysis started2023-12-12 11:28:54.985113
Analysis finished2023-12-12 11:28:56.202892
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct840
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
2023-12-12T20:28:56.516727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length4
Mean length4.8441145
Min length2

Characters and Unicode

Total characters4568
Distinct characters306
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique764 ?
Unique (%)81.0%

Sample

1st row상하하군목장
2nd row상하 유남목장
3rd row농업회사법인 유한회사 태흥축산
4th row상하한일목장
5th row오월농장
ValueCountFrequency (%)
농장 257
 
20.1%
목장 23
 
1.8%
축산 8
 
0.6%
상하 7
 
0.5%
형제농장 5
 
0.4%
한우 5
 
0.4%
유성농장 4
 
0.3%
덕암목장 4
 
0.3%
대성 4
 
0.3%
혜원농장 4
 
0.3%
Other values (851) 956
74.9%
2023-12-12T20:28:57.193649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
892
19.5%
745
 
16.3%
334
 
7.3%
147
 
3.2%
99
 
2.2%
85
 
1.9%
66
 
1.4%
53
 
1.2%
48
 
1.1%
47
 
1.0%
Other values (296) 2052
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4171
91.3%
Space Separator 334
 
7.3%
Decimal Number 34
 
0.7%
Open Punctuation 12
 
0.3%
Close Punctuation 12
 
0.3%
Uppercase Letter 4
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
892
21.4%
745
 
17.9%
147
 
3.5%
99
 
2.4%
85
 
2.0%
66
 
1.6%
53
 
1.3%
48
 
1.2%
47
 
1.1%
46
 
1.1%
Other values (283) 1943
46.6%
Decimal Number
ValueCountFrequency (%)
2 23
67.6%
5 4
 
11.8%
1 3
 
8.8%
3 2
 
5.9%
4 1
 
2.9%
0 1
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
J 2
50.0%
K 1
25.0%
H 1
25.0%
Space Separator
ValueCountFrequency (%)
334
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4171
91.3%
Common 392
 
8.6%
Latin 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
892
21.4%
745
 
17.9%
147
 
3.5%
99
 
2.4%
85
 
2.0%
66
 
1.6%
53
 
1.3%
48
 
1.2%
47
 
1.1%
46
 
1.1%
Other values (283) 1943
46.6%
Common
ValueCountFrequency (%)
334
85.2%
2 23
 
5.9%
( 12
 
3.1%
) 12
 
3.1%
5 4
 
1.0%
1 3
 
0.8%
3 2
 
0.5%
4 1
 
0.3%
0 1
 
0.3%
Latin
ValueCountFrequency (%)
J 2
40.0%
1
20.0%
K 1
20.0%
H 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4171
91.3%
ASCII 396
 
8.7%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
892
21.4%
745
 
17.9%
147
 
3.5%
99
 
2.4%
85
 
2.0%
66
 
1.6%
53
 
1.3%
48
 
1.2%
47
 
1.1%
46
 
1.1%
Other values (283) 1943
46.6%
ASCII
ValueCountFrequency (%)
334
84.3%
2 23
 
5.8%
( 12
 
3.0%
) 12
 
3.0%
5 4
 
1.0%
1 3
 
0.8%
3 2
 
0.5%
J 2
 
0.5%
K 1
 
0.3%
4 1
 
0.3%
Other values (2) 2
 
0.5%
Number Forms
ValueCountFrequency (%)
1
100.0%

주사육업종
Categorical

IMBALANCE 

Distinct15
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
한우
660 
젖소
82 
육계
68 
오리
 
40
돼지
 
26
Other values (10)
67 

Length

Max length6
Median length2
Mean length2.1049841
Min length2

Unique

Unique4 ?
Unique (%)0.4%

Sample

1st row젖소
2nd row젖소
3rd row돼지
4th row한우
5th row젖소

Common Values

ValueCountFrequency (%)
한우 660
70.0%
젖소 82
 
8.7%
육계 68
 
7.2%
오리 40
 
4.2%
돼지 26
 
2.8%
종계/산란계 20
 
2.1%
염소 15
 
1.6%
종계업 12
 
1.3%
육우 10
 
1.1%
종돈업 4
 
0.4%
Other values (5) 6
 
0.6%

Length

2023-12-12T20:28:57.475832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 660
70.0%
젖소 82
 
8.7%
육계 68
 
7.2%
오리 40
 
4.2%
돼지 26
 
2.8%
종계/산란계 20
 
2.1%
염소 15
 
1.6%
종계업 12
 
1.3%
육우 10
 
1.1%
종돈업 4
 
0.4%
Other values (5) 6
 
0.6%
Distinct911
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
2023-12-12T20:28:58.041241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length48
Mean length25.602333
Min length4

Characters and Unicode

Total characters24143
Distinct characters140
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique884 ?
Unique (%)93.7%

Sample

1st row전라북도 고창군 공음면 군유리 584번지 2호
2nd row전라북도 고창군 대산면 상금리 산 156번지
3rd row전라북도 고창군 성송면 낙양리 575번지 1호
4th row전라북도 고창군 대산면 성남리 634번지 1호
5th row전라북도 고창군 대산면 중산리 1192번지 10호
ValueCountFrequency (%)
전라북도 937
 
18.0%
고창군 937
 
18.0%
1호 167
 
3.2%
대산면 121
 
2.3%
흥덕면 113
 
2.2%
공음면 92
 
1.8%
무장면 83
 
1.6%
아산면 76
 
1.5%
2호 71
 
1.4%
부안면 66
 
1.3%
Other values (862) 2536
48.8%
2023-12-12T20:28:58.879648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6137
25.4%
1069
 
4.4%
997
 
4.1%
992
 
4.1%
967
 
4.0%
956
 
4.0%
955
 
4.0%
947
 
3.9%
942
 
3.9%
937
 
3.9%
Other values (130) 9244
38.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14563
60.3%
Space Separator 6137
25.4%
Decimal Number 3417
 
14.2%
Dash Punctuation 24
 
0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1069
 
7.3%
997
 
6.8%
992
 
6.8%
967
 
6.6%
956
 
6.6%
955
 
6.6%
947
 
6.5%
942
 
6.5%
937
 
6.4%
925
 
6.4%
Other values (116) 4876
33.5%
Decimal Number
ValueCountFrequency (%)
1 726
21.2%
2 398
11.6%
3 339
9.9%
4 325
9.5%
5 323
9.5%
6 292
8.5%
7 287
 
8.4%
8 284
 
8.3%
0 241
 
7.1%
9 202
 
5.9%
Space Separator
ValueCountFrequency (%)
6137
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14563
60.3%
Common 9580
39.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1069
 
7.3%
997
 
6.8%
992
 
6.8%
967
 
6.6%
956
 
6.6%
955
 
6.6%
947
 
6.5%
942
 
6.5%
937
 
6.4%
925
 
6.4%
Other values (116) 4876
33.5%
Common
ValueCountFrequency (%)
6137
64.1%
1 726
 
7.6%
2 398
 
4.2%
3 339
 
3.5%
4 325
 
3.4%
5 323
 
3.4%
6 292
 
3.0%
7 287
 
3.0%
8 284
 
3.0%
0 241
 
2.5%
Other values (4) 228
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14563
60.3%
ASCII 9580
39.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6137
64.1%
1 726
 
7.6%
2 398
 
4.2%
3 339
 
3.5%
4 325
 
3.4%
5 323
 
3.4%
6 292
 
3.0%
7 287
 
3.0%
8 284
 
3.0%
0 241
 
2.5%
Other values (4) 228
 
2.4%
Hangul
ValueCountFrequency (%)
1069
 
7.3%
997
 
6.8%
992
 
6.8%
967
 
6.6%
956
 
6.6%
955
 
6.6%
947
 
6.5%
942
 
6.5%
937
 
6.4%
925
 
6.4%
Other values (116) 4876
33.5%
Distinct758
Distinct (%)95.0%
Missing145
Missing (%)15.4%
Memory size7.5 KiB
2023-12-12T20:28:59.418817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length29
Mean length22.399749
Min length18

Characters and Unicode

Total characters17875
Distinct characters209
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique721 ?
Unique (%)90.4%

Sample

1st row전라북도 고창군 공음면 청보리로 825
2nd row전라북도 고창군 대산면 고산성로 117-35
3rd row전라북도 고창군 성송면 학천로 657
4th row전라북도 고창군 대산면 칠거리로 411-2
5th row전라북도 고창군 대산면 덕천칠거리길 310-25
ValueCountFrequency (%)
전라북도 798
19.9%
고창군 798
19.9%
대산면 109
 
2.7%
흥덕면 90
 
2.2%
공음면 77
 
1.9%
무장면 67
 
1.7%
아산면 67
 
1.7%
고수면 63
 
1.6%
부안면 54
 
1.3%
해리면 52
 
1.3%
Other values (1025) 1835
45.8%
2023-12-12T20:29:00.255926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3224
18.0%
916
 
5.1%
840
 
4.7%
829
 
4.6%
809
 
4.5%
807
 
4.5%
798
 
4.5%
798
 
4.5%
764
 
4.3%
1 667
 
3.7%
Other values (199) 7423
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10956
61.3%
Space Separator 3224
 
18.0%
Decimal Number 3133
 
17.5%
Dash Punctuation 546
 
3.1%
Close Punctuation 8
 
< 0.1%
Open Punctuation 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
916
 
8.4%
840
 
7.7%
829
 
7.6%
809
 
7.4%
807
 
7.4%
798
 
7.3%
798
 
7.3%
764
 
7.0%
500
 
4.6%
321
 
2.9%
Other values (185) 3574
32.6%
Decimal Number
ValueCountFrequency (%)
1 667
21.3%
2 447
14.3%
3 347
11.1%
4 296
9.4%
5 275
8.8%
6 239
 
7.6%
7 237
 
7.6%
8 217
 
6.9%
9 205
 
6.5%
0 203
 
6.5%
Space Separator
ValueCountFrequency (%)
3224
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 546
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10956
61.3%
Common 6919
38.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
916
 
8.4%
840
 
7.7%
829
 
7.6%
809
 
7.4%
807
 
7.4%
798
 
7.3%
798
 
7.3%
764
 
7.0%
500
 
4.6%
321
 
2.9%
Other values (185) 3574
32.6%
Common
ValueCountFrequency (%)
3224
46.6%
1 667
 
9.6%
- 546
 
7.9%
2 447
 
6.5%
3 347
 
5.0%
4 296
 
4.3%
5 275
 
4.0%
6 239
 
3.5%
7 237
 
3.4%
8 217
 
3.1%
Other values (4) 424
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10956
61.3%
ASCII 6919
38.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3224
46.6%
1 667
 
9.6%
- 546
 
7.9%
2 447
 
6.5%
3 347
 
5.0%
4 296
 
4.3%
5 275
 
4.0%
6 239
 
3.5%
7 237
 
3.4%
8 217
 
3.1%
Other values (4) 424
 
6.1%
Hangul
ValueCountFrequency (%)
916
 
8.4%
840
 
7.7%
829
 
7.6%
809
 
7.4%
807
 
7.4%
798
 
7.3%
798
 
7.3%
764
 
7.0%
500
 
4.6%
321
 
2.9%
Other values (185) 3574
32.6%

사육두수
Real number (ℝ)

ZEROS 

Distinct246
Distinct (%)26.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6169.1145
Minimum0
Maximum220000
Zeros165
Zeros (%)17.5%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2023-12-12T20:29:00.558408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16
median33
Q3120
95-th percentile53670
Maximum220000
Range220000
Interquartile range (IQR)114

Descriptive statistics

Standard deviation21293.982
Coefficient of variation (CV)3.4517079
Kurtosis25.400631
Mean6169.1145
Median Absolute Deviation (MAD)33
Skewness4.5960123
Sum5817475
Variance4.5343365 × 108
MonotonicityNot monotonic
2023-12-12T20:29:00.832958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 165
 
17.5%
10 25
 
2.7%
20 24
 
2.5%
5 21
 
2.2%
3 20
 
2.1%
50 20
 
2.1%
30 18
 
1.9%
4 18
 
1.9%
100 15
 
1.6%
40 15
 
1.6%
Other values (236) 602
63.8%
ValueCountFrequency (%)
0 165
17.5%
1 1
 
0.1%
2 4
 
0.4%
3 20
 
2.1%
4 18
 
1.9%
5 21
 
2.2%
6 14
 
1.5%
7 13
 
1.4%
8 12
 
1.3%
9 11
 
1.2%
ValueCountFrequency (%)
220000 1
0.1%
160000 1
0.1%
155000 1
0.1%
150000 1
0.1%
120000 1
0.1%
115000 1
0.1%
113386 1
0.1%
112000 1
0.1%
110000 1
0.1%
104976 1
0.1%

Interactions

2023-12-12T20:28:55.702118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:29:00.983614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육두수
주사육업종1.0000.631
사육두수0.6311.000
2023-12-12T20:29:01.127960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.315
주사육업종0.3151.000

Missing values

2023-12-12T20:28:55.923570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:28:56.126648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명칭주사육업종사업장소재지(지번)사업장소재지(도로명)사육두수
0상하하군목장젖소전라북도 고창군 공음면 군유리 584번지 2호전라북도 고창군 공음면 청보리로 825300
1상하 유남목장젖소전라북도 고창군 대산면 상금리 산 156번지전라북도 고창군 대산면 고산성로 117-35235
2농업회사법인 유한회사 태흥축산돼지전라북도 고창군 성송면 낙양리 575번지 1호전라북도 고창군 성송면 학천로 65715350
3상하한일목장한우전라북도 고창군 대산면 성남리 634번지 1호전라북도 고창군 대산면 칠거리로 411-294
4오월농장젖소전라북도 고창군 대산면 중산리 1192번지 10호전라북도 고창군 대산면 덕천칠거리길 310-25165
5노원농장돼지전라북도 고창군 신림면 세곡리 540번지전라북도 고창군 신림면 관은정길 481400
6청강농원한우전라북도 고창군 아산면 남산리 408번지 14호<NA>31
7덕암목장젖소전라북도 고창군 공음면 덕암리 182번지 11호전라북도 고창군 공음면 덕암로 286160
8태봉농장육계전라북도 고창군 고수면 예지리 461번지 18호전라북도 고창군 고수면 태봉로 190113386
9청룡농장한우전라북도 고창군 대산면 연동리 446번지 2호전라북도 고창군 대산면 장자산로 498100
사업장명칭주사육업종사업장소재지(지번)사업장소재지(도로명)사육두수
933건흥농장종계업전라북도 고창군 고창읍 신월리 30번지 7호전라북도 고창군 고창읍 신월길 35-5427500
934오송농장종계업전라북도 고창군 성송면 괴치리 472번지전라북도 고창군 성송면 주산길 32-1115100
935벧엘농장종계업전라북도 고창군 아산면 학전리 796번지 1호전라북도 고창군 아산면 월성길 18-22827500
936(주)행복한농장종돈업전라북도 고창군 무장면 송계리 352번지전라북도 고창군 무장면 송림산로 354-91575
937농업회사법인(주)해림종돈종돈업전라북도 고창군 무장면 덕림리 829번지 3호전라북도 고창군 무장면 칠거리로 4220
938성식농장종계업전라북도 고창군 고창읍 율계리 320번지 1호전라북도 고창군 고창읍 전봉준로 152-180
939월계농장종계업전라북도 고창군 고수면 장두리 592번지 21호 고창종난장전라북도 고창군 고수면 오산학산로 208-59 고창종난장16000
940산들농장종계업전라북도 고창군 무장면 옥산리 447번지전라북도 고창군 무장면 가라1길 12-220
941지혜농장종돈업전라북도 고창군 심원면 궁산리 2번지 심원양돈단지전라북도 고창군 심원면 궁산1길 174-44 심원양돈단지311
942신촌농장종계업전라북도 고창군 무장면 신촌리 562번지 2호전라북도 고창군 무장면 신촌농장길 9816500