Overview

Dataset statistics

Number of variables10
Number of observations336
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.0 KiB
Average record size in memory85.4 B

Variable types

Categorical6
Numeric4

Dataset

Description국립종자원 정부보급종 검사시료 입출고 등록 현황에 대한 데이터로 년산,채종단계,담당지원,생산지원,작물명,종자무게,소독여부,처분일자,입고량,출고량 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15065770/fileData.do

Alerts

담당지원 is highly overall correlated with 생산지원High correlation
생산지원 is highly overall correlated with 처분일자 and 2 other fieldsHigh correlation
종자무게 is highly overall correlated with 처분일자High correlation
처분일자 is highly overall correlated with 종자무게 and 3 other fieldsHigh correlation
채종단계 is highly overall correlated with 처분일자 and 1 other fieldsHigh correlation
작물명 is highly overall correlated with 처분일자High correlation
처분일자 has 173 (51.5%) zerosZeros
입고량 has 295 (87.8%) zerosZeros
출고량 has 185 (55.1%) zerosZeros

Reproduction

Analysis started2023-12-12 18:33:19.582376
Analysis finished2023-12-12 18:33:23.884106
Duration4.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년산
Categorical

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2022
134 
2021
129 
2020
73 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2022 134
39.9%
2021 129
38.4%
2020 73
21.7%

Length

2023-12-13T03:33:23.997126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:33:24.588974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 134
39.9%
2021 129
38.4%
2020 73
21.7%

채종단계
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
보급종
159 
원원종
90 
원종
87 

Length

Max length3
Median length3
Mean length2.7410714
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보급종
2nd row보급종
3rd row보급종
4th row보급종
5th row보급종

Common Values

ValueCountFrequency (%)
보급종 159
47.3%
원원종 90
26.8%
원종 87
25.9%

Length

2023-12-13T03:33:24.751674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:33:24.918148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보급종 159
47.3%
원원종 90
26.8%
원종 87
25.9%

담당지원
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
전북지원
73 
충남지원
52 
전남지원
41 
경남지원
41 
경북지원
39 
Other values (4)
90 

Length

Max length7
Median length4
Mean length4.125
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기종자관리소
2nd row경기종자관리소
3rd row경기종자관리소
4th row경기종자관리소
5th row경기종자관리소

Common Values

ValueCountFrequency (%)
전북지원 73
21.7%
충남지원 52
15.5%
전남지원 41
12.2%
경남지원 41
12.2%
경북지원 39
11.6%
강원지원 39
11.6%
충북지원 33
9.8%
경기종자관리소 14
 
4.2%
제주지원 4
 
1.2%

Length

2023-12-13T03:33:25.095812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:33:25.271996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전북지원 73
21.7%
충남지원 52
15.5%
전남지원 41
12.2%
경남지원 41
12.2%
경북지원 39
11.6%
강원지원 39
11.6%
충북지원 33
9.8%
경기종자관리소 14
 
4.2%
제주지원 4
 
1.2%

생산지원
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
전라북도농업기술원 종자사업소
46 
전북지원
27 
경남지원
23 
전남지원
22 
충남지원
21 
Other values (23)
197 

Length

Max length15
Median length13
Mean length7.7857143
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기종자관리소
2nd row경기종자관리소
3rd row경기종자관리소
4th row경기종자관리소
5th row경기종자관리소

Common Values

ValueCountFrequency (%)
전라북도농업기술원 종자사업소 46
 
13.7%
전북지원 27
 
8.0%
경남지원 23
 
6.8%
전남지원 22
 
6.5%
충남지원 21
 
6.2%
강원지원 15
 
4.5%
충북지원 15
 
4.5%
경기종자관리소 14
 
4.2%
경북지원 14
 
4.2%
강원도 농산물원종장 12
 
3.6%
Other values (18) 127
37.8%

Length

2023-12-13T03:33:25.466295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전라북도농업기술원 46
 
11.1%
종자사업소 46
 
11.1%
전북지원 27
 
6.5%
경남지원 23
 
5.5%
전남지원 22
 
5.3%
충남지원 21
 
5.1%
강원지원 15
 
3.6%
충북지원 15
 
3.6%
경기종자관리소 14
 
3.4%
경북지원 14
 
3.4%
Other values (22) 172
41.4%

작물명
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
85 
72 
보리
40 
40 
26 
Other values (9)
73 

Length

Max length8
Median length1
Mean length1.639881
Min length1

Unique

Unique3 ?
Unique (%)0.9%

Sample

1st row
2nd row
3rd row보리
4th row보리
5th row

Common Values

ValueCountFrequency (%)
85
25.3%
72
21.4%
보리 40
11.9%
40
11.9%
26
 
7.7%
겉보리 22
 
6.5%
쌀보리 18
 
5.4%
호밀 15
 
4.5%
맥주보리 7
 
2.1%
청보리(사료용) 6
 
1.8%
Other values (4) 5
 
1.5%

Length

2023-12-13T03:33:25.636101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
85
25.3%
72
21.4%
보리 40
11.9%
40
11.9%
26
 
7.7%
겉보리 22
 
6.5%
쌀보리 18
 
5.4%
호밀 15
 
4.5%
맥주보리 7
 
2.1%
청보리(사료용 6
 
1.8%
Other values (4) 5
 
1.5%

종자무게
Real number (ℝ)

HIGH CORRELATION 

Distinct287
Distinct (%)85.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean81.925536
Minimum0
Maximum1314.8
Zeros2
Zeros (%)0.6%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-13T03:33:25.825233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.8
Q11.745
median5.175
Q323.97
95-th percentile584.835
Maximum1314.8
Range1314.8
Interquartile range (IQR)22.225

Descriptive statistics

Standard deviation207.66704
Coefficient of variation (CV)2.5348267
Kurtosis11.884386
Mean81.925536
Median Absolute Deviation (MAD)4.175
Skewness3.4003956
Sum27526.98
Variance43125.598
MonotonicityNot monotonic
2023-12-13T03:33:26.020862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.98 9
 
2.7%
1.28 5
 
1.5%
1.0 3
 
0.9%
0.8 3
 
0.9%
0.9 3
 
0.9%
4.5 3
 
0.9%
1.6 3
 
0.9%
1.36 3
 
0.9%
1.3 3
 
0.9%
3.4 2
 
0.6%
Other values (277) 299
89.0%
ValueCountFrequency (%)
0.0 2
0.6%
0.18 1
0.3%
0.37 1
0.3%
0.43 1
0.3%
0.53 1
0.3%
0.6 2
0.6%
0.64 1
0.3%
0.7 1
0.3%
0.71 1
0.3%
0.74 1
0.3%
ValueCountFrequency (%)
1314.8 1
0.3%
1149.68 1
0.3%
1061.004 1
0.3%
1027.51 1
0.3%
1008.04 1
0.3%
974.56 1
0.3%
886.52 1
0.3%
822.03 1
0.3%
807.31 1
0.3%
783.75 1
0.3%

소독여부
Categorical

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
미소독
279 
소독
57 

Length

Max length3
Median length3
Mean length2.8303571
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소독
2nd row미소독
3rd row소독
4th row미소독
5th row소독

Common Values

ValueCountFrequency (%)
미소독 279
83.0%
소독 57
 
17.0%

Length

2023-12-13T03:33:26.190063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:33:26.317068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미소독 279
83.0%
소독 57
 
17.0%

처분일자
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean980.93155
Minimum0
Maximum2025
Zeros173
Zeros (%)51.5%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-13T03:33:26.428856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32022
95-th percentile2023
Maximum2025
Range2025
Interquartile range (IQR)2022

Descriptive statistics

Standard deviation1012.081
Coefficient of variation (CV)1.031755
Kurtosis-2.0084108
Mean980.93155
Median Absolute Deviation (MAD)0
Skewness0.059818712
Sum329593
Variance1024307.9
MonotonicityNot monotonic
2023-12-13T03:33:26.574203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 173
51.5%
2021 56
 
16.7%
2023 54
 
16.1%
2022 49
 
14.6%
2024 3
 
0.9%
2025 1
 
0.3%
ValueCountFrequency (%)
0 173
51.5%
2021 56
 
16.7%
2022 49
 
14.6%
2023 54
 
16.1%
2024 3
 
0.9%
2025 1
 
0.3%
ValueCountFrequency (%)
2025 1
 
0.3%
2024 3
 
0.9%
2023 54
 
16.1%
2022 49
 
14.6%
2021 56
 
16.7%
0 173
51.5%

입고량
Real number (ℝ)

ZEROS 

Distinct40
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.8890169
Minimum0
Maximum1222.78
Zeros295
Zeros (%)87.8%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-13T03:33:26.754745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile12.125
Maximum1222.78
Range1222.78
Interquartile range (IQR)0

Descriptive statistics

Standard deviation71.717352
Coefficient of variation (CV)9.0907845
Kurtosis249.32675
Mean7.8890169
Median Absolute Deviation (MAD)0
Skewness15.091235
Sum2650.7097
Variance5143.3786
MonotonicityNot monotonic
2023-12-13T03:33:26.961475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
0.0 295
87.8%
2.0 2
 
0.6%
5.0 2
 
0.6%
8.5 1
 
0.3%
29.0 1
 
0.3%
7.3 1
 
0.3%
1.48 1
 
0.3%
81.99 1
 
0.3%
5.78 1
 
0.3%
10.76 1
 
0.3%
Other values (30) 30
 
8.9%
ValueCountFrequency (%)
0.0 295
87.8%
0.79767 1
 
0.3%
0.83 1
 
0.3%
1.05 1
 
0.3%
1.244 1
 
0.3%
1.38 1
 
0.3%
1.48 1
 
0.3%
1.504 1
 
0.3%
1.77 1
 
0.3%
2.0 2
 
0.6%
ValueCountFrequency (%)
1222.78 1
0.3%
353.15 1
0.3%
248.74 1
0.3%
150.297 1
0.3%
117.0 1
0.3%
105.721 1
0.3%
81.99 1
0.3%
47.4 1
0.3%
29.694 1
0.3%
29.0 1
0.3%

출고량
Real number (ℝ)

ZEROS 

Distinct136
Distinct (%)40.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.727612
Minimum0
Maximum1327.71
Zeros185
Zeros (%)55.1%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-13T03:33:27.135133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q34.2075
95-th percentile283.0375
Maximum1327.71
Range1327.71
Interquartile range (IQR)4.2075

Descriptive statistics

Standard deviation166.9282
Coefficient of variation (CV)3.5723675
Kurtosis24.628973
Mean46.727612
Median Absolute Deviation (MAD)0
Skewness4.7621307
Sum15700.478
Variance27865.025
MonotonicityNot monotonic
2023-12-13T03:33:27.322022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 185
55.1%
0.3 5
 
1.5%
0.6 4
 
1.2%
0.36 3
 
0.9%
5.0 2
 
0.6%
4.2 2
 
0.6%
1.3 2
 
0.6%
0.44 2
 
0.6%
0.15 2
 
0.6%
0.5 2
 
0.6%
Other values (126) 127
37.8%
ValueCountFrequency (%)
0.0 185
55.1%
0.03 1
 
0.3%
0.09 1
 
0.3%
0.1 1
 
0.3%
0.105 1
 
0.3%
0.15 2
 
0.6%
0.162 1
 
0.3%
0.18 1
 
0.3%
0.2 1
 
0.3%
0.22 1
 
0.3%
ValueCountFrequency (%)
1327.71 1
0.3%
1061.004 1
0.3%
1051.3 1
0.3%
1008.04 1
0.3%
822.03 1
0.3%
783.75 1
0.3%
655.409 1
0.3%
650.06 1
0.3%
614.99 1
0.3%
597.93 1
0.3%

Interactions

2023-12-13T03:33:22.881214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:20.759251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:21.444888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:22.211093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:23.015101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:20.917167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:21.609493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:22.374639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:23.173829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:21.111250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:21.791163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:22.560458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:23.317157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:21.293395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:21.992452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:33:22.736638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:33:27.467308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년산채종단계담당지원생산지원작물명종자무게소독여부처분일자입고량출고량
년산1.0000.5070.2670.4120.1110.0000.1270.2060.1200.386
채종단계0.5071.0000.3330.9680.5410.4080.2920.6120.0000.303
담당지원0.2670.3331.0001.0000.3930.2590.1040.2390.0000.101
생산지원0.4120.9681.0001.0000.6310.3810.5610.9930.0000.000
작물명0.1110.5410.3930.6311.0000.0000.4740.6630.0000.000
종자무게0.0000.4080.2590.3810.0001.0000.1250.5240.6310.844
소독여부0.1270.2920.1040.5610.4740.1251.0000.6560.0000.000
처분일자0.2060.6120.2390.9930.6630.5240.6561.0000.0940.244
입고량0.1200.0000.0000.0000.0000.6310.0000.0941.0000.735
출고량0.3860.3030.1010.0000.0000.8440.0000.2440.7351.000
2023-12-13T03:33:27.628436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소독여부채종단계년산담당지원생산지원작물명
소독여부1.0000.4710.2100.1020.4310.365
채종단계0.4711.0000.2070.1540.8660.352
년산0.2100.2071.0000.1200.2220.059
담당지원0.1020.1540.1201.0000.9710.176
생산지원0.4310.8660.2220.9711.0000.210
작물명0.3650.3520.0590.1760.2101.000
2023-12-13T03:33:27.790541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종자무게처분일자입고량출고량년산채종단계담당지원생산지원작물명소독여부
종자무게1.0000.5420.2740.4780.0000.2660.1200.1410.0000.094
처분일자0.5421.0000.1440.3330.3370.8790.2360.8970.5170.455
입고량0.2740.1441.0000.4630.1130.0000.0000.0000.0000.000
출고량0.4780.3330.4631.0000.1830.1380.0320.0000.0000.000
년산0.0000.3370.1130.1831.0000.2070.1200.2220.0590.210
채종단계0.2660.8790.0000.1380.2071.0000.1540.8660.3520.471
담당지원0.1200.2360.0000.0320.1200.1541.0000.9710.1760.102
생산지원0.1410.8970.0000.0000.2220.8660.9711.0000.2100.431
작물명0.0000.5170.0000.0000.0590.3520.1760.2101.0000.365
소독여부0.0940.4550.0000.0000.2100.4710.1020.4310.3651.000

Missing values

2023-12-13T03:33:23.536677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:33:23.791895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년산채종단계담당지원생산지원작물명종자무게소독여부처분일자입고량출고량
02020보급종경기종자관리소경기종자관리소34.65소독20210.034.65
12020보급종경기종자관리소경기종자관리소525.44미소독20210.0525.44
22020보급종경기종자관리소경기종자관리소보리1.62소독20210.01.62
32020보급종경기종자관리소경기종자관리소보리18.39미소독20210.018.39
42020보급종경기종자관리소경기종자관리소2.42소독20210.02.42
52020보급종경기종자관리소경기종자관리소129.33미소독20210.0129.33
62020보급종충북지원충북지원1.595소독202125.53827.126
72020보급종충북지원충북지원505.121미소독2021150.297655.409
82020보급종충북지원충북지원보리16.6미소독20210.016.597
92020보급종충북지원충북지원11.492소독20210.011.489
년산채종단계담당지원생산지원작물명종자무게소독여부처분일자입고량출고량
3262022원원종강원지원강원도농업기술원0.8미소독00.00.0
3272022원원종강원지원강원도농업기술원4.1미소독00.00.0
3282022원원종강원지원강원도농업기술원1.3미소독00.00.0
3292022원원종강원지원강원도농업기술원호밀0.8미소독00.00.0
3302022원종강원지원강원도 농산물원종장4.9미소독00.00.0
3312022원종강원지원강원도 농산물원종장겉보리0.7미소독00.00.0
3322022원종강원지원강원도 농산물원종장1.0미소독00.00.0
3332022원종강원지원강원도 농산물원종장2.9미소독00.00.0
3342022원종강원지원강원도 농산물원종장옥수수2.7미소독00.00.0
3352022원종강원지원강원도 농산물원종장1.4미소독00.00.0