Overview

Dataset statistics

Number of variables16
Number of observations10000
Missing cells10008
Missing cells (%)6.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 MiB
Average record size in memory139.0 B

Variable types

Text4
Unsupported3
Categorical9

Dataset

Description유전체자원 정보
Author농림수산식품교육문화정보원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220210000000001801

Alerts

INSTT_CD_KOREA_NM is highly overall correlated with INSTT_CD and 5 other fieldsHigh correlation
LIFE_RESRCE_STLE_CD_NM is highly overall correlated with LIFE_RESRCE_KND_CD_NM and 4 other fieldsHigh correlation
DETAIL_INFO_URL is highly overall correlated with LIFE_RESRCE_STLE_CD_NM and 7 other fieldsHigh correlation
LAST_UPDT_DE is highly overall correlated with LIFE_RESRCE_STLE_CD_NM and 7 other fieldsHigh correlation
OUTNATN_TKOUT_AT is highly overall correlated with INSTT_CD and 5 other fieldsHigh correlation
IMAGE_URL is highly overall correlated with LIFE_RESRCE_STLE_CD_NM and 7 other fieldsHigh correlation
LIFE_RESRCE_LTTOT_AT is highly overall correlated with LIFE_RESRCE_STLE_CD_NM and 7 other fieldsHigh correlation
LIFE_RESRCE_KND_CD_NM is highly overall correlated with LIFE_RESRCE_STLE_CD_NM and 4 other fieldsHigh correlation
INSTT_CD is highly overall correlated with INSTT_CD_KOREA_NM and 5 other fieldsHigh correlation
LIFE_RESRCE_STLE_CD_NM is highly imbalanced (99.2%)Imbalance
LIFE_RESRCE_KND_CD_NM is highly imbalanced (99.2%)Imbalance
INSTT_CD is highly imbalanced (91.9%)Imbalance
INSTT_CD_KOREA_NM is highly imbalanced (91.9%)Imbalance
DETAIL_INFO_URL is highly imbalanced (96.7%)Imbalance
OUTNATN_TKOUT_AT is highly imbalanced (89.7%)Imbalance
LIFE_RESRCE_LTTOT_AT is highly imbalanced (93.6%)Imbalance
LAST_UPDT_DE is highly imbalanced (57.5%)Imbalance
SPCIES_PRTC_APLC_AT has 10000 (100.0%) missing valuesMissing
RESRCE_NO has unique valuesUnique
LIFE_RESRCE_STLE_CD is an unsupported type, check if it needs cleaning or further analysisUnsupported
LIFE_RESRCE_KND_CD is an unsupported type, check if it needs cleaning or further analysisUnsupported
SPCIES_PRTC_APLC_AT is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 03:28:33.984128
Analysis finished2023-12-11 03:28:36.225802
Duration2.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

RESRCE_NO
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:28:36.438780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length27
Mean length26.9097
Min length20

Characters and Unicode

Total characters269097
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row14005732610300-030-00054899
2nd row14005732610300-030-00017179
3rd row14005732610300-030-00055333
4th row14005732610300-030-00052302
5th row14005732610300-030-00046974
ValueCountFrequency (%)
14005732610300-030-00054899 1
 
< 0.1%
14005732610300-030-00020135 1
 
< 0.1%
15410022610gc0000408 1
 
< 0.1%
14005732610300-030-00055906 1
 
< 0.1%
14005732610300-030-00012532 1
 
< 0.1%
14005732610300-030-00041319 1
 
< 0.1%
14005732610300-030-00016505 1
 
< 0.1%
14005732610300-030-00003863 1
 
< 0.1%
14005732610300-030-00022666 1
 
< 0.1%
14005732610300-030-00050995 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-11T12:28:36.874836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 106030
39.4%
3 34645
 
12.9%
1 26358
 
9.8%
- 19746
 
7.3%
2 15588
 
5.8%
6 15122
 
5.6%
5 14995
 
5.6%
4 14752
 
5.5%
7 13927
 
5.2%
8 3855
 
1.4%
Other values (4) 4079
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 249099
92.6%
Dash Punctuation 19746
 
7.3%
Uppercase Letter 252
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 106030
42.6%
3 34645
 
13.9%
1 26358
 
10.6%
2 15588
 
6.3%
6 15122
 
6.1%
5 14995
 
6.0%
4 14752
 
5.9%
7 13927
 
5.6%
8 3855
 
1.5%
9 3827
 
1.5%
Uppercase Letter
ValueCountFrequency (%)
G 119
47.2%
C 119
47.2%
Z 14
 
5.6%
Dash Punctuation
ValueCountFrequency (%)
- 19746
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 268845
99.9%
Latin 252
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 106030
39.4%
3 34645
 
12.9%
1 26358
 
9.8%
- 19746
 
7.3%
2 15588
 
5.8%
6 15122
 
5.6%
5 14995
 
5.6%
4 14752
 
5.5%
7 13927
 
5.2%
8 3855
 
1.4%
Latin
ValueCountFrequency (%)
G 119
47.2%
C 119
47.2%
Z 14
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 269097
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 106030
39.4%
3 34645
 
12.9%
1 26358
 
9.8%
- 19746
 
7.3%
2 15588
 
5.8%
6 15122
 
5.6%
5 14995
 
5.6%
4 14752
 
5.5%
7 13927
 
5.2%
8 3855
 
1.4%
Other values (4) 4079
 
1.5%

LIFE_RESRCE_STLE_CD
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

LIFE_RESRCE_STLE_CD_NM
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
유전자
9993 
기타
 
7

Length

Max length3
Median length3
Mean length2.9993
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유전자
2nd row유전자
3rd row유전자
4th row유전자
5th row유전자

Common Values

ValueCountFrequency (%)
유전자 9993
99.9%
기타 7
 
0.1%

Length

2023-12-11T12:28:37.044604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:37.152780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유전자 9993
99.9%
기타 7
 
0.1%

LIFE_RESRCE_KND_CD
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

LIFE_RESRCE_KND_CD_NM
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
핵산정보
9993 
기타(대사체/칩)
 
7

Length

Max length9
Median length4
Mean length4.0035
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row핵산정보
2nd row핵산정보
3rd row핵산정보
4th row핵산정보
5th row핵산정보

Common Values

ValueCountFrequency (%)
핵산정보 9993
99.9%
기타(대사체/칩) 7
 
0.1%

Length

2023-12-11T12:28:37.263817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:37.391222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
핵산정보 9993
99.9%
기타(대사체/칩 7
 
0.1%
Distinct60
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:28:37.539851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length13
Min length13

Characters and Unicode

Total characters130000
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)0.2%

Sample

1st rowBSD0002425818
2nd rowBSD0002997219
3rd rowBSD0000442344
4th rowBSD0002425818
5th rowBSD0001596891
ValueCountFrequency (%)
bsd0001596891 3859
38.6%
bsd0002997219 2123
21.2%
bsd0000128072 1071
 
10.7%
bsd0001934957 915
 
9.2%
bsd0000135721 400
 
4.0%
bsd0002425818 360
 
3.6%
bsd0003841606 343
 
3.4%
bsd0003765910 332
 
3.3%
bsd0002165525 221
 
2.2%
bsd0003395965 85
 
0.9%
Other values (50) 291
 
2.9%
2023-12-11T12:28:37.893635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 33443
25.7%
9 16579
12.8%
1 14120
10.9%
B 10000
 
7.7%
S 10000
 
7.7%
D 10000
 
7.7%
2 8302
 
6.4%
5 6778
 
5.2%
8 6133
 
4.7%
6 5310
 
4.1%
Other values (3) 9335
 
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100000
76.9%
Uppercase Letter 30000
 
23.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 33443
33.4%
9 16579
16.6%
1 14120
14.1%
2 8302
 
8.3%
5 6778
 
6.8%
8 6133
 
6.1%
6 5310
 
5.3%
7 4943
 
4.9%
3 2364
 
2.4%
4 2028
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
B 10000
33.3%
S 10000
33.3%
D 10000
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
76.9%
Latin 30000
 
23.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 33443
33.4%
9 16579
16.6%
1 14120
14.1%
2 8302
 
8.3%
5 6778
 
6.8%
8 6133
 
6.1%
6 5310
 
5.3%
7 4943
 
4.9%
3 2364
 
2.4%
4 2028
 
2.0%
Latin
ValueCountFrequency (%)
B 10000
33.3%
S 10000
33.3%
D 10000
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 130000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 33443
25.7%
9 16579
12.8%
1 14120
10.9%
B 10000
 
7.7%
S 10000
 
7.7%
D 10000
 
7.7%
2 8302
 
6.4%
5 6778
 
5.2%
8 6133
 
4.7%
6 5310
 
4.1%
Other values (3) 9335
 
7.2%
Distinct62
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:28:38.115927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length37
Mean length30.9579
Min length12

Characters and Unicode

Total characters309579
Distinct characters53
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)0.2%

Sample

1st rowBetula platyphylla var. japonica (Miq.) Hara
2nd rowPinus koraiensis Siebold & Zucc.
3rd rowZelkova serrata (Thunb.) Makino
4th rowBetula platyphylla var. japonica (Miq.) Hara
5th rowPinus densiflora Siebold & Zucc.
ValueCountFrequency (%)
pinus 6327
13.5%
zucc 5526
11.8%
siebold 5526
11.8%
5526
11.8%
densiflora 3861
 
8.3%
koraiensis 2123
 
4.5%
et 1529
 
3.3%
z 1529
 
3.3%
s 1529
 
3.3%
quercus 1288
 
2.8%
Other values (116) 12012
25.7%
2023-12-11T12:28:38.582893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36776
 
11.9%
i 28187
 
9.1%
s 20196
 
6.5%
e 19521
 
6.3%
a 19445
 
6.3%
u 18064
 
5.8%
l 15133
 
4.9%
n 14654
 
4.7%
c 14325
 
4.6%
o 13916
 
4.5%
Other values (43) 109362
35.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 222640
71.9%
Space Separator 36776
 
11.9%
Uppercase Letter 28890
 
9.3%
Other Punctuation 17378
 
5.6%
Close Punctuation 1943
 
0.6%
Open Punctuation 1943
 
0.6%
Dash Punctuation 8
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 28187
12.7%
s 20196
9.1%
e 19521
8.8%
a 19445
8.7%
u 18064
 
8.1%
l 15133
 
6.8%
n 14654
 
6.6%
c 14325
 
6.4%
o 13916
 
6.3%
r 13829
 
6.2%
Other values (15) 45370
20.4%
Uppercase Letter
ValueCountFrequency (%)
Z 7118
24.6%
S 7084
24.5%
P 6687
23.1%
C 1827
 
6.3%
Q 1288
 
4.5%
B 1286
 
4.5%
E 1112
 
3.8%
L 834
 
2.9%
M 649
 
2.2%
H 372
 
1.3%
Other values (10) 633
 
2.2%
Other Punctuation
ValueCountFrequency (%)
. 11847
68.2%
& 5526
31.8%
5
 
< 0.1%
Space Separator
ValueCountFrequency (%)
36776
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1943
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1943
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 251530
81.2%
Common 58049
 
18.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 28187
 
11.2%
s 20196
 
8.0%
e 19521
 
7.8%
a 19445
 
7.7%
u 18064
 
7.2%
l 15133
 
6.0%
n 14654
 
5.8%
c 14325
 
5.7%
o 13916
 
5.5%
r 13829
 
5.5%
Other values (35) 74260
29.5%
Common
ValueCountFrequency (%)
36776
63.4%
. 11847
 
20.4%
& 5526
 
9.5%
) 1943
 
3.3%
( 1943
 
3.3%
- 8
 
< 0.1%
5
 
< 0.1%
2 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 309574
> 99.9%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
36776
 
11.9%
i 28187
 
9.1%
s 20196
 
6.5%
e 19521
 
6.3%
a 19445
 
6.3%
u 18064
 
5.8%
l 15133
 
4.9%
n 14654
 
4.7%
c 14325
 
4.6%
o 13916
 
4.5%
Other values (42) 109357
35.3%
None
ValueCountFrequency (%)
5
100.0%

TNOAC
Text

Distinct56
Distinct (%)0.6%
Missing8
Missing (%)0.1%
Memory size156.2 KiB
2023-12-11T12:28:38.816702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length3
Mean length3.3938151
Min length1

Characters and Unicode

Total characters33911
Distinct characters72
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)0.2%

Sample

1st row자작나무
2nd row잣나무
3rd row느티나무
4th row자작나무
5th row소나무
ValueCountFrequency (%)
소나무 3861
38.1%
잣나무 2123
21.0%
편백 1071
 
10.6%
굴참나무 915
 
9.0%
일본잎갈나무 405
 
4.0%
자작나무 360
 
3.6%
곰솔 343
 
3.4%
상수리나무 332
 
3.3%
전나무 221
 
2.2%
구상나무 85
 
0.8%
Other values (70) 405
 
4.0%
2023-12-11T12:28:39.208667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8459
24.9%
8459
24.9%
3861
11.4%
2128
 
6.3%
1071
 
3.2%
1071
 
3.2%
915
 
2.7%
915
 
2.7%
424
 
1.3%
417
 
1.2%
Other values (62) 6191
18.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31634
93.3%
Lowercase Letter 2019
 
6.0%
Space Separator 131
 
0.4%
Uppercase Letter 118
 
0.3%
Dash Punctuation 8
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8459
26.7%
8459
26.7%
3861
12.2%
2128
 
6.7%
1071
 
3.4%
1071
 
3.4%
915
 
2.9%
915
 
2.9%
424
 
1.3%
417
 
1.3%
Other values (23) 3914
12.4%
Lowercase Letter
ValueCountFrequency (%)
a 226
11.2%
e 201
10.0%
i 196
9.7%
s 180
8.9%
l 158
 
7.8%
r 153
 
7.6%
o 134
 
6.6%
c 115
 
5.7%
n 114
 
5.6%
u 105
 
5.2%
Other values (14) 437
21.6%
Uppercase Letter
ValueCountFrequency (%)
E 41
34.7%
S 24
20.3%
C 14
 
11.9%
P 9
 
7.6%
B 8
 
6.8%
H 6
 
5.1%
L 5
 
4.2%
F 4
 
3.4%
A 3
 
2.5%
V 2
 
1.7%
Other values (2) 2
 
1.7%
Space Separator
ValueCountFrequency (%)
131
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31634
93.3%
Latin 2137
 
6.3%
Common 140
 
0.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 226
10.6%
e 201
 
9.4%
i 196
 
9.2%
s 180
 
8.4%
l 158
 
7.4%
r 153
 
7.2%
o 134
 
6.3%
c 115
 
5.4%
n 114
 
5.3%
u 105
 
4.9%
Other values (26) 555
26.0%
Hangul
ValueCountFrequency (%)
8459
26.7%
8459
26.7%
3861
12.2%
2128
 
6.7%
1071
 
3.4%
1071
 
3.4%
915
 
2.9%
915
 
2.9%
424
 
1.3%
417
 
1.3%
Other values (23) 3914
12.4%
Common
ValueCountFrequency (%)
131
93.6%
- 8
 
5.7%
2 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31634
93.3%
ASCII 2277
 
6.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8459
26.7%
8459
26.7%
3861
12.2%
2128
 
6.7%
1071
 
3.4%
1071
 
3.4%
915
 
2.9%
915
 
2.9%
424
 
1.3%
417
 
1.3%
Other values (23) 3914
12.4%
ASCII
ValueCountFrequency (%)
a 226
 
9.9%
e 201
 
8.8%
i 196
 
8.6%
s 180
 
7.9%
l 158
 
6.9%
r 153
 
6.7%
o 134
 
5.9%
131
 
5.8%
c 115
 
5.1%
n 114
 
5.0%
Other values (29) 669
29.4%

INSTT_CD
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1400573
9803 
1541002
 
119
1400377
 
70
1390860
 
8

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1400573
2nd row1400573
3rd row1400573
4th row1400573
5th row1400573

Common Values

ValueCountFrequency (%)
1400573 9803
98.0%
1541002 119
 
1.2%
1400377 70
 
0.7%
1390860 8
 
0.1%

Length

2023-12-11T12:28:39.355412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:39.488694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1400573 9803
98.0%
1541002 119
 
1.2%
1400377 70
 
0.7%
1390860 8
 
0.1%

INSTT_CD_KOREA_NM
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국립산림품종관리센터
9803 
농림축산검역본부
 
119
국립산림과학원
 
70
국립농업과학원
 
8

Length

Max length10
Median length10
Mean length9.9528
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국립산림품종관리센터
2nd row국립산림품종관리센터
3rd row국립산림품종관리센터
4th row국립산림품종관리센터
5th row국립산림품종관리센터

Common Values

ValueCountFrequency (%)
국립산림품종관리센터 9803
98.0%
농림축산검역본부 119
 
1.2%
국립산림과학원 70
 
0.7%
국립농업과학원 8
 
0.1%

Length

2023-12-11T12:28:39.629189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:39.749387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국립산림품종관리센터 9803
98.0%
농림축산검역본부 119
 
1.2%
국립산림과학원 70
 
0.7%
국립농업과학원 8
 
0.1%

DETAIL_INFO_URL
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9873 
http://kvcc.kahis.go.kr/pms/web/gene/pmsGeneMain.do?iNo=3&iType=05&searchSale=0&searchCondition=&searchKeyword=&searchKeyword1=&searchKeyword2=&searchKeyword3=
 
118
http://www.genebank.go.kr/gp/resourceInfoSearch/microbe/microbe_search_view.jsp?sFlag=ONE&sStrainsn=2054
 
2
http://www.genebank.go.kr/gp/resourceInfoSearch/microbe/microbe_search_view.jsp?sFlag=ONE&sStrainsn=2043
 
2
http://www.genebank.go.kr/gp/resourceInfoSearch/microbe/microbe_search_view.jsp?sFlag=ONE&sStrainsn=2332
 
1
Other values (4)
 
4

Length

Max length177
Median length4
Mean length5.9263
Min length4

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9873
98.7%
http://kvcc.kahis.go.kr/pms/web/gene/pmsGeneMain.do?iNo=3&iType=05&searchSale=0&searchCondition=&searchKeyword=&searchKeyword1=&searchKeyword2=&searchKeyword3= 118
 
1.2%
http://www.genebank.go.kr/gp/resourceInfoSearch/microbe/microbe_search_view.jsp?sFlag=ONE&sStrainsn=2054 2
 
< 0.1%
http://www.genebank.go.kr/gp/resourceInfoSearch/microbe/microbe_search_view.jsp?sFlag=ONE&sStrainsn=2043 2
 
< 0.1%
http://www.genebank.go.kr/gp/resourceInfoSearch/microbe/microbe_search_view.jsp?sFlag=ONE&sStrainsn=2332 1
 
< 0.1%
http://www.genebank.go.kr/gp/resourceInfoSearch/microbe/microbe_search_view.jsp?sFlag=ONE&sStrainsn=2333 1
 
< 0.1%
http://www.genebank.go.kr/gp/resourceInfoSearch/microbe/microbe_search_view.jsp?sFlag=ONE&sStrainsn=2351 1
 
< 0.1%
http://www.genebank.go.kr/gp/resourceInfoSearch/microbe/microbe_search_view.jsp?sFlag=ONE&sStrainsn=2331 1
 
< 0.1%
http://kvcc.kahis.go.kr/pms/web/gene/pmsGeneMain.do?iNo=3&iType=05&searchKeyword1=&searchKeyword2=&searchKeyword3=&pageIndex=1&searchCondition=1&searchKeyword=horse&searchSale=0 1
 
< 0.1%

Length

2023-12-11T12:28:39.911067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:40.086497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9873
98.7%
http://kvcc.kahis.go.kr/pms/web/gene/pmsgenemain.do?ino=3&itype=05&searchsale=0&searchcondition=&searchkeyword=&searchkeyword1=&searchkeyword2=&searchkeyword3 118
 
1.2%
http://www.genebank.go.kr/gp/resourceinfosearch/microbe/microbe_search_view.jsp?sflag=one&sstrainsn=2054 2
 
< 0.1%
http://www.genebank.go.kr/gp/resourceinfosearch/microbe/microbe_search_view.jsp?sflag=one&sstrainsn=2043 2
 
< 0.1%
http://www.genebank.go.kr/gp/resourceinfosearch/microbe/microbe_search_view.jsp?sflag=one&sstrainsn=2332 1
 
< 0.1%
http://www.genebank.go.kr/gp/resourceinfosearch/microbe/microbe_search_view.jsp?sflag=one&sstrainsn=2333 1
 
< 0.1%
http://www.genebank.go.kr/gp/resourceinfosearch/microbe/microbe_search_view.jsp?sflag=one&sstrainsn=2351 1
 
< 0.1%
http://www.genebank.go.kr/gp/resourceinfosearch/microbe/microbe_search_view.jsp?sflag=one&sstrainsn=2331 1
 
< 0.1%
http://kvcc.kahis.go.kr/pms/web/gene/pmsgenemain.do?ino=3&itype=05&searchkeyword1=&searchkeyword2=&searchkeyword3=&pageindex=1&searchcondition=1&searchkeyword=horse&searchsale=0 1
 
< 0.1%

IMAGE_URL
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg
3859 
http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg
2123 
<NA>
1657 
http://www.forest.go.kr/images/fgri/2012/image/10009446-23798-01.jpg
1071 
http://www.forest.go.kr/images/fgri/2012/image/10006529-22541-01.jpg
 
360
Other values (14)
930 

Length

Max length77
Median length68
Mean length57.4069
Min length4

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowhttp://www.forest.go.kr/images/fgri/2012/image/10006529-22541-01.jpg
2nd rowhttp://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg
3rd rowhttp://www.forest.go.kr/images/fgri/2012/image/10008840-24804-01.jpg
4th rowhttp://www.forest.go.kr/images/fgri/2012/image/10006529-22541-01.jpg
5th rowhttp://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg

Common Values

ValueCountFrequency (%)
http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg 3859
38.6%
http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg 2123
21.2%
<NA> 1657
16.6%
http://www.forest.go.kr/images/fgri/2012/image/10009446-23798-01.jpg 1071
 
10.7%
http://www.forest.go.kr/images/fgri/2012/image/10006529-22541-01.jpg 360
 
3.6%
http://www.forest.go.kr/images/fgri/2012/image/10000003-xxxxx-01.jpg 343
 
3.4%
http://www.forest.go.kr/images/fgri/2012/image/10005032-21580-01.jpg 332
 
3.3%
http://www.forest.go.kr/images/fgri/2012/image/10012832-28583-01.jpg 85
 
0.9%
http://www.forest.go.kr/images/fgri/2012/image/10008840-24804-01.jpg 63
 
0.6%
http://www.forest.go.kr/images/fgri/2012/image/10000548-xxxxx-02.jpg 39
 
0.4%
Other values (9) 68
 
0.7%

Length

2023-12-11T12:28:40.326401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg 3859
38.6%
http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg 2123
21.2%
na 1657
16.6%
http://www.forest.go.kr/images/fgri/2012/image/10009446-23798-01.jpg 1071
 
10.7%
http://www.forest.go.kr/images/fgri/2012/image/10006529-22541-01.jpg 360
 
3.6%
http://www.forest.go.kr/images/fgri/2012/image/10000003-xxxxx-01.jpg 343
 
3.4%
http://www.forest.go.kr/images/fgri/2012/image/10005032-21580-01.jpg 332
 
3.3%
http://www.forest.go.kr/images/fgri/2012/image/10012832-28583-01.jpg 85
 
0.9%
http://www.forest.go.kr/images/fgri/2012/image/10008840-24804-01.jpg 63
 
0.6%
http://www.forest.go.kr/images/fgri/2012/image/10000548-xxxxx-02.jpg 39
 
0.4%
Other values (9) 68
 
0.7%

OUTNATN_TKOUT_AT
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
불가능
9865 
가능
 
135

Length

Max length3
Median length3
Mean length2.9865
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row불가능
2nd row불가능
3rd row불가능
4th row불가능
5th row불가능

Common Values

ValueCountFrequency (%)
불가능 9865
98.7%
가능 135
 
1.4%

Length

2023-12-11T12:28:40.524012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:40.661133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
불가능 9865
98.7%
가능 135
 
1.4%

SPCIES_PRTC_APLC_AT
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

LIFE_RESRCE_LTTOT_AT
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
-
9874 
불가능
 
119
가능
 
7

Length

Max length3
Median length1
Mean length1.0245
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 9874
98.7%
불가능 119
 
1.2%
가능 7
 
0.1%

Length

2023-12-11T12:28:40.792005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:40.941840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9874
98.7%
불가능 119
 
1.2%
가능 7
 
0.1%

LAST_UPDT_DE
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
20121216
4963 
<NA>
4903 
20120105
 
118
20141014
 
8
20191115
 
7

Length

Max length8
Median length8
Mean length6.0388
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row20121216
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
20121216 4963
49.6%
<NA> 4903
49.0%
20120105 118
 
1.2%
20141014 8
 
0.1%
20191115 7
 
0.1%
20140515 1
 
< 0.1%

Length

2023-12-11T12:28:41.075113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:41.216577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20121216 4963
49.6%
na 4903
49.0%
20120105 118
 
1.2%
20141014 8
 
0.1%
20191115 7
 
0.1%
20140515 1
 
< 0.1%

Correlations

2023-12-11T12:28:41.309834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
LIFE_RESRCE_STLE_CD_NMLIFE_RESRCE_KND_CD_NMSCNCENM_CDSCNCENMTNOACINSTT_CDINSTT_CD_KOREA_NMDETAIL_INFO_URLIMAGE_URLOUTNATN_TKOUT_ATLIFE_RESRCE_LTTOT_ATLAST_UPDT_DE
LIFE_RESRCE_STLE_CD_NM1.0000.9941.0000.9500.0660.0000.000NaN1.0000.3231.0001.000
LIFE_RESRCE_KND_CD_NM0.9941.0001.0000.9500.0660.0000.000NaN1.0000.3231.0001.000
SCNCENM_CD1.0001.0001.0001.0001.0000.9950.9950.9771.0001.0001.0000.999
SCNCENM0.9500.9501.0001.0001.0000.9880.9880.9751.0001.0000.9860.994
TNOAC0.0660.0661.0001.0001.0000.9790.9791.0001.0000.9990.8930.971
INSTT_CD0.0000.0000.9950.9880.9791.0001.0001.0001.0000.9990.6760.841
INSTT_CD_KOREA_NM0.0000.0000.9950.9880.9791.0001.0001.0001.0000.9990.6760.841
DETAIL_INFO_URLNaNNaN0.9770.9751.0001.0001.0001.0001.000NaN1.0001.000
IMAGE_URL1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
OUTNATN_TKOUT_AT0.3230.3231.0001.0000.9990.9990.999NaN1.0001.0000.7320.919
LIFE_RESRCE_LTTOT_AT1.0001.0001.0000.9860.8930.6760.6761.0001.0000.7321.0001.000
LAST_UPDT_DE1.0001.0000.9990.9940.9710.8410.8411.0001.0000.9191.0001.000
2023-12-11T12:28:41.489167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
INSTT_CD_KOREA_NMLIFE_RESRCE_STLE_CD_NMDETAIL_INFO_URLLAST_UPDT_DEOUTNATN_TKOUT_ATIMAGE_URLLIFE_RESRCE_LTTOT_ATLIFE_RESRCE_KND_CD_NMINSTT_CD
INSTT_CD_KOREA_NM1.0000.0000.9760.8160.9690.9990.7070.0001.000
LIFE_RESRCE_STLE_CD_NM0.0001.0001.0001.0000.2100.9991.0000.9290.000
DETAIL_INFO_URL0.9761.0001.0000.9801.0000.9050.9761.0000.976
LAST_UPDT_DE0.8161.0000.9801.0000.9960.9991.0001.0000.816
OUTNATN_TKOUT_AT0.9690.2101.0000.9961.0000.9990.9660.2100.969
IMAGE_URL0.9990.9990.9050.9990.9991.0000.9990.9990.999
LIFE_RESRCE_LTTOT_AT0.7071.0000.9761.0000.9660.9991.0001.0000.707
LIFE_RESRCE_KND_CD_NM0.0000.9291.0001.0000.2100.9991.0001.0000.000
INSTT_CD1.0000.0000.9760.8160.9690.9990.7070.0001.000
2023-12-11T12:28:41.636708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
LIFE_RESRCE_STLE_CD_NMLIFE_RESRCE_KND_CD_NMINSTT_CDINSTT_CD_KOREA_NMDETAIL_INFO_URLIMAGE_URLOUTNATN_TKOUT_ATLIFE_RESRCE_LTTOT_ATLAST_UPDT_DE
LIFE_RESRCE_STLE_CD_NM1.0000.9290.0000.0001.0000.9990.2101.0001.000
LIFE_RESRCE_KND_CD_NM0.9291.0000.0000.0001.0000.9990.2101.0001.000
INSTT_CD0.0000.0001.0001.0000.9760.9990.9690.7070.816
INSTT_CD_KOREA_NM0.0000.0001.0001.0000.9760.9990.9690.7070.816
DETAIL_INFO_URL1.0001.0000.9760.9761.0000.9051.0000.9760.980
IMAGE_URL0.9990.9990.9990.9990.9051.0000.9990.9990.999
OUTNATN_TKOUT_AT0.2100.2100.9690.9691.0000.9991.0000.9660.996
LIFE_RESRCE_LTTOT_AT1.0001.0000.7070.7070.9760.9990.9661.0001.000
LAST_UPDT_DE1.0001.0000.8160.8160.9800.9990.9961.0001.000

Missing values

2023-12-11T12:28:35.797726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:28:36.054640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

RESRCE_NOLIFE_RESRCE_STLE_CDLIFE_RESRCE_STLE_CD_NMLIFE_RESRCE_KND_CDLIFE_RESRCE_KND_CD_NMSCNCENM_CDSCNCENMTNOACINSTT_CDINSTT_CD_KOREA_NMDETAIL_INFO_URLIMAGE_URLOUTNATN_TKOUT_ATSPCIES_PRTC_APLC_ATLIFE_RESRCE_LTTOT_ATLAST_UPDT_DE
4546414005732610300-030-000548991유전자1핵산정보BSD0002425818Betula platyphylla var. japonica (Miq.) Hara자작나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10006529-22541-01.jpg불가능<NA>-<NA>
1988214005732610300-030-000171791유전자1핵산정보BSD0002997219Pinus koraiensis Siebold & Zucc.잣나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg불가능<NA>-20121216
4321314005732610300-030-000553331유전자1핵산정보BSD0000442344Zelkova serrata (Thunb.) Makino느티나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10008840-24804-01.jpg불가능<NA>-<NA>
4506414005732610300-030-000523021유전자1핵산정보BSD0002425818Betula platyphylla var. japonica (Miq.) Hara자작나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10006529-22541-01.jpg불가능<NA>-<NA>
1256914005732610300-030-000469741유전자1핵산정보BSD0001596891Pinus densiflora Siebold & Zucc.소나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg불가능<NA>-<NA>
1967814005732610300-030-000215821유전자1핵산정보BSD0002997219Pinus koraiensis Siebold & Zucc.잣나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg불가능<NA>-20121216
3713614005732610300-030-000357541유전자1핵산정보BSD0002997219Pinus koraiensis Siebold & Zucc.잣나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg불가능<NA>-<NA>
3166214005732610300-030-000150381유전자1핵산정보BSD0000135721Larix kaempferi (Lamb.) Carriere일본잎갈나무1400573국립산림품종관리센터<NA><NA>불가능<NA>-20121216
3782814005732610300-030-000650101유전자1핵산정보BSD0000128072Chamaecyparis obtusa (Siebold & Zucc.) Endl.편백1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10009446-23798-01.jpg불가능<NA>-<NA>
225214005732610300-030-000096161유전자1핵산정보BSD0001596891Pinus densiflora S. et Z.소나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg불가능<NA>-20121216
RESRCE_NOLIFE_RESRCE_STLE_CDLIFE_RESRCE_STLE_CD_NMLIFE_RESRCE_KND_CDLIFE_RESRCE_KND_CD_NMSCNCENM_CDSCNCENMTNOACINSTT_CDINSTT_CD_KOREA_NMDETAIL_INFO_URLIMAGE_URLOUTNATN_TKOUT_ATSPCIES_PRTC_APLC_ATLIFE_RESRCE_LTTOT_ATLAST_UPDT_DE
1075014005732610300-030-000457341유전자1핵산정보BSD0001596891Pinus densiflora Siebold & Zucc.소나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg불가능<NA>-<NA>
3016514005732610300-030-000698481유전자1핵산정보BSD0000128072Chamaecyparis obtusa (Siebold & Zucc.) Endl.편백1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10009446-23798-01.jpg불가능<NA>-<NA>
3088814005732610300-030-000121211유전자1핵산정보BSD0001596891Pinus densiflora S. et Z.소나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg불가능<NA>-20121216
2759314005732610300-030-000172251유전자1핵산정보BSD0002997219Pinus koraiensis Siebold & Zucc.잣나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg불가능<NA>-20121216
3726014005732610300-030-000667051유전자1핵산정보BSD0000128072Chamaecyparis obtusa (Siebold & Zucc.) Endl.편백1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10009446-23798-01.jpg불가능<NA>-<NA>
4665914005732610300-030-000563331유전자1핵산정보BSD0001934957Quercus variabilis Blume굴참나무1400573국립산림품종관리센터<NA><NA>불가능<NA>-<NA>
2524914005732610300-030-000194491유전자1핵산정보BSD0002997219Pinus koraiensis Siebold & Zucc.잣나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg불가능<NA>-20121216
1839814005732610300-030-000216341유전자1핵산정보BSD0002997219Pinus koraiensis Siebold & Zucc.잣나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg불가능<NA>-20121216
3589414005732610300-030-000364501유전자1핵산정보BSD0002997219Pinus koraiensis Siebold & Zucc.잣나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg불가능<NA>-<NA>
1942914005732610300-030-000171441유전자1핵산정보BSD0002997219Pinus koraiensis Siebold & Zucc.잣나무1400573국립산림품종관리센터<NA>http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg불가능<NA>-20121216