Overview

Dataset statistics

Number of variables7
Number of observations601
Missing cells428
Missing cells (%)10.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory33.0 KiB
Average record size in memory56.2 B

Variable types

Text7

Dataset

Description한국한의약진흥원에서 생산 및 도출된 실험정보기반의 정형데이터(표준용어 : 한약재ID, 일반명, 영문명, 라틴명, 기원종, 학명, 약용부위)로 이루어져 있습니다.
Author한국한의약진흥원
URLhttps://www.data.go.kr/data/15109111/fileData.do

Alerts

영문명 has 333 (55.4%) missing valuesMissing
라틴명 has 10 (1.7%) missing valuesMissing
학명목록 has 40 (6.7%) missing valuesMissing
약용부위 has 45 (7.5%) missing valuesMissing
한약재아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:55:09.460704
Analysis finished2023-12-12 09:55:10.571134
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

한약재아이디
Text

UNIQUE 

Distinct601
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T18:55:10.914248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.8702163
Min length4

Characters and Unicode

Total characters3528
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique601 ?
Unique (%)100.0%

Sample

1st rowKMH2
2nd rowKMH3
3rd rowKMH5
4th rowKMH6
5th rowKMH7
ValueCountFrequency (%)
kmh2 1
 
0.2%
kmh578 1
 
0.2%
kmh581 1
 
0.2%
kmh582 1
 
0.2%
kmh583 1
 
0.2%
kmh584 1
 
0.2%
kmh585 1
 
0.2%
kmh587 1
 
0.2%
kmh588 1
 
0.2%
kmh591 1
 
0.2%
Other values (591) 591
98.3%
2023-12-12T18:55:11.497102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
K 601
17.0%
M 601
17.0%
H 601
17.0%
5 197
 
5.6%
3 195
 
5.5%
6 195
 
5.5%
7 194
 
5.5%
1 189
 
5.4%
4 189
 
5.4%
2 178
 
5.0%
Other values (3) 388
11.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1803
51.1%
Decimal Number 1725
48.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 197
11.4%
3 195
11.3%
6 195
11.3%
7 194
11.2%
1 189
11.0%
4 189
11.0%
2 178
10.3%
8 168
9.7%
9 110
6.4%
0 110
6.4%
Uppercase Letter
ValueCountFrequency (%)
K 601
33.3%
M 601
33.3%
H 601
33.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 1803
51.1%
Common 1725
48.9%

Most frequent character per script

Common
ValueCountFrequency (%)
5 197
11.4%
3 195
11.3%
6 195
11.3%
7 194
11.2%
1 189
11.0%
4 189
11.0%
2 178
10.3%
8 168
9.7%
9 110
6.4%
0 110
6.4%
Latin
ValueCountFrequency (%)
K 601
33.3%
M 601
33.3%
H 601
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3528
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
K 601
17.0%
M 601
17.0%
H 601
17.0%
5 197
 
5.6%
3 195
 
5.5%
6 195
 
5.5%
7 194
 
5.5%
1 189
 
5.4%
4 189
 
5.4%
2 178
 
5.0%
Other values (3) 388
11.0%
Distinct600
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T18:55:11.885727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length2.81198
Min length2

Characters and Unicode

Total characters1690
Distinct characters291
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique599 ?
Unique (%)99.7%

Sample

1st row가자
2nd row갈근
3rd row갈화
4th row감국
5th row감송향
ValueCountFrequency (%)
진피 2
 
0.3%
익모초 1
 
0.2%
자소엽 1
 
0.2%
일당귀 1
 
0.2%
익지 1
 
0.2%
인도사목 1
 
0.2%
인동 1
 
0.2%
인삼 1
 
0.2%
인삼가루 1
 
0.2%
인진호 1
 
0.2%
Other values (590) 590
98.2%
2023-12-12T18:55:12.474565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
91
 
5.4%
59
 
3.5%
55
 
3.3%
54
 
3.2%
36
 
2.1%
31
 
1.8%
30
 
1.8%
28
 
1.7%
27
 
1.6%
26
 
1.5%
Other values (281) 1253
74.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1688
99.9%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
 
5.4%
59
 
3.5%
55
 
3.3%
54
 
3.2%
36
 
2.1%
31
 
1.8%
30
 
1.8%
28
 
1.7%
27
 
1.6%
26
 
1.5%
Other values (279) 1251
74.1%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1688
99.9%
Common 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
91
 
5.4%
59
 
3.5%
55
 
3.3%
54
 
3.2%
36
 
2.1%
31
 
1.8%
30
 
1.8%
28
 
1.7%
27
 
1.6%
26
 
1.5%
Other values (279) 1251
74.1%
Common
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1688
99.9%
ASCII 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
91
 
5.4%
59
 
3.5%
55
 
3.3%
54
 
3.2%
36
 
2.1%
31
 
1.8%
30
 
1.8%
28
 
1.7%
27
 
1.6%
26
 
1.5%
Other values (279) 1251
74.1%
ASCII
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%

영문명
Text

MISSING 

Distinct268
Distinct (%)100.0%
Missing333
Missing (%)55.4%
Memory size4.8 KiB
2023-12-12T18:55:12.881630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length26
Mean length14.738806
Min length4

Characters and Unicode

Total characters3950
Distinct characters53
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique268 ?
Unique (%)100.0%

Sample

1st rowTerminalia Fruit
2nd rowPueraria Root
3rd rowPueraria Flower
4th rowLicorice
5th rowLicorice Powder
ValueCountFrequency (%)
powder 51
 
8.8%
root 51
 
8.8%
rhizome 31
 
5.3%
fruit 27
 
4.7%
seed 17
 
2.9%
bark 16
 
2.8%
herb 10
 
1.7%
leaf 8
 
1.4%
and 7
 
1.2%
peel 5
 
0.9%
Other values (260) 357
61.6%
2023-12-12T18:55:13.453231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 351
 
8.9%
e 346
 
8.8%
312
 
7.9%
r 291
 
7.4%
a 283
 
7.2%
i 268
 
6.8%
t 186
 
4.7%
n 174
 
4.4%
u 162
 
4.1%
l 144
 
3.6%
Other values (43) 1433
36.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3070
77.7%
Uppercase Letter 563
 
14.3%
Space Separator 312
 
7.9%
Other Punctuation 5
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 351
11.4%
e 346
11.3%
r 291
 
9.5%
a 283
 
9.2%
i 268
 
8.7%
t 186
 
6.1%
n 174
 
5.7%
u 162
 
5.3%
l 144
 
4.7%
d 131
 
4.3%
Other values (15) 734
23.9%
Uppercase Letter
ValueCountFrequency (%)
R 94
16.7%
P 93
16.5%
S 51
9.1%
A 43
7.6%
F 42
7.5%
C 40
7.1%
G 30
 
5.3%
B 30
 
5.3%
L 24
 
4.3%
T 17
 
3.0%
Other values (14) 99
17.6%
Other Punctuation
ValueCountFrequency (%)
, 3
60.0%
' 1
 
20.0%
. 1
 
20.0%
Space Separator
ValueCountFrequency (%)
312
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3633
92.0%
Common 317
 
8.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 351
 
9.7%
e 346
 
9.5%
r 291
 
8.0%
a 283
 
7.8%
i 268
 
7.4%
t 186
 
5.1%
n 174
 
4.8%
u 162
 
4.5%
l 144
 
4.0%
d 131
 
3.6%
Other values (39) 1297
35.7%
Common
ValueCountFrequency (%)
312
98.4%
, 3
 
0.9%
' 1
 
0.3%
. 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3950
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 351
 
8.9%
e 346
 
8.8%
312
 
7.9%
r 291
 
7.4%
a 283
 
7.2%
i 268
 
6.8%
t 186
 
4.7%
n 174
 
4.4%
u 162
 
4.1%
l 144
 
3.6%
Other values (43) 1433
36.3%

라틴명
Text

MISSING 

Distinct591
Distinct (%)100.0%
Missing10
Missing (%)1.7%
Memory size4.8 KiB
2023-12-12T18:55:13.858679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length42
Mean length19.162437
Min length4

Characters and Unicode

Total characters11325
Distinct characters62
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique591 ?
Unique (%)100.0%

Sample

1st rowTerminaliae Fructus
2nd rowPuerariae Radix
3rd rowPuerariae Flos
4th rowChrysanthemi Indici Flos
5th rowNardostachyos Radix et Rhizoma
ValueCountFrequency (%)
radix 98
 
6.7%
rhizoma 68
 
4.6%
fructus 62
 
4.2%
herba 59
 
4.0%
semen 59
 
4.0%
pulvis 47
 
3.2%
et 37
 
2.5%
cortex 25
 
1.7%
preparata 22
 
1.5%
radicis 20
 
1.4%
Other values (609) 975
66.2%
2023-12-12T18:55:14.695138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 1231
 
10.9%
a 1124
 
9.9%
e 906
 
8.0%
881
 
7.8%
r 643
 
5.7%
u 552
 
4.9%
s 552
 
4.9%
o 516
 
4.6%
t 450
 
4.0%
n 440
 
3.9%
Other values (52) 4030
35.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9016
79.6%
Uppercase Letter 1416
 
12.5%
Space Separator 881
 
7.8%
Decimal Number 5
 
< 0.1%
Other Punctuation 3
 
< 0.1%
Other Letter 2
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 1231
13.7%
a 1124
12.5%
e 906
10.0%
r 643
 
7.1%
u 552
 
6.1%
s 552
 
6.1%
o 516
 
5.7%
t 450
 
5.0%
n 440
 
4.9%
l 431
 
4.8%
Other values (16) 2171
24.1%
Uppercase Letter
ValueCountFrequency (%)
R 245
17.3%
P 163
11.5%
C 155
10.9%
S 151
10.7%
F 124
8.8%
A 87
 
6.1%
H 79
 
5.6%
G 58
 
4.1%
L 53
 
3.7%
M 47
 
3.3%
Other values (15) 254
17.9%
Decimal Number
ValueCountFrequency (%)
2 2
40.0%
0 1
20.0%
1 1
20.0%
9 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
' 1
33.3%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
881
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 10432
92.1%
Common 891
 
7.9%
Hangul 2
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 1231
 
11.8%
a 1124
 
10.8%
e 906
 
8.7%
r 643
 
6.2%
u 552
 
5.3%
s 552
 
5.3%
o 516
 
4.9%
t 450
 
4.3%
n 440
 
4.2%
l 431
 
4.1%
Other values (41) 3587
34.4%
Common
ValueCountFrequency (%)
881
98.9%
2 2
 
0.2%
. 2
 
0.2%
) 1
 
0.1%
0 1
 
0.1%
1 1
 
0.1%
9 1
 
0.1%
' 1
 
0.1%
( 1
 
0.1%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11323
> 99.9%
Hangul 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 1231
 
10.9%
a 1124
 
9.9%
e 906
 
8.0%
881
 
7.8%
r 643
 
5.7%
u 552
 
4.9%
s 552
 
4.9%
o 516
 
4.6%
t 450
 
4.0%
n 440
 
3.9%
Other values (50) 4028
35.6%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct494
Distinct (%)82.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T18:55:15.023600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length33
Mean length7.1281198
Min length1

Characters and Unicode

Total characters4284
Distinct characters460
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique417 ?
Unique (%)69.4%

Sample

1st row가자, 융모가자
2nd row
3rd row
4th row감국
5th row감송, 시엽감송
ValueCountFrequency (%)
기타 35
 
3.4%
동속식물 14
 
1.4%
근연식물 13
 
1.3%
동속근연식물 11
 
1.1%
동속 10
 
1.0%
근연동물 6
 
0.6%
변종 6
 
0.6%
황산염광물 6
 
0.6%
두충 5
 
0.5%
인삼 5
 
0.5%
Other values (712) 910
89.1%
2023-12-12T18:55:15.518636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
420
 
9.8%
, 330
 
7.7%
111
 
2.6%
105
 
2.5%
81
 
1.9%
69
 
1.6%
68
 
1.6%
68
 
1.6%
52
 
1.2%
52
 
1.2%
Other values (450) 2928
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3169
74.0%
Space Separator 420
 
9.8%
Lowercase Letter 334
 
7.8%
Other Punctuation 333
 
7.8%
Uppercase Letter 21
 
0.5%
Decimal Number 5
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
 
3.5%
105
 
3.3%
81
 
2.6%
69
 
2.2%
68
 
2.1%
68
 
2.1%
52
 
1.6%
52
 
1.6%
51
 
1.6%
47
 
1.5%
Other values (412) 2465
77.8%
Lowercase Letter
ValueCountFrequency (%)
a 50
15.0%
i 34
10.2%
e 28
8.4%
t 27
 
8.1%
l 25
 
7.5%
r 22
 
6.6%
s 21
 
6.3%
c 20
 
6.0%
u 20
 
6.0%
n 18
 
5.4%
Other values (11) 69
20.7%
Uppercase Letter
ValueCountFrequency (%)
A 7
33.3%
C 4
19.0%
G 2
 
9.5%
P 2
 
9.5%
O 2
 
9.5%
H 1
 
4.8%
B 1
 
4.8%
V 1
 
4.8%
S 1
 
4.8%
Decimal Number
ValueCountFrequency (%)
1 3
60.0%
3 1
 
20.0%
7 1
 
20.0%
Other Punctuation
ValueCountFrequency (%)
, 330
99.1%
. 3
 
0.9%
Space Separator
ValueCountFrequency (%)
420
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3169
74.0%
Common 760
 
17.7%
Latin 355
 
8.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
 
3.5%
105
 
3.3%
81
 
2.6%
69
 
2.2%
68
 
2.1%
68
 
2.1%
52
 
1.6%
52
 
1.6%
51
 
1.6%
47
 
1.5%
Other values (412) 2465
77.8%
Latin
ValueCountFrequency (%)
a 50
14.1%
i 34
 
9.6%
e 28
 
7.9%
t 27
 
7.6%
l 25
 
7.0%
r 22
 
6.2%
s 21
 
5.9%
c 20
 
5.6%
u 20
 
5.6%
n 18
 
5.1%
Other values (20) 90
25.4%
Common
ValueCountFrequency (%)
420
55.3%
, 330
43.4%
. 3
 
0.4%
1 3
 
0.4%
3 1
 
0.1%
( 1
 
0.1%
7 1
 
0.1%
) 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3169
74.0%
ASCII 1115
 
26.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
420
37.7%
, 330
29.6%
a 50
 
4.5%
i 34
 
3.0%
e 28
 
2.5%
t 27
 
2.4%
l 25
 
2.2%
r 22
 
2.0%
s 21
 
1.9%
c 20
 
1.8%
Other values (28) 138
 
12.4%
Hangul
ValueCountFrequency (%)
111
 
3.5%
105
 
3.3%
81
 
2.6%
69
 
2.2%
68
 
2.1%
68
 
2.1%
52
 
1.6%
52
 
1.6%
51
 
1.6%
47
 
1.5%
Other values (412) 2465
77.8%

학명목록
Text

MISSING 

Distinct462
Distinct (%)82.4%
Missing40
Missing (%)6.7%
Memory size4.8 KiB
2023-12-12T18:55:15.790890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length183
Median length117
Mean length43.046346
Min length9

Characters and Unicode

Total characters24149
Distinct characters60
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique389 ?
Unique (%)69.3%

Sample

1st rowTerminalia chebula Retzins, Terminalia chebula Retzins var. tomentella Kurt.
2nd rowPueraria lobata Ohwi
3rd rowPueraria lobata Ohwi
4th rowChrysanthemum indicum Linn.
5th rowNardostachys chinensis Batal, Nardostachys jatamansi DC.
ValueCountFrequency (%)
linn 160
 
5.3%
et 63
 
2.1%
var 61
 
2.0%
ex 26
 
0.9%
japonica 26
 
0.9%
maximowicz 23
 
0.8%
siebold 21
 
0.7%
c 20
 
0.7%
nakai 19
 
0.6%
thunberg 19
 
0.6%
Other values (1317) 2563
85.4%
2023-12-12T18:55:16.236651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2441
 
10.1%
a 2304
 
9.5%
i 2120
 
8.8%
n 1609
 
6.7%
e 1541
 
6.4%
r 1280
 
5.3%
o 1154
 
4.8%
s 1152
 
4.8%
u 1117
 
4.6%
l 1028
 
4.3%
Other values (50) 8403
34.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 18927
78.4%
Space Separator 2441
 
10.1%
Uppercase Letter 1981
 
8.2%
Other Punctuation 693
 
2.9%
Close Punctuation 52
 
0.2%
Open Punctuation 52
 
0.2%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 2304
12.2%
i 2120
11.2%
n 1609
 
8.5%
e 1541
 
8.1%
r 1280
 
6.8%
o 1154
 
6.1%
s 1152
 
6.1%
u 1117
 
5.9%
l 1028
 
5.4%
t 894
 
4.7%
Other values (16) 4728
25.0%
Uppercase Letter
ValueCountFrequency (%)
L 265
13.4%
C 194
 
9.8%
S 163
 
8.2%
P 144
 
7.3%
M 141
 
7.1%
A 137
 
6.9%
B 130
 
6.6%
H 94
 
4.7%
G 88
 
4.4%
R 85
 
4.3%
Other values (16) 540
27.3%
Other Punctuation
ValueCountFrequency (%)
. 424
61.2%
, 266
38.4%
& 3
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 51
98.1%
1
 
1.9%
Space Separator
ValueCountFrequency (%)
2441
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%
Math Symbol
ValueCountFrequency (%)
= 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 20908
86.6%
Common 3241
 
13.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 2304
 
11.0%
i 2120
 
10.1%
n 1609
 
7.7%
e 1541
 
7.4%
r 1280
 
6.1%
o 1154
 
5.5%
s 1152
 
5.5%
u 1117
 
5.3%
l 1028
 
4.9%
t 894
 
4.3%
Other values (42) 6709
32.1%
Common
ValueCountFrequency (%)
2441
75.3%
. 424
 
13.1%
, 266
 
8.2%
) 52
 
1.6%
( 51
 
1.6%
= 3
 
0.1%
& 3
 
0.1%
1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24148
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2441
 
10.1%
a 2304
 
9.5%
i 2120
 
8.8%
n 1609
 
6.7%
e 1541
 
6.4%
r 1280
 
5.3%
o 1154
 
4.8%
s 1152
 
4.8%
u 1117
 
4.6%
l 1028
 
4.3%
Other values (49) 8402
34.8%
None
ValueCountFrequency (%)
1
100.0%

약용부위
Text

MISSING 

Distinct149
Distinct (%)26.8%
Missing45
Missing (%)7.5%
Memory size4.8 KiB
2023-12-12T18:55:16.590839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length34
Mean length4.1564748
Min length1

Characters and Unicode

Total characters2311
Distinct characters225
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)18.7%

Sample

1st row열매
2nd row뿌리
3rd row꽃봉오리, 막 피기 시작한 꽃
4th row
5th row뿌리, 뿌리줄기
ValueCountFrequency (%)
뿌리 103
 
13.0%
뿌리줄기 74
 
9.4%
열매 59
 
7.5%
59
 
7.5%
전초 29
 
3.7%
지상부 27
 
3.4%
24
 
3.0%
줄기껍질 18
 
2.3%
16
 
2.0%
몸체 14
 
1.8%
Other values (213) 368
46.5%
2023-12-12T18:55:17.056765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
235
 
10.2%
221
 
9.6%
201
 
8.7%
130
 
5.6%
127
 
5.5%
70
 
3.0%
, 70
 
3.0%
69
 
3.0%
65
 
2.8%
55
 
2.4%
Other values (215) 1068
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2004
86.7%
Space Separator 235
 
10.2%
Other Punctuation 70
 
3.0%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
221
 
11.0%
201
 
10.0%
130
 
6.5%
127
 
6.3%
70
 
3.5%
69
 
3.4%
65
 
3.2%
55
 
2.7%
52
 
2.6%
46
 
2.3%
Other values (211) 968
48.3%
Space Separator
ValueCountFrequency (%)
235
100.0%
Other Punctuation
ValueCountFrequency (%)
, 70
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2003
86.7%
Common 307
 
13.3%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
221
 
11.0%
201
 
10.0%
130
 
6.5%
127
 
6.3%
70
 
3.5%
69
 
3.4%
65
 
3.2%
55
 
2.7%
52
 
2.6%
46
 
2.3%
Other values (210) 967
48.3%
Common
ValueCountFrequency (%)
235
76.5%
, 70
 
22.8%
) 1
 
0.3%
( 1
 
0.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2003
86.7%
ASCII 307
 
13.3%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
235
76.5%
, 70
 
22.8%
) 1
 
0.3%
( 1
 
0.3%
Hangul
ValueCountFrequency (%)
221
 
11.0%
201
 
10.0%
130
 
6.5%
127
 
6.3%
70
 
3.5%
69
 
3.4%
65
 
3.2%
55
 
2.7%
52
 
2.6%
46
 
2.3%
Other values (210) 967
48.3%
CJK
ValueCountFrequency (%)
1
100.0%

Missing values

2023-12-12T18:55:10.260233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:55:10.384016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T18:55:10.503055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

한약재아이디일반명영문명라틴명기원종목록학명목록약용부위
0KMH2가자Terminalia FruitTerminaliae Fructus가자, 융모가자Terminalia chebula Retzins, Terminalia chebula Retzins var. tomentella Kurt.열매
1KMH3갈근Pueraria RootPuerariae RadixPueraria lobata Ohwi뿌리
2KMH5갈화Pueraria FlowerPuerariae FlosPueraria lobata Ohwi꽃봉오리, 막 피기 시작한 꽃
3KMH6감국<NA>Chrysanthemi Indici Flos감국Chrysanthemum indicum Linn.
4KMH7감송향<NA>Nardostachyos Radix et Rhizoma감송, 시엽감송Nardostachys chinensis Batal, Nardostachys jatamansi DC.뿌리, 뿌리줄기
5KMH8감수<NA>Euphorbiae Kansui Radix감수Euphorbia kansui Liou ex Wang덩이뿌리
6KMH9감초LicoriceGlycyrrhizae Radix et Rhizoma감초, 광과감초, 창과감초Glycyrrhiza uralensis Fischer, Glycyrrhiza glabra Linn., Glycyrrhiza inflata Batal.뿌리, 뿌리줄기
7KMH10감초가루Licorice PowderPulvis Glycyrrhizae Radicis et Rhizomatis감초, 광과감초, 창과감초Glycyrrhiza uralensis Fischer, Glycyrrhiza glabra Linn., Glycyrrhiza inflata Batal.뿌리, 뿌리줄기
8KMH11감초밀자<NA>Glycyrrhizae Radix Preparata cum Mel감초, 광과감초, 창과감초Glycyrrhiza uralensis Fischer, Glycyrrhiza glabra Linn., Glycyrrhiza inflata Batal.뿌리, 뿌리줄기
9KMH12감초초<NA>Glycyrrhizae Radix Preparata감초, 광과감초, 창과감초Glycyrrhiza uralensis Fischer, Glycyrrhiza glabra Linn., Glycyrrhiza inflata Batal.뿌리, 뿌리줄기
한약재아이디일반명영문명라틴명기원종목록학명목록약용부위
591KMH858회향FennelFoeniculi Fructus회향Foeniculum vulgare Miller열매
592KMH860후박Magnolia BarkMagnoliae Cortex일본목련, 후박, 요엽후박Magnolia ovobata Thunberg, Magnolia officinalis Rehder et Wilson, Magnolia officinalis Rehder et Wilson var. biloba Rehder et Wilson줄기껍질
593KMH862후추Black pepperPiperis Nigri Fructus후추Piper nigrum Linn.열매
594KMH863후추가루Black Pepper PowderPulvis Piperis Nigri Fructus후추Piper nigrum Linn.열매
595KMH864훤초근<NA>Hemerocallidis Radix et Rhizoma원추리Hemerocallis fulva Linn.뿌리, 뿌리줄기
596KMH865흑두<NA>Glycine Semen NigraGlycine max Merrill
597KMH866흑사당Brown sugarSaccharum Nigrum사탕수수Saccharum sinensis Roxburg조결정체
598KMH867흑지마<NA>Sesami Semen Nigra참깨Sesamum indicum Linn.
599KMH868희렴<NA>Siegesbeckiae Herba털진득찰, 진득찰Siegesbeckia pubescens Makino, Siegesbeckia glabrescens Makino지상부
600KMH869희렴주증<NA>Siegesbeckiae Herba Preparata cum Vinum털진득찰, 진득찰Siegesbeckia pubescens Makino, Siegesbeckia glabrescens Makino지상부