Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells2
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory634.8 KiB
Average record size in memory65.0 B

Variable types

Numeric1
Text6

Dataset

Description한국전력 전자도서관 보유중인 도서 및 자료 정보 데이터 입니다. 해당 리스트의 자료는 외부 일반인에게도 제공중입니다.(도서명, 저자명, 발행자, 발행년, 분류기호)
URLhttps://www.data.go.kr/data/15069176/fileData.do

Alerts

연번 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:15:57.215001
Analysis finished2023-12-12 01:16:00.981566
Duration3.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26801.778
Minimum14
Maximum53481
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T10:16:01.069909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14
5-th percentile2902.7
Q113711.75
median26605
Q339904.5
95-th percentile50867.25
Maximum53481
Range53467
Interquartile range (IQR)26192.75

Descriptive statistics

Standard deviation15265.135
Coefficient of variation (CV)0.56955682
Kurtosis-1.1737292
Mean26801.778
Median Absolute Deviation (MAD)13099
Skewness0.0099416884
Sum2.6801778 × 108
Variance2.3302436 × 108
MonotonicityNot monotonic
2023-12-12T10:16:01.238934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
634 1
 
< 0.1%
12120 1
 
< 0.1%
13339 1
 
< 0.1%
22062 1
 
< 0.1%
33941 1
 
< 0.1%
48031 1
 
< 0.1%
16738 1
 
< 0.1%
13745 1
 
< 0.1%
6019 1
 
< 0.1%
22363 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
14 1
< 0.1%
34 1
< 0.1%
36 1
< 0.1%
41 1
< 0.1%
43 1
< 0.1%
44 1
< 0.1%
48 1
< 0.1%
68 1
< 0.1%
81 1
< 0.1%
85 1
< 0.1%
ValueCountFrequency (%)
53481 1
< 0.1%
53476 1
< 0.1%
53464 1
< 0.1%
53458 1
< 0.1%
53455 1
< 0.1%
53447 1
< 0.1%
53444 1
< 0.1%
53442 1
< 0.1%
53429 1
< 0.1%
53428 1
< 0.1%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:16:01.617776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters110000
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowAA199101833
2nd rowAA201110376
3rd rowAA199510840
4th rowAA199510711
5th rowAA199510951
ValueCountFrequency (%)
aa199101833 1
 
< 0.1%
aa200110197 1
 
< 0.1%
aa201610856 1
 
< 0.1%
aa199130399 1
 
< 0.1%
aa200010580 1
 
< 0.1%
aa200810789 1
 
< 0.1%
aa201410612 1
 
< 0.1%
aa201912037 1
 
< 0.1%
aa200410259 1
 
< 0.1%
aa199910389 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T10:16:02.084434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 22268
20.2%
A 20000
18.2%
1 19919
18.1%
2 14451
13.1%
9 8076
 
7.3%
6 4738
 
4.3%
4 4654
 
4.2%
5 4231
 
3.8%
3 4177
 
3.8%
7 3910
 
3.6%
Other values (2) 3576
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 89999
81.8%
Uppercase Letter 20001
 
18.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 22268
24.7%
1 19919
22.1%
2 14451
16.1%
9 8076
 
9.0%
6 4738
 
5.3%
4 4654
 
5.2%
5 4231
 
4.7%
3 4177
 
4.6%
7 3910
 
4.3%
8 3575
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
A 20000
> 99.9%
O 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 89999
81.8%
Latin 20001
 
18.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 22268
24.7%
1 19919
22.1%
2 14451
16.1%
9 8076
 
9.0%
6 4738
 
5.3%
4 4654
 
5.2%
5 4231
 
4.7%
3 4177
 
4.6%
7 3910
 
4.3%
8 3575
 
4.0%
Latin
ValueCountFrequency (%)
A 20000
> 99.9%
O 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 110000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 22268
20.2%
A 20000
18.2%
1 19919
18.1%
2 14451
13.1%
9 8076
 
7.3%
6 4738
 
4.3%
4 4654
 
4.2%
5 4231
 
3.8%
3 4177
 
3.8%
7 3910
 
3.6%
Other values (2) 3576
 
3.3%
Distinct9252
Distinct (%)92.5%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T10:16:02.452718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length24
Mean length12.14823
Min length6

Characters and Unicode

Total characters121458
Distinct characters582
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8831 ?
Unique (%)88.3%

Sample

1st row31:62 U58i
2nd row82-3(08) 레31ㅅ V.246
3rd row340.13(51) 김14ㄱ
4th row347.1(076) 홍53ㅁ
5th row802.0 배79ㅇ
ValueCountFrequency (%)
82-31 441
 
2.0%
82-4 438
 
2.0%
82-34 267
 
1.2%
62 266
 
1.2%
c.2 232
 
1.1%
82-311.6 188
 
0.9%
171 188
 
0.9%
658 178
 
0.8%
82-3(08 171
 
0.8%
082.2 144
 
0.7%
Other values (8778) 19541
88.6%
2023-12-12T10:16:03.209942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12056
 
9.9%
1 11444
 
9.4%
3 10294
 
8.5%
2 10004
 
8.2%
8 8189
 
6.7%
. 7912
 
6.5%
6 7523
 
6.2%
5 6523
 
5.4%
9 6218
 
5.1%
0 4853
 
4.0%
Other values (572) 36442
30.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 73608
60.6%
Other Letter 17759
 
14.6%
Space Separator 12056
 
9.9%
Other Punctuation 8494
 
7.0%
Dash Punctuation 2471
 
2.0%
Lowercase Letter 2059
 
1.7%
Open Punctuation 1771
 
1.5%
Close Punctuation 1766
 
1.5%
Uppercase Letter 1385
 
1.1%
Math Symbol 89
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1581
 
8.9%
1323
 
7.4%
987
 
5.6%
984
 
5.5%
802
 
4.5%
765
 
4.3%
697
 
3.9%
539
 
3.0%
530
 
3.0%
525
 
3.0%
Other values (500) 9026
50.8%
Lowercase Letter
ValueCountFrequency (%)
v 796
38.7%
c 319
15.5%
e 135
 
6.6%
p 112
 
5.4%
w 78
 
3.8%
i 67
 
3.3%
m 61
 
3.0%
t 60
 
2.9%
s 57
 
2.8%
a 54
 
2.6%
Other values (15) 320
15.5%
Uppercase Letter
ValueCountFrequency (%)
E 173
12.5%
V 126
 
9.1%
C 119
 
8.6%
R 111
 
8.0%
T 93
 
6.7%
I 91
 
6.6%
W 86
 
6.2%
M 69
 
5.0%
O 62
 
4.5%
S 56
 
4.0%
Other values (14) 399
28.8%
Decimal Number
ValueCountFrequency (%)
1 11444
15.5%
3 10294
14.0%
2 10004
13.6%
8 8189
11.1%
6 7523
10.2%
5 6523
8.9%
9 6218
8.4%
0 4853
6.6%
7 4294
 
5.8%
4 4266
 
5.8%
Other Punctuation
ValueCountFrequency (%)
. 7912
93.1%
: 549
 
6.5%
/ 16
 
0.2%
' 11
 
0.1%
, 4
 
< 0.1%
; 1
 
< 0.1%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 58
65.2%
+ 31
34.8%
Space Separator
ValueCountFrequency (%)
12056
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2471
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1771
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1766
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100255
82.5%
Hangul 17759
 
14.6%
Latin 3444
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1581
 
8.9%
1323
 
7.4%
987
 
5.6%
984
 
5.5%
802
 
4.5%
765
 
4.3%
697
 
3.9%
539
 
3.0%
530
 
3.0%
525
 
3.0%
Other values (500) 9026
50.8%
Latin
ValueCountFrequency (%)
v 796
23.1%
c 319
 
9.3%
E 173
 
5.0%
e 135
 
3.9%
V 126
 
3.7%
C 119
 
3.5%
p 112
 
3.3%
R 111
 
3.2%
T 93
 
2.7%
I 91
 
2.6%
Other values (39) 1369
39.8%
Common
ValueCountFrequency (%)
12056
12.0%
1 11444
11.4%
3 10294
10.3%
2 10004
10.0%
8 8189
8.2%
. 7912
7.9%
6 7523
7.5%
5 6523
6.5%
9 6218
6.2%
0 4853
 
4.8%
Other values (13) 15239
15.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 103698
85.4%
Hangul 8950
 
7.4%
Compat Jamo 8809
 
7.3%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12056
11.6%
1 11444
11.0%
3 10294
9.9%
2 10004
9.6%
8 8189
7.9%
. 7912
7.6%
6 7523
7.3%
5 6523
 
6.3%
9 6218
 
6.0%
0 4853
 
4.7%
Other values (61) 18682
18.0%
Compat Jamo
ValueCountFrequency (%)
1581
17.9%
1323
15.0%
987
11.2%
984
11.2%
765
8.7%
539
 
6.1%
530
 
6.0%
525
 
6.0%
523
 
5.9%
266
 
3.0%
Other values (9) 786
8.9%
Hangul
ValueCountFrequency (%)
802
 
9.0%
697
 
7.8%
365
 
4.1%
313
 
3.5%
222
 
2.5%
186
 
2.1%
169
 
1.9%
149
 
1.7%
134
 
1.5%
114
 
1.3%
Other values (481) 5799
64.8%
None
ValueCountFrequency (%)
1
100.0%

서명
Text

Distinct9874
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:16:03.642941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length171
Median length131
Mean length26.8962
Min length1

Characters and Unicode

Total characters268962
Distinct characters2270
Distinct categories18 ?
Distinct scripts6 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9761 ?
Unique (%)97.6%

Sample

1st rowIndustry and Development Global Report 1988/89
2nd row사랑할 때와 죽을 때. V.246 ; 세계문학전집 ; 246
3rd row중국 외자기업 세법편람
4th row民法 및 民事特別法
5th row영어회화 삼국지1
ValueCountFrequency (%)
5621
 
9.2%
1 408
 
0.7%
2 390
 
0.6%
of 338
 
0.6%
and 298
 
0.5%
위한 283
 
0.5%
the 250
 
0.4%
이야기 224
 
0.4%
3 200
 
0.3%
for 149
 
0.2%
Other values (23882) 52894
86.6%
2023-12-12T10:16:04.284818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53979
 
20.1%
e 5172
 
1.9%
n 4005
 
1.5%
: 4000
 
1.5%
3970
 
1.5%
o 3851
 
1.4%
i 3843
 
1.4%
a 3479
 
1.3%
t 3450
 
1.3%
r 3125
 
1.2%
Other values (2260) 180088
67.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 132688
49.3%
Space Separator 53979
20.1%
Lowercase Letter 43899
 
16.3%
Uppercase Letter 12688
 
4.7%
Decimal Number 12033
 
4.5%
Other Punctuation 10063
 
3.7%
Close Punctuation 1250
 
0.5%
Open Punctuation 1245
 
0.5%
Dash Punctuation 646
 
0.2%
Math Symbol 350
 
0.1%
Other values (8) 121
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3970
 
3.0%
2782
 
2.1%
2478
 
1.9%
2243
 
1.7%
1883
 
1.4%
1844
 
1.4%
1796
 
1.4%
1752
 
1.3%
1734
 
1.3%
1356
 
1.0%
Other values (2140) 110850
83.5%
Lowercase Letter
ValueCountFrequency (%)
e 5172
11.8%
n 4005
 
9.1%
o 3851
 
8.8%
i 3843
 
8.8%
a 3479
 
7.9%
t 3450
 
7.9%
r 3125
 
7.1%
s 2744
 
6.3%
l 2004
 
4.6%
c 1799
 
4.1%
Other values (16) 10427
23.8%
Uppercase Letter
ValueCountFrequency (%)
E 1223
 
9.6%
S 1111
 
8.8%
T 948
 
7.5%
C 935
 
7.4%
A 865
 
6.8%
P 844
 
6.7%
I 837
 
6.6%
R 704
 
5.5%
O 581
 
4.6%
M 533
 
4.2%
Other values (16) 4107
32.4%
Other Punctuation
ValueCountFrequency (%)
: 4000
39.7%
; 2056
20.4%
. 1954
19.4%
, 980
 
9.7%
' 203
 
2.0%
! 176
 
1.7%
113
 
1.1%
112
 
1.1%
· 109
 
1.1%
93
 
0.9%
Other values (12) 267
 
2.7%
Decimal Number
ValueCountFrequency (%)
1 2791
23.2%
2 2194
18.2%
0 2115
17.6%
3 1014
 
8.4%
9 975
 
8.1%
5 708
 
5.9%
4 672
 
5.6%
8 529
 
4.4%
6 519
 
4.3%
7 516
 
4.3%
Math Symbol
ValueCountFrequency (%)
= 282
80.6%
~ 35
 
10.0%
+ 26
 
7.4%
> 2
 
0.6%
< 2
 
0.6%
1
 
0.3%
| 1
 
0.3%
1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 1218
97.4%
] 20
 
1.6%
5
 
0.4%
} 4
 
0.3%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1217
97.8%
[ 20
 
1.6%
5
 
0.4%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
36
45.0%
22
27.5%
11
 
13.8%
9
 
11.2%
2
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 522
80.8%
124
 
19.2%
Space Separator
ValueCountFrequency (%)
53979
100.0%
Final Punctuation
ValueCountFrequency (%)
13
100.0%
Initial Punctuation
ValueCountFrequency (%)
12
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 123199
45.8%
Common 79607
29.6%
Latin 56667
21.1%
Han 8472
 
3.1%
Katakana 650
 
0.2%
Hiragana 367
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3970
 
3.2%
2782
 
2.3%
2478
 
2.0%
2243
 
1.8%
1883
 
1.5%
1844
 
1.5%
1796
 
1.5%
1752
 
1.4%
1734
 
1.4%
1356
 
1.1%
Other values (1197) 101361
82.3%
Han
ValueCountFrequency (%)
236
 
2.8%
181
 
2.1%
156
 
1.8%
145
 
1.7%
138
 
1.6%
131
 
1.5%
125
 
1.5%
116
 
1.4%
100
 
1.2%
94
 
1.1%
Other values (821) 7050
83.2%
Katakana
ValueCountFrequency (%)
43
 
6.6%
39
 
6.0%
39
 
6.0%
34
 
5.2%
27
 
4.2%
26
 
4.0%
25
 
3.8%
25
 
3.8%
24
 
3.7%
23
 
3.5%
Other values (55) 345
53.1%
Common
ValueCountFrequency (%)
53979
67.8%
: 4000
 
5.0%
1 2791
 
3.5%
2 2194
 
2.8%
0 2115
 
2.7%
; 2056
 
2.6%
. 1954
 
2.5%
) 1218
 
1.5%
( 1217
 
1.5%
3 1014
 
1.3%
Other values (53) 7069
 
8.9%
Latin
ValueCountFrequency (%)
e 5172
 
9.1%
n 4005
 
7.1%
o 3851
 
6.8%
i 3843
 
6.8%
a 3479
 
6.1%
t 3450
 
6.1%
r 3125
 
5.5%
s 2744
 
4.8%
l 2004
 
3.5%
c 1799
 
3.2%
Other values (47) 23195
40.9%
Hiragana
ValueCountFrequency (%)
133
36.2%
40
 
10.9%
16
 
4.4%
16
 
4.4%
11
 
3.0%
10
 
2.7%
9
 
2.5%
9
 
2.5%
8
 
2.2%
8
 
2.2%
Other values (37) 107
29.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 135515
50.4%
Hangul 123174
45.8%
CJK 8260
 
3.1%
None 652
 
0.2%
Katakana 650
 
0.2%
Hiragana 367
 
0.1%
CJK Compat Ideographs 212
 
0.1%
Number Forms 80
 
< 0.1%
Punctuation 25
 
< 0.1%
Compat Jamo 25
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
53979
39.8%
e 5172
 
3.8%
n 4005
 
3.0%
: 4000
 
3.0%
o 3851
 
2.8%
i 3843
 
2.8%
a 3479
 
2.6%
t 3450
 
2.5%
r 3125
 
2.3%
1 2791
 
2.1%
Other values (79) 47820
35.3%
Hangul
ValueCountFrequency (%)
3970
 
3.2%
2782
 
2.3%
2478
 
2.0%
2243
 
1.8%
1883
 
1.5%
1844
 
1.5%
1796
 
1.5%
1752
 
1.4%
1734
 
1.4%
1356
 
1.1%
Other values (1188) 101336
82.3%
CJK
ValueCountFrequency (%)
236
 
2.9%
181
 
2.2%
156
 
1.9%
145
 
1.8%
138
 
1.7%
131
 
1.6%
125
 
1.5%
116
 
1.4%
100
 
1.2%
94
 
1.1%
Other values (784) 6838
82.8%
Hiragana
ValueCountFrequency (%)
133
36.2%
40
 
10.9%
16
 
4.4%
16
 
4.4%
11
 
3.0%
10
 
2.7%
9
 
2.5%
9
 
2.5%
8
 
2.2%
8
 
2.2%
Other values (37) 107
29.2%
None
ValueCountFrequency (%)
124
19.0%
113
17.3%
112
17.2%
· 109
16.7%
93
14.3%
41
 
6.3%
13
 
2.0%
13
 
2.0%
9
 
1.4%
5
 
0.8%
Other values (12) 20
 
3.1%
CJK Compat Ideographs
ValueCountFrequency (%)
50
23.6%
28
13.2%
19
 
9.0%
18
 
8.5%
14
 
6.6%
11
 
5.2%
8
 
3.8%
5
 
2.4%
5
 
2.4%
5
 
2.4%
Other values (27) 49
23.1%
Katakana
ValueCountFrequency (%)
43
 
6.6%
39
 
6.0%
39
 
6.0%
34
 
5.2%
27
 
4.2%
26
 
4.0%
25
 
3.8%
25
 
3.8%
24
 
3.7%
23
 
3.5%
Other values (55) 345
53.1%
Number Forms
ValueCountFrequency (%)
36
45.0%
22
27.5%
11
 
13.8%
9
 
11.2%
2
 
2.5%
Punctuation
ValueCountFrequency (%)
13
52.0%
12
48.0%
Compat Jamo
ValueCountFrequency (%)
9
36.0%
6
24.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct7940
Distinct (%)79.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:16:04.623764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length94
Median length70
Mean length11.5517
Min length2

Characters and Unicode

Total characters115517
Distinct characters1703
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7057 ?
Unique (%)70.6%

Sample

1st rowUnited Nations Industrial Development Organization
2nd row에리히 마리아,레마르크 ; 장희창
3rd row김경직
4th row洪性徹
5th row배진용
ValueCountFrequency (%)
4040
 
13.4%
지음 3459
 
11.4%
옮김 1253
 
4.1%
그림 241
 
0.8%
institute 226
 
0.7%
research 212
 
0.7%
electric 209
 
0.7%
power 207
 
0.7%
189
 
0.6%
159
 
0.5%
Other values (11523) 20044
66.3%
2023-12-12T10:16:05.136995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21478
 
18.6%
; 4112
 
3.6%
4059
 
3.5%
3672
 
3.2%
2731
 
2.4%
e 2086
 
1.8%
2043
 
1.8%
t 1444
 
1.3%
r 1413
 
1.2%
i 1297
 
1.1%
Other values (1693) 71182
61.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67182
58.2%
Space Separator 21478
 
18.6%
Lowercase Letter 15113
 
13.1%
Other Punctuation 5733
 
5.0%
Uppercase Letter 5310
 
4.6%
Open Punctuation 278
 
0.2%
Close Punctuation 277
 
0.2%
Dash Punctuation 76
 
0.1%
Decimal Number 51
 
< 0.1%
Math Symbol 17
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4059
 
6.0%
3672
 
5.5%
2731
 
4.1%
2043
 
3.0%
1292
 
1.9%
1124
 
1.7%
939
 
1.4%
815
 
1.2%
767
 
1.1%
608
 
0.9%
Other values (1607) 49132
73.1%
Lowercase Letter
ValueCountFrequency (%)
e 2086
13.8%
t 1444
9.6%
r 1413
9.3%
i 1297
8.6%
n 1265
8.4%
a 1097
 
7.3%
o 1043
 
6.9%
c 953
 
6.3%
s 867
 
5.7%
l 728
 
4.8%
Other values (16) 2920
19.3%
Uppercase Letter
ValueCountFrequency (%)
E 668
12.6%
I 539
 
10.2%
R 514
 
9.7%
A 415
 
7.8%
P 398
 
7.5%
S 292
 
5.5%
C 287
 
5.4%
D 209
 
3.9%
M 203
 
3.8%
O 189
 
3.6%
Other values (16) 1596
30.1%
Other Punctuation
ValueCountFrequency (%)
; 4112
71.7%
, 768
 
13.4%
. 635
 
11.1%
· 118
 
2.1%
/ 45
 
0.8%
& 23
 
0.4%
10
 
0.2%
' 9
 
0.2%
8
 
0.1%
" 3
 
0.1%
Other values (2) 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 11
21.6%
1 10
19.6%
2 8
15.7%
6 7
13.7%
3 6
11.8%
5 5
9.8%
4 1
 
2.0%
9 1
 
2.0%
7 1
 
2.0%
8 1
 
2.0%
Math Symbol
ValueCountFrequency (%)
> 8
47.1%
< 8
47.1%
| 1
 
5.9%
Close Punctuation
ValueCountFrequency (%)
] 217
78.3%
) 60
 
21.7%
Open Punctuation
ValueCountFrequency (%)
[ 217
78.1%
( 61
 
21.9%
Dash Punctuation
ValueCountFrequency (%)
38
50.0%
- 38
50.0%
Space Separator
ValueCountFrequency (%)
21478
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 61813
53.5%
Common 27911
24.2%
Latin 20424
 
17.7%
Han 5099
 
4.4%
Katakana 251
 
0.2%
Hiragana 19
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4059
 
6.6%
3672
 
5.9%
2731
 
4.4%
2043
 
3.3%
1292
 
2.1%
1124
 
1.8%
939
 
1.5%
815
 
1.3%
767
 
1.2%
608
 
1.0%
Other values (822) 43763
70.8%
Han
ValueCountFrequency (%)
234
 
4.6%
128
 
2.5%
116
 
2.3%
99
 
1.9%
95
 
1.9%
80
 
1.6%
79
 
1.5%
76
 
1.5%
72
 
1.4%
68
 
1.3%
Other values (717) 4052
79.5%
Latin
ValueCountFrequency (%)
e 2086
 
10.2%
t 1444
 
7.1%
r 1413
 
6.9%
i 1297
 
6.4%
n 1265
 
6.2%
a 1097
 
5.4%
o 1043
 
5.1%
c 953
 
4.7%
s 867
 
4.2%
l 728
 
3.6%
Other values (43) 8231
40.3%
Katakana
ValueCountFrequency (%)
31
12.4%
28
11.2%
26
 
10.4%
26
 
10.4%
18
 
7.2%
15
 
6.0%
12
 
4.8%
11
 
4.4%
10
 
4.0%
9
 
3.6%
Other values (33) 65
25.9%
Common
ValueCountFrequency (%)
21478
77.0%
; 4112
 
14.7%
, 768
 
2.8%
. 635
 
2.3%
] 217
 
0.8%
[ 217
 
0.8%
· 118
 
0.4%
( 61
 
0.2%
) 60
 
0.2%
/ 45
 
0.2%
Other values (23) 200
 
0.7%
Hiragana
ValueCountFrequency (%)
4
21.1%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (5) 5
26.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 61812
53.5%
ASCII 48159
41.7%
CJK 4914
 
4.3%
Katakana 251
 
0.2%
CJK Compat Ideographs 185
 
0.2%
None 174
 
0.2%
Hiragana 19
 
< 0.1%
Number Forms 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21478
44.6%
; 4112
 
8.5%
e 2086
 
4.3%
t 1444
 
3.0%
r 1413
 
2.9%
i 1297
 
2.7%
n 1265
 
2.6%
a 1097
 
2.3%
o 1043
 
2.2%
c 953
 
2.0%
Other values (70) 11971
24.9%
Hangul
ValueCountFrequency (%)
4059
 
6.6%
3672
 
5.9%
2731
 
4.4%
2043
 
3.3%
1292
 
2.1%
1124
 
1.8%
939
 
1.5%
815
 
1.3%
767
 
1.2%
608
 
1.0%
Other values (821) 43762
70.8%
CJK
ValueCountFrequency (%)
234
 
4.8%
128
 
2.6%
116
 
2.4%
99
 
2.0%
95
 
1.9%
79
 
1.6%
76
 
1.5%
72
 
1.5%
68
 
1.4%
68
 
1.4%
Other values (684) 3879
78.9%
None
ValueCountFrequency (%)
· 118
67.8%
38
 
21.8%
10
 
5.7%
8
 
4.6%
CJK Compat Ideographs
ValueCountFrequency (%)
80
43.2%
15
 
8.1%
14
 
7.6%
7
 
3.8%
6
 
3.2%
6
 
3.2%
5
 
2.7%
5
 
2.7%
5
 
2.7%
4
 
2.2%
Other values (23) 38
20.5%
Katakana
ValueCountFrequency (%)
31
12.4%
28
11.2%
26
 
10.4%
26
 
10.4%
18
 
7.2%
15
 
6.0%
12
 
4.8%
11
 
4.4%
10
 
4.0%
9
 
3.6%
Other values (33) 65
25.9%
Hiragana
ValueCountFrequency (%)
4
21.1%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (5) 5
26.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Distinct3246
Distinct (%)32.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:16:05.537594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length53
Mean length5.6661
Min length1

Characters and Unicode

Total characters56661
Distinct characters1106
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1979 ?
Unique (%)19.8%

Sample

1st rowUNIDO
2nd row민음사
3rd row대외투자개발원
4th row傳文閣
5th row도솔
ValueCountFrequency (%)
epri 248
 
2.2%
민음사 199
 
1.8%
김영사 133
 
1.2%
문학동네 129
 
1.1%
113
 
1.0%
에너지경제연구원 103
 
0.9%
위즈덤하우스 94
 
0.8%
21세기북스 87
 
0.8%
살림 76
 
0.7%
시공사 66
 
0.6%
Other values (3442) 10116
89.0%
2023-12-12T10:16:06.226694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1786
 
3.2%
1502
 
2.7%
1430
 
2.5%
e 876
 
1.5%
860
 
1.5%
o 849
 
1.5%
802
 
1.4%
787
 
1.4%
n 783
 
1.4%
i 747
 
1.3%
Other values (1096) 46239
81.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40737
71.9%
Lowercase Letter 8269
 
14.6%
Uppercase Letter 4530
 
8.0%
Space Separator 1502
 
2.7%
Open Punctuation 412
 
0.7%
Close Punctuation 411
 
0.7%
Other Punctuation 410
 
0.7%
Decimal Number 285
 
0.5%
Dash Punctuation 100
 
0.2%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1786
 
4.4%
1430
 
3.5%
860
 
2.1%
802
 
2.0%
787
 
1.9%
743
 
1.8%
589
 
1.4%
587
 
1.4%
563
 
1.4%
496
 
1.2%
Other values (1015) 32094
78.8%
Lowercase Letter
ValueCountFrequency (%)
e 876
10.6%
o 849
10.3%
n 783
9.5%
i 747
9.0%
s 655
 
7.9%
r 636
 
7.7%
a 564
 
6.8%
t 481
 
5.8%
l 460
 
5.6%
c 383
 
4.6%
Other values (16) 1835
22.2%
Uppercase Letter
ValueCountFrequency (%)
E 568
12.5%
I 541
11.9%
P 481
10.6%
R 426
 
9.4%
C 272
 
6.0%
A 263
 
5.8%
B 237
 
5.2%
O 219
 
4.8%
S 194
 
4.3%
M 175
 
3.9%
Other values (15) 1154
25.5%
Other Punctuation
ValueCountFrequency (%)
: 144
35.1%
& 94
22.9%
, 56
 
13.7%
. 51
 
12.4%
/ 31
 
7.6%
9
 
2.2%
' 5
 
1.2%
# 4
 
1.0%
3
 
0.7%
; 3
 
0.7%
Other values (5) 10
 
2.4%
Decimal Number
ValueCountFrequency (%)
2 138
48.4%
1 125
43.9%
0 9
 
3.2%
6 6
 
2.1%
3 4
 
1.4%
5 1
 
0.4%
9 1
 
0.4%
8 1
 
0.4%
Dash Punctuation
ValueCountFrequency (%)
- 66
66.0%
34
34.0%
Space Separator
ValueCountFrequency (%)
1502
100.0%
Open Punctuation
ValueCountFrequency (%)
( 412
100.0%
Close Punctuation
ValueCountFrequency (%)
) 411
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 36746
64.9%
Latin 12799
 
22.6%
Han 3797
 
6.7%
Common 3121
 
5.5%
Katakana 189
 
0.3%
Hiragana 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1786
 
4.9%
1430
 
3.9%
860
 
2.3%
802
 
2.2%
787
 
2.1%
743
 
2.0%
589
 
1.6%
587
 
1.6%
563
 
1.5%
496
 
1.3%
Other values (605) 28103
76.5%
Han
ValueCountFrequency (%)
317
 
8.3%
140
 
3.7%
130
 
3.4%
123
 
3.2%
116
 
3.1%
96
 
2.5%
95
 
2.5%
84
 
2.2%
74
 
1.9%
74
 
1.9%
Other values (347) 2548
67.1%
Latin
ValueCountFrequency (%)
e 876
 
6.8%
o 849
 
6.6%
n 783
 
6.1%
i 747
 
5.8%
s 655
 
5.1%
r 636
 
5.0%
E 568
 
4.4%
a 564
 
4.4%
I 541
 
4.2%
P 481
 
3.8%
Other values (41) 6099
47.7%
Katakana
ValueCountFrequency (%)
15
 
7.9%
14
 
7.4%
14
 
7.4%
13
 
6.9%
13
 
6.9%
12
 
6.3%
11
 
5.8%
10
 
5.3%
9
 
4.8%
8
 
4.2%
Other values (36) 70
37.0%
Common
ValueCountFrequency (%)
1502
48.1%
( 412
 
13.2%
) 411
 
13.2%
: 144
 
4.6%
2 138
 
4.4%
1 125
 
4.0%
& 94
 
3.0%
- 66
 
2.1%
, 56
 
1.8%
. 51
 
1.6%
Other values (19) 122
 
3.9%
Hiragana
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36742
64.8%
ASCII 15867
28.0%
CJK 3756
 
6.6%
Katakana 189
 
0.3%
None 57
 
0.1%
CJK Compat Ideographs 41
 
0.1%
Hiragana 9
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1786
 
4.9%
1430
 
3.9%
860
 
2.3%
802
 
2.2%
787
 
2.1%
743
 
2.0%
589
 
1.6%
587
 
1.6%
563
 
1.5%
496
 
1.3%
Other values (604) 28099
76.5%
ASCII
ValueCountFrequency (%)
1502
 
9.5%
e 876
 
5.5%
o 849
 
5.4%
n 783
 
4.9%
i 747
 
4.7%
s 655
 
4.1%
r 636
 
4.0%
E 568
 
3.6%
a 564
 
3.6%
I 541
 
3.4%
Other values (63) 8146
51.3%
CJK
ValueCountFrequency (%)
317
 
8.4%
140
 
3.7%
130
 
3.5%
123
 
3.3%
116
 
3.1%
96
 
2.6%
95
 
2.5%
84
 
2.2%
74
 
2.0%
74
 
2.0%
Other values (334) 2507
66.7%
None
ValueCountFrequency (%)
34
59.6%
9
 
15.8%
4
 
7.0%
3
 
5.3%
2
 
3.5%
· 2
 
3.5%
2
 
3.5%
1
 
1.8%
Katakana
ValueCountFrequency (%)
15
 
7.9%
14
 
7.4%
14
 
7.4%
13
 
6.9%
13
 
6.9%
12
 
6.3%
11
 
5.8%
10
 
5.3%
9
 
4.8%
8
 
4.2%
Other values (36) 70
37.0%
CJK Compat Ideographs
ValueCountFrequency (%)
10
24.4%
6
14.6%
6
14.6%
5
12.2%
4
 
9.8%
3
 
7.3%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (3) 3
 
7.3%
Hiragana
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Distinct69
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:16:06.558504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.993
Min length2

Characters and Unicode

Total characters39930
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row1988
2nd row2010
3rd row1995
4th row1995
5th row1995
ValueCountFrequency (%)
2017 541
 
5.4%
2015 499
 
5.0%
2014 489
 
4.9%
2016 479
 
4.8%
2008 470
 
4.7%
2013 458
 
4.6%
2012 396
 
4.0%
2011 386
 
3.9%
2007 378
 
3.8%
2018 339
 
3.4%
Other values (59) 5565
55.6%
2023-12-12T10:16:07.074018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11074
27.7%
2 8953
22.4%
1 7554
18.9%
9 4989
12.5%
8 1641
 
4.1%
7 1373
 
3.4%
4 1087
 
2.7%
5 1080
 
2.7%
6 1048
 
2.6%
3 948
 
2.4%
Other values (3) 183
 
0.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39747
99.5%
Other Letter 183
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 11074
27.9%
2 8953
22.5%
1 7554
19.0%
9 4989
12.6%
8 1641
 
4.1%
7 1373
 
3.5%
4 1087
 
2.7%
5 1080
 
2.7%
6 1048
 
2.6%
3 948
 
2.4%
Other Letter
ValueCountFrequency (%)
61
33.3%
61
33.3%
61
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 39747
99.5%
Hangul 183
 
0.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 11074
27.9%
2 8953
22.5%
1 7554
19.0%
9 4989
12.6%
8 1641
 
4.1%
7 1373
 
3.5%
4 1087
 
2.7%
5 1080
 
2.7%
6 1048
 
2.6%
3 948
 
2.4%
Hangul
ValueCountFrequency (%)
61
33.3%
61
33.3%
61
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 39747
99.5%
Hangul 183
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11074
27.9%
2 8953
22.5%
1 7554
19.0%
9 4989
12.6%
8 1641
 
4.1%
7 1373
 
3.5%
4 1087
 
2.7%
5 1080
 
2.7%
6 1048
 
2.6%
3 948
 
2.4%
Hangul
ValueCountFrequency (%)
61
33.3%
61
33.3%
61
33.3%

Interactions

2023-12-12T10:16:00.597351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:16:07.202091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번출판년
연번1.0000.957
출판년0.9571.000

Missing values

2023-12-12T10:16:00.742541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:16:00.901336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번등록번호청구기호서명저자출판사출판년
49420634AA19910183331:62 U58iIndustry and Development Global Report 1988/89United Nations Industrial Development OrganizationUNIDO1988
1781826034AA20111037682-3(08) 레31ㅅ V.246사랑할 때와 죽을 때. V.246 ; 세계문학전집 ; 246에리히 마리아,레마르크 ; 장희창민음사2010
504487265AA199510840340.13(51) 김14ㄱ중국 외자기업 세법편람김경직대외투자개발원1995
503397167AA199510711347.1(076) 홍53ㅁ民法 및 民事特別法洪性徹傳文閣1995
505627368AA199510951802.0 배79ㅇ영어회화 삼국지1배진용도솔1995
4108046971AA20191097682-311.9 윌239ㅇ 2올클리어 2코니 윌리스 지음아작2019
2040428362AA201310519620.9 에213ㅅ세계 에너지시장 인사이트 2012 : 국가별 정책 및 시장동향에너지경제연구원에너지경제연구원2012
816217344AA200410885624.131 김51ㅌ토질역학김상규청문각2004
36810329AA199710129951.9:32 서67ㅎ한국역사와 개혁정치서울대학교 사회발전연구소서울대학교 사회발전연구소1997
2352031166AA201402532082.2 빛11ㄷ v.121동신당 ; 빛깔있는 책들 ; 121김태곤 글·사진대원사2003
연번등록번호청구기호서명저자출판사출판년
490265986AA19941071972.021.2 집37ㄱ建築設計資料集成;9.地域集文社集文社1993
795117154AA200410690659.3 브232ㅁ미디어랩;MIT에서 미래만들기스튜어트 브랜드 ; 김창현 공역한울 아카데미2004
1199420793AA200711064908(08) 드233ㅋ V.41큐리어스시리즈 41 : 인도네시아캐시 드레인 ; 바버라 홀 ; 박영원휘슬러2005
4359849237AA2020108335(08) 곰225ㄴ v.48내일은 실험왕 48 : 방사능 물질스토리 a. 지음아이세움2019
48598560AA199101687336.76 재228ㄷ大和證券紊藤裕かんき出版1984
873917864AA20051017731:628(058) 환14ㅎ환경통계연감 2004환경부환경부2004
3047737428AA201560085551 D711e V.1Eyewonder 1 : Earth by Penelope York. V.1 ; DK EyewonderDorling Kindersley LimitedDorlingKindersleyLimited2004
2391931525AA201402892082.2 살239ㅅ v.93한국의 연출가들 ; 살림지식총서 ; 93김남석 지음살림2010
60791547AA1991031975/6(038) D133lLongman Dictionary of Scientific UsageJohn DaintithLongman Group Ltd1979
4740652664AA202210280809.51 제69ㅁ 6맛있는 중국어 Level 6 중국통 : 최신 개정JRC 중국어연구소 지음맛있는Books(JRC북스)2021