Overview

Dataset statistics

Number of variables7
Number of observations1977
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory112.1 KiB
Average record size in memory58.1 B

Variable types

Numeric2
Text5

Dataset

Description인천소방학교에서 보유하고 있는 도서 현황으로 나무, 아름다운 소풍길, 파피용, 마음을 다스리는 기술, 회사가 당신에게 알려주지 않는 50가지 비밀 등을 포함하는 도서 목록 입니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15105749&srcSe=7661IVAWM27C61E190

Alerts

분류기호 is highly skewed (γ1 = 38.84499424)Skewed
번호 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2024-01-28 13:55:29.052821
Analysis finished2024-01-28 13:55:30.900474
Duration1.85 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct1977
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean989
Minimum1
Maximum1977
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.5 KiB
2024-01-28T22:55:30.966734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile99.8
Q1495
median989
Q31483
95-th percentile1878.2
Maximum1977
Range1976
Interquartile range (IQR)988

Descriptive statistics

Standard deviation570.85506
Coefficient of variation (CV)0.57720431
Kurtosis-1.2
Mean989
Median Absolute Deviation (MAD)494
Skewness0
Sum1955253
Variance325875.5
MonotonicityStrictly increasing
2024-01-28T22:55:31.083744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1329 1
 
0.1%
1327 1
 
0.1%
1326 1
 
0.1%
1325 1
 
0.1%
1324 1
 
0.1%
1323 1
 
0.1%
1322 1
 
0.1%
1321 1
 
0.1%
1320 1
 
0.1%
Other values (1967) 1967
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1977 1
0.1%
1976 1
0.1%
1975 1
0.1%
1974 1
0.1%
1973 1
0.1%
1972 1
0.1%
1971 1
0.1%
1970 1
0.1%
1969 1
0.1%
1968 1
0.1%

등록번호
Text

UNIQUE 

Distinct1977
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2024-01-28T22:55:31.273812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters19770
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1977 ?
Unique (%)100.0%

Sample

1st rowEM00000001
2nd rowEM00000002
3rd rowEM00000003
4th rowEM00000004
5th rowEM00000005
ValueCountFrequency (%)
em00000001 1
 
0.1%
sa00000070 1
 
0.1%
sa00000068 1
 
0.1%
sa00000067 1
 
0.1%
sa00000066 1
 
0.1%
sa00000065 1
 
0.1%
sa00000064 1
 
0.1%
sa00000063 1
 
0.1%
sa00000062 1
 
0.1%
sa00000061 1
 
0.1%
Other values (1967) 1967
99.5%
2024-01-28T22:55:31.543737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10326
52.2%
E 1258
 
6.4%
M 1258
 
6.4%
1 966
 
4.9%
S 719
 
3.6%
A 719
 
3.6%
2 662
 
3.3%
3 598
 
3.0%
4 598
 
3.0%
5 597
 
3.0%
Other values (4) 2069
 
10.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15816
80.0%
Uppercase Letter 3954
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10326
65.3%
1 966
 
6.1%
2 662
 
4.2%
3 598
 
3.8%
4 598
 
3.8%
5 597
 
3.8%
6 587
 
3.7%
7 508
 
3.2%
8 488
 
3.1%
9 486
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
E 1258
31.8%
M 1258
31.8%
S 719
18.2%
A 719
18.2%

Most occurring scripts

ValueCountFrequency (%)
Common 15816
80.0%
Latin 3954
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10326
65.3%
1 966
 
6.1%
2 662
 
4.2%
3 598
 
3.8%
4 598
 
3.8%
5 597
 
3.8%
6 587
 
3.7%
7 508
 
3.2%
8 488
 
3.1%
9 486
 
3.1%
Latin
ValueCountFrequency (%)
E 1258
31.8%
M 1258
31.8%
S 719
18.2%
A 719
18.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19770
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10326
52.2%
E 1258
 
6.4%
M 1258
 
6.4%
1 966
 
4.9%
S 719
 
3.6%
A 719
 
3.6%
2 662
 
3.3%
3 598
 
3.0%
4 598
 
3.0%
5 597
 
3.0%
Other values (4) 2069
 
10.5%

서명
Text

Distinct1766
Distinct (%)89.3%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2024-01-28T22:55:31.798094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length150
Median length65
Mean length15.144664
Min length1

Characters and Unicode

Total characters29941
Distinct characters1017
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1645 ?
Unique (%)83.2%

Sample

1st row나무
2nd row아름다운 소풍길
3rd row회사가 당신에게 알려주지 않는 50가지 비밀
4th row삼성을 생각한다
5th row쉼터
ValueCountFrequency (%)
348
 
4.9%
필기 49
 
0.7%
실기 48
 
0.7%
이야기 42
 
0.6%
2 41
 
0.6%
1 39
 
0.5%
신의 38
 
0.5%
위한 34
 
0.5%
26
 
0.4%
2018 25
 
0.4%
Other values (3839) 6424
90.3%
2024-01-28T22:55:32.195149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5632
 
18.8%
615
 
2.1%
538
 
1.8%
459
 
1.5%
( 427
 
1.4%
) 427
 
1.4%
2 426
 
1.4%
423
 
1.4%
350
 
1.2%
0 339
 
1.1%
Other values (1007) 20305
67.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18954
63.3%
Space Separator 5632
 
18.8%
Lowercase Letter 1901
 
6.3%
Decimal Number 1526
 
5.1%
Other Punctuation 613
 
2.0%
Open Punctuation 429
 
1.4%
Close Punctuation 429
 
1.4%
Uppercase Letter 370
 
1.2%
Math Symbol 43
 
0.1%
Dash Punctuation 34
 
0.1%
Other values (2) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
615
 
3.2%
538
 
2.8%
459
 
2.4%
423
 
2.2%
350
 
1.8%
263
 
1.4%
246
 
1.3%
237
 
1.3%
229
 
1.2%
209
 
1.1%
Other values (920) 15385
81.2%
Uppercase Letter
ValueCountFrequency (%)
S 41
 
11.1%
P 34
 
9.2%
E 32
 
8.6%
H 28
 
7.6%
C 26
 
7.0%
F 20
 
5.4%
M 19
 
5.1%
O 18
 
4.9%
A 17
 
4.6%
T 17
 
4.6%
Other values (16) 118
31.9%
Lowercase Letter
ValueCountFrequency (%)
o 212
11.2%
e 192
10.1%
n 170
 
8.9%
a 165
 
8.7%
i 164
 
8.6%
t 163
 
8.6%
r 144
 
7.6%
s 109
 
5.7%
d 75
 
3.9%
h 65
 
3.4%
Other values (15) 442
23.3%
Other Punctuation
ValueCountFrequency (%)
: 305
49.8%
, 138
22.5%
. 56
 
9.1%
· 46
 
7.5%
! 35
 
5.7%
' 11
 
1.8%
& 10
 
1.6%
/ 4
 
0.7%
; 4
 
0.7%
1
 
0.2%
Other values (3) 3
 
0.5%
Decimal Number
ValueCountFrequency (%)
2 426
27.9%
0 339
22.2%
1 324
21.2%
3 103
 
6.7%
8 68
 
4.5%
4 66
 
4.3%
5 60
 
3.9%
9 58
 
3.8%
7 43
 
2.8%
6 39
 
2.6%
Letter Number
ValueCountFrequency (%)
3
37.5%
3
37.5%
1
 
12.5%
1
 
12.5%
Open Punctuation
ValueCountFrequency (%)
( 427
99.5%
2
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 427
99.5%
2
 
0.5%
Math Symbol
ValueCountFrequency (%)
= 40
93.0%
+ 3
 
7.0%
Space Separator
ValueCountFrequency (%)
5632
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18683
62.4%
Common 8708
29.1%
Latin 2279
 
7.6%
Han 195
 
0.7%
Hiragana 46
 
0.2%
Katakana 30
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
615
 
3.3%
538
 
2.9%
459
 
2.5%
423
 
2.3%
350
 
1.9%
263
 
1.4%
246
 
1.3%
237
 
1.3%
229
 
1.2%
209
 
1.1%
Other values (788) 15114
80.9%
Han
ValueCountFrequency (%)
14
 
7.2%
12
 
6.2%
11
 
5.6%
10
 
5.1%
10
 
5.1%
10
 
5.1%
10
 
5.1%
8
 
4.1%
5
 
2.6%
5
 
2.6%
Other values (79) 100
51.3%
Latin
ValueCountFrequency (%)
o 212
 
9.3%
e 192
 
8.4%
n 170
 
7.5%
a 165
 
7.2%
i 164
 
7.2%
t 163
 
7.2%
r 144
 
6.3%
s 109
 
4.8%
d 75
 
3.3%
h 65
 
2.9%
Other values (45) 820
36.0%
Common
ValueCountFrequency (%)
5632
64.7%
( 427
 
4.9%
) 427
 
4.9%
2 426
 
4.9%
0 339
 
3.9%
1 324
 
3.7%
: 305
 
3.5%
, 138
 
1.6%
3 103
 
1.2%
8 68
 
0.8%
Other values (22) 519
 
6.0%
Katakana
ValueCountFrequency (%)
3
 
10.0%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (12) 12
40.0%
Hiragana
ValueCountFrequency (%)
10
21.7%
6
13.0%
6
13.0%
4
 
8.7%
2
 
4.3%
2
 
4.3%
2
 
4.3%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (11) 11
23.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18682
62.4%
ASCII 10924
36.5%
CJK 195
 
0.7%
None 53
 
0.2%
Hiragana 46
 
0.2%
Katakana 30
 
0.1%
Number Forms 8
 
< 0.1%
Box Drawing 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5632
51.6%
( 427
 
3.9%
) 427
 
3.9%
2 426
 
3.9%
0 339
 
3.1%
1 324
 
3.0%
: 305
 
2.8%
o 212
 
1.9%
e 192
 
1.8%
n 170
 
1.6%
Other values (66) 2470
22.6%
Hangul
ValueCountFrequency (%)
615
 
3.3%
538
 
2.9%
459
 
2.5%
423
 
2.3%
350
 
1.9%
263
 
1.4%
246
 
1.3%
237
 
1.3%
229
 
1.2%
209
 
1.1%
Other values (787) 15113
80.9%
None
ValueCountFrequency (%)
· 46
86.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
CJK
ValueCountFrequency (%)
14
 
7.2%
12
 
6.2%
11
 
5.6%
10
 
5.1%
10
 
5.1%
10
 
5.1%
10
 
5.1%
8
 
4.1%
5
 
2.6%
5
 
2.6%
Other values (79) 100
51.3%
Hiragana
ValueCountFrequency (%)
10
21.7%
6
13.0%
6
13.0%
4
 
8.7%
2
 
4.3%
2
 
4.3%
2
 
4.3%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (11) 11
23.9%
Number Forms
ValueCountFrequency (%)
3
37.5%
3
37.5%
1
 
12.5%
1
 
12.5%
Katakana
ValueCountFrequency (%)
3
 
10.0%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (12) 12
40.0%
Box Drawing
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct1356
Distinct (%)68.6%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2024-01-28T22:55:32.399525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length304
Median length146
Mean length11.830551
Min length2

Characters and Unicode

Total characters23389
Distinct characters667
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1142 ?
Unique (%)57.8%

Sample

1st row베르나르 베르베르 뫼비우스 그림 이세욱
2nd row윤동례
3rd row신시아 샤피로 공혜진
4th row변호사 김용철 씀
5th row박성철
ValueCountFrequency (%)
지음 443
 
7.6%
옮김 251
 
4.3%
그림 116
 
2.0%
109
 
1.9%
78
 
1.3%
51
 
0.9%
공하성 50
 
0.9%
편저 50
 
0.9%
43
 
0.7%
설은미 38
 
0.6%
Other values (2452) 4633
79.0%
2024-01-28T22:55:32.746785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4696
 
20.1%
681
 
2.9%
671
 
2.9%
602
 
2.6%
583
 
2.5%
292
 
1.2%
291
 
1.2%
248
 
1.1%
220
 
0.9%
a 218
 
0.9%
Other values (657) 14887
63.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15863
67.8%
Space Separator 4696
 
20.1%
Lowercase Letter 1658
 
7.1%
Uppercase Letter 575
 
2.5%
Other Punctuation 341
 
1.5%
Open Punctuation 107
 
0.5%
Close Punctuation 107
 
0.5%
Decimal Number 22
 
0.1%
Math Symbol 15
 
0.1%
Dash Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
681
 
4.3%
671
 
4.2%
602
 
3.8%
583
 
3.7%
292
 
1.8%
291
 
1.8%
248
 
1.6%
220
 
1.4%
203
 
1.3%
201
 
1.3%
Other values (583) 11871
74.8%
Uppercase Letter
ValueCountFrequency (%)
A 74
12.9%
T 69
12.0%
S 63
11.0%
O 40
 
7.0%
B 36
 
6.3%
M 32
 
5.6%
K 32
 
5.6%
J 31
 
5.4%
Y 25
 
4.3%
C 24
 
4.2%
Other values (15) 149
25.9%
Lowercase Letter
ValueCountFrequency (%)
a 218
13.1%
i 206
12.4%
o 155
 
9.3%
e 139
 
8.4%
h 114
 
6.9%
s 111
 
6.7%
r 96
 
5.8%
t 79
 
4.8%
k 71
 
4.3%
u 70
 
4.2%
Other values (14) 399
24.1%
Decimal Number
ValueCountFrequency (%)
1 5
22.7%
8 3
13.6%
2 3
13.6%
9 3
13.6%
6 2
 
9.1%
4 2
 
9.1%
5 2
 
9.1%
0 1
 
4.5%
7 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 153
44.9%
. 113
33.1%
· 52
 
15.2%
: 22
 
6.5%
/ 1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
[ 95
88.8%
( 9
 
8.4%
3
 
2.8%
Close Punctuation
ValueCountFrequency (%)
] 95
88.8%
) 9
 
8.4%
3
 
2.8%
Math Symbol
ValueCountFrequency (%)
< 7
46.7%
> 7
46.7%
= 1
 
6.7%
Space Separator
ValueCountFrequency (%)
4696
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15816
67.6%
Common 5293
 
22.6%
Latin 2233
 
9.5%
Han 42
 
0.2%
Hiragana 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
681
 
4.3%
671
 
4.2%
602
 
3.8%
583
 
3.7%
292
 
1.8%
291
 
1.8%
248
 
1.6%
220
 
1.4%
203
 
1.3%
201
 
1.3%
Other values (547) 11824
74.8%
Latin
ValueCountFrequency (%)
a 218
 
9.8%
i 206
 
9.2%
o 155
 
6.9%
e 139
 
6.2%
h 114
 
5.1%
s 111
 
5.0%
r 96
 
4.3%
t 79
 
3.5%
A 74
 
3.3%
k 71
 
3.2%
Other values (39) 970
43.4%
Han
ValueCountFrequency (%)
5
 
11.9%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (21) 21
50.0%
Common
ValueCountFrequency (%)
4696
88.7%
, 153
 
2.9%
. 113
 
2.1%
[ 95
 
1.8%
] 95
 
1.8%
· 52
 
1.0%
: 22
 
0.4%
) 9
 
0.2%
( 9
 
0.2%
< 7
 
0.1%
Other values (15) 42
 
0.8%
Hiragana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15816
67.6%
ASCII 7468
31.9%
None 58
 
0.2%
CJK 41
 
0.2%
Hiragana 5
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4696
62.9%
a 218
 
2.9%
i 206
 
2.8%
o 155
 
2.1%
, 153
 
2.0%
e 139
 
1.9%
h 114
 
1.5%
. 113
 
1.5%
s 111
 
1.5%
r 96
 
1.3%
Other values (61) 1467
 
19.6%
Hangul
ValueCountFrequency (%)
681
 
4.3%
671
 
4.2%
602
 
3.8%
583
 
3.7%
292
 
1.8%
291
 
1.8%
248
 
1.6%
220
 
1.4%
203
 
1.3%
201
 
1.3%
Other values (547) 11824
74.8%
None
ValueCountFrequency (%)
· 52
89.7%
3
 
5.2%
3
 
5.2%
CJK
ValueCountFrequency (%)
5
 
12.2%
3
 
7.3%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (20) 20
48.8%
Hiragana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct1150
Distinct (%)58.2%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2024-01-28T22:55:33.067435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length4.1254426
Min length3

Characters and Unicode

Total characters8156
Distinct characters255
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique892 ?
Unique (%)45.1%

Sample

1st row베297ㄴ
2nd row윤동례
3rd row신58ㅎ
4th row변95ㅅ
5th row박54ㅅ
ValueCountFrequency (%)
공92ㅅ 49
 
2.5%
타22ㅅ 38
 
1.9%
박65ㅅ 22
 
1.1%
허64ㅅ 18
 
0.9%
시65ㄹ 15
 
0.8%
최73ㄹ 15
 
0.8%
아44ㅅ 14
 
0.7%
다68ㅁ 14
 
0.7%
이36ㅅ 14
 
0.7%
야32ㄷ 12
 
0.6%
Other values (1140) 1766
89.3%
2024-01-28T22:55:33.531906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 636
 
7.8%
6 621
 
7.6%
2 619
 
7.6%
5 570
 
7.0%
4 555
 
6.8%
7 503
 
6.2%
442
 
5.4%
9 411
 
5.0%
342
 
4.2%
3 212
 
2.6%
Other values (245) 3245
39.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4314
52.9%
Other Letter 3812
46.7%
Uppercase Letter 29
 
0.4%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
442
 
11.6%
342
 
9.0%
175
 
4.6%
169
 
4.4%
147
 
3.9%
147
 
3.9%
121
 
3.2%
119
 
3.1%
105
 
2.8%
89
 
2.3%
Other values (224) 1956
51.3%
Uppercase Letter
ValueCountFrequency (%)
N 7
24.1%
H 7
24.1%
J 3
10.3%
F 3
10.3%
G 2
 
6.9%
I 2
 
6.9%
L 1
 
3.4%
E 1
 
3.4%
U 1
 
3.4%
M 1
 
3.4%
Decimal Number
ValueCountFrequency (%)
8 636
14.7%
6 621
14.4%
2 619
14.3%
5 570
13.2%
4 555
12.9%
7 503
11.7%
9 411
9.5%
3 212
 
4.9%
1 187
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
c 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4314
52.9%
Hangul 3812
46.7%
Latin 30
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
442
 
11.6%
342
 
9.0%
175
 
4.6%
169
 
4.4%
147
 
3.9%
147
 
3.9%
121
 
3.2%
119
 
3.1%
105
 
2.8%
89
 
2.3%
Other values (224) 1956
51.3%
Latin
ValueCountFrequency (%)
N 7
23.3%
H 7
23.3%
J 3
10.0%
F 3
10.0%
G 2
 
6.7%
I 2
 
6.7%
c 1
 
3.3%
L 1
 
3.3%
E 1
 
3.3%
U 1
 
3.3%
Other values (2) 2
 
6.7%
Common
ValueCountFrequency (%)
8 636
14.7%
6 621
14.4%
2 619
14.3%
5 570
13.2%
4 555
12.9%
7 503
11.7%
9 411
9.5%
3 212
 
4.9%
1 187
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4344
53.3%
Hangul 1964
24.1%
Compat Jamo 1848
22.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 636
14.6%
6 621
14.3%
2 619
14.2%
5 570
13.1%
4 555
12.8%
7 503
11.6%
9 411
9.5%
3 212
 
4.9%
1 187
 
4.3%
N 7
 
0.2%
Other values (11) 23
 
0.5%
Compat Jamo
ValueCountFrequency (%)
442
23.9%
342
18.5%
147
 
8.0%
147
 
8.0%
121
 
6.5%
119
 
6.4%
105
 
5.7%
88
 
4.8%
87
 
4.7%
56
 
3.0%
Other values (9) 194
10.5%
Hangul
ValueCountFrequency (%)
175
 
8.9%
169
 
8.6%
89
 
4.5%
63
 
3.2%
55
 
2.8%
53
 
2.7%
51
 
2.6%
47
 
2.4%
47
 
2.4%
35
 
1.8%
Other values (205) 1180
60.1%

분류기호
Real number (ℝ)

SKEWED 

Distinct509
Distinct (%)25.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean600.41764
Minimum1
Maximum37000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.5 KiB
2024-01-28T22:55:33.671925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile151.916
Q1351.18077
median560
Q3814.6
95-th percentile911.034
Maximum37000
Range36999
Interquartile range (IQR)463.41923

Descriptive statistics

Standard deviation856.65533
Coefficient of variation (CV)1.4267658
Kurtosis1651.6131
Mean600.41764
Median Absolute Deviation (MAD)234.9
Skewness38.844994
Sum1187025.7
Variance733858.36
MonotonicityNot monotonic
2024-01-28T22:55:33.822084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
813.6 99
 
5.0%
539.99077 86
 
4.4%
818.0 69
 
3.5%
833.6 69
 
3.5%
843.0 59
 
3.0%
657.1 48
 
2.4%
539.99 44
 
2.2%
813.7 40
 
2.0%
539.0 34
 
1.7%
199.1 27
 
1.4%
Other values (499) 1402
70.9%
ValueCountFrequency (%)
1.0 9
0.5%
1.3 8
0.4%
4.0 3
 
0.2%
4.076 1
 
0.1%
4.077 7
0.4%
4.16 1
 
0.1%
4.5 1
 
0.1%
4.61 2
 
0.1%
4.75 1
 
0.1%
4.76 5
0.3%
ValueCountFrequency (%)
37000.0 1
 
0.1%
989.11 1
 
0.1%
989.0 1
 
0.1%
986.802 1
 
0.1%
986.6102 1
 
0.1%
982.889 1
 
0.1%
982.6302 1
 
0.1%
982.02 4
0.2%
982.0 1
 
0.1%
981.4602 2
0.1%
Distinct761
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2024-01-28T22:55:34.064869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length24
Mean length5.2498735
Min length1

Characters and Unicode

Total characters10379
Distinct characters489
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique511 ?
Unique (%)25.8%

Sample

1st row열린책들
2nd row데이터 없음
3rd row서돌
4th row사회평론
5th row지원북클럽
ValueCountFrequency (%)
62
 
2.7%
학산문화사 60
 
2.6%
위즈덤하우스 58
 
2.6%
bm성안당 52
 
2.3%
시대고시기획 40
 
1.8%
문학동네 36
 
1.6%
민음사 35
 
1.5%
김영사 34
 
1.5%
데이터 32
 
1.4%
없음 32
 
1.4%
Other values (760) 1827
80.6%
2024-01-28T22:55:34.445860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
423
 
4.1%
412
 
4.0%
292
 
2.8%
272
 
2.6%
245
 
2.4%
187
 
1.8%
181
 
1.7%
169
 
1.6%
154
 
1.5%
: 137
 
1.3%
Other values (479) 7907
76.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8818
85.0%
Lowercase Letter 529
 
5.1%
Uppercase Letter 335
 
3.2%
Space Separator 292
 
2.8%
Other Punctuation 154
 
1.5%
Close Punctuation 84
 
0.8%
Open Punctuation 84
 
0.8%
Decimal Number 80
 
0.8%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
423
 
4.8%
412
 
4.7%
272
 
3.1%
245
 
2.8%
187
 
2.1%
181
 
2.1%
169
 
1.9%
154
 
1.7%
131
 
1.5%
131
 
1.5%
Other values (419) 6513
73.9%
Lowercase Letter
ValueCountFrequency (%)
o 101
19.1%
s 52
 
9.8%
e 38
 
7.2%
k 35
 
6.6%
a 35
 
6.6%
b 33
 
6.2%
n 30
 
5.7%
i 27
 
5.1%
r 24
 
4.5%
m 23
 
4.3%
Other values (12) 131
24.8%
Uppercase Letter
ValueCountFrequency (%)
B 90
26.9%
M 63
18.8%
A 18
 
5.4%
P 18
 
5.4%
K 15
 
4.5%
H 15
 
4.5%
R 15
 
4.5%
S 13
 
3.9%
O 12
 
3.6%
N 11
 
3.3%
Other values (12) 65
19.4%
Decimal Number
ValueCountFrequency (%)
2 32
40.0%
1 30
37.5%
0 6
 
7.5%
4 4
 
5.0%
8 3
 
3.8%
3 3
 
3.8%
9 2
 
2.5%
Other Punctuation
ValueCountFrequency (%)
: 137
89.0%
. 10
 
6.5%
& 3
 
1.9%
# 2
 
1.3%
; 2
 
1.3%
Space Separator
ValueCountFrequency (%)
292
100.0%
Close Punctuation
ValueCountFrequency (%)
) 84
100.0%
Open Punctuation
ValueCountFrequency (%)
( 84
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8770
84.5%
Latin 864
 
8.3%
Common 697
 
6.7%
Han 30
 
0.3%
Katakana 18
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
423
 
4.8%
412
 
4.7%
272
 
3.1%
245
 
2.8%
187
 
2.1%
181
 
2.1%
169
 
1.9%
154
 
1.8%
131
 
1.5%
131
 
1.5%
Other values (393) 6465
73.7%
Latin
ValueCountFrequency (%)
o 101
 
11.7%
B 90
 
10.4%
M 63
 
7.3%
s 52
 
6.0%
e 38
 
4.4%
k 35
 
4.1%
a 35
 
4.1%
b 33
 
3.8%
n 30
 
3.5%
i 27
 
3.1%
Other values (34) 360
41.7%
Han
ValueCountFrequency (%)
6
20.0%
4
13.3%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (7) 7
23.3%
Common
ValueCountFrequency (%)
292
41.9%
: 137
19.7%
) 84
 
12.1%
( 84
 
12.1%
2 32
 
4.6%
1 30
 
4.3%
. 10
 
1.4%
0 6
 
0.9%
4 4
 
0.6%
8 3
 
0.4%
Other values (6) 15
 
2.2%
Katakana
ValueCountFrequency (%)
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8770
84.5%
ASCII 1561
 
15.0%
CJK 30
 
0.3%
Katakana 18
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
423
 
4.8%
412
 
4.7%
272
 
3.1%
245
 
2.8%
187
 
2.1%
181
 
2.1%
169
 
1.9%
154
 
1.8%
131
 
1.5%
131
 
1.5%
Other values (393) 6465
73.7%
ASCII
ValueCountFrequency (%)
292
18.7%
: 137
 
8.8%
o 101
 
6.5%
B 90
 
5.8%
) 84
 
5.4%
( 84
 
5.4%
M 63
 
4.0%
s 52
 
3.3%
e 38
 
2.4%
k 35
 
2.2%
Other values (50) 585
37.5%
CJK
ValueCountFrequency (%)
6
20.0%
4
13.3%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (7) 7
23.3%
Katakana
ValueCountFrequency (%)
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%
2
11.1%

Interactions

2024-01-28T22:55:30.239619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:55:30.069588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:55:30.326010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:55:30.156007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T22:55:34.531627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분류기호
번호1.0000.000
분류기호0.0001.000
2024-01-28T22:55:34.601447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분류기호
번호1.000-0.197
분류기호-0.1971.000

Missing values

2024-01-28T22:55:30.761342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T22:55:30.859653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호등록번호서명저자저자기호분류기호출판사
01EM00000001나무베르나르 베르베르 뫼비우스 그림 이세욱베297ㄴ863.0열린책들
12EM00000002아름다운 소풍길윤동례윤동례814.0데이터 없음
23EM00000003회사가 당신에게 알려주지 않는 50가지 비밀신시아 샤피로 공혜진신58ㅎ325.3서돌
34EM00000004삼성을 생각한다변호사 김용철 씀변95ㅅ325.3사회평론
45EM00000005쉼터박성철박54ㅅ818.0지원북클럽
56EM00000006(한 권으로 읽는)삼국유사일연 김길형일64ㅅ911.03아이템북스
67EM00000007(theme study) 일본어 中級점프 Reading荒井禮子 ...[等]著시52ㅇ737.4시사일본어사
78EM00000008요깝정홍수정95ㅇ326.162한진
89EM00000009마음을 다스리는 기술이지드로 페르낭데이78ㅁ804.0핸디북
910EM00000010아프니까 청춘이다김난도김192ㅇ199.1쌤앤파커스
번호등록번호서명저자저자기호분류기호출판사
19671968SA00000713뉴욕 현대 미술관박혜성 글, 이정화 그림을892ㅋ69.0을파소(북이십일)
19681969SA00000714이상한 과자 가게 전천당.히로시마 레이코 글, 쟈쟈 그림, 김정화 옮김히295ㅇ833.8길벗스쿨
19691970SA00000715이상한 과자 가게 전천당.히로시마 레이코 글, 쟈쟈 그림, 김정화 옮김히295ㅇ833.8길벗스쿨
19701971SA00000716이상한 과자 가게 전천당.히로시마 레이코 글, 쟈쟈 그림, 김정화 옮김히295ㅇ833.8길벗스쿨
19711972SA00000717이상한 과자 가게 전천당.히로시마 레이코 글, 쟈쟈 그림, 김정화 옮김히295ㅇ833.8길벗스쿨
19721973SA00000718(한 눈에 펼쳐보는)우리나라 지도 그림책민병준 글, 최선웅 지도, 구연산 그림민44ㅇ989.11진선아이
19731974SA00000719(한눈에 펼쳐보는)세계지도 그림책최선웅 글·지도이병용 그림최54ㅅ989.0진선아이
19741975SA00000720(한 권으로 보는)그림 한국사 백과지호진 글, 이혁 그림지95ㄱ911.0진선아이
19751976SA00000721(한 권으로 보는)그림 세계사 백과정연 외 글, 이병용 그림정64ㄱ909.0진선아이
19761977SA00000722거꾸로 읽는 세계사유시민 지음유58ㄱ909.0돌베개