Overview

Dataset statistics

Number of variables22
Number of observations10000
Missing cells22692
Missing cells (%)10.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 MiB
Average record size in memory193.0 B

Variable types

Categorical5
Numeric9
Text6
Boolean2

Dataset

Description행정구역명,학원/교습소,학원지정번호,학원명,도로명주소,도로명상세주소,분야명,교습계열명,교습과정목록명,교습과정명,정원합계,일시수용능력인원합계,인당수강료내용,수강료공개여부,기숙사학원여부,도로명우편번호,등록상태명,등록일자,휴원시작일자,휴원종료일자,개설일자,적재일시
Author서울특별시교육청
URLhttps://data.seoul.go.kr/dataList/OA-20528/S/1/datasetView.do

Alerts

등록상태명 has constant value ""Constant
수강료공개여부 is highly imbalanced (55.3%)Imbalance
기숙사학원여부 is highly imbalanced (98.3%)Imbalance
교습과정목록명 has 2927 (29.3%) missing valuesMissing
교습과정명 has 885 (8.8%) missing valuesMissing
인당수강료내용 has 7724 (77.2%) missing valuesMissing
기숙사학원여부 has 504 (5.0%) missing valuesMissing
휴원시작일자 has 9858 (98.6%) missing valuesMissing
휴원종료일자 has 761 (7.6%) missing valuesMissing
정원합계 is highly skewed (γ1 = 65.71614301)Skewed
일시수용능력인원합계 is highly skewed (γ1 = 91.42071342)Skewed
학원지정번호 has unique valuesUnique
정원합계 has 665 (6.7%) zerosZeros
일시수용능력인원합계 has 221 (2.2%) zerosZeros

Reproduction

Analysis started2024-05-18 07:21:49.763066
Analysis finished2024-05-18 07:21:55.795892
Duration6.03 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정구역명
Categorical

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
강남구
1379 
양천구
821 
송파구
744 
서초구
702 
노원구
 
540
Other values (21)
5814 

Length

Max length4
Median length3
Mean length3.0837
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용산구
2nd row마포구
3rd row양천구
4th row영등포구
5th row강북구

Common Values

ValueCountFrequency (%)
강남구 1379
 
13.8%
양천구 821
 
8.2%
송파구 744
 
7.4%
서초구 702
 
7.0%
노원구 540
 
5.4%
강동구 539
 
5.4%
강서구 510
 
5.1%
마포구 463
 
4.6%
은평구 441
 
4.4%
성북구 405
 
4.0%
Other values (16) 3456
34.6%

Length

2024-05-18T16:21:56.114620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 1379
 
13.8%
양천구 821
 
8.2%
송파구 744
 
7.4%
서초구 702
 
7.0%
노원구 540
 
5.4%
강동구 539
 
5.4%
강서구 510
 
5.1%
마포구 463
 
4.6%
은평구 441
 
4.4%
성북구 405
 
4.0%
Other values (16) 3456
34.6%

학원/교습소
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
학원
5865 
교습소
4135 

Length

Max length3
Median length2
Mean length2.4135
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학원
2nd row교습소
3rd row교습소
4th row학원
5th row학원

Common Values

ValueCountFrequency (%)
학원 5865
58.7%
교습소 4135
41.3%

Length

2024-05-18T16:21:56.571249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:21:56.916755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학원 5865
58.7%
교습소 4135
41.3%

학원지정번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2576288 × 109
Minimum272
Maximum3.0000504 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T16:21:57.504526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum272
5-th percentile10681.95
Q11.0000412 × 109
median3.0000272 × 109
Q33.0000385 × 109
95-th percentile3.0000466 × 109
Maximum3.0000504 × 109
Range3.0000501 × 109
Interquartile range (IQR)1.9999973 × 109

Descriptive statistics

Standard deviation1.2311005 × 109
Coefficient of variation (CV)0.54530686
Kurtosis-0.55081269
Mean2.2576288 × 109
Median Absolute Deviation (MAD)13711.5
Skewness-1.1519058
Sum2.2576288 × 1013
Variance1.5156084 × 1018
MonotonicityNot monotonic
2024-05-18T16:21:58.094522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3000015397 1
 
< 0.1%
3000032043 1
 
< 0.1%
3000024093 1
 
< 0.1%
3000039477 1
 
< 0.1%
3000041620 1
 
< 0.1%
3000026145 1
 
< 0.1%
3000016024 1
 
< 0.1%
3000038979 1
 
< 0.1%
3000025926 1
 
< 0.1%
3000016540 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
272 1
< 0.1%
301 1
< 0.1%
303 1
< 0.1%
305 1
< 0.1%
320 1
< 0.1%
327 1
< 0.1%
331 1
< 0.1%
397 1
< 0.1%
404 1
< 0.1%
416 1
< 0.1%
ValueCountFrequency (%)
3000050381 1
< 0.1%
3000050378 1
< 0.1%
3000050377 1
< 0.1%
3000050375 1
< 0.1%
3000050361 1
< 0.1%
3000050360 1
< 0.1%
3000050358 1
< 0.1%
3000050357 1
< 0.1%
3000050355 1
< 0.1%
3000050354 1
< 0.1%
Distinct9626
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-18T16:21:58.878361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length36
Mean length9.6333
Min length1

Characters and Unicode

Total characters96333
Distinct characters969
Distinct categories14 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9328 ?
Unique (%)93.3%

Sample

1st row셀파우등생교실효창학원
2nd row로제타스톤영어교실망원캠퍼스영어교습소
3rd row수학하는사람들수학교습소
4th row여의도영재음악학원
5th rowNEW아이비학원
ValueCountFrequency (%)
english)영어교습소 19
 
0.2%
academy)학원 7
 
0.1%
툰스테이션미술학원 5
 
< 0.1%
english)학원 5
 
< 0.1%
math)수학교습소 5
 
< 0.1%
미술로생각하기학원 4
 
< 0.1%
아이지에스이아카데미(igse 4
 
< 0.1%
edu)학원 4
 
< 0.1%
포르테음악교습소 4
 
< 0.1%
예음음악교습소 4
 
< 0.1%
Other values (9741) 10124
99.4%
2024-05-18T16:22:00.204449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7684
 
8.0%
6150
 
6.4%
4459
 
4.6%
4313
 
4.5%
4296
 
4.5%
2345
 
2.4%
2049
 
2.1%
1982
 
2.1%
1940
 
2.0%
1793
 
1.9%
Other values (959) 59322
61.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 89386
92.8%
Uppercase Letter 2485
 
2.6%
Lowercase Letter 1887
 
2.0%
Close Punctuation 751
 
0.8%
Open Punctuation 751
 
0.8%
Decimal Number 694
 
0.7%
Space Separator 185
 
0.2%
Other Punctuation 164
 
0.2%
Dash Punctuation 19
 
< 0.1%
Math Symbol 7
 
< 0.1%
Other values (4) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7684
 
8.6%
6150
 
6.9%
4459
 
5.0%
4313
 
4.8%
4296
 
4.8%
2345
 
2.6%
2049
 
2.3%
1982
 
2.2%
1940
 
2.2%
1793
 
2.0%
Other values (874) 52375
58.6%
Lowercase Letter
ValueCountFrequency (%)
e 208
11.0%
i 169
 
9.0%
a 159
 
8.4%
n 158
 
8.4%
s 135
 
7.2%
l 125
 
6.6%
t 115
 
6.1%
o 112
 
5.9%
h 98
 
5.2%
r 95
 
5.0%
Other values (17) 513
27.2%
Uppercase Letter
ValueCountFrequency (%)
E 268
 
10.8%
S 247
 
9.9%
A 218
 
8.8%
M 193
 
7.8%
C 154
 
6.2%
T 133
 
5.4%
I 124
 
5.0%
B 110
 
4.4%
N 109
 
4.4%
L 102
 
4.1%
Other values (16) 827
33.3%
Other Punctuation
ValueCountFrequency (%)
& 52
31.7%
. 46
28.0%
' 15
 
9.1%
, 15
 
9.1%
? 15
 
9.1%
/ 5
 
3.0%
% 4
 
2.4%
: 4
 
2.4%
! 4
 
2.4%
# 3
 
1.8%
Decimal Number
ValueCountFrequency (%)
2 232
33.4%
1 181
26.1%
3 87
 
12.5%
0 85
 
12.2%
4 43
 
6.2%
7 23
 
3.3%
5 17
 
2.4%
9 12
 
1.7%
6 8
 
1.2%
8 6
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 749
99.7%
] 2
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 749
99.7%
[ 2
 
0.3%
Space Separator
ValueCountFrequency (%)
185
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Math Symbol
ValueCountFrequency (%)
+ 7
100.0%
Other Number
ValueCountFrequency (%)
² 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 89370
92.8%
Latin 4372
 
4.5%
Common 2574
 
2.7%
Han 16
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7684
 
8.6%
6150
 
6.9%
4459
 
5.0%
4313
 
4.8%
4296
 
4.8%
2345
 
2.6%
2049
 
2.3%
1982
 
2.2%
1940
 
2.2%
1793
 
2.0%
Other values (859) 52359
58.6%
Latin
ValueCountFrequency (%)
E 268
 
6.1%
S 247
 
5.6%
A 218
 
5.0%
e 208
 
4.8%
M 193
 
4.4%
i 169
 
3.9%
a 159
 
3.6%
n 158
 
3.6%
C 154
 
3.5%
s 135
 
3.1%
Other values (43) 2463
56.3%
Common
ValueCountFrequency (%)
) 749
29.1%
( 749
29.1%
2 232
 
9.0%
185
 
7.2%
1 181
 
7.0%
3 87
 
3.4%
0 85
 
3.3%
& 52
 
2.0%
. 46
 
1.8%
4 43
 
1.7%
Other values (21) 165
 
6.4%
Han
ValueCountFrequency (%)
2
 
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (5) 5
31.2%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 89370
92.8%
ASCII 6943
 
7.2%
CJK 14
 
< 0.1%
None 2
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%
Punctuation 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7684
 
8.6%
6150
 
6.9%
4459
 
5.0%
4313
 
4.8%
4296
 
4.8%
2345
 
2.6%
2049
 
2.3%
1982
 
2.2%
1940
 
2.2%
1793
 
2.0%
Other values (859) 52359
58.6%
ASCII
ValueCountFrequency (%)
) 749
 
10.8%
( 749
 
10.8%
E 268
 
3.9%
S 247
 
3.6%
2 232
 
3.3%
A 218
 
3.1%
e 208
 
3.0%
M 193
 
2.8%
185
 
2.7%
1 181
 
2.6%
Other values (71) 3713
53.5%
CJK
ValueCountFrequency (%)
2
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (3) 3
21.4%
None
ValueCountFrequency (%)
² 1
50.0%
α 1
50.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct6516
Distinct (%)65.2%
Missing3
Missing (%)< 0.1%
Memory size156.2 KiB
2024-05-18T16:22:01.311404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length28
Mean length18.469741
Min length1

Characters and Unicode

Total characters184642
Distinct characters298
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4984 ?
Unique (%)49.9%

Sample

1st row서울특별시 용산구 효창원로 136
2nd row서울특별시 마포구 망원로 60
3rd row서울특별시 양천구 목동서로 349
4th row서울특별시 영등포구 국제금융로 78
5th row서울특별시 강북구 삼양로 231
ValueCountFrequency (%)
서울특별시 9988
25.0%
강남구 1369
 
3.4%
양천구 820
 
2.1%
송파구 744
 
1.9%
서초구 716
 
1.8%
노원구 540
 
1.4%
강동구 539
 
1.3%
강서구 511
 
1.3%
마포구 464
 
1.2%
은평구 442
 
1.1%
Other values (3753) 23820
59.6%
2024-05-18T16:22:02.961750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29957
16.2%
12323
 
6.7%
10475
 
5.7%
10161
 
5.5%
10068
 
5.5%
10014
 
5.4%
9989
 
5.4%
9989
 
5.4%
1 6401
 
3.5%
2 4733
 
2.6%
Other values (288) 70532
38.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 120838
65.4%
Decimal Number 32883
 
17.8%
Space Separator 29957
 
16.2%
Dash Punctuation 895
 
0.5%
Other Punctuation 44
 
< 0.1%
Close Punctuation 12
 
< 0.1%
Open Punctuation 12
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12323
 
10.2%
10475
 
8.7%
10161
 
8.4%
10068
 
8.3%
10014
 
8.3%
9989
 
8.3%
9989
 
8.3%
4643
 
3.8%
2855
 
2.4%
2639
 
2.2%
Other values (270) 37682
31.2%
Decimal Number
ValueCountFrequency (%)
1 6401
19.5%
2 4733
14.4%
3 4007
12.2%
4 3122
9.5%
5 3075
9.4%
6 2733
8.3%
7 2463
 
7.5%
0 2249
 
6.8%
8 2113
 
6.4%
9 1987
 
6.0%
Other Punctuation
ValueCountFrequency (%)
? 33
75.0%
. 9
 
20.5%
, 2
 
4.5%
Space Separator
ValueCountFrequency (%)
29957
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 895
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 120838
65.4%
Common 63803
34.6%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12323
 
10.2%
10475
 
8.7%
10161
 
8.4%
10068
 
8.3%
10014
 
8.3%
9989
 
8.3%
9989
 
8.3%
4643
 
3.8%
2855
 
2.4%
2639
 
2.2%
Other values (270) 37682
31.2%
Common
ValueCountFrequency (%)
29957
47.0%
1 6401
 
10.0%
2 4733
 
7.4%
3 4007
 
6.3%
4 3122
 
4.9%
5 3075
 
4.8%
6 2733
 
4.3%
7 2463
 
3.9%
0 2249
 
3.5%
8 2113
 
3.3%
Other values (7) 2950
 
4.6%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 120838
65.4%
ASCII 63804
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29957
47.0%
1 6401
 
10.0%
2 4733
 
7.4%
3 4007
 
6.3%
4 3122
 
4.9%
5 3075
 
4.8%
6 2733
 
4.3%
7 2463
 
3.9%
0 2249
 
3.5%
8 2113
 
3.3%
Other values (8) 2951
 
4.6%
Hangul
ValueCountFrequency (%)
12323
 
10.2%
10475
 
8.7%
10161
 
8.4%
10068
 
8.3%
10014
 
8.3%
9989
 
8.3%
9989
 
8.3%
4643
 
3.8%
2855
 
2.4%
2639
 
2.2%
Other values (270) 37682
31.2%
Distinct8446
Distinct (%)84.7%
Missing30
Missing (%)0.3%
Memory size156.2 KiB
2024-05-18T16:22:03.955022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length53
Mean length17.542126
Min length1

Characters and Unicode

Total characters174895
Distinct characters594
Distinct categories13 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7798 ?
Unique (%)78.2%

Sample

1st row, 3층 (효창동)
2nd row, 2층 전부 (망원동, 희망주택)
3rd row외1필지 센트럴프라자 1402호 (신정동)
4th row, 909호,홍우빌딩 (여의도동)
5th row, 3층 (미아동)
ValueCountFrequency (%)
7812
 
21.1%
2층 1870
 
5.1%
3층 1307
 
3.5%
일부 1154
 
3.1%
1층 681
 
1.8%
4층 660
 
1.8%
대치동 504
 
1.4%
상가동 354
 
1.0%
5층 340
 
0.9%
목동 338
 
0.9%
Other values (6370) 21959
59.4%
2024-05-18T16:22:05.345714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27183
 
15.5%
, 13565
 
7.8%
11616
 
6.6%
) 10119
 
5.8%
( 10115
 
5.8%
7062
 
4.0%
2 6479
 
3.7%
5828
 
3.3%
1 5107
 
2.9%
0 4928
 
2.8%
Other values (584) 72893
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 84341
48.2%
Decimal Number 27710
 
15.8%
Space Separator 27183
 
15.5%
Other Punctuation 13737
 
7.9%
Close Punctuation 10119
 
5.8%
Open Punctuation 10115
 
5.8%
Uppercase Letter 842
 
0.5%
Dash Punctuation 421
 
0.2%
Math Symbol 287
 
0.2%
Lowercase Letter 123
 
0.1%
Other values (3) 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11616
 
13.8%
7062
 
8.4%
5828
 
6.9%
2446
 
2.9%
2440
 
2.9%
2252
 
2.7%
2246
 
2.7%
1650
 
2.0%
1608
 
1.9%
1554
 
1.8%
Other values (509) 45639
54.1%
Uppercase Letter
ValueCountFrequency (%)
B 189
22.4%
A 159
18.9%
S 58
 
6.9%
M 55
 
6.5%
C 53
 
6.3%
K 37
 
4.4%
D 35
 
4.2%
E 33
 
3.9%
O 30
 
3.6%
I 29
 
3.4%
Other values (16) 164
19.5%
Lowercase Letter
ValueCountFrequency (%)
e 21
17.1%
r 15
12.2%
i 12
9.8%
s 11
8.9%
o 9
 
7.3%
l 9
 
7.3%
n 9
 
7.3%
a 6
 
4.9%
u 4
 
3.3%
t 4
 
3.3%
Other values (11) 23
18.7%
Decimal Number
ValueCountFrequency (%)
2 6479
23.4%
1 5107
18.4%
0 4928
17.8%
3 4212
15.2%
4 2519
 
9.1%
5 1631
 
5.9%
6 1059
 
3.8%
7 800
 
2.9%
8 535
 
1.9%
9 440
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 13565
98.7%
? 72
 
0.5%
. 42
 
0.3%
@ 36
 
0.3%
/ 18
 
0.1%
' 2
 
< 0.1%
& 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 285
99.3%
> 1
 
0.3%
< 1
 
0.3%
Letter Number
ValueCountFrequency (%)
13
86.7%
2
 
13.3%
Space Separator
ValueCountFrequency (%)
27183
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10119
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10115
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 421
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 89573
51.2%
Hangul 84339
48.2%
Latin 979
 
0.6%
Han 3
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11616
 
13.8%
7062
 
8.4%
5828
 
6.9%
2446
 
2.9%
2440
 
2.9%
2252
 
2.7%
2246
 
2.7%
1650
 
2.0%
1608
 
1.9%
1554
 
1.8%
Other values (507) 45637
54.1%
Latin
ValueCountFrequency (%)
B 189
19.3%
A 159
16.2%
S 58
 
5.9%
M 55
 
5.6%
C 53
 
5.4%
K 37
 
3.8%
D 35
 
3.6%
E 33
 
3.4%
O 30
 
3.1%
I 29
 
3.0%
Other values (38) 301
30.7%
Common
ValueCountFrequency (%)
27183
30.3%
, 13565
15.1%
) 10119
 
11.3%
( 10115
 
11.3%
2 6479
 
7.2%
1 5107
 
5.7%
0 4928
 
5.5%
3 4212
 
4.7%
4 2519
 
2.8%
5 1631
 
1.8%
Other values (15) 3715
 
4.1%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Greek
ValueCountFrequency (%)
Ι 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90537
51.8%
Hangul 84329
48.2%
Number Forms 15
 
< 0.1%
Compat Jamo 9
 
< 0.1%
CJK 2
 
< 0.1%
None 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27183
30.0%
, 13565
15.0%
) 10119
 
11.2%
( 10115
 
11.2%
2 6479
 
7.2%
1 5107
 
5.6%
0 4928
 
5.4%
3 4212
 
4.7%
4 2519
 
2.8%
5 1631
 
1.8%
Other values (61) 4679
 
5.2%
Hangul
ValueCountFrequency (%)
11616
 
13.8%
7062
 
8.4%
5828
 
6.9%
2446
 
2.9%
2440
 
2.9%
2252
 
2.7%
2246
 
2.7%
1650
 
2.0%
1608
 
1.9%
1554
 
1.8%
Other values (505) 45627
54.1%
Number Forms
ValueCountFrequency (%)
13
86.7%
2
 
13.3%
Compat Jamo
ValueCountFrequency (%)
9
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
50.0%
Ι 1
50.0%

분야명
Categorical

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
입시.검정 및 보습
5403 
예능(대)
2671 
국제화
 
483
직업기술
 
334
기타(대)
 
325
Other values (6)
784 

Length

Max length10
Median length10
Mean length7.544
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row입시.검정 및 보습
2nd row입시.검정 및 보습
3rd row입시.검정 및 보습
4th row예능(대)
5th row입시.검정 및 보습

Common Values

ValueCountFrequency (%)
입시.검정 및 보습 5403
54.0%
예능(대) 2671
26.7%
국제화 483
 
4.8%
직업기술 334
 
3.3%
기타(대) 325
 
3.2%
기예(대) 290
 
2.9%
독서실 218
 
2.2%
종합(대) 174
 
1.7%
인문사회(대) 93
 
0.9%
정보 8
 
0.1%

Length

2024-05-18T16:22:05.878467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
입시.검정 5403
26.0%
5403
26.0%
보습 5403
26.0%
예능(대 2671
12.8%
국제화 483
 
2.3%
직업기술 334
 
1.6%
기타(대 325
 
1.6%
기예(대 290
 
1.4%
독서실 218
 
1.0%
종합(대 174
 
0.8%
Other values (3) 102
 
0.5%

교습계열명
Categorical

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보통교과
4923 
예능(중)
2527 
<NA>
887 
외국어
 
384
기타(중)
 
301
Other values (14)
978 

Length

Max length7
Median length4
Mean length4.2744
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보통교과
2nd row보통교과
3rd row보통교과
4th row예능(중)
5th row보통교과

Common Values

ValueCountFrequency (%)
보통교과 4923
49.2%
예능(중) 2527
25.3%
<NA> 887
 
8.9%
외국어 384
 
3.8%
기타(중) 301
 
3.0%
기예(중) 261
 
2.6%
독서 213
 
2.1%
산업응용기술 125
 
1.2%
인문사회(중) 84
 
0.8%
국제 73
 
0.7%
Other values (9) 222
 
2.2%

Length

2024-05-18T16:22:06.512551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보통교과 4923
49.2%
예능(중 2527
25.3%
na 887
 
8.9%
외국어 384
 
3.8%
기타(중 301
 
3.0%
기예(중 261
 
2.6%
독서 213
 
2.1%
산업응용기술 125
 
1.2%
인문사회(중 84
 
0.8%
국제 73
 
0.7%
Other values (9) 222
 
2.2%

교습과정목록명
Text

MISSING 

Distinct1664
Distinct (%)23.5%
Missing2927
Missing (%)29.3%
Memory size156.2 KiB
2024-05-18T16:22:07.089414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length36
Mean length6.1778595
Min length1

Characters and Unicode

Total characters43696
Distinct characters350
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1342 ?
Unique (%)19.0%

Sample

1st row보습
2nd row초등수학1
3rd row음악,
4th row보습,
5th row보습,
ValueCountFrequency (%)
보습 1681
22.9%
보습?논술 596
 
8.1%
음악 425
 
5.8%
미술 319
 
4.3%
실용외국어(유아/초?중?고 233
 
3.2%
초등수학 134
 
1.8%
독서실(유아/초?중?고 109
 
1.5%
피아노 100
 
1.4%
초등영어 68
 
0.9%
무용 66
 
0.9%
Other values (1594) 3615
49.2%
2024-05-18T16:22:08.607730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 3554
 
8.1%
2520
 
5.8%
2449
 
5.6%
2012
 
4.6%
( 1906
 
4.4%
) 1905
 
4.4%
? 1547
 
3.5%
1507
 
3.4%
1309
 
3.0%
1 911
 
2.1%
Other values (340) 24076
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29778
68.1%
Other Punctuation 6110
 
14.0%
Decimal Number 3008
 
6.9%
Open Punctuation 1909
 
4.4%
Close Punctuation 1908
 
4.4%
Uppercase Letter 447
 
1.0%
Space Separator 276
 
0.6%
Lowercase Letter 187
 
0.4%
Math Symbol 32
 
0.1%
Dash Punctuation 27
 
0.1%
Other values (3) 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2520
 
8.5%
2449
 
8.2%
2012
 
6.8%
1507
 
5.1%
1309
 
4.4%
884
 
3.0%
847
 
2.8%
836
 
2.8%
833
 
2.8%
816
 
2.7%
Other values (273) 15765
52.9%
Uppercase Letter
ValueCountFrequency (%)
A 366
81.9%
B 16
 
3.6%
P 14
 
3.1%
C 7
 
1.6%
E 7
 
1.6%
W 5
 
1.1%
S 5
 
1.1%
H 3
 
0.7%
L 3
 
0.7%
O 3
 
0.7%
Other values (10) 18
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
e 25
13.4%
i 24
12.8%
a 16
8.6%
s 15
8.0%
r 15
8.0%
h 13
7.0%
n 13
7.0%
t 12
 
6.4%
o 12
 
6.4%
c 10
 
5.3%
Other values (9) 32
17.1%
Decimal Number
ValueCountFrequency (%)
1 911
30.3%
0 475
15.8%
2 431
14.3%
4 319
 
10.6%
5 318
 
10.6%
3 200
 
6.6%
6 192
 
6.4%
9 74
 
2.5%
8 47
 
1.6%
7 41
 
1.4%
Other Punctuation
ValueCountFrequency (%)
, 3554
58.2%
? 1547
25.3%
/ 411
 
6.7%
* 396
 
6.5%
. 201
 
3.3%
& 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 1906
99.8%
[ 3
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1905
99.8%
] 3
 
0.2%
Space Separator
ValueCountFrequency (%)
274
99.3%
  2
 
0.7%
Math Symbol
ValueCountFrequency (%)
~ 29
90.6%
+ 3
 
9.4%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%
Letter Number
ValueCountFrequency (%)
5
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29778
68.1%
Common 13279
30.4%
Latin 639
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2520
 
8.5%
2449
 
8.2%
2012
 
6.8%
1507
 
5.1%
1309
 
4.4%
884
 
3.0%
847
 
2.8%
836
 
2.8%
833
 
2.8%
816
 
2.7%
Other values (273) 15765
52.9%
Latin
ValueCountFrequency (%)
A 366
57.3%
e 25
 
3.9%
i 24
 
3.8%
B 16
 
2.5%
a 16
 
2.5%
s 15
 
2.3%
r 15
 
2.3%
P 14
 
2.2%
h 13
 
2.0%
n 13
 
2.0%
Other values (30) 122
 
19.1%
Common
ValueCountFrequency (%)
, 3554
26.8%
( 1906
14.4%
) 1905
14.3%
? 1547
11.6%
1 911
 
6.9%
0 475
 
3.6%
2 431
 
3.2%
/ 411
 
3.1%
* 396
 
3.0%
4 319
 
2.4%
Other values (17) 1424
10.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29778
68.1%
ASCII 13910
31.8%
Number Forms 5
 
< 0.1%
None 2
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 3554
25.5%
( 1906
13.7%
) 1905
13.7%
? 1547
11.1%
1 911
 
6.5%
0 475
 
3.4%
2 431
 
3.1%
/ 411
 
3.0%
* 396
 
2.8%
A 366
 
2.6%
Other values (54) 2008
14.4%
Hangul
ValueCountFrequency (%)
2520
 
8.5%
2449
 
8.2%
2012
 
6.8%
1507
 
5.1%
1309
 
4.4%
884
 
3.0%
847
 
2.8%
836
 
2.8%
833
 
2.8%
816
 
2.7%
Other values (273) 15765
52.9%
Number Forms
ValueCountFrequency (%)
5
100.0%
None
ValueCountFrequency (%)
  2
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

교습과정명
Text

MISSING 

Distinct95
Distinct (%)1.0%
Missing885
Missing (%)8.8%
Memory size156.2 KiB
2024-05-18T16:22:09.188786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length2
Mean length3.5445968
Min length2

Characters and Unicode

Total characters32309
Distinct characters152
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row보습
2nd row보습
3rd row보습
4th row음악
5th row보습
ValueCountFrequency (%)
보습 3901
42.8%
음악 1434
 
15.7%
미술 1006
 
11.0%
보습?논술 990
 
10.9%
실용외국어(유아/초?중?고 373
 
4.1%
독서실(유아/초?중?고 141
 
1.5%
무용 114
 
1.3%
기타(소 95
 
1.0%
실용음악(성악 83
 
0.9%
독서실(일반인 67
 
0.7%
Other values (85) 911
 
10.0%
2024-05-18T16:22:10.244135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4960
15.4%
4891
15.1%
? 2085
 
6.5%
2021
 
6.3%
1603
 
5.0%
1564
 
4.8%
1079
 
3.3%
) 1023
 
3.2%
( 1023
 
3.2%
999
 
3.1%
Other values (142) 11061
34.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27369
84.7%
Other Punctuation 2894
 
9.0%
Close Punctuation 1023
 
3.2%
Open Punctuation 1023
 
3.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4960
18.1%
4891
17.9%
2021
 
7.4%
1603
 
5.9%
1564
 
5.7%
1079
 
3.9%
999
 
3.7%
714
 
2.6%
669
 
2.4%
570
 
2.1%
Other values (137) 8299
30.3%
Other Punctuation
ValueCountFrequency (%)
? 2085
72.0%
/ 514
 
17.8%
, 295
 
10.2%
Close Punctuation
ValueCountFrequency (%)
) 1023
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1023
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27369
84.7%
Common 4940
 
15.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4960
18.1%
4891
17.9%
2021
 
7.4%
1603
 
5.9%
1564
 
5.7%
1079
 
3.9%
999
 
3.7%
714
 
2.6%
669
 
2.4%
570
 
2.1%
Other values (137) 8299
30.3%
Common
ValueCountFrequency (%)
? 2085
42.2%
) 1023
20.7%
( 1023
20.7%
/ 514
 
10.4%
, 295
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27369
84.7%
ASCII 4940
 
15.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4960
18.1%
4891
17.9%
2021
 
7.4%
1603
 
5.9%
1564
 
5.7%
1079
 
3.9%
999
 
3.7%
714
 
2.6%
669
 
2.4%
570
 
2.1%
Other values (137) 8299
30.3%
ASCII
ValueCountFrequency (%)
? 2085
42.2%
) 1023
20.7%
( 1023
20.7%
/ 514
 
10.4%
, 295
 
6.0%

정원합계
Real number (ℝ)

SKEWED  ZEROS 

Distinct848
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1219.4825
Minimum0
Maximum3999996
Zeros665
Zeros (%)6.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T16:22:10.671155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q116
median45
Q3135
95-th percentile700
Maximum3999996
Range3999996
Interquartile range (IQR)119

Descriptive statistics

Standard deviation50063.197
Coefficient of variation (CV)41.052821
Kurtosis4709.1424
Mean1219.4825
Median Absolute Deviation (MAD)36
Skewness65.716143
Sum12194825
Variance2.5063237 × 109
MonotonicityNot monotonic
2024-05-18T16:22:11.104853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 665
 
6.7%
12 304
 
3.0%
20 285
 
2.9%
30 278
 
2.8%
18 252
 
2.5%
24 245
 
2.5%
15 240
 
2.4%
60 219
 
2.2%
40 191
 
1.9%
10 188
 
1.9%
Other values (838) 7133
71.3%
ValueCountFrequency (%)
0 665
6.7%
1 14
 
0.1%
2 42
 
0.4%
3 55
 
0.5%
4 99
 
1.0%
5 96
 
1.0%
6 171
 
1.7%
7 53
 
0.5%
8 159
 
1.6%
9 175
 
1.8%
ValueCountFrequency (%)
3999996 1
< 0.1%
2369763 1
< 0.1%
1711000 1
< 0.1%
489510 1
< 0.1%
400045 1
< 0.1%
281572 1
< 0.1%
109890 1
< 0.1%
103896 1
< 0.1%
65292 1
< 0.1%
59994 1
< 0.1%

일시수용능력인원합계
Real number (ℝ)

SKEWED  ZEROS 

Distinct404
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75.3379
Minimum0
Maximum99999
Zeros221
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T16:22:11.649097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q16
median41.5
Q377
95-th percentile185
Maximum99999
Range99999
Interquartile range (IQR)71

Descriptive statistics

Standard deviation1031.5445
Coefficient of variation (CV)13.692238
Kurtosis8812.6653
Mean75.3379
Median Absolute Deviation (MAD)35.5
Skewness91.420713
Sum753379
Variance1064084
MonotonicityNot monotonic
2024-05-18T16:22:12.059652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9 901
 
9.0%
5 769
 
7.7%
6 596
 
6.0%
4 565
 
5.7%
7 454
 
4.5%
3 328
 
3.3%
8 311
 
3.1%
70 288
 
2.9%
0 221
 
2.2%
50 218
 
2.2%
Other values (394) 5349
53.5%
ValueCountFrequency (%)
0 221
 
2.2%
1 37
 
0.4%
2 115
 
1.1%
3 328
 
3.3%
4 565
5.7%
5 769
7.7%
6 596
6.0%
7 454
4.5%
8 311
 
3.1%
9 901
9.0%
ValueCountFrequency (%)
99999 1
 
< 0.1%
9999 5
0.1%
8100 1
 
< 0.1%
2000 1
 
< 0.1%
1744 1
 
< 0.1%
1701 1
 
< 0.1%
1600 2
 
< 0.1%
1500 2
 
< 0.1%
1373 1
 
< 0.1%
1313 1
 
< 0.1%

인당수강료내용
Text

MISSING 

Distinct2239
Distinct (%)98.4%
Missing7724
Missing (%)77.2%
Memory size156.2 KiB
2024-05-18T16:22:12.592070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length958
Median length345
Mean length101.43278
Min length4

Characters and Unicode

Total characters230861
Distinct characters452
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2209 ?
Unique (%)97.1%

Sample

1st row초등수학1:300000, 중등수학:380000, 초등수학2:330000, 고등수학:420000
2nd row피아노초급A:120000, 피아노초급B:130000, 피아노초급C:140000, 피아노중급A:140000, 피아노중급B:150000, 피아노중급C:160000, 피아노고급A:100000, 피아노고급B:110000, 피아노고급C:120000, 플루트초급:40000, 플루트중급:70000
3rd row중등과학:100000, 통합과학:150000, 화학1, 생명1:150000
4th row초3,4(주4회,회150분):220000, 초5,6(주4회,회150분):250000, 중1(주4회,회180분):270000, 중2,3(주4회,회240분):290000, 고등(주5회,회240분):350000, 고등(주5회,회240분):400000, 고등수학(주5회,회당240분):450000
5th row중2수학:470000, 중3수학:470000, 중1수학:450000, 고1수학:500000, 고2수학:500000, 고3수학(1):450000, 고3수학(2):220000
ValueCountFrequency (%)
피아노 209
 
1.4%
초등 104
 
0.7%
중등 64
 
0.4%
미술 37
 
0.2%
고등 32
 
0.2%
수학 26
 
0.2%
중등수학:250000 24
 
0.2%
초등수학:200000 24
 
0.2%
중등수학:300000 21
 
0.1%
고급 20
 
0.1%
Other values (11460) 14432
96.3%
2024-05-18T16:22:13.624684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 56612
24.5%
: 13388
 
5.8%
12760
 
5.5%
, 12647
 
5.5%
1 10464
 
4.5%
2 8912
 
3.9%
6713
 
2.9%
( 6340
 
2.7%
) 6330
 
2.7%
4 5153
 
2.2%
Other values (442) 91542
39.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 99234
43.0%
Other Letter 70098
30.4%
Other Punctuation 30955
 
13.4%
Space Separator 12760
 
5.5%
Open Punctuation 6370
 
2.8%
Close Punctuation 6359
 
2.8%
Uppercase Letter 3242
 
1.4%
Lowercase Letter 1159
 
0.5%
Dash Punctuation 342
 
0.1%
Math Symbol 192
 
0.1%
Other values (3) 150
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6713
 
9.6%
5050
 
7.2%
4501
 
6.4%
4332
 
6.2%
3567
 
5.1%
3189
 
4.5%
3055
 
4.4%
2841
 
4.1%
2647
 
3.8%
2523
 
3.6%
Other values (354) 31680
45.2%
Uppercase Letter
ValueCountFrequency (%)
A 956
29.5%
B 888
27.4%
C 440
13.6%
D 223
 
6.9%
E 121
 
3.7%
H 92
 
2.8%
F 72
 
2.2%
S 70
 
2.2%
G 58
 
1.8%
K 47
 
1.4%
Other values (15) 275
 
8.5%
Lowercase Letter
ValueCountFrequency (%)
e 160
13.8%
a 101
 
8.7%
i 98
 
8.5%
n 92
 
7.9%
r 80
 
6.9%
s 70
 
6.0%
c 70
 
6.0%
t 69
 
6.0%
l 68
 
5.9%
o 58
 
5.0%
Other values (14) 293
25.3%
Decimal Number
ValueCountFrequency (%)
0 56612
57.0%
1 10464
 
10.5%
2 8912
 
9.0%
4 5153
 
5.2%
3 5146
 
5.2%
5 5131
 
5.2%
6 2600
 
2.6%
8 2042
 
2.1%
7 1760
 
1.8%
9 1414
 
1.4%
Other Punctuation
ValueCountFrequency (%)
: 13388
43.2%
, 12647
40.9%
* 3176
 
10.3%
. 1585
 
5.1%
/ 105
 
0.3%
? 35
 
0.1%
& 12
 
< 0.1%
# 6
 
< 0.1%
; 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 135
70.3%
+ 43
 
22.4%
< 7
 
3.6%
> 7
 
3.6%
Letter Number
ValueCountFrequency (%)
25
45.5%
24
43.6%
5
 
9.1%
1
 
1.8%
Open Punctuation
ValueCountFrequency (%)
( 6340
99.5%
[ 28
 
0.4%
{ 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 6330
99.5%
] 27
 
0.4%
} 2
 
< 0.1%
Other Number
ValueCountFrequency (%)
5
50.0%
4
40.0%
1
 
10.0%
Space Separator
ValueCountFrequency (%)
12760
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 342
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 85
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 156307
67.7%
Hangul 70098
30.4%
Latin 4456
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6713
 
9.6%
5050
 
7.2%
4501
 
6.4%
4332
 
6.2%
3567
 
5.1%
3189
 
4.5%
3055
 
4.4%
2841
 
4.1%
2647
 
3.8%
2523
 
3.6%
Other values (354) 31680
45.2%
Latin
ValueCountFrequency (%)
A 956
21.5%
B 888
19.9%
C 440
 
9.9%
D 223
 
5.0%
e 160
 
3.6%
E 121
 
2.7%
a 101
 
2.3%
i 98
 
2.2%
n 92
 
2.1%
H 92
 
2.1%
Other values (43) 1285
28.8%
Common
ValueCountFrequency (%)
0 56612
36.2%
: 13388
 
8.6%
12760
 
8.2%
, 12647
 
8.1%
1 10464
 
6.7%
2 8912
 
5.7%
( 6340
 
4.1%
) 6330
 
4.0%
4 5153
 
3.3%
3 5146
 
3.3%
Other values (25) 18555
 
11.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 160698
69.6%
Hangul 70097
30.4%
Number Forms 55
 
< 0.1%
Enclosed Alphanum 10
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 56612
35.2%
: 13388
 
8.3%
12760
 
7.9%
, 12647
 
7.9%
1 10464
 
6.5%
2 8912
 
5.5%
( 6340
 
3.9%
) 6330
 
3.9%
4 5153
 
3.2%
3 5146
 
3.2%
Other values (71) 22946
14.3%
Hangul
ValueCountFrequency (%)
6713
 
9.6%
5050
 
7.2%
4501
 
6.4%
4332
 
6.2%
3567
 
5.1%
3189
 
4.5%
3055
 
4.4%
2841
 
4.1%
2647
 
3.8%
2523
 
3.6%
Other values (353) 31679
45.2%
Number Forms
ValueCountFrequency (%)
25
45.5%
24
43.6%
5
 
9.1%
1
 
1.8%
Enclosed Alphanum
ValueCountFrequency (%)
5
50.0%
4
40.0%
1
 
10.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

수강료공개여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
9069 
False
931 
ValueCountFrequency (%)
True 9069
90.7%
False 931
 
9.3%
2024-05-18T16:22:13.980044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

기숙사학원여부
Boolean

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing504
Missing (%)5.0%
Memory size97.7 KiB
False
9481 
True
 
15
(Missing)
 
504
ValueCountFrequency (%)
False 9481
94.8%
True 15
 
0.1%
(Missing) 504
 
5.0%
2024-05-18T16:22:14.263575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

도로명우편번호
Real number (ℝ)

Distinct3654
Distinct (%)36.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45068.74
Minimum0
Maximum158887
Zeros11
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T16:22:14.755428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1765
Q14735.75
median6665
Q3132752
95-th percentile153859.05
Maximum158887
Range158887
Interquartile range (IQR)128016.25

Descriptive statistics

Standard deviation61858.181
Coefficient of variation (CV)1.3725296
Kurtosis-1.0728413
Mean45068.74
Median Absolute Deviation (MAD)2611
Skewness0.93646001
Sum4.506874 × 108
Variance3.8264346 × 109
MonotonicityNot monotonic
2024-05-18T16:22:15.201428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7983 110
 
1.1%
6593 61
 
0.6%
6202 37
 
0.4%
5849 35
 
0.4%
6280 33
 
0.3%
135998 32
 
0.3%
6199 31
 
0.3%
6279 31
 
0.3%
5269 30
 
0.3%
6512 30
 
0.3%
Other values (3644) 9570
95.7%
ValueCountFrequency (%)
0 11
0.1%
1006 2
 
< 0.1%
1021 1
 
< 0.1%
1030 1
 
< 0.1%
1031 2
 
< 0.1%
1033 1
 
< 0.1%
1041 3
 
< 0.1%
1042 3
 
< 0.1%
1043 3
 
< 0.1%
1047 1
 
< 0.1%
ValueCountFrequency (%)
158887 1
 
< 0.1%
158885 13
0.1%
158884 2
 
< 0.1%
158883 1
 
< 0.1%
158881 1
 
< 0.1%
158879 1
 
< 0.1%
158878 2
 
< 0.1%
158877 12
0.1%
158876 1
 
< 0.1%
158875 1
 
< 0.1%

등록상태명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
개원
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개원
2nd row개원
3rd row개원
4th row개원
5th row개원

Common Values

ValueCountFrequency (%)
개원 10000
100.0%

Length

2024-05-18T16:22:15.640384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:22:15.931036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개원 10000
100.0%

등록일자
Real number (ℝ)

Distinct4775
Distinct (%)47.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20140753
Minimum19561209
Maximum20240510
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T16:22:16.361303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19561209
5-th percentile19970619
Q120100112
median20160831
Q320210211
95-th percentile20231024
Maximum20240510
Range679301
Interquartile range (IQR)110098.75

Descriptive statistics

Standard deviation85575.784
Coefficient of variation (CV)0.004248887
Kurtosis2.6751678
Mean20140753
Median Absolute Deviation (MAD)50273.5
Skewness-1.4444825
Sum2.0140753 × 1011
Variance7.3232148 × 109
MonotonicityNot monotonic
2024-05-18T16:22:16.848077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20220711 13
 
0.1%
20240229 11
 
0.1%
20231222 10
 
0.1%
20231228 9
 
0.1%
20140513 9
 
0.1%
20221208 9
 
0.1%
20230206 9
 
0.1%
20231012 9
 
0.1%
20211231 9
 
0.1%
20231106 9
 
0.1%
Other values (4765) 9903
99.0%
ValueCountFrequency (%)
19561209 1
< 0.1%
19581203 1
< 0.1%
19620127 1
< 0.1%
19620213 1
< 0.1%
19631010 1
< 0.1%
19700417 1
< 0.1%
19701118 1
< 0.1%
19710508 1
< 0.1%
19710520 1
< 0.1%
19710624 1
< 0.1%
ValueCountFrequency (%)
20240510 1
 
< 0.1%
20240509 2
 
< 0.1%
20240508 4
< 0.1%
20240507 5
0.1%
20240503 4
< 0.1%
20240502 2
 
< 0.1%
20240501 5
0.1%
20240430 6
0.1%
20240429 4
< 0.1%
20240426 1
 
< 0.1%

휴원시작일자
Real number (ℝ)

MISSING 

Distinct112
Distinct (%)78.9%
Missing9858
Missing (%)98.6%
Infinite0
Infinite (%)0.0%
Mean20047255
Minimum0
Maximum20240424
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T16:22:17.288784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile20120724
Q120170630
median20200224
Q320201122
95-th percentile20230711
Maximum20240424
Range20240424
Interquartile range (IQR)30492

Descriptive statistics

Standard deviation1694555.4
Coefficient of variation (CV)0.084528052
Kurtosis141.89877
Mean20047255
Median Absolute Deviation (MAD)19054
Skewness-11.910054
Sum2.8467102 × 109
Variance2.8715179 × 1012
MonotonicityNot monotonic
2024-05-18T16:22:17.774311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20200224 15
 
0.1%
20200225 11
 
0.1%
20200302 3
 
< 0.1%
20220801 2
 
< 0.1%
20200301 2
 
< 0.1%
20200831 2
 
< 0.1%
20200226 2
 
< 0.1%
20200324 1
 
< 0.1%
20231007 1
 
< 0.1%
20160604 1
 
< 0.1%
Other values (102) 102
 
1.0%
(Missing) 9858
98.6%
ValueCountFrequency (%)
0 1
< 0.1%
20110418 1
< 0.1%
20110614 1
< 0.1%
20110912 1
< 0.1%
20120101 1
< 0.1%
20120409 1
< 0.1%
20120430 1
< 0.1%
20120719 1
< 0.1%
20120823 1
< 0.1%
20120824 1
< 0.1%
ValueCountFrequency (%)
20240424 1
< 0.1%
20240320 1
< 0.1%
20231109 1
< 0.1%
20231007 1
< 0.1%
20230818 1
< 0.1%
20230807 1
< 0.1%
20230717 1
< 0.1%
20230712 1
< 0.1%
20230701 1
< 0.1%
20230521 1
< 0.1%

휴원종료일자
Real number (ℝ)

MISSING 

Distinct113
Distinct (%)1.2%
Missing761
Missing (%)7.6%
Infinite0
Infinite (%)0.0%
Mean98782017
Minimum20110424
Maximum99999999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T16:22:18.182439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20110424
5-th percentile99991231
Q199991231
median99991231
Q399991231
95-th percentile99991231
Maximum99999999
Range79889575
Interquartile range (IQR)0

Descriptive statistics

Standard deviation9748995.6
Coefficient of variation (CV)0.098692008
Kurtosis61.041965
Mean98782017
Median Absolute Deviation (MAD)0
Skewness-7.9390642
Sum9.1264705 × 1011
Variance9.5042915 × 1013
MonotonicityNot monotonic
2024-05-18T16:22:18.725150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99991231 9098
91.0%
20200405 11
 
0.1%
20200419 4
 
< 0.1%
20200415 4
 
< 0.1%
20210228 3
 
< 0.1%
20200420 2
 
< 0.1%
20200322 2
 
< 0.1%
20160930 2
 
< 0.1%
20231231 2
 
< 0.1%
20200331 2
 
< 0.1%
Other values (103) 109
 
1.1%
(Missing) 761
 
7.6%
ValueCountFrequency (%)
20110424 1
< 0.1%
20110713 1
< 0.1%
20110925 1
< 0.1%
20120228 1
< 0.1%
20120415 1
< 0.1%
20120513 1
< 0.1%
20121231 1
< 0.1%
20130101 1
< 0.1%
20130705 1
< 0.1%
20130719 1
< 0.1%
ValueCountFrequency (%)
99999999 1
 
< 0.1%
99991231 9098
91.0%
20240520 1
 
< 0.1%
20240430 1
 
< 0.1%
20240228 1
 
< 0.1%
20240131 1
 
< 0.1%
20240107 1
 
< 0.1%
20231231 2
 
< 0.1%
20231124 1
 
< 0.1%
20230731 1
 
< 0.1%

개설일자
Real number (ℝ)

Distinct4781
Distinct (%)47.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20141157
Minimum19561209
Maximum20240510
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T16:22:19.179030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19561209
5-th percentile19970731
Q120100216
median20160909
Q320210218
95-th percentile20231024
Maximum20240510
Range679301
Interquartile range (IQR)110002.25

Descriptive statistics

Standard deviation85319.443
Coefficient of variation (CV)0.0042360745
Kurtosis2.7467849
Mean20141157
Median Absolute Deviation (MAD)50205
Skewness-1.4573109
Sum2.0141157 × 1011
Variance7.2794073 × 109
MonotonicityNot monotonic
2024-05-18T16:22:19.642772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20220711 12
 
0.1%
20240229 10
 
0.1%
20231030 9
 
0.1%
20221208 9
 
0.1%
20231222 9
 
0.1%
20231106 9
 
0.1%
20240320 9
 
0.1%
20220125 9
 
0.1%
20211231 9
 
0.1%
20231012 9
 
0.1%
Other values (4771) 9906
99.1%
ValueCountFrequency (%)
19561209 1
< 0.1%
19581203 1
< 0.1%
19620127 1
< 0.1%
19620213 1
< 0.1%
19631010 1
< 0.1%
19700417 1
< 0.1%
19701118 1
< 0.1%
19710508 1
< 0.1%
19710520 1
< 0.1%
19710624 1
< 0.1%
ValueCountFrequency (%)
20240510 1
 
< 0.1%
20240509 2
 
< 0.1%
20240508 4
< 0.1%
20240507 5
0.1%
20240503 4
< 0.1%
20240502 2
 
< 0.1%
20240501 7
0.1%
20240430 6
0.1%
20240429 3
< 0.1%
20240426 1
 
< 0.1%

적재일시
Real number (ℝ)

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20233095
Minimum20231018
Maximum20240512
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T16:22:20.041854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20231018
5-th percentile20231018
Q120231018
median20231018
Q320231210
95-th percentile20240428
Maximum20240512
Range9494
Interquartile range (IQR)192

Descriptive statistics

Standard deviation3860.0998
Coefficient of variation (CV)0.00019078147
Kurtosis-0.20871861
Mean20233095
Median Absolute Deviation (MAD)0
Skewness1.337684
Sum2.0233095 × 1011
Variance14900370
MonotonicityNot monotonic
2024-05-18T16:22:20.418015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
20231018 6779
67.8%
20240225 419
 
4.2%
20240317 399
 
4.0%
20240428 375
 
3.8%
20240128 353
 
3.5%
20231029 225
 
2.2%
20240505 158
 
1.6%
20240331 147
 
1.5%
20240324 136
 
1.4%
20240407 118
 
1.2%
Other values (11) 891
 
8.9%
ValueCountFrequency (%)
20231018 6779
67.8%
20231023 29
 
0.3%
20231029 225
 
2.2%
20231105 93
 
0.9%
20231113 78
 
0.8%
20231119 70
 
0.7%
20231126 73
 
0.7%
20231206 111
 
1.1%
20231210 54
 
0.5%
20231217 91
 
0.9%
ValueCountFrequency (%)
20240512 116
 
1.2%
20240505 158
 
1.6%
20240428 375
3.8%
20240407 118
 
1.2%
20240331 147
 
1.5%
20240324 136
 
1.4%
20240317 399
4.0%
20240225 419
4.2%
20240128 353
3.5%
20231231 106
 
1.1%

Sample

행정구역명학원/교습소학원지정번호학원명도로명주소도로명상세주소분야명교습계열명교습과정목록명교습과정명정원합계일시수용능력인원합계인당수강료내용수강료공개여부기숙사학원여부도로명우편번호등록상태명등록일자휴원시작일자휴원종료일자개설일자적재일시
7487용산구학원3000015397셀파우등생교실효창학원서울특별시 용산구 효창원로 136, 3층 (효창동)입시.검정 및 보습보통교과보습보습16570<NA>YN140896개원20121011<NA>999912312012101120231105
9397마포구교습소3000021498로제타스톤영어교실망원캠퍼스영어교습소서울특별시 마포구 망원로 60, 2층 전부 (망원동, 희망주택)입시.검정 및 보습보통교과<NA>보습129<NA>YN4009개원20141111<NA>999912312014111120231018
4524양천구교습소23356수학하는사람들수학교습소서울특별시 양천구 목동서로 349외1필지 센트럴프라자 1402호 (신정동)입시.검정 및 보습보통교과초등수학1보습248초등수학1:300000, 중등수학:380000, 초등수학2:330000, 고등수학:420000YN158885개원20090304<NA>999912312009030420231018
10965영등포구학원3000025607여의도영재음악학원서울특별시 영등포구 국제금융로 78, 909호,홍우빌딩 (여의도동)예능(대)예능(중)음악,음악7043<NA>YN7333개원20160212<NA>999912312016021220231018
4045강북구학원20981NEW아이비학원서울특별시 강북구 삼양로 231, 3층 (미아동)입시.검정 및 보습보통교과보습,보습16080<NA>YN1181개원20080616<NA>999912312008061620231018
2808강동구학원18379종로엠스쿨암사학원서울특별시 강동구 고덕로 140, 4층일부 (암사동)입시.검정 및 보습보통교과보습,보습12060<NA>YN134050개원20071001<NA>999912312007100120231018
5415강서구학원272강서더배움학원서울특별시 강서구 강서로45라길 55, 2층 (내발산동,미라어린이집)입시.검정 및 보습보통교과보습,보습3034<NA>YN157835개원20020829<NA>999912312004101520231018
4806노원구교습소24721바흐피아노음악교습소서울특별시 노원구 덕릉로 669302호 (중계동, 세양빌딩)예능(대)예능(중)바이엘음악275<NA>YN1699개원20090723<NA>999912312009072320231018
5251강동구학원26569뉴이엠영어보습학원서울특별시 강동구 고덕로 140, 403, 404호 (암사동, 프라이어팰리스)입시.검정 및 보습보통교과보습,보습5467<NA>YN5256개원20091119<NA>999912312009111920231018
683송파구학원1000029164유박스독서실서울특별시 송파구 동남로 202, (6층601,602호) (가락동)독서실독서독서실(유아/초?중?고),독서실(유아/초?중?고)84092<NA>YN138812개원19990427<NA>999912311999042720231018
행정구역명학원/교습소학원지정번호학원명도로명주소도로명상세주소분야명교습계열명교습과정목록명교습과정명정원합계일시수용능력인원합계인당수강료내용수강료공개여부기숙사학원여부도로명우편번호등록상태명등록일자휴원시작일자휴원종료일자개설일자적재일시
11744중랑구교습소3000027446올컴컴퓨터교습소서울특별시 중랑구 면목로91길 19-11, 2층 (상봉동)기타(대)기타(중)<NA>기타(소)124자격증과정Ⅰ:200001, 자격증과정Ⅱ:250000, 퓨터활용과정(고급):300000, 컴퓨터활용과정(기초):150000YN2138개원20161110<NA>999912312016111020231018
16935송파구학원3000036742잇올스파르타프리미엄관리형독서실서울특별시 송파구 백제고분로 446, 4층 일부 (방이동, 송암빌딩)독서실독서독서실,독서실(유아/초?중?고)독서실(유아/초?중?고)550110<NA>YN5641개원20200420<NA>999912312020042020240225
2446동작구학원16539밤비니뮤직스튜디오학원서울특별시 동작구 동작대로29길 69, 307호, 308호일부예능(대)예능(중)음악,음악1749<NA>YN6999개원2007021220140929201410122007021220231018
20467강동구교습소3000041494놀작후(逅)아트미술교습소서울특별시 강동구 고덕로 353, 107호 일부 (고덕동, 고덕그라시움(제1상가))예능(대)예능(중)미술(주1회,회175분)미술366미술(주1회,회175분):120000, 미술(주1회,회190분):130000, 미술(주1회,회205분):140000, 미술(주1회,회220분):150000, 미술(주2회,회140분):190000, 미술(주2회,회145분):200000YN5224개원20220523<NA>999912312022052320231018
13995송파구교습소3000031976이루다사고력논술교습소서울특별시 송파구 올림픽로 119, 5층 5B01호 (잠실동, 잠실파인애플상가)입시.검정 및 보습보통교과<NA>보습?논술55<NA>YN5501개원20180612<NA>999912312018061220231018
8882강북구학원3000019962엠엘(ML)댄스스포츠학원서울특별시 강북구 노해로27길 3, 3층 (수유동)기예(대)기예(중)댄스,댄스15050<NA>YN142879개원20140409<NA>999912312014040920231018
25380송파구교습소5631새소리음악교습소서울특별시 송파구 마천로 193, 1층일부 (오금동)예능(대)예능(중)피아노(주4회,회60분)음악105<NA>YN138858개원20040419<NA>999912312004041920231018
7346강동구학원3000014930천호한가람보습학원서울특별시 강동구 성안로 201, 3층 (천호동)입시.검정 및 보습보통교과보습,보습6050<NA>YN134864개원20120801<NA>999912312012080120231018
4105도봉구교습소21281신창해법영어교습소서울특별시 도봉구 덕릉로60길 88, 2층 (창동)입시.검정 및 보습보통교과초등영어보습199초등영어:170000, 초등문법포함(고):180000, 중등영어:210000YN1479개원20080714<NA>999912312008071420231018
1835강북구학원13211영춘학원서울특별시 강북구 솔샘로 2123층 (미아동)입시.검정 및 보습보통교과보습,보습13055<NA>YN142100개원20060310<NA>999912312006031020231018