Overview

Dataset statistics

Number of variables20
Number of observations10000
Missing cells70288
Missing cells (%)35.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 MiB
Average record size in memory178.0 B

Variable types

Numeric8
Categorical5
Text5
Unsupported2

Dataset

Description인덱스,연도,지역구,시설군,건물명,주소1,주소2,우편번호1,우편번호2,도로명주소1,도로명주소2,연면적,준공일자,부서명,상주인원수,일일사용자수,침상수,최근교육이수일,교육종류코드,교육종류코드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15380/S/1/datasetView.do

Alerts

교육종류코드 is highly overall correlated with 우편번호1 and 2 other fieldsHigh correlation
교육종류코드.1 is highly overall correlated with 우편번호1 and 2 other fieldsHigh correlation
인덱스 is highly overall correlated with 연도 and 3 other fieldsHigh correlation
연도 is highly overall correlated with 인덱스 and 3 other fieldsHigh correlation
우편번호1 is highly overall correlated with 인덱스 and 6 other fieldsHigh correlation
우편번호2 is highly overall correlated with 인덱스 and 5 other fieldsHigh correlation
상주인원수 is highly overall correlated with 일일사용자수 and 1 other fieldsHigh correlation
일일사용자수 is highly overall correlated with 상주인원수 and 1 other fieldsHigh correlation
침상수 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
지역구 is highly overall correlated with 우편번호1High correlation
시설군 is highly overall correlated with 우편번호1High correlation
부서명 is highly overall correlated with 인덱스 and 4 other fieldsHigh correlation
부서명 is highly imbalanced (93.0%)Imbalance
교육종류코드 is highly imbalanced (70.1%)Imbalance
주소2 has 1552 (15.5%) missing valuesMissing
우편번호1 has 9931 (99.3%) missing valuesMissing
우편번호2 has 9931 (99.3%) missing valuesMissing
도로명주소1 has 10000 (100.0%) missing valuesMissing
도로명주소2 has 10000 (100.0%) missing valuesMissing
연면적 has 3359 (33.6%) missing valuesMissing
준공일자 has 8946 (89.5%) missing valuesMissing
상주인원수 has 3359 (33.6%) missing valuesMissing
일일사용자수 has 3359 (33.6%) missing valuesMissing
침상수 has 3359 (33.6%) missing valuesMissing
최근교육이수일 has 6479 (64.8%) missing valuesMissing
일일사용자수 is highly skewed (γ1 = 37.50086395)Skewed
인덱스 has unique valuesUnique
도로명주소1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
도로명주소2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
상주인원수 has 5425 (54.2%) zerosZeros
일일사용자수 has 5381 (53.8%) zerosZeros
침상수 has 4491 (44.9%) zerosZeros

Reproduction

Analysis started2024-05-18 02:26:51.783388
Analysis finished2024-05-18 02:27:16.537490
Duration24.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인덱스
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41800.253
Minimum5
Maximum73024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T11:27:16.924524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile2573.85
Q124690.75
median50476.5
Q359039.25
95-th percentile71325.1
Maximum73024
Range73019
Interquartile range (IQR)34348.5

Descriptive statistics

Standard deviation23320.859
Coefficient of variation (CV)0.55791191
Kurtosis-1.1464343
Mean41800.253
Median Absolute Deviation (MAD)17516.5
Skewness-0.52821455
Sum4.1800253 × 108
Variance5.4386245 × 108
MonotonicityNot monotonic
2024-05-18T11:27:17.582034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1774 1
 
< 0.1%
1119 1
 
< 0.1%
46591 1
 
< 0.1%
60849 1
 
< 0.1%
58095 1
 
< 0.1%
3548 1
 
< 0.1%
59084 1
 
< 0.1%
24747 1
 
< 0.1%
68660 1
 
< 0.1%
2510 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
5 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
12 1
< 0.1%
17 1
< 0.1%
20 1
< 0.1%
34 1
< 0.1%
41 1
< 0.1%
50 1
< 0.1%
55 1
< 0.1%
ValueCountFrequency (%)
73024 1
< 0.1%
73019 1
< 0.1%
73018 1
< 0.1%
73017 1
< 0.1%
73012 1
< 0.1%
73007 1
< 0.1%
73006 1
< 0.1%
73003 1
< 0.1%
73002 1
< 0.1%
73000 1
< 0.1%

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.5277
Minimum2013
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T11:27:18.275448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2013
5-th percentile2014
Q12015
median2017
Q32023
95-th percentile2024
Maximum2024
Range11
Interquartile range (IQR)8

Descriptive statistics

Standard deviation3.9084729
Coefficient of variation (CV)0.0019362989
Kurtosis-1.5366309
Mean2018.5277
Median Absolute Deviation (MAD)3
Skewness0.24582072
Sum20185277
Variance15.27616
MonotonicityNot monotonic
2024-05-18T11:27:18.775899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2014 2190
21.9%
2024 1727
17.3%
2023 1631
16.3%
2019 1502
15.0%
2017 1401
14.0%
2015 1397
14.0%
2013 72
 
0.7%
2016 41
 
0.4%
2020 26
 
0.3%
2018 13
 
0.1%
ValueCountFrequency (%)
2013 72
 
0.7%
2014 2190
21.9%
2015 1397
14.0%
2016 41
 
0.4%
2017 1401
14.0%
2018 13
 
0.1%
2019 1502
15.0%
2020 26
 
0.3%
2023 1631
16.3%
2024 1727
17.3%
ValueCountFrequency (%)
2024 1727
17.3%
2023 1631
16.3%
2020 26
 
0.3%
2019 1502
15.0%
2018 13
 
0.1%
2017 1401
14.0%
2016 41
 
0.4%
2015 1397
14.0%
2014 2190
21.9%
2013 72
 
0.7%

지역구
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
강남구
1211 
중구
672 
영등포구
668 
서초구
 
615
송파구
 
567
Other values (21)
6267 

Length

Max length4
Median length3
Mean length3.0545
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광진구
2nd row종로구
3rd row영등포구
4th row중구
5th row동작구

Common Values

ValueCountFrequency (%)
강남구 1211
 
12.1%
중구 672
 
6.7%
영등포구 668
 
6.7%
서초구 615
 
6.2%
송파구 567
 
5.7%
마포구 523
 
5.2%
강서구 519
 
5.2%
구로구 404
 
4.0%
강동구 391
 
3.9%
종로구 383
 
3.8%
Other values (16) 4047
40.5%

Length

2024-05-18T11:27:19.277506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 1211
 
12.1%
중구 672
 
6.7%
영등포구 668
 
6.7%
서초구 615
 
6.2%
송파구 567
 
5.7%
마포구 523
 
5.2%
강서구 519
 
5.2%
구로구 404
 
4.0%
강동구 391
 
3.9%
종로구 383
 
3.8%
Other values (16) 4047
40.5%

시설군
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
실내주차장
3901 
보육시설
1002 
어린이집
774 
의료기관
747 
대규모점포
715 
Other values (19)
2861 

Length

Max length9
Median length5
Mean length4.5461
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row실내주차장
2nd row실내주차장
3rd row지하역사
4th row대규모점포
5th row의료기관

Common Values

ValueCountFrequency (%)
실내주차장 3901
39.0%
보육시설 1002
 
10.0%
어린이집 774
 
7.7%
의료기관 747
 
7.5%
대규모점포 715
 
7.1%
지하역사 607
 
6.1%
PC영업시설 515
 
5.1%
목욕장 433
 
4.3%
학원 323
 
3.2%
산후조리원 239
 
2.4%
Other values (14) 744
 
7.4%

Length

2024-05-18T11:27:19.828560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실내주차장 3901
39.0%
보육시설 1002
 
10.0%
어린이집 774
 
7.7%
의료기관 747
 
7.5%
대규모점포 715
 
7.1%
지하역사 607
 
6.1%
pc영업시설 515
 
5.1%
목욕장 433
 
4.3%
학원 323
 
3.2%
산후조리원 239
 
2.4%
Other values (14) 744
 
7.4%
Distinct5828
Distinct (%)58.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-18T11:27:20.542884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length30
Mean length8.3735
Min length2

Characters and Unicode

Total characters83735
Distinct characters783
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3223 ?
Unique (%)32.2%

Sample

1st row화양타워
2nd row코리안리빌딩
3rd row영등포시장역 (5호선)
4th row롯데마트서울역점
5th row서울요양병원
ValueCountFrequency (%)
어린이집 132
 
1.0%
pc방 104
 
0.8%
구립 86
 
0.7%
공영주차장 80
 
0.6%
산후조리원 61
 
0.5%
pc 61
 
0.5%
롯데시네마 48
 
0.4%
이마트 43
 
0.3%
cgv 33
 
0.3%
홈플러스 30
 
0.2%
Other values (6263) 11999
94.7%
2024-05-18T11:27:21.742136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2795
 
3.3%
2766
 
3.3%
2024
 
2.4%
1937
 
2.3%
1831
 
2.2%
1769
 
2.1%
1546
 
1.8%
) 1318
 
1.6%
( 1311
 
1.6%
1073
 
1.3%
Other values (773) 65365
78.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 72233
86.3%
Uppercase Letter 3359
 
4.0%
Space Separator 2766
 
3.3%
Decimal Number 1625
 
1.9%
Close Punctuation 1319
 
1.6%
Open Punctuation 1312
 
1.6%
Lowercase Letter 614
 
0.7%
Other Symbol 234
 
0.3%
Other Punctuation 156
 
0.2%
Dash Punctuation 69
 
0.1%
Other values (4) 48
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2795
 
3.9%
2024
 
2.8%
1937
 
2.7%
1831
 
2.5%
1769
 
2.4%
1546
 
2.1%
1073
 
1.5%
949
 
1.3%
919
 
1.3%
885
 
1.2%
Other values (684) 56505
78.2%
Uppercase Letter
ValueCountFrequency (%)
C 672
20.0%
P 552
16.4%
S 258
 
7.7%
G 177
 
5.3%
T 171
 
5.1%
K 162
 
4.8%
E 158
 
4.7%
A 145
 
4.3%
I 127
 
3.8%
L 106
 
3.2%
Other values (15) 831
24.7%
Lowercase Letter
ValueCountFrequency (%)
e 91
14.8%
c 81
13.2%
p 59
 
9.6%
o 40
 
6.5%
r 35
 
5.7%
n 35
 
5.7%
a 34
 
5.5%
s 30
 
4.9%
t 26
 
4.2%
k 24
 
3.9%
Other values (14) 159
25.9%
Other Punctuation
ValueCountFrequency (%)
& 51
32.7%
, 32
20.5%
. 26
16.7%
? 17
 
10.9%
: 15
 
9.6%
/ 5
 
3.2%
4
 
2.6%
% 2
 
1.3%
! 2
 
1.3%
1
 
0.6%
Decimal Number
ValueCountFrequency (%)
2 367
22.6%
1 358
22.0%
3 210
12.9%
5 146
 
9.0%
4 134
 
8.2%
7 95
 
5.8%
9 92
 
5.7%
0 89
 
5.5%
6 82
 
5.0%
8 52
 
3.2%
Math Symbol
ValueCountFrequency (%)
+ 5
41.7%
= 3
25.0%
> 2
 
16.7%
~ 1
 
8.3%
1
 
8.3%
Other Number
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
Close Punctuation
ValueCountFrequency (%)
) 1318
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1311
99.9%
[ 1
 
0.1%
Letter Number
ValueCountFrequency (%)
10
90.9%
1
 
9.1%
Space Separator
ValueCountFrequency (%)
2766
100.0%
Other Symbol
ValueCountFrequency (%)
234
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 69
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 72462
86.5%
Common 7284
 
8.7%
Latin 3984
 
4.8%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2795
 
3.9%
2024
 
2.8%
1937
 
2.7%
1831
 
2.5%
1769
 
2.4%
1546
 
2.1%
1073
 
1.5%
949
 
1.3%
919
 
1.3%
885
 
1.2%
Other values (683) 56734
78.3%
Latin
ValueCountFrequency (%)
C 672
16.9%
P 552
 
13.9%
S 258
 
6.5%
G 177
 
4.4%
T 171
 
4.3%
K 162
 
4.1%
E 158
 
4.0%
A 145
 
3.6%
I 127
 
3.2%
L 106
 
2.7%
Other values (41) 1456
36.5%
Common
ValueCountFrequency (%)
2766
38.0%
) 1318
18.1%
( 1311
18.0%
2 367
 
5.0%
1 358
 
4.9%
3 210
 
2.9%
5 146
 
2.0%
4 134
 
1.8%
7 95
 
1.3%
9 92
 
1.3%
Other values (27) 487
 
6.7%
Han
ValueCountFrequency (%)
4
80.0%
1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 72228
86.3%
ASCII 11245
 
13.4%
None 239
 
0.3%
Number Forms 11
 
< 0.1%
Enclosed Alphanum 6
 
< 0.1%
CJK 5
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2795
 
3.9%
2024
 
2.8%
1937
 
2.7%
1831
 
2.5%
1769
 
2.4%
1546
 
2.1%
1073
 
1.5%
949
 
1.3%
919
 
1.3%
885
 
1.2%
Other values (682) 56500
78.2%
ASCII
ValueCountFrequency (%)
2766
24.6%
) 1318
11.7%
( 1311
11.7%
C 672
 
6.0%
P 552
 
4.9%
2 367
 
3.3%
1 358
 
3.2%
S 258
 
2.3%
3 210
 
1.9%
G 177
 
1.6%
Other values (69) 3256
29.0%
None
ValueCountFrequency (%)
234
97.9%
4
 
1.7%
1
 
0.4%
Number Forms
ValueCountFrequency (%)
10
90.9%
1
 
9.1%
CJK
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
Arrows
ValueCountFrequency (%)
1
100.0%
Distinct1519
Distinct (%)15.2%
Missing13
Missing (%)0.1%
Memory size156.2 KiB
2024-05-18T11:27:22.442214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length7
Mean length8.8032442
Min length3

Characters and Unicode

Total characters87918
Distinct characters404
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1437 ?
Unique (%)14.4%

Sample

1st row서울시 광진구
2nd row서울시 종로구
3rd row서울시 영등포구
4th row서울시 중구
5th row서울시 동작구
ValueCountFrequency (%)
서울시 7295
32.0%
서울 1217
 
5.3%
강남구 1211
 
5.3%
중구 672
 
3.0%
서초구 610
 
2.7%
서울특별시 606
 
2.7%
영등포구 565
 
2.5%
송파구 564
 
2.5%
마포구 529
 
2.3%
강서구 429
 
1.9%
Other values (2108) 9079
39.9%
2024-05-18T11:27:23.686634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12948
14.7%
10533
 
12.0%
9892
 
11.3%
9149
 
10.4%
7977
 
9.1%
2324
 
2.6%
2298
 
2.6%
2210
 
2.5%
1279
 
1.5%
1204
 
1.4%
Other values (394) 28104
32.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66794
76.0%
Space Separator 12948
 
14.7%
Decimal Number 5579
 
6.3%
Close Punctuation 924
 
1.1%
Open Punctuation 923
 
1.0%
Other Punctuation 470
 
0.5%
Dash Punctuation 194
 
0.2%
Math Symbol 49
 
0.1%
Uppercase Letter 35
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10533
15.8%
9892
14.8%
9149
13.7%
7977
11.9%
2324
 
3.5%
2298
 
3.4%
2210
 
3.3%
1279
 
1.9%
1204
 
1.8%
927
 
1.4%
Other values (356) 19001
28.4%
Uppercase Letter
ValueCountFrequency (%)
B 16
45.7%
A 3
 
8.6%
K 2
 
5.7%
S 2
 
5.7%
D 2
 
5.7%
J 2
 
5.7%
M 2
 
5.7%
H 1
 
2.9%
P 1
 
2.9%
F 1
 
2.9%
Other values (3) 3
 
8.6%
Decimal Number
ValueCountFrequency (%)
1 1180
21.2%
2 782
14.0%
3 666
11.9%
4 533
9.6%
5 512
9.2%
6 442
 
7.9%
0 424
 
7.6%
7 379
 
6.8%
8 353
 
6.3%
9 308
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 448
95.3%
. 10
 
2.1%
8
 
1.7%
? 2
 
0.4%
: 1
 
0.2%
/ 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 917
99.2%
] 7
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 916
99.2%
[ 7
 
0.8%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
k 1
50.0%
Space Separator
ValueCountFrequency (%)
12948
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 194
100.0%
Math Symbol
ValueCountFrequency (%)
~ 49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66794
76.0%
Common 21087
 
24.0%
Latin 37
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10533
15.8%
9892
14.8%
9149
13.7%
7977
11.9%
2324
 
3.5%
2298
 
3.4%
2210
 
3.3%
1279
 
1.9%
1204
 
1.8%
927
 
1.4%
Other values (356) 19001
28.4%
Common
ValueCountFrequency (%)
12948
61.4%
1 1180
 
5.6%
) 917
 
4.3%
( 916
 
4.3%
2 782
 
3.7%
3 666
 
3.2%
4 533
 
2.5%
5 512
 
2.4%
, 448
 
2.1%
6 442
 
2.1%
Other values (13) 1743
 
8.3%
Latin
ValueCountFrequency (%)
B 16
43.2%
A 3
 
8.1%
K 2
 
5.4%
S 2
 
5.4%
D 2
 
5.4%
J 2
 
5.4%
M 2
 
5.4%
H 1
 
2.7%
P 1
 
2.7%
F 1
 
2.7%
Other values (5) 5
 
13.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66794
76.0%
ASCII 21116
 
24.0%
None 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12948
61.3%
1 1180
 
5.6%
) 917
 
4.3%
( 916
 
4.3%
2 782
 
3.7%
3 666
 
3.2%
4 533
 
2.5%
5 512
 
2.4%
, 448
 
2.1%
6 442
 
2.1%
Other values (27) 1772
 
8.4%
Hangul
ValueCountFrequency (%)
10533
15.8%
9892
14.8%
9149
13.7%
7977
11.9%
2324
 
3.5%
2298
 
3.4%
2210
 
3.3%
1279
 
1.9%
1204
 
1.8%
927
 
1.4%
Other values (356) 19001
28.4%
None
ValueCountFrequency (%)
8
100.0%

주소2
Text

MISSING 

Distinct7124
Distinct (%)84.3%
Missing1552
Missing (%)15.5%
Memory size156.2 KiB
2024-05-18T11:27:24.604773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length48
Mean length14.332031
Min length3

Characters and Unicode

Total characters121077
Distinct characters543
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6059 ?
Unique (%)71.7%

Sample

1st row화양동 110-37
2nd row수송동 80
3rd row양산로 지하200 (영등포동5가)
4th row 봉래동 2가 122-11
5th row상도4동 255-4
ValueCountFrequency (%)
영등포구 174
 
0.7%
여의도동 174
 
0.7%
강남구 174
 
0.7%
지하 161
 
0.7%
테헤란로 131
 
0.6%
지하1층 127
 
0.5%
남부순환로 122
 
0.5%
도봉로 114
 
0.5%
강남대로 97
 
0.4%
서초동 97
 
0.4%
Other values (6544) 21944
94.1%
2024-05-18T11:27:26.142828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17216
 
14.2%
7265
 
6.0%
1 6903
 
5.7%
6265
 
5.2%
2 4830
 
4.0%
( 4371
 
3.6%
) 4368
 
3.6%
3 3809
 
3.1%
4 3074
 
2.5%
5 2957
 
2.4%
Other values (533) 60019
49.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57613
47.6%
Decimal Number 33151
27.4%
Space Separator 17216
 
14.2%
Open Punctuation 4418
 
3.6%
Close Punctuation 4415
 
3.6%
Other Punctuation 1985
 
1.6%
Dash Punctuation 1812
 
1.5%
Uppercase Letter 235
 
0.2%
Math Symbol 214
 
0.2%
Lowercase Letter 14
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7265
 
12.6%
6265
 
10.9%
2653
 
4.6%
1438
 
2.5%
1261
 
2.2%
1236
 
2.1%
1159
 
2.0%
1132
 
2.0%
840
 
1.5%
750
 
1.3%
Other values (472) 33614
58.3%
Uppercase Letter
ValueCountFrequency (%)
B 73
31.1%
S 25
 
10.6%
C 19
 
8.1%
A 11
 
4.7%
D 11
 
4.7%
T 10
 
4.3%
L 9
 
3.8%
K 9
 
3.8%
I 7
 
3.0%
G 6
 
2.6%
Other values (14) 55
23.4%
Lowercase Letter
ValueCountFrequency (%)
c 2
14.3%
k 2
14.3%
s 2
14.3%
v 1
7.1%
n 1
7.1%
b 1
7.1%
w 1
7.1%
p 1
7.1%
r 1
7.1%
e 1
7.1%
Decimal Number
ValueCountFrequency (%)
1 6903
20.8%
2 4830
14.6%
3 3809
11.5%
4 3074
9.3%
5 2957
8.9%
6 2638
 
8.0%
0 2510
 
7.6%
7 2393
 
7.2%
8 2076
 
6.3%
9 1961
 
5.9%
Other Punctuation
ValueCountFrequency (%)
, 1894
95.4%
. 44
 
2.2%
? 22
 
1.1%
12
 
0.6%
: 7
 
0.4%
/ 6
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 4371
98.9%
[ 47
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 4368
98.9%
] 47
 
1.1%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
17216
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1812
100.0%
Math Symbol
ValueCountFrequency (%)
~ 214
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 63211
52.2%
Hangul 57614
47.6%
Latin 252
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7265
 
12.6%
6265
 
10.9%
2653
 
4.6%
1438
 
2.5%
1261
 
2.2%
1236
 
2.1%
1159
 
2.0%
1132
 
2.0%
840
 
1.5%
750
 
1.3%
Other values (473) 33615
58.3%
Latin
ValueCountFrequency (%)
B 73
29.0%
S 25
 
9.9%
C 19
 
7.5%
A 11
 
4.4%
D 11
 
4.4%
T 10
 
4.0%
L 9
 
3.6%
K 9
 
3.6%
I 7
 
2.8%
G 6
 
2.4%
Other values (27) 72
28.6%
Common
ValueCountFrequency (%)
17216
27.2%
1 6903
10.9%
2 4830
 
7.6%
( 4371
 
6.9%
) 4368
 
6.9%
3 3809
 
6.0%
4 3074
 
4.9%
5 2957
 
4.7%
6 2638
 
4.2%
0 2510
 
4.0%
Other values (13) 10535
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 63448
52.4%
Hangul 57610
47.6%
None 13
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Number Forms 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17216
27.1%
1 6903
10.9%
2 4830
 
7.6%
( 4371
 
6.9%
) 4368
 
6.9%
3 3809
 
6.0%
4 3074
 
4.8%
5 2957
 
4.7%
6 2638
 
4.2%
0 2510
 
4.0%
Other values (47) 10772
17.0%
Hangul
ValueCountFrequency (%)
7265
 
12.6%
6265
 
10.9%
2653
 
4.6%
1438
 
2.5%
1261
 
2.2%
1236
 
2.1%
1159
 
2.0%
1132
 
2.0%
840
 
1.5%
750
 
1.3%
Other values (470) 33611
58.3%
None
ValueCountFrequency (%)
12
92.3%
1
 
7.7%
Compat Jamo
ValueCountFrequency (%)
2
66.7%
1
33.3%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%

우편번호1
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct9
Distinct (%)13.0%
Missing9931
Missing (%)99.3%
Infinite0
Infinite (%)0.0%
Mean103.91304
Minimum2
Maximum156
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T11:27:26.517637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile11
Q111
median151
Q3151
95-th percentile156
Maximum156
Range154
Interquartile range (IQR)140

Descriptive statistics

Standard deviation63.24991
Coefficient of variation (CV)0.60868114
Kurtosis-1.3308725
Mean103.91304
Median Absolute Deviation (MAD)5
Skewness-0.77216362
Sum7170
Variance4000.5512
MonotonicityNot monotonic
2024-05-18T11:27:26.936625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
151 28
 
0.3%
11 20
 
0.2%
156 7
 
0.1%
121 7
 
0.1%
152 2
 
< 0.1%
111 2
 
< 0.1%
2 1
 
< 0.1%
132 1
 
< 0.1%
123 1
 
< 0.1%
(Missing) 9931
99.3%
ValueCountFrequency (%)
2 1
 
< 0.1%
11 20
0.2%
111 2
 
< 0.1%
121 7
 
0.1%
123 1
 
< 0.1%
132 1
 
< 0.1%
151 28
0.3%
152 2
 
< 0.1%
156 7
 
0.1%
ValueCountFrequency (%)
156 7
 
0.1%
152 2
 
< 0.1%
151 28
0.3%
132 1
 
< 0.1%
123 1
 
< 0.1%
121 7
 
0.1%
111 2
 
< 0.1%
11 20
0.2%
2 1
 
< 0.1%

우편번호2
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct41
Distinct (%)59.4%
Missing9931
Missing (%)99.3%
Infinite0
Infinite (%)0.0%
Mean561.76812
Minimum15
Maximum907
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T11:27:27.320466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile111
Q1111
median807
Q3829
95-th percentile895
Maximum907
Range892
Interquartile range (IQR)718

Descriptive statistics

Standard deviation354.71493
Coefficient of variation (CV)0.63142588
Kurtosis-1.7355962
Mean561.76812
Median Absolute Deviation (MAD)83
Skewness-0.50060228
Sum38762
Variance125822.68
MonotonicityNot monotonic
2024-05-18T11:27:27.785058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
111 21
 
0.2%
807 3
 
< 0.1%
829 3
 
< 0.1%
809 2
 
< 0.1%
895 2
 
< 0.1%
890 2
 
< 0.1%
826 2
 
< 0.1%
801 1
 
< 0.1%
899 1
 
< 0.1%
888 1
 
< 0.1%
Other values (31) 31
 
0.3%
(Missing) 9931
99.3%
ValueCountFrequency (%)
15 1
 
< 0.1%
21 1
 
< 0.1%
111 21
0.2%
123 1
 
< 0.1%
212 1
 
< 0.1%
222 1
 
< 0.1%
701 1
 
< 0.1%
725 1
 
< 0.1%
742 1
 
< 0.1%
783 1
 
< 0.1%
ValueCountFrequency (%)
907 1
< 0.1%
904 1
< 0.1%
899 1
< 0.1%
895 2
< 0.1%
893 1
< 0.1%
890 2
< 0.1%
888 1
< 0.1%
883 1
< 0.1%
882 1
< 0.1%
872 1
< 0.1%

도로명주소1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

도로명주소2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

연면적
Real number (ℝ)

MISSING 

Distinct3942
Distinct (%)59.4%
Missing3359
Missing (%)33.6%
Infinite0
Infinite (%)0.0%
Mean8601.7775
Minimum0
Maximum604089.5
Zeros88
Zeros (%)0.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T11:27:28.256740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile359.56
Q1771.33
median2887.07
Q37793
95-th percentile28900
Maximum604089.5
Range604089.5
Interquartile range (IQR)7021.67

Descriptive statistics

Standard deviation25245.342
Coefficient of variation (CV)2.9348983
Kurtosis161.10838
Mean8601.7775
Median Absolute Deviation (MAD)2368.93
Skewness10.678689
Sum57124404
Variance6.3732732 × 108
MonotonicityNot monotonic
2024-05-18T11:27:28.734963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2251.3 171
 
1.7%
0.0 88
 
0.9%
353.88 86
 
0.9%
11043.4 55
 
0.5%
2700.0 53
 
0.5%
606.0 52
 
0.5%
499.0 40
 
0.4%
9345.69 39
 
0.4%
14133.0 27
 
0.3%
579.0 27
 
0.3%
Other values (3932) 6003
60.0%
(Missing) 3359
33.6%
ValueCountFrequency (%)
0.0 88
0.9%
4.358 4
 
< 0.1%
130.0 2
 
< 0.1%
189.0 5
 
0.1%
194.0 2
 
< 0.1%
283.0 1
 
< 0.1%
300.0 2
 
< 0.1%
300.99 1
 
< 0.1%
301.0 1
 
< 0.1%
301.2 1
 
< 0.1%
ValueCountFrequency (%)
604089.5 1
< 0.1%
521000.0 1
< 0.1%
439000.0 1
< 0.1%
438913.3 1
< 0.1%
426635.0 2
< 0.1%
418000.0 1
< 0.1%
341000.0 1
< 0.1%
315000.0 1
< 0.1%
305934.0 1
< 0.1%
283843.2 1
< 0.1%

준공일자
Text

MISSING 

Distinct936
Distinct (%)88.8%
Missing8946
Missing (%)89.5%
Memory size156.2 KiB
2024-05-18T11:27:29.466483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length9.1413662
Min length4

Characters and Unicode

Total characters9635
Distinct characters14
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique853 ?
Unique (%)80.9%

Sample

1st row1999-06-28
2nd row2007-11-16
3rd row1991-12-1
4th row1995-12-11
5th row1994-10-20
ValueCountFrequency (%)
2000-12-15 8
 
0.8%
2009-7-22 7
 
0.7%
2013 6
 
0.6%
1905-7-2 6
 
0.6%
1905-6-29 5
 
0.5%
1996-11-23 4
 
0.4%
1984-3-31 4
 
0.4%
2009-05-31 4
 
0.4%
2008-7-18 3
 
0.3%
2012-12-15 3
 
0.3%
Other values (925) 1004
95.3%
2024-05-18T11:27:30.751925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2078
21.6%
1 1712
17.8%
0 1625
16.9%
2 1307
13.6%
9 986
10.2%
8 388
 
4.0%
3 362
 
3.8%
7 332
 
3.4%
5 306
 
3.2%
6 283
 
2.9%
Other values (4) 256
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7539
78.2%
Dash Punctuation 2078
 
21.6%
Other Punctuation 16
 
0.2%
Space Separator 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1712
22.7%
0 1625
21.6%
2 1307
17.3%
9 986
13.1%
8 388
 
5.1%
3 362
 
4.8%
7 332
 
4.4%
5 306
 
4.1%
6 283
 
3.8%
4 238
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 15
93.8%
, 1
 
6.2%
Dash Punctuation
ValueCountFrequency (%)
- 2078
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9635
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2078
21.6%
1 1712
17.8%
0 1625
16.9%
2 1307
13.6%
9 986
10.2%
8 388
 
4.0%
3 362
 
3.8%
7 332
 
3.4%
5 306
 
3.2%
6 283
 
2.9%
Other values (4) 256
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9635
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2078
21.6%
1 1712
17.8%
0 1625
16.9%
2 1307
13.6%
9 986
10.2%
8 388
 
4.0%
3 362
 
3.8%
7 332
 
3.4%
5 306
 
3.2%
6 283
 
2.9%
Other values (4) 256
 
2.7%

부서명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9747 
new
 
235
관리부서
 
13
O
 
3
 
1

Length

Max length6
Median length4
Mean length3.9755
Min length1

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9747
97.5%
new 235
 
2.4%
관리부서 13
 
0.1%
O 3
 
< 0.1%
1
 
< 0.1%
제노피플관리 1
 
< 0.1%

Length

2024-05-18T11:27:31.147632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T11:27:31.530507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9747
97.5%
new 235
 
2.4%
관리부서 13
 
0.1%
o 3
 
< 0.1%
1
 
< 0.1%
제노피플관리 1
 
< 0.1%

상주인원수
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct68
Distinct (%)1.0%
Missing3359
Missing (%)33.6%
Infinite0
Infinite (%)0.0%
Mean4.1584099
Minimum0
Maximum850
Zeros5425
Zeros (%)54.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T11:27:31.907751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile12
Maximum850
Range850
Interquartile range (IQR)0

Descriptive statistics

Standard deviation37.505296
Coefficient of variation (CV)9.0191435
Kurtosis277.38464
Mean4.1584099
Median Absolute Deviation (MAD)0
Skewness15.797117
Sum27616
Variance1406.6472
MonotonicityNot monotonic
2024-05-18T11:27:32.399725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 5425
54.2%
3 744
 
7.4%
12 311
 
3.1%
80 27
 
0.3%
10 8
 
0.1%
20 6
 
0.1%
30 5
 
0.1%
500 5
 
0.1%
6 4
 
< 0.1%
50 4
 
< 0.1%
Other values (58) 102
 
1.0%
(Missing) 3359
33.6%
ValueCountFrequency (%)
0 5425
54.2%
1 2
 
< 0.1%
2 3
 
< 0.1%
3 744
 
7.4%
4 3
 
< 0.1%
5 4
 
< 0.1%
6 4
 
< 0.1%
7 1
 
< 0.1%
8 2
 
< 0.1%
9 2
 
< 0.1%
ValueCountFrequency (%)
850 1
 
< 0.1%
800 2
 
< 0.1%
780 1
 
< 0.1%
755 1
 
< 0.1%
700 2
 
< 0.1%
660 2
 
< 0.1%
600 2
 
< 0.1%
508 2
 
< 0.1%
500 5
0.1%
400 3
< 0.1%

일일사용자수
Real number (ℝ)

HIGH CORRELATION  MISSING  SKEWED  ZEROS 

Distinct71
Distinct (%)1.1%
Missing3359
Missing (%)33.6%
Infinite0
Infinite (%)0.0%
Mean59.167896
Minimum0
Maximum27979
Zeros5381
Zeros (%)53.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T11:27:32.948125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile150
Maximum27979
Range27979
Interquartile range (IQR)0

Descriptive statistics

Standard deviation582.11288
Coefficient of variation (CV)9.838323
Kurtosis1588.4473
Mean59.167896
Median Absolute Deviation (MAD)0
Skewness37.500864
Sum392934
Variance338855.4
MonotonicityNot monotonic
2024-05-18T11:27:33.370501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 5381
53.8%
150 531
 
5.3%
45 321
 
3.2%
700 198
 
2.0%
100 41
 
0.4%
200 16
 
0.2%
250 14
 
0.1%
800 12
 
0.1%
500 11
 
0.1%
300 10
 
0.1%
Other values (61) 106
 
1.1%
(Missing) 3359
33.6%
ValueCountFrequency (%)
0 5381
53.8%
15 1
 
< 0.1%
19 1
 
< 0.1%
20 2
 
< 0.1%
25 1
 
< 0.1%
30 6
 
0.1%
40 2
 
< 0.1%
44 1
 
< 0.1%
45 321
 
3.2%
48 2
 
< 0.1%
ValueCountFrequency (%)
27979 1
 
< 0.1%
25703 1
 
< 0.1%
18442 1
 
< 0.1%
15332 1
 
< 0.1%
6500 1
 
< 0.1%
5000 3
 
< 0.1%
2000 1
 
< 0.1%
1000 4
 
< 0.1%
850 3
 
< 0.1%
800 12
0.1%

침상수
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct63
Distinct (%)0.9%
Missing3359
Missing (%)33.6%
Infinite0
Infinite (%)0.0%
Mean71.128143
Minimum0
Maximum2091
Zeros4491
Zeros (%)44.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T11:27:33.955677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3106
95-th percentile406
Maximum2091
Range2091
Interquartile range (IQR)106

Descriptive statistics

Standard deviation138.17978
Coefficient of variation (CV)1.9426879
Kurtosis17.695505
Mean71.128143
Median Absolute Deviation (MAD)0
Skewness3.1298053
Sum472362
Variance19093.652
MonotonicityNot monotonic
2024-05-18T11:27:34.404000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4491
44.9%
171 348
 
3.5%
156 244
 
2.4%
103 234
 
2.3%
178 175
 
1.8%
636 165
 
1.7%
406 149
 
1.5%
106 142
 
1.4%
190 116
 
1.2%
367 100
 
1.0%
Other values (53) 477
 
4.8%
(Missing) 3359
33.6%
ValueCountFrequency (%)
0 4491
44.9%
30 2
 
< 0.1%
31 6
 
0.1%
35 56
 
0.6%
48 1
 
< 0.1%
60 1
 
< 0.1%
65 2
 
< 0.1%
79 3
 
< 0.1%
80 2
 
< 0.1%
85 2
 
< 0.1%
ValueCountFrequency (%)
2091 1
 
< 0.1%
1965 1
 
< 0.1%
684 1
 
< 0.1%
636 165
1.7%
500 1
 
< 0.1%
477 20
 
0.2%
406 149
1.5%
367 100
1.0%
305 1
 
< 0.1%
297 44
 
0.4%

최근교육이수일
Text

MISSING 

Distinct463
Distinct (%)13.1%
Missing6479
Missing (%)64.8%
Memory size156.2 KiB
2024-05-18T11:27:35.925566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.6836126
Min length1

Characters and Unicode

Total characters34096
Distinct characters24
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique174 ?
Unique (%)4.9%

Sample

1st row2013-04-04
2nd row2018-10-17
3rd row2014-05-30
4th row2015-12-02
5th row2016-05-25
ValueCountFrequency (%)
2016-07-20 68
 
1.9%
2016-10-26 65
 
1.8%
2016-11-23 62
 
1.8%
2016-11-15 61
 
1.7%
2012-11-14 52
 
1.5%
2015-12-02 49
 
1.4%
2015-05-28 46
 
1.3%
2016-08-24 45
 
1.3%
2014-11-21 44
 
1.2%
2015-11-27 43
 
1.2%
Other values (452) 2988
84.8%
2024-05-18T11:27:38.238647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 7290
21.4%
0 6364
18.7%
- 6202
18.2%
2 5915
17.3%
6 1292
 
3.8%
4 1279
 
3.8%
5 1257
 
3.7%
7 1104
 
3.2%
3 1001
 
2.9%
8 942
 
2.8%
Other values (14) 1450
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 27032
79.3%
Dash Punctuation 6202
 
18.2%
Other Punctuation 818
 
2.4%
Space Separator 27
 
0.1%
Other Letter 17
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 7290
27.0%
0 6364
23.5%
2 5915
21.9%
6 1292
 
4.8%
4 1279
 
4.7%
5 1257
 
4.7%
7 1104
 
4.1%
3 1001
 
3.7%
8 942
 
3.5%
9 588
 
2.2%
Other Letter
ValueCountFrequency (%)
3
17.6%
2
11.8%
2
11.8%
2
11.8%
2
11.8%
2
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other Punctuation
ValueCountFrequency (%)
. 788
96.3%
/ 30
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 6202
100.0%
Space Separator
ValueCountFrequency (%)
27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 34079
> 99.9%
Hangul 17
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 7290
21.4%
0 6364
18.7%
- 6202
18.2%
2 5915
17.4%
6 1292
 
3.8%
4 1279
 
3.8%
5 1257
 
3.7%
7 1104
 
3.2%
3 1001
 
2.9%
8 942
 
2.8%
Other values (4) 1433
 
4.2%
Hangul
ValueCountFrequency (%)
3
17.6%
2
11.8%
2
11.8%
2
11.8%
2
11.8%
2
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 34079
> 99.9%
Hangul 17
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 7290
21.4%
0 6364
18.7%
- 6202
18.2%
2 5915
17.4%
6 1292
 
3.8%
4 1279
 
3.8%
5 1257
 
3.7%
7 1104
 
3.2%
3 1001
 
2.9%
8 942
 
2.8%
Other values (4) 1433
 
4.2%
Hangul
ValueCountFrequency (%)
3
17.6%
2
11.8%
2
11.8%
2
11.8%
2
11.8%
2
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%

교육종류코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
7279 
보수
1987 
신규
 
575
실내공기질관리자교육
 
43
보수교육
 
41
Other values (10)
 
75

Length

Max length10
Median length4
Mean length3.5328
Min length1

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row보수
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7279
72.8%
보수 1987
 
19.9%
신규 575
 
5.8%
실내공기질관리자교육 43
 
0.4%
보수교육 41
 
0.4%
환경보전협회사이버 41
 
0.4%
교육면제 9
 
0.1%
신규교육 8
 
0.1%
- 7
 
0.1%
사이버 3
 
< 0.1%
Other values (5) 7
 
0.1%

Length

2024-05-18T11:27:38.828280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 7279
72.8%
보수 1987
 
19.9%
신규 576
 
5.8%
실내공기질관리자교육 43
 
0.4%
보수교육 41
 
0.4%
환경보전협회사이버 41
 
0.4%
교육면제 9
 
0.1%
신규교육 8
 
0.1%
7
 
0.1%
사이버 3
 
< 0.1%
Other values (5) 7
 
0.1%

교육종류코드.1
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
7438 
보수
1987 
신규
 
575

Length

Max length4
Median length4
Mean length3.4876
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보수
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7438
74.4%
보수 1987
 
19.9%
신규 575
 
5.8%

Length

2024-05-18T11:27:39.459629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T11:27:39.968812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 7438
74.4%
보수 1987
 
19.9%
신규 575
 
5.8%

Interactions

2024-05-18T11:27:11.816992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:56.749939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:58.870515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:01.364798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:04.055511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:05.692106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:07.599442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:09.697721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:12.139086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:57.043014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:59.140911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:01.622820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:04.259424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:05.955675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:07.850186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:09.986275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:12.442862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:57.309652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:59.451149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:01.935576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:04.460063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:06.234840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:08.110244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:10.254312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:12.738395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:57.562255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:59.807707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:02.516654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:04.662508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:06.491074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:08.369089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:10.519692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:13.019590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:57.787345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:00.034601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:02.863923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:04.790592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:06.635526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:08.710753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:10.754119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:13.299778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:58.050306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:00.324213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:03.160748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:05.004353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:06.811722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:08.933774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:11.037987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:13.560100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:58.302448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:00.599126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:03.533313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:05.248077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:07.062424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:09.177533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:11.291006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:13.847052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:26:58.619709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:01.011489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:03.782472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:05.482133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:07.325612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:09.494529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T11:27:11.556024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-18T11:27:40.204624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인덱스연도지역구시설군우편번호1우편번호2연면적부서명상주인원수일일사용자수침상수교육종류코드교육종류코드.1
인덱스1.0000.8840.3090.4060.6640.5850.0340.6430.1640.0000.3300.3700.215
연도0.8841.0000.3050.3930.7120.8620.0000.6320.1000.0000.2950.3290.155
지역구0.3090.3051.0000.4050.8510.6060.1870.5710.1900.1730.6630.7650.450
시설군0.4060.3930.4051.0000.8090.7090.3690.4840.0000.0000.1230.2660.185
우편번호10.6640.7120.8510.8091.0000.805NaN0.7521.0001.0000.0000.5140.514
우편번호20.5850.8620.6060.7090.8051.000NaN0.6180.8640.8640.0000.4940.494
연면적0.0340.0000.1870.369NaNNaN1.0000.0000.0000.0000.0000.0000.042
부서명0.6430.6320.5710.4840.7520.6180.0001.000NaNNaNNaN0.0000.000
상주인원수0.1640.1000.1900.0001.0000.8640.000NaN1.0000.5550.3180.0000.000
일일사용자수0.0000.0000.1730.0001.0000.8640.000NaN0.5551.0000.0160.0000.000
침상수0.3300.2950.6630.1230.0000.0000.000NaN0.3180.0161.0000.0000.026
교육종류코드0.3700.3290.7650.2660.5140.4940.0000.0000.0000.0000.0001.0001.000
교육종류코드.10.2150.1550.4500.1850.5140.4940.0420.0000.0000.0000.0261.0001.000
2024-05-18T11:27:40.620172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육종류코드교육종류코드.1지역구시설군부서명
교육종류코드1.0000.9990.3590.0880.000
교육종류코드.10.9991.0000.3560.1460.000
지역구0.3590.3561.0000.1160.272
시설군0.0880.1460.1161.0000.222
부서명0.0000.0000.2720.2221.000
2024-05-18T11:27:41.285132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인덱스연도우편번호1우편번호2연면적상주인원수일일사용자수침상수지역구시설군부서명교육종류코드교육종류코드.1
인덱스1.0000.984-0.688-0.515-0.006-0.081-0.086-0.3910.1230.1690.5710.2170.143
연도0.9841.000-0.834-0.7370.003-0.092-0.091-0.5060.1250.1690.4910.1760.179
우편번호1-0.688-0.8341.0000.6750.077-0.3600.1520.3350.5850.6050.3960.7630.763
우편번호2-0.515-0.7370.6751.0000.153-0.405-0.2080.3350.2970.4570.5970.5640.564
연면적-0.0060.0030.0770.1531.000-0.218-0.213-0.1400.0670.1450.0000.0000.042
상주인원수-0.081-0.092-0.360-0.405-0.2181.0000.9600.3320.0730.0001.0000.0000.000
일일사용자수-0.086-0.0910.152-0.208-0.2130.9601.0000.2600.0780.0001.0000.0000.000
침상수-0.391-0.5060.3350.335-0.1400.3320.2601.0000.3510.0611.0000.0000.017
지역구0.1230.1250.5850.2970.0670.0730.0780.3511.0000.1160.2720.3590.356
시설군0.1690.1690.6050.4570.1450.0000.0000.0610.1161.0000.2220.0880.146
부서명0.5710.4910.3960.5970.0001.0001.0001.0000.2720.2221.0000.0000.000
교육종류코드0.2170.1760.7630.5640.0000.0000.0000.0000.3590.0880.0001.0000.999
교육종류코드.10.1430.1790.7630.5640.0420.0000.0000.0170.3560.1460.0000.9991.000

Missing values

2024-05-18T11:27:14.293978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T11:27:15.097450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-18T11:27:16.020249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

인덱스연도지역구시설군건물명주소1주소2우편번호1우편번호2도로명주소1도로명주소2연면적준공일자부서명상주인원수일일사용자수침상수최근교육이수일교육종류코드교육종류코드.1
2739617742014광진구실내주차장화양타워서울시 광진구화양동 110-37<NA><NA><NA><NA>4030.01999-06-28<NA>0002013-04-04보수보수
277269392014종로구실내주차장코리안리빌딩서울시 종로구수송동 80<NA><NA><NA><NA>6027.14<NA><NA>000<NA><NA><NA>
2817685502014영등포구지하역사영등포시장역 (5호선)서울시 영등포구양산로 지하200 (영등포동5가)<NA><NA><NA><NA>14029.0<NA><NA>00171<NA><NA><NA>
3111263632014중구대규모점포롯데마트서울역점서울시 중구봉래동 2가 122-11<NA><NA><NA><NA>26069.0<NA><NA>000<NA><NA><NA>
21368455422016동작구의료기관서울요양병원서울시 동작구상도4동 255-4<NA><NA><NA><NA>2271.0<NA><NA>01100<NA><NA><NA>
8187582182023강동구PC영업시설힐러PC서울시 강동구양재대로89길 17, 지층 (성내동)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5961705982024관악구의료기관척편한병원서울시 관악구신림로 318, 4,5층 (신림동, 청암두산위브)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
11471528772019서대문구영화상영관필름포럼(사단법인 필레마)서울특별시 성산로 527 (대신동, 하늬솔빌딩 지하1층)<NA><NA><NA><NA><NA>8898.97<NA><NA>0002018-10-17보수보수
9485565852023성동구실내주차장(주)신세계이마트성수점서울시 성동구뚝섬로 377(성수2가1동)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
8472581632023동작구실내주차장롯데타워(지하5층)서울시 동작구보라매로5길 51<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
인덱스연도지역구시설군건물명주소1주소2우편번호1우편번호2도로명주소1도로명주소2연면적준공일자부서명상주인원수일일사용자수침상수최근교육이수일교육종류코드교육종류코드.1
3008072422014노원구보육시설한국성서대학교어린이집서울시 노원구동일로214길 32 (상계동)<NA><NA><NA><NA>1190.0<NA><NA>00253<NA><NA><NA>
16480507582018종로구어린이집상록수어린이집서울시 종로구송월1길 73-7<NA><NA><NA><NA>0.0<NA><NA>000<NA><NA><NA>
1715682832024송파구실내주차장문정역skv1서울시 송파구법원로 128<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
20313488552017노원구장례식장한국원자력의학원 장례식장서울시 노원구노원로 75 (공릉동)<NA><NA><NA><NA>3297.0<NA><NA>0002014-11-07보수보수
25953280062015중랑구목욕장두산사우나서울시 중랑구면목7동 1522 두산(아) 402동 지하1층<NA><NA><NA><NA>1320.0<NA><NA>00133<NA><NA><NA>
3088265952014성동구실내주차장성동종합행정마을(성동구청)서울시 성동구고산자로 270<NA><NA><NA><NA>9503.29<NA><NA>00135<NA><NA><NA>
2672153212014성동구목욕장월드사우나서울시 성동구독서당로 272<NA><NA><NA><NA>1420.02009-3-20<NA>001352010-4-21보수보수
2648147892014송파구장례식장서울아산병원장례식장서울시 송파구올림픽로43길 88<NA><NA><NA><NA>16200.01994-12-31<NA>00106<NA><NA><NA>
15781541912019강남구실내주차장엔씨소프트 R&D센타(엔씨타워1)서울특별시 강남구 테헤란로 509<NA><NA><NA><NA><NA>30902.0<NA><NA>000<NA><NA><NA>
14855539012019강남구학원비전21닿을관학원서울특별시 강남구 도곡로 505 , 지하1층 앞쪽 일부 및 6층 (대치동)<NA><NA><NA><NA><NA>1082.37<NA><NA>000<NA>신규신규