Dataset statistics
Number of variables | 20 |
---|---|
Number of observations | 10000 |
Missing cells | 70288 |
Missing cells (%) | 35.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.7 MiB |
Average record size in memory | 178.0 B |
Variable types
Numeric | 8 |
---|---|
Categorical | 5 |
Text | 5 |
Unsupported | 2 |
Dataset
Description | 인덱스,연도,지역구,시설군,건물명,주소1,주소2,우편번호1,우편번호2,도로명주소1,도로명주소2,연면적,준공일자,부서명,상주인원수,일일사용자수,침상수,최근교육이수일,교육종류코드,교육종류코드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15380/S/1/datasetView.do |
교육종류코드 is highly overall correlated with 우편번호1 and 2 other fields | High correlation |
교육종류코드.1 is highly overall correlated with 우편번호1 and 2 other fields | High correlation |
인덱스 is highly overall correlated with 연도 and 3 other fields | High correlation |
연도 is highly overall correlated with 인덱스 and 3 other fields | High correlation |
우편번호1 is highly overall correlated with 인덱스 and 6 other fields | High correlation |
우편번호2 is highly overall correlated with 인덱스 and 5 other fields | High correlation |
상주인원수 is highly overall correlated with 일일사용자수 and 1 other fields | High correlation |
일일사용자수 is highly overall correlated with 상주인원수 and 1 other fields | High correlation |
침상수 is highly overall correlated with 연도 and 1 other fields | High correlation |
지역구 is highly overall correlated with 우편번호1 | High correlation |
시설군 is highly overall correlated with 우편번호1 | High correlation |
부서명 is highly overall correlated with 인덱스 and 4 other fields | High correlation |
부서명 is highly imbalanced (93.0%) | Imbalance |
교육종류코드 is highly imbalanced (70.1%) | Imbalance |
주소2 has 1552 (15.5%) missing values | Missing |
우편번호1 has 9931 (99.3%) missing values | Missing |
우편번호2 has 9931 (99.3%) missing values | Missing |
도로명주소1 has 10000 (100.0%) missing values | Missing |
도로명주소2 has 10000 (100.0%) missing values | Missing |
연면적 has 3359 (33.6%) missing values | Missing |
준공일자 has 8946 (89.5%) missing values | Missing |
상주인원수 has 3359 (33.6%) missing values | Missing |
일일사용자수 has 3359 (33.6%) missing values | Missing |
침상수 has 3359 (33.6%) missing values | Missing |
최근교육이수일 has 6479 (64.8%) missing values | Missing |
일일사용자수 is highly skewed (γ1 = 37.50086395) | Skewed |
인덱스 has unique values | Unique |
도로명주소1 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
도로명주소2 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
상주인원수 has 5425 (54.2%) zeros | Zeros |
일일사용자수 has 5381 (53.8%) zeros | Zeros |
침상수 has 4491 (44.9%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-18 02:26:51.783388 |
---|---|
Analysis finished | 2024-05-18 02:27:16.537490 |
Duration | 24.75 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 41800.253 |
Minimum | 5 |
---|---|
Maximum | 73024 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 2573.85 |
Q1 | 24690.75 |
median | 50476.5 |
Q3 | 59039.25 |
95-th percentile | 71325.1 |
Maximum | 73024 |
Range | 73019 |
Interquartile range (IQR) | 34348.5 |
Descriptive statistics
Standard deviation | 23320.859 |
---|---|
Coefficient of variation (CV) | 0.55791191 |
Kurtosis | -1.1464343 |
Mean | 41800.253 |
Median Absolute Deviation (MAD) | 17516.5 |
Skewness | -0.52821455 |
Sum | 4.1800253 × 108 |
Variance | 5.4386245 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1774 | 1 | < 0.1% |
1119 | 1 | < 0.1% |
46591 | 1 | < 0.1% |
60849 | 1 | < 0.1% |
58095 | 1 | < 0.1% |
3548 | 1 | < 0.1% |
59084 | 1 | < 0.1% |
24747 | 1 | < 0.1% |
68660 | 1 | < 0.1% |
2510 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
5 | 1 | |
7 | 1 | |
8 | 1 | |
12 | 1 | |
17 | 1 | |
20 | 1 | |
34 | 1 | |
41 | 1 | |
50 | 1 | |
55 | 1 |
Value | Count | Frequency (%) |
73024 | 1 | |
73019 | 1 | |
73018 | 1 | |
73017 | 1 | |
73012 | 1 | |
73007 | 1 | |
73006 | 1 | |
73003 | 1 | |
73002 | 1 | |
73000 | 1 |
연도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2018.5277 |
Minimum | 2013 |
---|---|
Maximum | 2024 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2013 |
---|---|
5-th percentile | 2014 |
Q1 | 2015 |
median | 2017 |
Q3 | 2023 |
95-th percentile | 2024 |
Maximum | 2024 |
Range | 11 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 3.9084729 |
---|---|
Coefficient of variation (CV) | 0.0019362989 |
Kurtosis | -1.5366309 |
Mean | 2018.5277 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0.24582072 |
Sum | 20185277 |
Variance | 15.27616 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2014 | 2190 | |
2024 | 1727 | |
2023 | 1631 | |
2019 | 1502 | |
2017 | 1401 | |
2015 | 1397 | |
2013 | 72 | 0.7% |
2016 | 41 | 0.4% |
2020 | 26 | 0.3% |
2018 | 13 | 0.1% |
Value | Count | Frequency (%) |
2013 | 72 | 0.7% |
2014 | 2190 | |
2015 | 1397 | |
2016 | 41 | 0.4% |
2017 | 1401 | |
2018 | 13 | 0.1% |
2019 | 1502 | |
2020 | 26 | 0.3% |
2023 | 1631 | |
2024 | 1727 |
Value | Count | Frequency (%) |
2024 | 1727 | |
2023 | 1631 | |
2020 | 26 | 0.3% |
2019 | 1502 | |
2018 | 13 | 0.1% |
2017 | 1401 | |
2016 | 41 | 0.4% |
2015 | 1397 | |
2014 | 2190 | |
2013 | 72 | 0.7% |
지역구
Categorical
HIGH CORRELATION
 
Distinct | 26 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
강남구 | |
---|---|
중구 | |
영등포구 | |
서초구 | 615 |
송파구 | 567 |
Other values (21) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0545 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 광진구 |
---|---|
2nd row | 종로구 |
3rd row | 영등포구 |
4th row | 중구 |
5th row | 동작구 |
Common Values
Value | Count | Frequency (%) |
강남구 | 1211 | 12.1% |
중구 | 672 | 6.7% |
영등포구 | 668 | 6.7% |
서초구 | 615 | 6.2% |
송파구 | 567 | 5.7% |
마포구 | 523 | 5.2% |
강서구 | 519 | 5.2% |
구로구 | 404 | 4.0% |
강동구 | 391 | 3.9% |
종로구 | 383 | 3.8% |
Other values (16) | 4047 |
Length
Value | Count | Frequency (%) |
강남구 | 1211 | 12.1% |
중구 | 672 | 6.7% |
영등포구 | 668 | 6.7% |
서초구 | 615 | 6.2% |
송파구 | 567 | 5.7% |
마포구 | 523 | 5.2% |
강서구 | 519 | 5.2% |
구로구 | 404 | 4.0% |
강동구 | 391 | 3.9% |
종로구 | 383 | 3.8% |
Other values (16) | 4047 |
시설군
Categorical
HIGH CORRELATION
 
Distinct | 24 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
실내주차장 | |
---|---|
보육시설 | |
어린이집 | |
의료기관 | |
대규모점포 | |
Other values (19) |
Length
Max length | 9 |
---|---|
Median length | 5 |
Mean length | 4.5461 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 실내주차장 |
---|---|
2nd row | 실내주차장 |
3rd row | 지하역사 |
4th row | 대규모점포 |
5th row | 의료기관 |
Common Values
Value | Count | Frequency (%) |
실내주차장 | 3901 | |
보육시설 | 1002 | 10.0% |
어린이집 | 774 | 7.7% |
의료기관 | 747 | 7.5% |
대규모점포 | 715 | 7.1% |
지하역사 | 607 | 6.1% |
PC영업시설 | 515 | 5.1% |
목욕장 | 433 | 4.3% |
학원 | 323 | 3.2% |
산후조리원 | 239 | 2.4% |
Other values (14) | 744 | 7.4% |
Length
Value | Count | Frequency (%) |
실내주차장 | 3901 | |
보육시설 | 1002 | 10.0% |
어린이집 | 774 | 7.7% |
의료기관 | 747 | 7.5% |
대규모점포 | 715 | 7.1% |
지하역사 | 607 | 6.1% |
pc영업시설 | 515 | 5.1% |
목욕장 | 433 | 4.3% |
학원 | 323 | 3.2% |
산후조리원 | 239 | 2.4% |
Other values (14) | 744 | 7.4% |
건물명
Text
Distinct | 5828 |
---|---|
Distinct (%) | 58.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
어린이집 | 132 | 1.0% |
pc방 | 104 | 0.8% |
구립 | 86 | 0.7% |
공영주차장 | 80 | 0.6% |
산후조리원 | 61 | 0.5% |
pc | 61 | 0.5% |
롯데시네마 | 48 | 0.4% |
이마트 | 43 | 0.3% |
cgv | 33 | 0.3% |
홈플러스 | 30 | 0.2% |
Other values (6263) | 11999 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 2795 | 3.3% |
2766 | 3.3% | |
어 | 2024 | 2.4% |
원 | 1937 | 2.3% |
린 | 1831 | 2.2% |
집 | 1769 | 2.1% |
스 | 1546 | 1.8% |
) | 1318 | 1.6% |
( | 1311 | 1.6% |
리 | 1073 | 1.3% |
Other values (773) | 65365 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 72233 | |
Uppercase Letter | 3359 | 4.0% |
Space Separator | 2766 | 3.3% |
Decimal Number | 1625 | 1.9% |
Close Punctuation | 1319 | 1.6% |
Open Punctuation | 1312 | 1.6% |
Lowercase Letter | 614 | 0.7% |
Other Symbol | 234 | 0.3% |
Other Punctuation | 156 | 0.2% |
Dash Punctuation | 69 | 0.1% |
Other values (4) | 48 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 2795 | 3.9% |
어 | 2024 | 2.8% |
원 | 1937 | 2.7% |
린 | 1831 | 2.5% |
집 | 1769 | 2.4% |
스 | 1546 | 2.1% |
리 | 1073 | 1.5% |
서 | 949 | 1.3% |
동 | 919 | 1.3% |
타 | 885 | 1.2% |
Other values (684) | 56505 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 672 | |
P | 552 | |
S | 258 | 7.7% |
G | 177 | 5.3% |
T | 171 | 5.1% |
K | 162 | 4.8% |
E | 158 | 4.7% |
A | 145 | 4.3% |
I | 127 | 3.8% |
L | 106 | 3.2% |
Other values (15) | 831 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 91 | |
c | 81 | |
p | 59 | 9.6% |
o | 40 | 6.5% |
r | 35 | 5.7% |
n | 35 | 5.7% |
a | 34 | 5.5% |
s | 30 | 4.9% |
t | 26 | 4.2% |
k | 24 | 3.9% |
Other values (14) | 159 |
Other Punctuation
Value | Count | Frequency (%) |
& | 51 | |
, | 32 | |
. | 26 | |
? | 17 | 10.9% |
: | 15 | 9.6% |
/ | 5 | 3.2% |
? | 4 | 2.6% |
% | 2 | 1.3% |
! | 2 | 1.3% |
& | 1 | 0.6% |
Decimal Number
Value | Count | Frequency (%) |
2 | 367 | |
1 | 358 | |
3 | 210 | |
5 | 146 | 9.0% |
4 | 134 | 8.2% |
7 | 95 | 5.8% |
9 | 92 | 5.7% |
0 | 89 | 5.5% |
6 | 82 | 5.0% |
8 | 52 | 3.2% |
Math Symbol
Value | Count | Frequency (%) |
+ | 5 | |
= | 3 | |
> | 2 | 16.7% |
~ | 1 | 8.3% |
→ | 1 | 8.3% |
Other Number
Value | Count | Frequency (%) |
③ | 2 | |
⑥ | 2 | |
② | 1 | |
⑤ | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1318 | |
] | 1 | 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1311 | |
[ | 1 | 0.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 10 | |
Ⅲ | 1 | 9.1% |
Space Separator
Value | Count | Frequency (%) |
2766 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 234 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 69 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 19 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 72462 | |
Common | 7284 | 8.7% |
Latin | 3984 | 4.8% |
Han | 5 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 2795 | 3.9% |
어 | 2024 | 2.8% |
원 | 1937 | 2.7% |
린 | 1831 | 2.5% |
집 | 1769 | 2.4% |
스 | 1546 | 2.1% |
리 | 1073 | 1.5% |
서 | 949 | 1.3% |
동 | 919 | 1.3% |
타 | 885 | 1.2% |
Other values (683) | 56734 |
Latin
Value | Count | Frequency (%) |
C | 672 | |
P | 552 | 13.9% |
S | 258 | 6.5% |
G | 177 | 4.4% |
T | 171 | 4.3% |
K | 162 | 4.1% |
E | 158 | 4.0% |
A | 145 | 3.6% |
I | 127 | 3.2% |
L | 106 | 2.7% |
Other values (41) | 1456 |
Common
Value | Count | Frequency (%) |
2766 | ||
) | 1318 | |
( | 1311 | |
2 | 367 | 5.0% |
1 | 358 | 4.9% |
3 | 210 | 2.9% |
5 | 146 | 2.0% |
4 | 134 | 1.8% |
7 | 95 | 1.3% |
9 | 92 | 1.3% |
Other values (27) | 487 | 6.7% |
Han
Value | Count | Frequency (%) |
前 | 4 | |
秀 | 1 | 20.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 72228 | |
ASCII | 11245 | 13.4% |
None | 239 | 0.3% |
Number Forms | 11 | < 0.1% |
Enclosed Alphanum | 6 | < 0.1% |
CJK | 5 | < 0.1% |
Arrows | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 2795 | 3.9% |
어 | 2024 | 2.8% |
원 | 1937 | 2.7% |
린 | 1831 | 2.5% |
집 | 1769 | 2.4% |
스 | 1546 | 2.1% |
리 | 1073 | 1.5% |
서 | 949 | 1.3% |
동 | 919 | 1.3% |
타 | 885 | 1.2% |
Other values (682) | 56500 |
ASCII
Value | Count | Frequency (%) |
2766 | ||
) | 1318 | |
( | 1311 | |
C | 672 | 6.0% |
P | 552 | 4.9% |
2 | 367 | 3.3% |
1 | 358 | 3.2% |
S | 258 | 2.3% |
3 | 210 | 1.9% |
G | 177 | 1.6% |
Other values (69) | 3256 |
None
Value | Count | Frequency (%) |
㈜ | 234 | |
? | 4 | 1.7% |
& | 1 | 0.4% |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 10 | |
Ⅲ | 1 | 9.1% |
CJK
Value | Count | Frequency (%) |
前 | 4 | |
秀 | 1 | 20.0% |
Enclosed Alphanum
Value | Count | Frequency (%) |
③ | 2 | |
⑥ | 2 | |
② | 1 | |
⑤ | 1 |
Arrows
Value | Count | Frequency (%) |
→ | 1 |
주소1
Text
Distinct | 1519 |
---|---|
Distinct (%) | 15.2% |
Missing | 13 |
Missing (%) | 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
서울시 | 7295 | |
서울 | 1217 | 5.3% |
강남구 | 1211 | 5.3% |
중구 | 672 | 3.0% |
서초구 | 610 | 2.7% |
서울특별시 | 606 | 2.7% |
영등포구 | 565 | 2.5% |
송파구 | 564 | 2.5% |
마포구 | 529 | 2.3% |
강서구 | 429 | 1.9% |
Other values (2108) | 9079 |
Most occurring characters
Value | Count | Frequency (%) |
12948 | ||
서 | 10533 | 12.0% |
구 | 9892 | 11.3% |
울 | 9149 | 10.4% |
시 | 7977 | 9.1% |
동 | 2324 | 2.6% |
강 | 2298 | 2.6% |
로 | 2210 | 2.5% |
남 | 1279 | 1.5% |
포 | 1204 | 1.4% |
Other values (394) | 28104 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 66794 | |
Space Separator | 12948 | 14.7% |
Decimal Number | 5579 | 6.3% |
Close Punctuation | 924 | 1.1% |
Open Punctuation | 923 | 1.0% |
Other Punctuation | 470 | 0.5% |
Dash Punctuation | 194 | 0.2% |
Math Symbol | 49 | 0.1% |
Uppercase Letter | 35 | < 0.1% |
Lowercase Letter | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 10533 | |
구 | 9892 | |
울 | 9149 | |
시 | 7977 | |
동 | 2324 | 3.5% |
강 | 2298 | 3.4% |
로 | 2210 | 3.3% |
남 | 1279 | 1.9% |
포 | 1204 | 1.8% |
중 | 927 | 1.4% |
Other values (356) | 19001 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 16 | |
A | 3 | 8.6% |
K | 2 | 5.7% |
S | 2 | 5.7% |
D | 2 | 5.7% |
J | 2 | 5.7% |
M | 2 | 5.7% |
H | 1 | 2.9% |
P | 1 | 2.9% |
F | 1 | 2.9% |
Other values (3) | 3 | 8.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1180 | |
2 | 782 | |
3 | 666 | |
4 | 533 | |
5 | 512 | |
6 | 442 | 7.9% |
0 | 424 | 7.6% |
7 | 379 | 6.8% |
8 | 353 | 6.3% |
9 | 308 | 5.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 448 | |
. | 10 | 2.1% |
? | 8 | 1.7% |
? | 2 | 0.4% |
: | 1 | 0.2% |
/ | 1 | 0.2% |
Close Punctuation
Value | Count | Frequency (%) |
) | 917 | |
] | 7 | 0.8% |
Open Punctuation
Value | Count | Frequency (%) |
( | 916 | |
[ | 7 | 0.8% |
Lowercase Letter
Value | Count | Frequency (%) |
s | 1 | |
k | 1 |
Space Separator
Value | Count | Frequency (%) |
12948 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 194 |
Math Symbol
Value | Count | Frequency (%) |
~ | 49 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 66794 | |
Common | 21087 | 24.0% |
Latin | 37 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 10533 | |
구 | 9892 | |
울 | 9149 | |
시 | 7977 | |
동 | 2324 | 3.5% |
강 | 2298 | 3.4% |
로 | 2210 | 3.3% |
남 | 1279 | 1.9% |
포 | 1204 | 1.8% |
중 | 927 | 1.4% |
Other values (356) | 19001 |
Common
Value | Count | Frequency (%) |
12948 | ||
1 | 1180 | 5.6% |
) | 917 | 4.3% |
( | 916 | 4.3% |
2 | 782 | 3.7% |
3 | 666 | 3.2% |
4 | 533 | 2.5% |
5 | 512 | 2.4% |
, | 448 | 2.1% |
6 | 442 | 2.1% |
Other values (13) | 1743 | 8.3% |
Latin
Value | Count | Frequency (%) |
B | 16 | |
A | 3 | 8.1% |
K | 2 | 5.4% |
S | 2 | 5.4% |
D | 2 | 5.4% |
J | 2 | 5.4% |
M | 2 | 5.4% |
H | 1 | 2.7% |
P | 1 | 2.7% |
F | 1 | 2.7% |
Other values (5) | 5 | 13.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 66794 | |
ASCII | 21116 | 24.0% |
None | 8 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
12948 | ||
1 | 1180 | 5.6% |
) | 917 | 4.3% |
( | 916 | 4.3% |
2 | 782 | 3.7% |
3 | 666 | 3.2% |
4 | 533 | 2.5% |
5 | 512 | 2.4% |
, | 448 | 2.1% |
6 | 442 | 2.1% |
Other values (27) | 1772 | 8.4% |
Hangul
Value | Count | Frequency (%) |
서 | 10533 | |
구 | 9892 | |
울 | 9149 | |
시 | 7977 | |
동 | 2324 | 3.5% |
강 | 2298 | 3.4% |
로 | 2210 | 3.3% |
남 | 1279 | 1.9% |
포 | 1204 | 1.8% |
중 | 927 | 1.4% |
Other values (356) | 19001 |
None
Value | Count | Frequency (%) |
? | 8 |
주소2
Text
MISSING
 
Distinct | 7124 |
---|---|
Distinct (%) | 84.3% |
Missing | 1552 |
Missing (%) | 15.5% |
Memory size | 156.2 KiB |
Length
Max length | 67 |
---|---|
Median length | 48 |
Mean length | 14.332031 |
Min length | 3 |
Characters and Unicode
Total characters | 121077 |
---|---|
Distinct characters | 543 |
Distinct categories | 12 ? |
Distinct scripts | 3 ? |
Distinct blocks | 5 ? |
Unique
Unique | 6059 ? |
---|---|
Unique (%) | 71.7% |
Sample
1st row | 화양동 110-37 |
---|---|
2nd row | 수송동 80 |
3rd row | 양산로 지하200 (영등포동5가) |
4th row | 봉래동 2가 122-11 |
5th row | 상도4동 255-4 |
Value | Count | Frequency (%) |
영등포구 | 174 | 0.7% |
여의도동 | 174 | 0.7% |
강남구 | 174 | 0.7% |
지하 | 161 | 0.7% |
테헤란로 | 131 | 0.6% |
지하1층 | 127 | 0.5% |
남부순환로 | 122 | 0.5% |
도봉로 | 114 | 0.5% |
강남대로 | 97 | 0.4% |
서초동 | 97 | 0.4% |
Other values (6544) | 21944 |
Most occurring characters
Value | Count | Frequency (%) |
17216 | 14.2% | |
로 | 7265 | 6.0% |
1 | 6903 | 5.7% |
동 | 6265 | 5.2% |
2 | 4830 | 4.0% |
( | 4371 | 3.6% |
) | 4368 | 3.6% |
3 | 3809 | 3.1% |
4 | 3074 | 2.5% |
5 | 2957 | 2.4% |
Other values (533) | 60019 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 57613 | |
Decimal Number | 33151 | |
Space Separator | 17216 | 14.2% |
Open Punctuation | 4418 | 3.6% |
Close Punctuation | 4415 | 3.6% |
Other Punctuation | 1985 | 1.6% |
Dash Punctuation | 1812 | 1.5% |
Uppercase Letter | 235 | 0.2% |
Math Symbol | 214 | 0.2% |
Lowercase Letter | 14 | < 0.1% |
Other values (2) | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 7265 | 12.6% |
동 | 6265 | 10.9% |
길 | 2653 | 4.6% |
대 | 1438 | 2.5% |
층 | 1261 | 2.2% |
지 | 1236 | 2.1% |
가 | 1159 | 2.0% |
구 | 1132 | 2.0% |
산 | 840 | 1.5% |
하 | 750 | 1.3% |
Other values (472) | 33614 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 73 | |
S | 25 | 10.6% |
C | 19 | 8.1% |
A | 11 | 4.7% |
D | 11 | 4.7% |
T | 10 | 4.3% |
L | 9 | 3.8% |
K | 9 | 3.8% |
I | 7 | 3.0% |
G | 6 | 2.6% |
Other values (14) | 55 |
Lowercase Letter
Value | Count | Frequency (%) |
c | 2 | |
k | 2 | |
s | 2 | |
v | 1 | |
n | 1 | |
b | 1 | |
w | 1 | |
p | 1 | |
r | 1 | |
e | 1 |
Decimal Number
Value | Count | Frequency (%) |
1 | 6903 | |
2 | 4830 | |
3 | 3809 | |
4 | 3074 | |
5 | 2957 | |
6 | 2638 | 8.0% |
0 | 2510 | 7.6% |
7 | 2393 | 7.2% |
8 | 2076 | 6.3% |
9 | 1961 | 5.9% |
Other Punctuation
Value | Count | Frequency (%) |
, | 1894 | |
. | 44 | 2.2% |
? | 22 | 1.1% |
? | 12 | 0.6% |
: | 7 | 0.4% |
/ | 6 | 0.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 4371 | |
[ | 47 | 1.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 4368 | |
] | 47 | 1.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 2 | |
Ⅲ | 1 |
Space Separator
Value | Count | Frequency (%) |
17216 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1812 |
Math Symbol
Value | Count | Frequency (%) |
~ | 214 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 63211 | |
Hangul | 57614 | |
Latin | 252 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 7265 | 12.6% |
동 | 6265 | 10.9% |
길 | 2653 | 4.6% |
대 | 1438 | 2.5% |
층 | 1261 | 2.2% |
지 | 1236 | 2.1% |
가 | 1159 | 2.0% |
구 | 1132 | 2.0% |
산 | 840 | 1.5% |
하 | 750 | 1.3% |
Other values (473) | 33615 |
Latin
Value | Count | Frequency (%) |
B | 73 | |
S | 25 | 9.9% |
C | 19 | 7.5% |
A | 11 | 4.4% |
D | 11 | 4.4% |
T | 10 | 4.0% |
L | 9 | 3.6% |
K | 9 | 3.6% |
I | 7 | 2.8% |
G | 6 | 2.4% |
Other values (27) | 72 |
Common
Value | Count | Frequency (%) |
17216 | ||
1 | 6903 | |
2 | 4830 | 7.6% |
( | 4371 | 6.9% |
) | 4368 | 6.9% |
3 | 3809 | 6.0% |
4 | 3074 | 4.9% |
5 | 2957 | 4.7% |
6 | 2638 | 4.2% |
0 | 2510 | 4.0% |
Other values (13) | 10535 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 63448 | |
Hangul | 57610 | |
None | 13 | < 0.1% |
Compat Jamo | 3 | < 0.1% |
Number Forms | 3 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
17216 | ||
1 | 6903 | |
2 | 4830 | 7.6% |
( | 4371 | 6.9% |
) | 4368 | 6.9% |
3 | 3809 | 6.0% |
4 | 3074 | 4.8% |
5 | 2957 | 4.7% |
6 | 2638 | 4.2% |
0 | 2510 | 4.0% |
Other values (47) | 10772 |
Hangul
Value | Count | Frequency (%) |
로 | 7265 | 12.6% |
동 | 6265 | 10.9% |
길 | 2653 | 4.6% |
대 | 1438 | 2.5% |
층 | 1261 | 2.2% |
지 | 1236 | 2.1% |
가 | 1159 | 2.0% |
구 | 1132 | 2.0% |
산 | 840 | 1.5% |
하 | 750 | 1.3% |
Other values (470) | 33611 |
None
Value | Count | Frequency (%) |
? | 12 | |
㈜ | 1 | 7.7% |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 2 | |
ㅏ | 1 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 2 | |
Ⅲ | 1 |
우편번호1
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 9 |
---|---|
Distinct (%) | 13.0% |
Missing | 9931 |
Missing (%) | 99.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 103.91304 |
Minimum | 2 |
---|---|
Maximum | 156 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 11 |
Q1 | 11 |
median | 151 |
Q3 | 151 |
95-th percentile | 156 |
Maximum | 156 |
Range | 154 |
Interquartile range (IQR) | 140 |
Descriptive statistics
Standard deviation | 63.24991 |
---|---|
Coefficient of variation (CV) | 0.60868114 |
Kurtosis | -1.3308725 |
Mean | 103.91304 |
Median Absolute Deviation (MAD) | 5 |
Skewness | -0.77216362 |
Sum | 7170 |
Variance | 4000.5512 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
151 | 28 | 0.3% |
11 | 20 | 0.2% |
156 | 7 | 0.1% |
121 | 7 | 0.1% |
152 | 2 | < 0.1% |
111 | 2 | < 0.1% |
2 | 1 | < 0.1% |
132 | 1 | < 0.1% |
123 | 1 | < 0.1% |
(Missing) | 9931 |
Value | Count | Frequency (%) |
2 | 1 | < 0.1% |
11 | 20 | |
111 | 2 | < 0.1% |
121 | 7 | 0.1% |
123 | 1 | < 0.1% |
132 | 1 | < 0.1% |
151 | 28 | |
152 | 2 | < 0.1% |
156 | 7 | 0.1% |
Value | Count | Frequency (%) |
156 | 7 | 0.1% |
152 | 2 | < 0.1% |
151 | 28 | |
132 | 1 | < 0.1% |
123 | 1 | < 0.1% |
121 | 7 | 0.1% |
111 | 2 | < 0.1% |
11 | 20 | |
2 | 1 | < 0.1% |
우편번호2
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 41 |
---|---|
Distinct (%) | 59.4% |
Missing | 9931 |
Missing (%) | 99.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 561.76812 |
Minimum | 15 |
---|---|
Maximum | 907 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 15 |
---|---|
5-th percentile | 111 |
Q1 | 111 |
median | 807 |
Q3 | 829 |
95-th percentile | 895 |
Maximum | 907 |
Range | 892 |
Interquartile range (IQR) | 718 |
Descriptive statistics
Standard deviation | 354.71493 |
---|---|
Coefficient of variation (CV) | 0.63142588 |
Kurtosis | -1.7355962 |
Mean | 561.76812 |
Median Absolute Deviation (MAD) | 83 |
Skewness | -0.50060228 |
Sum | 38762 |
Variance | 125822.68 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
111 | 21 | 0.2% |
807 | 3 | < 0.1% |
829 | 3 | < 0.1% |
809 | 2 | < 0.1% |
895 | 2 | < 0.1% |
890 | 2 | < 0.1% |
826 | 2 | < 0.1% |
801 | 1 | < 0.1% |
899 | 1 | < 0.1% |
888 | 1 | < 0.1% |
Other values (31) | 31 | 0.3% |
(Missing) | 9931 |
Value | Count | Frequency (%) |
15 | 1 | < 0.1% |
21 | 1 | < 0.1% |
111 | 21 | |
123 | 1 | < 0.1% |
212 | 1 | < 0.1% |
222 | 1 | < 0.1% |
701 | 1 | < 0.1% |
725 | 1 | < 0.1% |
742 | 1 | < 0.1% |
783 | 1 | < 0.1% |
Value | Count | Frequency (%) |
907 | 1 | |
904 | 1 | |
899 | 1 | |
895 | 2 | |
893 | 1 | |
890 | 2 | |
888 | 1 | |
883 | 1 | |
882 | 1 | |
872 | 1 |
도로명주소1
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
도로명주소2
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
연면적
Real number (ℝ)
MISSING
 
Distinct | 3942 |
---|---|
Distinct (%) | 59.4% |
Missing | 3359 |
Missing (%) | 33.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8601.7775 |
Minimum | 0 |
---|---|
Maximum | 604089.5 |
Zeros | 88 |
Zeros (%) | 0.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 359.56 |
Q1 | 771.33 |
median | 2887.07 |
Q3 | 7793 |
95-th percentile | 28900 |
Maximum | 604089.5 |
Range | 604089.5 |
Interquartile range (IQR) | 7021.67 |
Descriptive statistics
Standard deviation | 25245.342 |
---|---|
Coefficient of variation (CV) | 2.9348983 |
Kurtosis | 161.10838 |
Mean | 8601.7775 |
Median Absolute Deviation (MAD) | 2368.93 |
Skewness | 10.678689 |
Sum | 57124404 |
Variance | 6.3732732 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2251.3 | 171 | 1.7% |
0.0 | 88 | 0.9% |
353.88 | 86 | 0.9% |
11043.4 | 55 | 0.5% |
2700.0 | 53 | 0.5% |
606.0 | 52 | 0.5% |
499.0 | 40 | 0.4% |
9345.69 | 39 | 0.4% |
14133.0 | 27 | 0.3% |
579.0 | 27 | 0.3% |
Other values (3932) | 6003 | |
(Missing) | 3359 |
Value | Count | Frequency (%) |
0.0 | 88 | |
4.358 | 4 | < 0.1% |
130.0 | 2 | < 0.1% |
189.0 | 5 | 0.1% |
194.0 | 2 | < 0.1% |
283.0 | 1 | < 0.1% |
300.0 | 2 | < 0.1% |
300.99 | 1 | < 0.1% |
301.0 | 1 | < 0.1% |
301.2 | 1 | < 0.1% |
Value | Count | Frequency (%) |
604089.5 | 1 | |
521000.0 | 1 | |
439000.0 | 1 | |
438913.3 | 1 | |
426635.0 | 2 | |
418000.0 | 1 | |
341000.0 | 1 | |
315000.0 | 1 | |
305934.0 | 1 | |
283843.2 | 1 |
준공일자
Text
MISSING
 
Distinct | 936 |
---|---|
Distinct (%) | 88.8% |
Missing | 8946 |
Missing (%) | 89.5% |
Memory size | 156.2 KiB |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.1413662 |
Min length | 4 |
Characters and Unicode
Total characters | 9635 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 853 ? |
---|---|
Unique (%) | 80.9% |
Sample
1st row | 1999-06-28 |
---|---|
2nd row | 2007-11-16 |
3rd row | 1991-12-1 |
4th row | 1995-12-11 |
5th row | 1994-10-20 |
Value | Count | Frequency (%) |
2000-12-15 | 8 | 0.8% |
2009-7-22 | 7 | 0.7% |
2013 | 6 | 0.6% |
1905-7-2 | 6 | 0.6% |
1905-6-29 | 5 | 0.5% |
1996-11-23 | 4 | 0.4% |
1984-3-31 | 4 | 0.4% |
2009-05-31 | 4 | 0.4% |
2008-7-18 | 3 | 0.3% |
2012-12-15 | 3 | 0.3% |
Other values (925) | 1004 |
Most occurring characters
Value | Count | Frequency (%) |
- | 2078 | |
1 | 1712 | |
0 | 1625 | |
2 | 1307 | |
9 | 986 | |
8 | 388 | 4.0% |
3 | 362 | 3.8% |
7 | 332 | 3.4% |
5 | 306 | 3.2% |
6 | 283 | 2.9% |
Other values (4) | 256 | 2.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 7539 | |
Dash Punctuation | 2078 | 21.6% |
Other Punctuation | 16 | 0.2% |
Space Separator | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 1712 | |
0 | 1625 | |
2 | 1307 | |
9 | 986 | |
8 | 388 | 5.1% |
3 | 362 | 4.8% |
7 | 332 | 4.4% |
5 | 306 | 4.1% |
6 | 283 | 3.8% |
4 | 238 | 3.2% |
Other Punctuation
Value | Count | Frequency (%) |
. | 15 | |
, | 1 | 6.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2078 |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 9635 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 2078 | |
1 | 1712 | |
0 | 1625 | |
2 | 1307 | |
9 | 986 | |
8 | 388 | 4.0% |
3 | 362 | 3.8% |
7 | 332 | 3.4% |
5 | 306 | 3.2% |
6 | 283 | 2.9% |
Other values (4) | 256 | 2.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9635 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 2078 | |
1 | 1712 | |
0 | 1625 | |
2 | 1307 | |
9 | 986 | |
8 | 388 | 4.0% |
3 | 362 | 3.8% |
7 | 332 | 3.4% |
5 | 306 | 3.2% |
6 | 283 | 2.9% |
Other values (4) | 256 | 2.7% |
부서명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
new | 235 |
관리부서 | 13 |
O | 3 |
○ | 1 |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 3.9755 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9747 | |
new | 235 | 2.4% |
관리부서 | 13 | 0.1% |
O | 3 | < 0.1% |
○ | 1 | < 0.1% |
제노피플관리 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9747 | |
new | 235 | 2.4% |
관리부서 | 13 | 0.1% |
o | 3 | < 0.1% |
○ | 1 | < 0.1% |
제노피플관리 | 1 | < 0.1% |
상주인원수
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 68 |
---|---|
Distinct (%) | 1.0% |
Missing | 3359 |
Missing (%) | 33.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.1584099 |
Minimum | 0 |
---|---|
Maximum | 850 |
Zeros | 5425 |
Zeros (%) | 54.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 12 |
Maximum | 850 |
Range | 850 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 37.505296 |
---|---|
Coefficient of variation (CV) | 9.0191435 |
Kurtosis | 277.38464 |
Mean | 4.1584099 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 15.797117 |
Sum | 27616 |
Variance | 1406.6472 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5425 | |
3 | 744 | 7.4% |
12 | 311 | 3.1% |
80 | 27 | 0.3% |
10 | 8 | 0.1% |
20 | 6 | 0.1% |
30 | 5 | 0.1% |
500 | 5 | 0.1% |
6 | 4 | < 0.1% |
50 | 4 | < 0.1% |
Other values (58) | 102 | 1.0% |
(Missing) | 3359 |
Value | Count | Frequency (%) |
0 | 5425 | |
1 | 2 | < 0.1% |
2 | 3 | < 0.1% |
3 | 744 | 7.4% |
4 | 3 | < 0.1% |
5 | 4 | < 0.1% |
6 | 4 | < 0.1% |
7 | 1 | < 0.1% |
8 | 2 | < 0.1% |
9 | 2 | < 0.1% |
Value | Count | Frequency (%) |
850 | 1 | < 0.1% |
800 | 2 | < 0.1% |
780 | 1 | < 0.1% |
755 | 1 | < 0.1% |
700 | 2 | < 0.1% |
660 | 2 | < 0.1% |
600 | 2 | < 0.1% |
508 | 2 | < 0.1% |
500 | 5 | |
400 | 3 |
일일사용자수
Real number (ℝ)
HIGH CORRELATION
  MISSING
  SKEWED
  ZEROS
 
Distinct | 71 |
---|---|
Distinct (%) | 1.1% |
Missing | 3359 |
Missing (%) | 33.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 59.167896 |
Minimum | 0 |
---|---|
Maximum | 27979 |
Zeros | 5381 |
Zeros (%) | 53.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 150 |
Maximum | 27979 |
Range | 27979 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 582.11288 |
---|---|
Coefficient of variation (CV) | 9.838323 |
Kurtosis | 1588.4473 |
Mean | 59.167896 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 37.500864 |
Sum | 392934 |
Variance | 338855.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5381 | |
150 | 531 | 5.3% |
45 | 321 | 3.2% |
700 | 198 | 2.0% |
100 | 41 | 0.4% |
200 | 16 | 0.2% |
250 | 14 | 0.1% |
800 | 12 | 0.1% |
500 | 11 | 0.1% |
300 | 10 | 0.1% |
Other values (61) | 106 | 1.1% |
(Missing) | 3359 |
Value | Count | Frequency (%) |
0 | 5381 | |
15 | 1 | < 0.1% |
19 | 1 | < 0.1% |
20 | 2 | < 0.1% |
25 | 1 | < 0.1% |
30 | 6 | 0.1% |
40 | 2 | < 0.1% |
44 | 1 | < 0.1% |
45 | 321 | 3.2% |
48 | 2 | < 0.1% |
Value | Count | Frequency (%) |
27979 | 1 | < 0.1% |
25703 | 1 | < 0.1% |
18442 | 1 | < 0.1% |
15332 | 1 | < 0.1% |
6500 | 1 | < 0.1% |
5000 | 3 | < 0.1% |
2000 | 1 | < 0.1% |
1000 | 4 | < 0.1% |
850 | 3 | < 0.1% |
800 | 12 |
침상수
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 63 |
---|---|
Distinct (%) | 0.9% |
Missing | 3359 |
Missing (%) | 33.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 71.128143 |
Minimum | 0 |
---|---|
Maximum | 2091 |
Zeros | 4491 |
Zeros (%) | 44.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 106 |
95-th percentile | 406 |
Maximum | 2091 |
Range | 2091 |
Interquartile range (IQR) | 106 |
Descriptive statistics
Standard deviation | 138.17978 |
---|---|
Coefficient of variation (CV) | 1.9426879 |
Kurtosis | 17.695505 |
Mean | 71.128143 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.1298053 |
Sum | 472362 |
Variance | 19093.652 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4491 | |
171 | 348 | 3.5% |
156 | 244 | 2.4% |
103 | 234 | 2.3% |
178 | 175 | 1.8% |
636 | 165 | 1.7% |
406 | 149 | 1.5% |
106 | 142 | 1.4% |
190 | 116 | 1.2% |
367 | 100 | 1.0% |
Other values (53) | 477 | 4.8% |
(Missing) | 3359 |
Value | Count | Frequency (%) |
0 | 4491 | |
30 | 2 | < 0.1% |
31 | 6 | 0.1% |
35 | 56 | 0.6% |
48 | 1 | < 0.1% |
60 | 1 | < 0.1% |
65 | 2 | < 0.1% |
79 | 3 | < 0.1% |
80 | 2 | < 0.1% |
85 | 2 | < 0.1% |
Value | Count | Frequency (%) |
2091 | 1 | < 0.1% |
1965 | 1 | < 0.1% |
684 | 1 | < 0.1% |
636 | 165 | |
500 | 1 | < 0.1% |
477 | 20 | 0.2% |
406 | 149 | |
367 | 100 | |
305 | 1 | < 0.1% |
297 | 44 | 0.4% |
최근교육이수일
Text
MISSING
 
Distinct | 463 |
---|---|
Distinct (%) | 13.1% |
Missing | 6479 |
Missing (%) | 64.8% |
Memory size | 156.2 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.6836126 |
Min length | 1 |
Characters and Unicode
Total characters | 34096 |
---|---|
Distinct characters | 24 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 174 ? |
---|---|
Unique (%) | 4.9% |
Sample
1st row | 2013-04-04 |
---|---|
2nd row | 2018-10-17 |
3rd row | 2014-05-30 |
4th row | 2015-12-02 |
5th row | 2016-05-25 |
Value | Count | Frequency (%) |
2016-07-20 | 68 | 1.9% |
2016-10-26 | 65 | 1.8% |
2016-11-23 | 62 | 1.8% |
2016-11-15 | 61 | 1.7% |
2012-11-14 | 52 | 1.5% |
2015-12-02 | 49 | 1.4% |
2015-05-28 | 46 | 1.3% |
2016-08-24 | 45 | 1.3% |
2014-11-21 | 44 | 1.2% |
2015-11-27 | 43 | 1.2% |
Other values (452) | 2988 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 7290 | |
0 | 6364 | |
- | 6202 | |
2 | 5915 | |
6 | 1292 | 3.8% |
4 | 1279 | 3.8% |
5 | 1257 | 3.7% |
7 | 1104 | 3.2% |
3 | 1001 | 2.9% |
8 | 942 | 2.8% |
Other values (14) | 1450 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 27032 | |
Dash Punctuation | 6202 | 18.2% |
Other Punctuation | 818 | 2.4% |
Space Separator | 27 | 0.1% |
Other Letter | 17 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 7290 | |
0 | 6364 | |
2 | 5915 | |
6 | 1292 | 4.8% |
4 | 1279 | 4.7% |
5 | 1257 | 4.7% |
7 | 1104 | 4.1% |
3 | 1001 | 3.7% |
8 | 942 | 3.5% |
9 | 588 | 2.2% |
Other Letter
Value | Count | Frequency (%) |
업 | 3 | |
영 | 2 | |
구 | 2 | |
면 | 2 | |
제 | 2 | |
폐 | 2 | |
휴 | 1 | 5.9% |
노 | 1 | 5.9% |
강 | 1 | 5.9% |
래 | 1 | 5.9% |
Other Punctuation
Value | Count | Frequency (%) |
. | 788 | |
/ | 30 | 3.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6202 |
Space Separator
Value | Count | Frequency (%) |
27 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 34079 | |
Hangul | 17 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 7290 | |
0 | 6364 | |
- | 6202 | |
2 | 5915 | |
6 | 1292 | 3.8% |
4 | 1279 | 3.8% |
5 | 1257 | 3.7% |
7 | 1104 | 3.2% |
3 | 1001 | 2.9% |
8 | 942 | 2.8% |
Other values (4) | 1433 | 4.2% |
Hangul
Value | Count | Frequency (%) |
업 | 3 | |
영 | 2 | |
구 | 2 | |
면 | 2 | |
제 | 2 | |
폐 | 2 | |
휴 | 1 | 5.9% |
노 | 1 | 5.9% |
강 | 1 | 5.9% |
래 | 1 | 5.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 34079 | |
Hangul | 17 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 7290 | |
0 | 6364 | |
- | 6202 | |
2 | 5915 | |
6 | 1292 | 3.8% |
4 | 1279 | 3.8% |
5 | 1257 | 3.7% |
7 | 1104 | 3.2% |
3 | 1001 | 2.9% |
8 | 942 | 2.8% |
Other values (4) | 1433 | 4.2% |
Hangul
Value | Count | Frequency (%) |
업 | 3 | |
영 | 2 | |
구 | 2 | |
면 | 2 | |
제 | 2 | |
폐 | 2 | |
휴 | 1 | 5.9% |
노 | 1 | 5.9% |
강 | 1 | 5.9% |
래 | 1 | 5.9% |
교육종류코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 15 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
보수 | |
신규 | 575 |
실내공기질관리자교육 | 43 |
보수교육 | 41 |
Other values (10) | 75 |
Length
Max length | 10 |
---|---|
Median length | 4 |
Mean length | 3.5328 |
Min length | 1 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 보수 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 7279 | |
보수 | 1987 | 19.9% |
신규 | 575 | 5.8% |
실내공기질관리자교육 | 43 | 0.4% |
보수교육 | 41 | 0.4% |
환경보전협회사이버 | 41 | 0.4% |
교육면제 | 9 | 0.1% |
신규교육 | 8 | 0.1% |
- | 7 | 0.1% |
사이버 | 3 | < 0.1% |
Other values (5) | 7 | 0.1% |
Length
Value | Count | Frequency (%) |
na | 7279 | |
보수 | 1987 | 19.9% |
신규 | 576 | 5.8% |
실내공기질관리자교육 | 43 | 0.4% |
보수교육 | 41 | 0.4% |
환경보전협회사이버 | 41 | 0.4% |
교육면제 | 9 | 0.1% |
신규교육 | 8 | 0.1% |
7 | 0.1% | |
사이버 | 3 | < 0.1% |
Other values (5) | 7 | 0.1% |
교육종류코드.1
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
보수 | |
신규 | 575 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.4876 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 보수 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 7438 | |
보수 | 1987 | 19.9% |
신규 | 575 | 5.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 7438 | |
보수 | 1987 | 19.9% |
신규 | 575 | 5.8% |
인덱스 | 연도 | 지역구 | 시설군 | 우편번호1 | 우편번호2 | 연면적 | 부서명 | 상주인원수 | 일일사용자수 | 침상수 | 교육종류코드 | 교육종류코드.1 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
인덱스 | 1.000 | 0.884 | 0.309 | 0.406 | 0.664 | 0.585 | 0.034 | 0.643 | 0.164 | 0.000 | 0.330 | 0.370 | 0.215 |
연도 | 0.884 | 1.000 | 0.305 | 0.393 | 0.712 | 0.862 | 0.000 | 0.632 | 0.100 | 0.000 | 0.295 | 0.329 | 0.155 |
지역구 | 0.309 | 0.305 | 1.000 | 0.405 | 0.851 | 0.606 | 0.187 | 0.571 | 0.190 | 0.173 | 0.663 | 0.765 | 0.450 |
시설군 | 0.406 | 0.393 | 0.405 | 1.000 | 0.809 | 0.709 | 0.369 | 0.484 | 0.000 | 0.000 | 0.123 | 0.266 | 0.185 |
우편번호1 | 0.664 | 0.712 | 0.851 | 0.809 | 1.000 | 0.805 | NaN | 0.752 | 1.000 | 1.000 | 0.000 | 0.514 | 0.514 |
우편번호2 | 0.585 | 0.862 | 0.606 | 0.709 | 0.805 | 1.000 | NaN | 0.618 | 0.864 | 0.864 | 0.000 | 0.494 | 0.494 |
연면적 | 0.034 | 0.000 | 0.187 | 0.369 | NaN | NaN | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.042 |
부서명 | 0.643 | 0.632 | 0.571 | 0.484 | 0.752 | 0.618 | 0.000 | 1.000 | NaN | NaN | NaN | 0.000 | 0.000 |
상주인원수 | 0.164 | 0.100 | 0.190 | 0.000 | 1.000 | 0.864 | 0.000 | NaN | 1.000 | 0.555 | 0.318 | 0.000 | 0.000 |
일일사용자수 | 0.000 | 0.000 | 0.173 | 0.000 | 1.000 | 0.864 | 0.000 | NaN | 0.555 | 1.000 | 0.016 | 0.000 | 0.000 |
침상수 | 0.330 | 0.295 | 0.663 | 0.123 | 0.000 | 0.000 | 0.000 | NaN | 0.318 | 0.016 | 1.000 | 0.000 | 0.026 |
교육종류코드 | 0.370 | 0.329 | 0.765 | 0.266 | 0.514 | 0.494 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
교육종류코드.1 | 0.215 | 0.155 | 0.450 | 0.185 | 0.514 | 0.494 | 0.042 | 0.000 | 0.000 | 0.000 | 0.026 | 1.000 | 1.000 |
교육종류코드 | 교육종류코드.1 | 지역구 | 시설군 | 부서명 | |
---|---|---|---|---|---|
교육종류코드 | 1.000 | 0.999 | 0.359 | 0.088 | 0.000 |
교육종류코드.1 | 0.999 | 1.000 | 0.356 | 0.146 | 0.000 |
지역구 | 0.359 | 0.356 | 1.000 | 0.116 | 0.272 |
시설군 | 0.088 | 0.146 | 0.116 | 1.000 | 0.222 |
부서명 | 0.000 | 0.000 | 0.272 | 0.222 | 1.000 |
인덱스 | 연도 | 우편번호1 | 우편번호2 | 연면적 | 상주인원수 | 일일사용자수 | 침상수 | 지역구 | 시설군 | 부서명 | 교육종류코드 | 교육종류코드.1 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
인덱스 | 1.000 | 0.984 | -0.688 | -0.515 | -0.006 | -0.081 | -0.086 | -0.391 | 0.123 | 0.169 | 0.571 | 0.217 | 0.143 |
연도 | 0.984 | 1.000 | -0.834 | -0.737 | 0.003 | -0.092 | -0.091 | -0.506 | 0.125 | 0.169 | 0.491 | 0.176 | 0.179 |
우편번호1 | -0.688 | -0.834 | 1.000 | 0.675 | 0.077 | -0.360 | 0.152 | 0.335 | 0.585 | 0.605 | 0.396 | 0.763 | 0.763 |
우편번호2 | -0.515 | -0.737 | 0.675 | 1.000 | 0.153 | -0.405 | -0.208 | 0.335 | 0.297 | 0.457 | 0.597 | 0.564 | 0.564 |
연면적 | -0.006 | 0.003 | 0.077 | 0.153 | 1.000 | -0.218 | -0.213 | -0.140 | 0.067 | 0.145 | 0.000 | 0.000 | 0.042 |
상주인원수 | -0.081 | -0.092 | -0.360 | -0.405 | -0.218 | 1.000 | 0.960 | 0.332 | 0.073 | 0.000 | 1.000 | 0.000 | 0.000 |
일일사용자수 | -0.086 | -0.091 | 0.152 | -0.208 | -0.213 | 0.960 | 1.000 | 0.260 | 0.078 | 0.000 | 1.000 | 0.000 | 0.000 |
침상수 | -0.391 | -0.506 | 0.335 | 0.335 | -0.140 | 0.332 | 0.260 | 1.000 | 0.351 | 0.061 | 1.000 | 0.000 | 0.017 |
지역구 | 0.123 | 0.125 | 0.585 | 0.297 | 0.067 | 0.073 | 0.078 | 0.351 | 1.000 | 0.116 | 0.272 | 0.359 | 0.356 |
시설군 | 0.169 | 0.169 | 0.605 | 0.457 | 0.145 | 0.000 | 0.000 | 0.061 | 0.116 | 1.000 | 0.222 | 0.088 | 0.146 |
부서명 | 0.571 | 0.491 | 0.396 | 0.597 | 0.000 | 1.000 | 1.000 | 1.000 | 0.272 | 0.222 | 1.000 | 0.000 | 0.000 |
교육종류코드 | 0.217 | 0.176 | 0.763 | 0.564 | 0.000 | 0.000 | 0.000 | 0.000 | 0.359 | 0.088 | 0.000 | 1.000 | 0.999 |
교육종류코드.1 | 0.143 | 0.179 | 0.763 | 0.564 | 0.042 | 0.000 | 0.000 | 0.017 | 0.356 | 0.146 | 0.000 | 0.999 | 1.000 |
인덱스 | 연도 | 지역구 | 시설군 | 건물명 | 주소1 | 주소2 | 우편번호1 | 우편번호2 | 도로명주소1 | 도로명주소2 | 연면적 | 준공일자 | 부서명 | 상주인원수 | 일일사용자수 | 침상수 | 최근교육이수일 | 교육종류코드 | 교육종류코드.1 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
27396 | 1774 | 2014 | 광진구 | 실내주차장 | 화양타워 | 서울시 광진구 | 화양동 110-37 | <NA> | <NA> | <NA> | <NA> | 4030.0 | 1999-06-28 | <NA> | 0 | 0 | 0 | 2013-04-04 | 보수 | 보수 |
27726 | 939 | 2014 | 종로구 | 실내주차장 | 코리안리빌딩 | 서울시 종로구 | 수송동 80 | <NA> | <NA> | <NA> | <NA> | 6027.14 | <NA> | <NA> | 0 | 0 | 0 | <NA> | <NA> | <NA> |
28176 | 8550 | 2014 | 영등포구 | 지하역사 | 영등포시장역 (5호선) | 서울시 영등포구 | 양산로 지하200 (영등포동5가) | <NA> | <NA> | <NA> | <NA> | 14029.0 | <NA> | <NA> | 0 | 0 | 171 | <NA> | <NA> | <NA> |
31112 | 6363 | 2014 | 중구 | 대규모점포 | 롯데마트서울역점 | 서울시 중구 | 봉래동 2가 122-11 | <NA> | <NA> | <NA> | <NA> | 26069.0 | <NA> | <NA> | 0 | 0 | 0 | <NA> | <NA> | <NA> |
21368 | 45542 | 2016 | 동작구 | 의료기관 | 서울요양병원 | 서울시 동작구 | 상도4동 255-4 | <NA> | <NA> | <NA> | <NA> | 2271.0 | <NA> | <NA> | 0 | 110 | 0 | <NA> | <NA> | <NA> |
8187 | 58218 | 2023 | 강동구 | PC영업시설 | 힐러PC | 서울시 강동구 | 양재대로89길 17, 지층 (성내동) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
5961 | 70598 | 2024 | 관악구 | 의료기관 | 척편한병원 | 서울시 관악구 | 신림로 318, 4,5층 (신림동, 청암두산위브) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
11471 | 52877 | 2019 | 서대문구 | 영화상영관 | 필름포럼(사단법인 필레마) | 서울특별시 성산로 527 (대신동, 하늬솔빌딩 지하1층) | <NA> | <NA> | <NA> | <NA> | <NA> | 8898.97 | <NA> | <NA> | 0 | 0 | 0 | 2018-10-17 | 보수 | 보수 |
9485 | 56585 | 2023 | 성동구 | 실내주차장 | (주)신세계이마트성수점 | 서울시 성동구 | 뚝섬로 377(성수2가1동) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
8472 | 58163 | 2023 | 동작구 | 실내주차장 | 롯데타워(지하5층) | 서울시 동작구 | 보라매로5길 51 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
인덱스 | 연도 | 지역구 | 시설군 | 건물명 | 주소1 | 주소2 | 우편번호1 | 우편번호2 | 도로명주소1 | 도로명주소2 | 연면적 | 준공일자 | 부서명 | 상주인원수 | 일일사용자수 | 침상수 | 최근교육이수일 | 교육종류코드 | 교육종류코드.1 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
30080 | 7242 | 2014 | 노원구 | 보육시설 | 한국성서대학교어린이집 | 서울시 노원구 | 동일로214길 32 (상계동) | <NA> | <NA> | <NA> | <NA> | 1190.0 | <NA> | <NA> | 0 | 0 | 253 | <NA> | <NA> | <NA> |
16480 | 50758 | 2018 | 종로구 | 어린이집 | 상록수어린이집 | 서울시 종로구 | 송월1길 73-7 | <NA> | <NA> | <NA> | <NA> | 0.0 | <NA> | <NA> | 0 | 0 | 0 | <NA> | <NA> | <NA> |
1715 | 68283 | 2024 | 송파구 | 실내주차장 | 문정역skv1 | 서울시 송파구 | 법원로 128 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
20313 | 48855 | 2017 | 노원구 | 장례식장 | 한국원자력의학원 장례식장 | 서울시 노원구 | 노원로 75 (공릉동) | <NA> | <NA> | <NA> | <NA> | 3297.0 | <NA> | <NA> | 0 | 0 | 0 | 2014-11-07 | 보수 | 보수 |
25953 | 28006 | 2015 | 중랑구 | 목욕장 | 두산사우나 | 서울시 중랑구 | 면목7동 1522 두산(아) 402동 지하1층 | <NA> | <NA> | <NA> | <NA> | 1320.0 | <NA> | <NA> | 0 | 0 | 133 | <NA> | <NA> | <NA> |
30882 | 6595 | 2014 | 성동구 | 실내주차장 | 성동종합행정마을(성동구청) | 서울시 성동구 | 고산자로 270 | <NA> | <NA> | <NA> | <NA> | 9503.29 | <NA> | <NA> | 0 | 0 | 135 | <NA> | <NA> | <NA> |
26721 | 5321 | 2014 | 성동구 | 목욕장 | 월드사우나 | 서울시 성동구 | 독서당로 272 | <NA> | <NA> | <NA> | <NA> | 1420.0 | 2009-3-20 | <NA> | 0 | 0 | 135 | 2010-4-21 | 보수 | 보수 |
26481 | 4789 | 2014 | 송파구 | 장례식장 | 서울아산병원장례식장 | 서울시 송파구 | 올림픽로43길 88 | <NA> | <NA> | <NA> | <NA> | 16200.0 | 1994-12-31 | <NA> | 0 | 0 | 106 | <NA> | <NA> | <NA> |
15781 | 54191 | 2019 | 강남구 | 실내주차장 | 엔씨소프트 R&D센타(엔씨타워1) | 서울특별시 강남구 테헤란로 509 | <NA> | <NA> | <NA> | <NA> | <NA> | 30902.0 | <NA> | <NA> | 0 | 0 | 0 | <NA> | <NA> | <NA> |
14855 | 53901 | 2019 | 강남구 | 학원 | 비전21닿을관학원 | 서울특별시 강남구 도곡로 505 , 지하1층 앞쪽 일부 및 6층 (대치동) | <NA> | <NA> | <NA> | <NA> | <NA> | 1082.37 | <NA> | <NA> | 0 | 0 | 0 | <NA> | 신규 | 신규 |