Overview

Dataset statistics

Number of variables5
Number of observations89
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory44.5 B

Variable types

Numeric3
Text2

Dataset

Description인천광역시 계양구 지진옥외대피장소에 대한 현황 데이터로서 연번, 대피장소명, 주소, 대피가능인원, 면적의 정보를 포함하고 있습니다.
Author인천광역시 계양구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15117236&srcSe=7661IVAWM27C61E190

Alerts

대피가능인원 is highly overall correlated with 면적(m2)High correlation
면적(m2) is highly overall correlated with 대피가능인원High correlation
연번 has unique valuesUnique
대피장소명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2024-01-28 06:25:40.653299
Analysis finished2024-01-28 06:25:41.879899
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45
Minimum1
Maximum89
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2024-01-28T15:25:41.944603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.4
Q123
median45
Q367
95-th percentile84.6
Maximum89
Range88
Interquartile range (IQR)44

Descriptive statistics

Standard deviation25.836021
Coefficient of variation (CV)0.57413381
Kurtosis-1.2
Mean45
Median Absolute Deviation (MAD)22
Skewness0
Sum4005
Variance667.5
MonotonicityStrictly increasing
2024-01-28T15:25:42.054482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
68 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
59 1
 
1.1%
Other values (79) 79
88.8%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%
80 1
1.1%

대피장소명
Text

UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size844.0 B
2024-01-28T15:25:42.223884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length7.9101124
Min length4

Characters and Unicode

Total characters704
Distinct characters89
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)100.0%

Sample

1st row효성서초등학교 운동장
2nd row북인천여자중학교 운동장
3rd row작동초등학교 운동장
4th row효성남초등학교 운동장
5th row성지초등학교 운동장
ValueCountFrequency (%)
운동장 53
37.1%
계양초등학교 2
 
1.4%
고향골공원 1
 
0.7%
계산공원 1
 
0.7%
계산1공원 1
 
0.7%
부일공원 1
 
0.7%
살나리공원 1
 
0.7%
계양교통공원 1
 
0.7%
작전공원2 1
 
0.7%
까치공원 1
 
0.7%
Other values (80) 80
55.9%
2024-01-28T15:25:42.498596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
61
 
8.7%
57
 
8.1%
56
 
8.0%
56
 
8.0%
55
 
7.8%
54
 
7.7%
38
 
5.4%
38
 
5.4%
37
 
5.3%
28
 
4.0%
Other values (79) 224
31.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 644
91.5%
Space Separator 54
 
7.7%
Decimal Number 6
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
9.5%
57
 
8.9%
56
 
8.7%
56
 
8.7%
55
 
8.5%
38
 
5.9%
38
 
5.9%
37
 
5.7%
28
 
4.3%
16
 
2.5%
Other values (76) 202
31.4%
Decimal Number
ValueCountFrequency (%)
1 4
66.7%
2 2
33.3%
Space Separator
ValueCountFrequency (%)
54
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 644
91.5%
Common 60
 
8.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
9.5%
57
 
8.9%
56
 
8.7%
56
 
8.7%
55
 
8.5%
38
 
5.9%
38
 
5.9%
37
 
5.7%
28
 
4.3%
16
 
2.5%
Other values (76) 202
31.4%
Common
ValueCountFrequency (%)
54
90.0%
1 4
 
6.7%
2 2
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 644
91.5%
ASCII 60
 
8.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
61
 
9.5%
57
 
8.9%
56
 
8.7%
56
 
8.7%
55
 
8.5%
38
 
5.9%
38
 
5.9%
37
 
5.7%
28
 
4.3%
16
 
2.5%
Other values (76) 202
31.4%
ASCII
ValueCountFrequency (%)
54
90.0%
1 4
 
6.7%
2 2
 
3.3%

주소
Text

UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size844.0 B
2024-01-28T15:25:42.725413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length22.280899
Min length16

Characters and Unicode

Total characters1983
Distinct characters81
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)100.0%

Sample

1st row인천광역시 계양구 봉오대로457 (효성동)
2nd row인천광역시 계양구 아나지로85번길12 (효성동)
3rd row인천광역시 계양구 까치말로35 (작전동)
4th row인천광역시 계양구 새벌로125 (효성동)
5th row인천광역시 계양구 아나지로247번길8 (작전동)
ValueCountFrequency (%)
인천광역시 89
24.4%
계양구 89
24.4%
계산동 20
 
5.5%
작전동 13
 
3.6%
효성동 12
 
3.3%
동양동 8
 
2.2%
병방동 7
 
1.9%
용종동 6
 
1.6%
귤현동 6
 
1.6%
서운동 5
 
1.4%
Other values (101) 110
30.1%
2024-01-28T15:25:43.044050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
276
 
13.9%
117
 
5.9%
102
 
5.1%
98
 
4.9%
90
 
4.5%
89
 
4.5%
89
 
4.5%
89
 
4.5%
89
 
4.5%
89
 
4.5%
Other values (71) 855
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1262
63.6%
Decimal Number 299
 
15.1%
Space Separator 276
 
13.9%
Open Punctuation 65
 
3.3%
Close Punctuation 65
 
3.3%
Dash Punctuation 16
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
117
 
9.3%
102
 
8.1%
98
 
7.8%
90
 
7.1%
89
 
7.1%
89
 
7.1%
89
 
7.1%
89
 
7.1%
89
 
7.1%
63
 
5.0%
Other values (57) 347
27.5%
Decimal Number
ValueCountFrequency (%)
1 63
21.1%
5 40
13.4%
2 34
11.4%
4 27
9.0%
6 26
8.7%
9 24
 
8.0%
7 23
 
7.7%
3 23
 
7.7%
0 20
 
6.7%
8 19
 
6.4%
Space Separator
ValueCountFrequency (%)
276
100.0%
Open Punctuation
ValueCountFrequency (%)
( 65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 65
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1262
63.6%
Common 721
36.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
117
 
9.3%
102
 
8.1%
98
 
7.8%
90
 
7.1%
89
 
7.1%
89
 
7.1%
89
 
7.1%
89
 
7.1%
89
 
7.1%
63
 
5.0%
Other values (57) 347
27.5%
Common
ValueCountFrequency (%)
276
38.3%
( 65
 
9.0%
) 65
 
9.0%
1 63
 
8.7%
5 40
 
5.5%
2 34
 
4.7%
4 27
 
3.7%
6 26
 
3.6%
9 24
 
3.3%
7 23
 
3.2%
Other values (4) 78
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1262
63.6%
ASCII 721
36.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
276
38.3%
( 65
 
9.0%
) 65
 
9.0%
1 63
 
8.7%
5 40
 
5.5%
2 34
 
4.7%
4 27
 
3.7%
6 26
 
3.6%
9 24
 
3.3%
7 23
 
3.2%
Other values (4) 78
 
10.8%
Hangul
ValueCountFrequency (%)
117
 
9.3%
102
 
8.1%
98
 
7.8%
90
 
7.1%
89
 
7.1%
89
 
7.1%
89
 
7.1%
89
 
7.1%
89
 
7.1%
63
 
5.0%
Other values (57) 347
27.5%

대피가능인원
Real number (ℝ)

HIGH CORRELATION 

Distinct87
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4693.4157
Minimum531
Maximum19353
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2024-01-28T15:25:43.159856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum531
5-th percentile1100.6
Q12023
median4072
Q35334
95-th percentile13028.2
Maximum19353
Range18822
Interquartile range (IQR)3311

Descriptive statistics

Standard deviation3794.0897
Coefficient of variation (CV)0.80838559
Kurtosis5.5079312
Mean4693.4157
Median Absolute Deviation (MAD)1496
Skewness2.1745414
Sum417714
Variance14395116
MonotonicityNot monotonic
2024-01-28T15:25:43.265998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4212 2
 
2.2%
1357 2
 
2.2%
2644 1
 
1.1%
14461 1
 
1.1%
1229 1
 
1.1%
1260 1
 
1.1%
18904 1
 
1.1%
1015 1
 
1.1%
993 1
 
1.1%
1742 1
 
1.1%
Other values (77) 77
86.5%
ValueCountFrequency (%)
531 1
1.1%
711 1
1.1%
763 1
1.1%
993 1
1.1%
1015 1
1.1%
1229 1
1.1%
1260 1
1.1%
1276 1
1.1%
1287 1
1.1%
1338 1
1.1%
ValueCountFrequency (%)
19353 1
1.1%
18904 1
1.1%
17896 1
1.1%
15539 1
1.1%
14461 1
1.1%
10879 1
1.1%
9889 1
1.1%
9228 1
1.1%
8036 1
1.1%
8009 1
1.1%

면적(m2)
Real number (ℝ)

HIGH CORRELATION 

Distinct87
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3872.4944
Minimum438.2
Maximum15967
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2024-01-28T15:25:43.393080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum438.2
5-th percentile908.46
Q11669.5
median3360
Q34400.9
95-th percentile10748.64
Maximum15967
Range15528.8
Interquartile range (IQR)2731.4

Descriptive statistics

Standard deviation3130.1117
Coefficient of variation (CV)0.80829341
Kurtosis5.5079327
Mean3872.4944
Median Absolute Deviation (MAD)1234.1
Skewness2.1745307
Sum344652
Variance9797599.2
MonotonicityNot monotonic
2024-01-28T15:25:43.500578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3475.5 2
 
2.2%
1120.0 2
 
2.2%
2181.9 1
 
1.1%
11930.8 1
 
1.1%
1014.3 1
 
1.1%
1039.5 1
 
1.1%
15596.0 1
 
1.1%
837.9 1
 
1.1%
819.7 1
 
1.1%
1437.8 1
 
1.1%
Other values (77) 77
86.5%
ValueCountFrequency (%)
438.2 1
1.1%
586.6 1
1.1%
630.0 1
1.1%
819.7 1
1.1%
837.9 1
1.1%
1014.3 1
1.1%
1039.5 1
1.1%
1053.5 1
1.1%
1061.9 1
1.1%
1104.6 1
1.1%
ValueCountFrequency (%)
15967.0 1
1.1%
15596.0 1
1.1%
14764.4 1
1.1%
12819.8 1
1.1%
11930.8 1
1.1%
8975.4 1
1.1%
8158.5 1
1.1%
7613.9 1
1.1%
6629.7 1
1.1%
6608.0 1
1.1%

Interactions

2024-01-28T15:25:41.535769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:25:40.952123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:25:41.337027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:25:41.598985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:25:41.007101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:25:41.398397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:25:41.669562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:25:41.070464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:25:41.466938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T15:25:43.574469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번대피장소명주소대피가능인원면적(m2)
연번1.0001.0001.0000.5380.538
대피장소명1.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.000
대피가능인원0.5381.0001.0001.0001.000
면적(m2)0.5381.0001.0001.0001.000
2024-01-28T15:25:43.649842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번대피가능인원면적(m2)
연번1.000-0.304-0.304
대피가능인원-0.3041.0001.000
면적(m2)-0.3041.0001.000

Missing values

2024-01-28T15:25:41.779864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T15:25:41.852737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번대피장소명주소대피가능인원면적(m2)
01효성서초등학교 운동장인천광역시 계양구 봉오대로457 (효성동)26442181.9
12북인천여자중학교 운동장인천광역시 계양구 아나지로85번길12 (효성동)80366629.7
23작동초등학교 운동장인천광역시 계양구 까치말로35 (작전동)42123475.5
34효성남초등학교 운동장인천광역시 계양구 새벌로125 (효성동)27502269.4
45성지초등학교 운동장인천광역시 계양구 아나지로247번길8 (작전동)41053387.3
56안남고등학교 운동장인천광역시 계양구 아나지로308번길9 (작전동)52134301.5
67화전초등학교 운동장인천광역시 계양구 효서로303번길8 (작전동)41553427.9
78서운초등학교 운동장인천광역시 계양구 서운로 12 (서운동)34212822.4
89서운중학교 운동장인천광역시 계양구 아나지로467 (서운동)40093307.5
910서운고등학교 운동장인천광역시 계양구 아나지로481 (서운동)40723360.0
연번대피장소명주소대피가능인원면적(m2)
7980낙원공원인천광역시 계양구 임학동 25763630.0
8081샘터공원인천광역시 계양구 귤현동 509-513571120.0
8182안골공원인천광역시 계양구 귤현동 486-717731463.0
8283양지말공원인천광역시 계양구 동양동 59276006270.6
8384동양공원1인천광역시 계양구 동양동 615-3711586.6
8485동산공원인천광역시 계양구 장기동 13141113392.2
8586장기공원인천광역시 계양구 장기동 12414281178.8
8687중앙공원인천광역시 계양구 장기동 13620231669.5
8788장터공원인천광역시 계양구 장기동 14618691542.1
8889계양초등학교 상야분교 운동장인천광역시 계양구 벌말로572-1 (상야동)14421190.0