Overview

Dataset statistics

Number of variables5
Number of observations351
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.2 KiB
Average record size in memory41.4 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description인천광역시 계양구 관내 석면조사대상 건축물에 대한 데이터로, 기관명, 건축물 소재지, 구분(다중이용시설, 공공건축물, 어린이집 등) 데이터를 제공합니다.
Author인천광역시 계양구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3073073&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 08:03:27.149681
Analysis finished2024-01-28 08:03:27.630058
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct351
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean176
Minimum1
Maximum351
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2024-01-28T17:03:27.696045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile18.5
Q188.5
median176
Q3263.5
95-th percentile333.5
Maximum351
Range350
Interquartile range (IQR)175

Descriptive statistics

Standard deviation101.46921
Coefficient of variation (CV)0.57652959
Kurtosis-1.2
Mean176
Median Absolute Deviation (MAD)88
Skewness0
Sum61776
Variance10296
MonotonicityStrictly increasing
2024-01-28T17:03:27.809108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
2 1
 
0.3%
241 1
 
0.3%
240 1
 
0.3%
239 1
 
0.3%
238 1
 
0.3%
237 1
 
0.3%
236 1
 
0.3%
235 1
 
0.3%
234 1
 
0.3%
Other values (341) 341
97.2%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
351 1
0.3%
350 1
0.3%
349 1
0.3%
348 1
0.3%
347 1
0.3%
346 1
0.3%
345 1
0.3%
344 1
0.3%
343 1
0.3%
342 1
0.3%
Distinct348
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-01-28T17:03:28.018906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length7.5925926
Min length4

Characters and Unicode

Total characters2665
Distinct characters345
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique345 ?
Unique (%)98.3%

Sample

1st row함춘요양병원
2nd row인천지일이삼공학원
3rd row연세소망병원(큰사랑요양병원)
4th row인천서부보호관찰소
5th row천지연보석불가마사우나
ValueCountFrequency (%)
어린이집 14
 
3.5%
인천교통공사 7
 
1.8%
구립 3
 
0.8%
부평농협 3
 
0.8%
pc방 2
 
0.5%
노블pc방 2
 
0.5%
한국도로공사 2
 
0.5%
계양역 2
 
0.5%
계양새마을금고 2
 
0.5%
대산월드프라자 2
 
0.5%
Other values (359) 361
90.2%
2024-01-28T17:03:28.351778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
223
 
8.4%
197
 
7.4%
195
 
7.3%
191
 
7.2%
60
 
2.3%
50
 
1.9%
44
 
1.7%
43
 
1.6%
42
 
1.6%
40
 
1.5%
Other values (335) 1580
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2538
95.2%
Space Separator 50
 
1.9%
Uppercase Letter 34
 
1.3%
Decimal Number 18
 
0.7%
Lowercase Letter 11
 
0.4%
Open Punctuation 7
 
0.3%
Close Punctuation 7
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
223
 
8.8%
197
 
7.8%
195
 
7.7%
191
 
7.5%
60
 
2.4%
44
 
1.7%
43
 
1.7%
42
 
1.7%
40
 
1.6%
32
 
1.3%
Other values (302) 1471
58.0%
Uppercase Letter
ValueCountFrequency (%)
C 9
26.5%
P 8
23.5%
B 2
 
5.9%
S 2
 
5.9%
G 1
 
2.9%
K 1
 
2.9%
V 1
 
2.9%
L 1
 
2.9%
X 1
 
2.9%
F 1
 
2.9%
Other values (7) 7
20.6%
Lowercase Letter
ValueCountFrequency (%)
i 3
27.3%
e 2
18.2%
d 1
 
9.1%
s 1
 
9.1%
c 1
 
9.1%
n 1
 
9.1%
h 1
 
9.1%
p 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 8
44.4%
2 4
22.2%
9 3
 
16.7%
4 2
 
11.1%
3 1
 
5.6%
Space Separator
ValueCountFrequency (%)
50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2538
95.2%
Common 82
 
3.1%
Latin 45
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
223
 
8.8%
197
 
7.8%
195
 
7.7%
191
 
7.5%
60
 
2.4%
44
 
1.7%
43
 
1.7%
42
 
1.7%
40
 
1.6%
32
 
1.3%
Other values (302) 1471
58.0%
Latin
ValueCountFrequency (%)
C 9
20.0%
P 8
17.8%
i 3
 
6.7%
B 2
 
4.4%
S 2
 
4.4%
e 2
 
4.4%
G 1
 
2.2%
d 1
 
2.2%
K 1
 
2.2%
s 1
 
2.2%
Other values (15) 15
33.3%
Common
ValueCountFrequency (%)
50
61.0%
1 8
 
9.8%
( 7
 
8.5%
) 7
 
8.5%
2 4
 
4.9%
9 3
 
3.7%
4 2
 
2.4%
3 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2538
95.2%
ASCII 127
 
4.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
223
 
8.8%
197
 
7.8%
195
 
7.7%
191
 
7.5%
60
 
2.4%
44
 
1.7%
43
 
1.7%
42
 
1.7%
40
 
1.6%
32
 
1.3%
Other values (302) 1471
58.0%
ASCII
ValueCountFrequency (%)
50
39.4%
C 9
 
7.1%
P 8
 
6.3%
1 8
 
6.3%
( 7
 
5.5%
) 7
 
5.5%
2 4
 
3.1%
i 3
 
2.4%
9 3
 
2.4%
4 2
 
1.6%
Other values (23) 26
20.5%
Distinct333
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-01-28T17:03:28.588457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length45
Mean length31.048433
Min length21

Characters and Unicode

Total characters10898
Distinct characters193
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique317 ?
Unique (%)90.3%

Sample

1st row인천광역시 계양구 오조산로57번길 11-1 (계산동)
2nd row인천광역시 계양구 장제로 708 (작전동)
3rd row인천광역시 계양구 오조산로57번길 11-1 (계산동)
4th row인천광역시 계양구 경명대로 1022 (계산동)
5th row인천광역시 계양구 장제로 718 (작전동)
ValueCountFrequency (%)
인천광역시 352
 
17.2%
계양구 352
 
17.2%
계산동 79
 
3.9%
작전동 60
 
2.9%
장제로 34
 
1.7%
효성동 26
 
1.3%
효서로 23
 
1.1%
아나지로 17
 
0.8%
도두리로 16
 
0.8%
계산새로 14
 
0.7%
Other values (543) 1076
52.5%
2024-01-28T17:03:28.953798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1711
 
15.7%
508
 
4.7%
1 484
 
4.4%
466
 
4.3%
413
 
3.8%
353
 
3.2%
353
 
3.2%
353
 
3.2%
352
 
3.2%
352
 
3.2%
Other values (183) 5553
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6508
59.7%
Decimal Number 1855
 
17.0%
Space Separator 1711
 
15.7%
Open Punctuation 311
 
2.9%
Close Punctuation 311
 
2.9%
Other Punctuation 139
 
1.3%
Dash Punctuation 38
 
0.3%
Uppercase Letter 17
 
0.2%
Math Symbol 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
508
 
7.8%
466
 
7.2%
413
 
6.3%
353
 
5.4%
353
 
5.4%
353
 
5.4%
352
 
5.4%
352
 
5.4%
352
 
5.4%
345
 
5.3%
Other values (159) 2661
40.9%
Decimal Number
ValueCountFrequency (%)
1 484
26.1%
0 282
15.2%
2 213
11.5%
3 160
 
8.6%
4 149
 
8.0%
5 138
 
7.4%
7 129
 
7.0%
6 121
 
6.5%
8 117
 
6.3%
9 62
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
R 3
17.6%
K 3
17.6%
A 3
17.6%
P 3
17.6%
I 3
17.6%
D 1
 
5.9%
B 1
 
5.9%
Other Punctuation
ValueCountFrequency (%)
, 138
99.3%
/ 1
 
0.7%
Space Separator
ValueCountFrequency (%)
1711
100.0%
Open Punctuation
ValueCountFrequency (%)
( 311
100.0%
Close Punctuation
ValueCountFrequency (%)
) 311
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6508
59.7%
Common 4373
40.1%
Latin 17
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
508
 
7.8%
466
 
7.2%
413
 
6.3%
353
 
5.4%
353
 
5.4%
353
 
5.4%
352
 
5.4%
352
 
5.4%
352
 
5.4%
345
 
5.3%
Other values (159) 2661
40.9%
Common
ValueCountFrequency (%)
1711
39.1%
1 484
 
11.1%
( 311
 
7.1%
) 311
 
7.1%
0 282
 
6.4%
2 213
 
4.9%
3 160
 
3.7%
4 149
 
3.4%
5 138
 
3.2%
, 138
 
3.2%
Other values (7) 476
 
10.9%
Latin
ValueCountFrequency (%)
R 3
17.6%
K 3
17.6%
A 3
17.6%
P 3
17.6%
I 3
17.6%
D 1
 
5.9%
B 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6508
59.7%
ASCII 4390
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1711
39.0%
1 484
 
11.0%
( 311
 
7.1%
) 311
 
7.1%
0 282
 
6.4%
2 213
 
4.9%
3 160
 
3.6%
4 149
 
3.4%
5 138
 
3.1%
, 138
 
3.1%
Other values (14) 493
 
11.2%
Hangul
ValueCountFrequency (%)
508
 
7.8%
466
 
7.2%
413
 
6.3%
353
 
5.4%
353
 
5.4%
353
 
5.4%
352
 
5.4%
352
 
5.4%
352
 
5.4%
345
 
5.3%
Other values (159) 2661
40.9%

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
어린이집
158 
다중이용시설
138 
공공건축물
55 

Length

Max length6
Median length5
Mean length4.9430199
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row다중이용시설
2nd row다중이용시설
3rd row다중이용시설
4th row공공건축물
5th row다중이용시설

Common Values

ValueCountFrequency (%)
어린이집 158
45.0%
다중이용시설 138
39.3%
공공건축물 55
 
15.7%

Length

2024-01-28T17:03:29.082597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T17:03:29.178132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
어린이집 158
45.0%
다중이용시설 138
39.3%
공공건축물 55
 
15.7%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-04-28
351 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-04-28
2nd row2023-04-28
3rd row2023-04-28
4th row2023-04-28
5th row2023-04-28

Common Values

ValueCountFrequency (%)
2023-04-28 351
100.0%

Length

2024-01-28T17:03:29.278652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T17:03:29.367247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-04-28 351
100.0%

Interactions

2024-01-28T17:03:27.416115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T17:03:29.416561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.848
구분0.8481.000
2024-01-28T17:03:29.485826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.761
구분0.7611.000

Missing values

2024-01-28T17:03:27.518927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T17:03:27.598014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번기관(상호)명건축물 소재지구분데이터기준일
01함춘요양병원인천광역시 계양구 오조산로57번길 11-1 (계산동)다중이용시설2023-04-28
12인천지일이삼공학원인천광역시 계양구 장제로 708 (작전동)다중이용시설2023-04-28
23연세소망병원(큰사랑요양병원)인천광역시 계양구 오조산로57번길 11-1 (계산동)다중이용시설2023-04-28
34인천서부보호관찰소인천광역시 계양구 경명대로 1022 (계산동)공공건축물2023-04-28
45천지연보석불가마사우나인천광역시 계양구 장제로 718 (작전동)다중이용시설2023-04-28
56성베드로한방병원인천광역시 계양구 장제로 785 (계산동)다중이용시설2023-04-28
67신명스카이홈지하주차장인천광역시 계양구 계산새로87번길 16 (용종동)다중이용시설2023-04-28
78우림카이저펠리스인천광역시 계양구 아나지로 332 (작전동)다중이용시설2023-04-28
89메트로몰 계양점인천광역시 계양구 장제로 738 (작전동)다중이용시설2023-04-28
910고운빛요양원인천광역시 계양구 도두리로 25 (계산동)다중이용시설2023-04-28
연번기관(상호)명건축물 소재지구분데이터기준일
341342예진어린이집인천광역시 계양구 하느재로 15 삼환아파트 102동 101호어린이집2023-04-28
342343해피아이어린이집인천광역시 계양구 경명대로1142번길 3 107동 103호(계산동, 주공아파트)어린이집2023-04-28
343344진주어린이집인천광역시 계양구 경명대로1142번길 3 103동 111호(계산동, 주공아파트)어린이집2023-04-28
344345포도나무어린이집인천광역시 계양구 경명대로1142번길 3 102동 110호(계산동, 주공아파트)어린이집2023-04-28
345346앰코어린이집인천광역시 계양구 아나지로 110 D동 1층어린이집2023-04-28
346347신나는어린이집인천광역시 계양구 봉오대로691번길 4 102동 101호(작전동, 코오롱아파트)어린이집2023-04-28
347348꿈초롱어린이집인천광역시 계양구 경명대로 1126 14동 103호(계산동, 삼보3차아파트)어린이집2023-04-28
348349엄지어린이집인천광역시 계양구 경명대로1017번길 5-18어린이집2023-04-28
349350솔빛어린이집인천광역시 계양구 향교로6번길 14-3어린이집2023-04-28
350351잼잼어린이집인천광역시 계양구 주부토로413번길 17 B동 106호(작전동, 공작아파트)어린이집2023-04-28