Overview

Dataset statistics

Number of variables7
Number of observations3624
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory212.5 KiB
Average record size in memory60.0 B

Variable types

Categorical4
Numeric2
Unsupported1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15473/F/1/datasetView.do

Alerts

측정지점 has constant value ""Constant
년도 is highly overall correlated with High correlation
is highly overall correlated with 년도 and 1 other fieldsHigh correlation
비고 is highly overall correlated with High correlation
비고 is highly imbalanced (55.9%)Imbalance
소음도 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 05:47:07.223321
Analysis finished2023-12-11 05:47:07.966160
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
2019
2160 
2018
1464 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2019 2160
59.6%
2018 1464
40.4%

Length

2023-12-11T14:47:08.019977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:47:08.106400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 2160
59.6%
2018 1464
40.4%


Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
12
744 
1
744 
3
744 
11
720 
2
672 

Length

Max length2
Median length1
Mean length1.4039735
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11
2nd row11
3rd row11
4th row11
5th row11

Common Values

ValueCountFrequency (%)
12 744
20.5%
1 744
20.5%
3 744
20.5%
11 720
19.9%
2 672
18.5%

Length

2023-12-11T14:47:08.232581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:47:08.364335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
12 744
20.5%
1 744
20.5%
3 744
20.5%
11 720
19.9%
2 672
18.5%


Real number (ℝ)

Distinct31
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.622517
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.0 KiB
2023-12-11T14:47:08.460498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q18
median16
Q323
95-th percentile29
Maximum31
Range30
Interquartile range (IQR)15

Descriptive statistics

Standard deviation8.752171
Coefficient of variation (CV)0.56022799
Kurtosis-1.1850814
Mean15.622517
Median Absolute Deviation (MAD)8
Skewness0.013961409
Sum56616
Variance76.600498
MonotonicityNot monotonic
2023-12-11T14:47:08.565745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
1 120
 
3.3%
16 120
 
3.3%
28 120
 
3.3%
27 120
 
3.3%
26 120
 
3.3%
25 120
 
3.3%
24 120
 
3.3%
23 120
 
3.3%
22 120
 
3.3%
21 120
 
3.3%
Other values (21) 2424
66.9%
ValueCountFrequency (%)
1 120
3.3%
2 120
3.3%
3 120
3.3%
4 120
3.3%
5 120
3.3%
6 120
3.3%
7 120
3.3%
8 120
3.3%
9 120
3.3%
10 120
3.3%
ValueCountFrequency (%)
31 72
2.0%
30 96
2.6%
29 96
2.6%
28 120
3.3%
27 120
3.3%
26 120
3.3%
25 120
3.3%
24 120
3.3%
23 120
3.3%
22 120
3.3%

시간
Real number (ℝ)

Distinct24
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.5
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.0 KiB
2023-12-11T14:47:08.692053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16.75
median12.5
Q318.25
95-th percentile23
Maximum24
Range23
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation6.9231418
Coefficient of variation (CV)0.55385134
Kurtosis-1.2041795
Mean12.5
Median Absolute Deviation (MAD)6
Skewness0
Sum45300
Variance47.929892
MonotonicityNot monotonic
2023-12-11T14:47:08.843294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1 151
 
4.2%
14 151
 
4.2%
24 151
 
4.2%
23 151
 
4.2%
22 151
 
4.2%
21 151
 
4.2%
20 151
 
4.2%
19 151
 
4.2%
18 151
 
4.2%
17 151
 
4.2%
Other values (14) 2114
58.3%
ValueCountFrequency (%)
1 151
4.2%
2 151
4.2%
3 151
4.2%
4 151
4.2%
5 151
4.2%
6 151
4.2%
7 151
4.2%
8 151
4.2%
9 151
4.2%
10 151
4.2%
ValueCountFrequency (%)
24 151
4.2%
23 151
4.2%
22 151
4.2%
21 151
4.2%
20 151
4.2%
19 151
4.2%
18 151
4.2%
17 151
4.2%
16 151
4.2%
15 151
4.2%

측정지점
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
성수동
3624 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성수동
2nd row성수동
3rd row성수동
4th row성수동
5th row성수동

Common Values

ValueCountFrequency (%)
성수동 3624
100.0%

Length

2023-12-11T14:47:08.984117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:47:09.069931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성수동 3624
100.0%

소음도
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size28.4 KiB

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
<NA>
2946 
장비 및 시스템 교체
677 
통신장애
 
1

Length

Max length11
Median length4
Mean length5.3076711
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2946
81.3%
장비 및 시스템 교체 677
 
18.7%
통신장애 1
 
< 0.1%

Length

2023-12-11T14:47:09.179014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:47:09.275894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2946
52.1%
장비 677
 
12.0%
677
 
12.0%
시스템 677
 
12.0%
교체 677
 
12.0%
통신장애 1
 
< 0.1%

Interactions

2023-12-11T14:47:07.625748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T14:47:07.455996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T14:47:07.721864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T14:47:07.540383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T14:47:09.360736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도시간비고
년도1.0001.0000.0000.0000.705
1.0001.0000.0690.0001.000
0.0000.0691.0000.0000.085
시간0.0000.0000.0001.0000.000
비고0.7051.0000.0850.0001.000
2023-12-11T14:47:09.481379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도비고
년도1.0001.0000.498
1.0001.0000.999
비고0.4980.9991.000
2023-12-11T14:47:09.565390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시간년도비고
1.0000.0000.0000.0500.021
시간0.0001.0000.0000.0000.000
년도0.0000.0001.0001.0000.498
0.0500.0001.0001.0000.999
비고0.0210.0000.4980.9991.000

Missing values

2023-12-11T14:47:07.828664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T14:47:07.929745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도시간측정지점소음도비고
020181111성수동70.360396<NA>
120181112성수동68.94844<NA>
220181113성수동68.233438<NA>
320181114성수동67.761103<NA>
420181115성수동68.158605<NA>
520181116성수동69.201015<NA>
620181117성수동71.002103<NA>
720181118성수동73.009199<NA>
820181119성수동72.758199<NA>
9201811110성수동71.920365<NA>
년도시간측정지점소음도비고
3614201933115성수동75.1<NA>
3615201933116성수동74.6<NA>
3616201933117성수동74.9<NA>
3617201933118성수동74.3<NA>
3618201933119성수동74.4<NA>
3619201933120성수동73.6<NA>
3620201933121성수동73.8<NA>
3621201933122성수동73.6<NA>
3622201933123성수동72.7<NA>
3623201933124성수동71.3<NA>