Overview

Dataset statistics

Number of variables7
Number of observations4844
Missing cells0
Missing cells (%)0.0%
Duplicate rows662
Duplicate rows (%)13.7%
Total size in memory274.5 KiB
Average record size in memory58.0 B

Variable types

DateTime2
Categorical4
Numeric1

Dataset

Description부산광역시사하구_U-옥외광고물통합관리시스템_행정처분정보_20221021
Author부산광역시 사하구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15093349

Alerts

시도 has constant value ""Constant
구군 has constant value ""Constant
Dataset has 662 (13.7%) duplicate rowsDuplicates
전수조사건수 is highly imbalanced (70.2%)Imbalance

Reproduction

Analysis started2023-12-10 17:05:45.466552
Analysis finished2023-12-10 17:05:46.123598
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct15
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size38.0 KiB
Minimum2021-03-31 00:00:00
Maximum2021-10-01 00:00:00
2023-12-11T02:05:46.188050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:05:46.327306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
Distinct45
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size38.0 KiB
Minimum2020-09-12 00:00:00
Maximum2020-11-04 00:00:00
2023-12-11T02:05:46.471084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:05:46.631680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size38.0 KiB
부산광역시
4844 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 4844
100.0%

Length

2023-12-11T02:05:47.064526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:05:47.161109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 4844
100.0%

구군
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size38.0 KiB
사하구
4844 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사하구
2nd row사하구
3rd row사하구
4th row사하구
5th row사하구

Common Values

ValueCountFrequency (%)
사하구 4844
100.0%

Length

2023-12-11T02:05:47.280393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:05:47.406033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사하구 4844
100.0%

읍면동
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size38.0 KiB
괴정동
1098 
하단동
752 
다대동
704 
장림동
700 
당리동
665 
Other values (2)
925 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row괴정동
2nd row괴정동
3rd row괴정동
4th row괴정동
5th row괴정동

Common Values

ValueCountFrequency (%)
괴정동 1098
22.7%
하단동 752
15.5%
다대동 704
14.5%
장림동 700
14.5%
당리동 665
13.7%
신평동 601
12.4%
감천동 324
 
6.7%

Length

2023-12-11T02:05:47.515540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:05:47.627700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
괴정동 1098
22.7%
하단동 752
15.5%
다대동 704
14.5%
장림동 700
14.5%
당리동 665
13.7%
신평동 601
12.4%
감천동 324
 
6.7%

전수조사건수
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size38.0 KiB
1
4201 
2
620 
3
 
22
4
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 4201
86.7%
2 620
 
12.8%
3 22
 
0.5%
4 1
 
< 0.1%

Length

2023-12-11T02:05:47.749902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:05:47.873426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4201
86.7%
2 620
 
12.8%
3 22
 
0.5%
4 1
 
< 0.1%

계고장발송횟수
Real number (ℝ)

Distinct8
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9731627
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size42.7 KiB
2023-12-11T02:05:47.982468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile4
Maximum8
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.1287645
Coefficient of variation (CV)0.5720585
Kurtosis1.6436499
Mean1.9731627
Median Absolute Deviation (MAD)1
Skewness1.2764531
Sum9558
Variance1.2741093
MonotonicityNot monotonic
2023-12-11T02:05:48.119068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 2115
43.7%
2 1501
31.0%
3 673
 
13.9%
4 409
 
8.4%
5 109
 
2.3%
6 21
 
0.4%
7 13
 
0.3%
8 3
 
0.1%
ValueCountFrequency (%)
1 2115
43.7%
2 1501
31.0%
3 673
 
13.9%
4 409
 
8.4%
5 109
 
2.3%
6 21
 
0.4%
7 13
 
0.3%
8 3
 
0.1%
ValueCountFrequency (%)
8 3
 
0.1%
7 13
 
0.3%
6 21
 
0.4%
5 109
 
2.3%
4 409
 
8.4%
3 673
 
13.9%
2 1501
31.0%
1 2115
43.7%

Interactions

2023-12-11T02:05:45.746416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:05:48.202281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계고장발송일자전수조사일읍면동전수조사건수계고장발송횟수
계고장발송일자1.0000.8560.6400.0210.360
전수조사일0.8561.0000.9720.1210.445
읍면동0.6400.9721.0000.0510.265
전수조사건수0.0210.1210.0511.0000.000
계고장발송횟수0.3600.4450.2650.0001.000
2023-12-11T02:05:48.295763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
읍면동전수조사건수
읍면동1.0000.035
전수조사건수0.0351.000
2023-12-11T02:05:48.368639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계고장발송횟수읍면동전수조사건수
계고장발송횟수1.0000.1450.000
읍면동0.1451.0000.035
전수조사건수0.0000.0351.000

Missing values

2023-12-11T02:05:45.928326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:05:46.059980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

계고장발송일자전수조사일시도구군읍면동전수조사건수계고장발송횟수
02021-05-012020-09-19부산광역시사하구괴정동14
12021-05-012020-09-19부산광역시사하구괴정동13
22021-05-012020-09-19부산광역시사하구괴정동14
32021-05-012020-09-19부산광역시사하구괴정동11
42021-05-012020-09-19부산광역시사하구괴정동14
52021-05-012020-09-19부산광역시사하구괴정동13
62021-05-012020-09-19부산광역시사하구괴정동11
72021-05-012020-09-19부산광역시사하구괴정동14
82021-05-012020-09-19부산광역시사하구괴정동11
92021-05-012020-09-17부산광역시사하구괴정동11
계고장발송일자전수조사일시도구군읍면동전수조사건수계고장발송횟수
48342021-08-012020-10-24부산광역시사하구장림동11
48352021-08-012020-10-23부산광역시사하구장림동11
48362021-08-012020-10-21부산광역시사하구장림동11
48372021-08-012020-10-21부산광역시사하구장림동11
48382021-08-012020-10-30부산광역시사하구장림동11
48392021-08-012020-10-21부산광역시사하구장림동21
48402021-08-012020-10-20부산광역시사하구장림동11
48412021-08-012020-10-29부산광역시사하구장림동11
48422021-08-012020-10-29부산광역시사하구장림동21
48432021-08-012020-10-24부산광역시사하구장림동11

Duplicate rows

Most frequently occurring

계고장발송일자전수조사일시도구군읍면동전수조사건수계고장발송횟수# duplicates
892021-03-312020-11-01부산광역시사하구감천동1149
2482021-05-312020-09-28부산광역시사하구당리동1144
3242021-06-302020-09-25부산광역시사하구괴정동1131
3252021-06-302020-09-25부산광역시사하구괴정동1231
1992021-05-012020-10-31부산광역시사하구감천동1130
2212021-05-312020-09-20부산광역시사하구괴정동1129
3222021-06-302020-09-23부산광역시사하구괴정동1128
3232021-06-302020-09-23부산광역시사하구괴정동1228
4062021-07-012020-10-18부산광역시사하구장림동1328
4722021-08-082020-09-13부산광역시사하구괴정동1128