Overview

Dataset statistics

Number of variables6
Number of observations122
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory52.1 B

Variable types

DateTime1
Categorical2
Numeric3

Dataset

Description한국지역난방공사 구역전기사업(강남, 중앙, 삼송)의 공급지역별 배전공급 현황에 대한 정보입니다. (구분, 단위, 강남, 중앙, 삼송)
Author공공데이터포털
URLhttps://www.data.go.kr/data/15069235/fileData.do

Alerts

단위 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 강남 and 3 other fieldsHigh correlation
강남 is highly overall correlated with 중앙 and 2 other fieldsHigh correlation
중앙 is highly overall correlated with 강남 and 2 other fieldsHigh correlation
삼송 is highly overall correlated with 강남 and 2 other fieldsHigh correlation
강남 has 48 (39.3%) zerosZeros
중앙 has 49 (40.2%) zerosZeros
삼송 has 3 (2.5%) zerosZeros

Reproduction

Analysis started2024-04-19 06:18:29.011032
Analysis finished2024-04-19 06:18:30.212462
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct8
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2014-12-31 00:00:00
Maximum2023-06-16 00:00:00
2024-04-19T15:18:30.259490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:18:30.372178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)

구분
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)17.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
전력구 긍장
 
8
관로 연장
 
8
저압접속함
 
8
핸드홀
 
8
특고압맨홀
 
8
Other values (16)
82 

Length

Max length13
Median length12
Mean length6.6967213
Min length3

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row전력구 긍장
2nd row덕트뱅크 긍장
3rd row관로 연장
4th row지중케이블 연장(특고압)
5th row지중케이블 연장(저압)

Common Values

ValueCountFrequency (%)
전력구 긍장 8
 
6.6%
관로 연장 8
 
6.6%
저압접속함 8
 
6.6%
핸드홀 8
 
6.6%
특고압맨홀 8
 
6.6%
덕트뱅크 긍장 8
 
6.6%
저압분전함 8
 
6.6%
지중케이블연장 6
 
4.9%
개폐기 6
 
4.9%
광케이블 연장 5
 
4.1%
Other values (11) 49
40.2%

Length

2024-04-19T15:18:30.497185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
긍장 16
 
9.4%
연장 13
 
7.6%
가공케이블 11
 
6.4%
연장(저압 10
 
5.8%
연장(특고압 10
 
5.8%
지중케이블 10
 
5.8%
저압분전함 8
 
4.7%
전력구 8
 
4.7%
덕트뱅크 8
 
4.7%
특고압맨홀 8
 
4.7%
Other values (12) 69
40.4%

단위
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
km
61 
37 
개소
24 

Length

Max length2
Median length2
Mean length1.6967213
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowkm
2nd rowkm
3rd rowkm
4th rowkm
5th rowkm

Common Values

ValueCountFrequency (%)
km 61
50.0%
37
30.3%
개소 24
 
19.7%

Length

2024-04-19T15:18:30.637104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:18:30.738244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
km 61
50.0%
37
30.3%
개소 24
 
19.7%

강남
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct25
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.304148
Minimum0
Maximum67.236
Zeros48
Zeros (%)39.3%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-04-19T15:18:30.836922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4
Q318.5
95-th percentile63.4605
Maximum67.236
Range67.236
Interquartile range (IQR)18.5

Descriptive statistics

Standard deviation17.458868
Coefficient of variation (CV)1.5444657
Kurtosis3.3561907
Mean11.304148
Median Absolute Deviation (MAD)4
Skewness1.9518574
Sum1379.106
Variance304.81209
MonotonicityNot monotonic
2024-04-19T15:18:30.969356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
0.0 48
39.3%
5.4 8
 
6.6%
0.617 8
 
6.6%
23.0 8
 
6.6%
4.0 8
 
6.6%
14.0 8
 
6.6%
5.3 7
 
5.7%
24.0 6
 
4.9%
33.5 3
 
2.5%
1.0 2
 
1.6%
Other values (15) 16
 
13.1%
ValueCountFrequency (%)
0.0 48
39.3%
0.617 8
 
6.6%
1.0 2
 
1.6%
4.0 8
 
6.6%
4.8 1
 
0.8%
5.3 7
 
5.7%
5.4 8
 
6.6%
14.0 8
 
6.6%
17.0 1
 
0.8%
19.0 1
 
0.8%
ValueCountFrequency (%)
67.236 1
0.8%
66.831 1
0.8%
66.5 1
0.8%
66.4 2
1.6%
65.6 1
0.8%
64.19 1
0.8%
49.6 1
0.8%
33.715 1
0.8%
33.708 1
0.8%
33.6 1
0.8%

중앙
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct14
Distinct (%)11.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.2967213
Minimum0
Maximum14.3
Zeros49
Zeros (%)40.2%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-04-19T15:18:31.107414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2.9
Q311
95-th percentile14.19
Maximum14.3
Range14.3
Interquartile range (IQR)11

Descriptive statistics

Standard deviation5.6604753
Coefficient of variation (CV)1.0686753
Kurtosis-1.4058891
Mean5.2967213
Median Absolute Deviation (MAD)2.9
Skewness0.51656887
Sum646.2
Variance32.040981
MonotonicityNot monotonic
2024-04-19T15:18:31.217048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
0.0 49
40.2%
14.0 17
 
13.9%
2.6 8
 
6.6%
7.0 8
 
6.6%
11.0 8
 
6.6%
5.0 7
 
5.7%
2.9 6
 
4.9%
10.6 4
 
3.3%
14.3 4
 
3.3%
1.0 3
 
2.5%
Other values (4) 8
 
6.6%
ValueCountFrequency (%)
0.0 49
40.2%
1.0 3
 
2.5%
2.6 8
 
6.6%
2.9 6
 
4.9%
4.0 1
 
0.8%
5.0 7
 
5.7%
7.0 8
 
6.6%
10.3 1
 
0.8%
10.5 3
 
2.5%
10.6 4
 
3.3%
ValueCountFrequency (%)
14.3 4
 
3.3%
14.2 3
 
2.5%
14.0 17
13.9%
11.0 8
6.6%
10.6 4
 
3.3%
10.5 3
 
2.5%
10.3 1
 
0.8%
7.0 8
6.6%
5.0 7
5.7%
4.0 1
 
0.8%

삼송
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct73
Distinct (%)59.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean116.837
Minimum0
Maximum482.2
Zeros3
Zeros (%)2.5%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-04-19T15:18:31.351883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.539
Q15.364
median72.925
Q3179.295
95-th percentile379.5
Maximum482.2
Range482.2
Interquartile range (IQR)173.931

Descriptive statistics

Standard deviation121.55844
Coefficient of variation (CV)1.0404105
Kurtosis1.0983509
Mean116.837
Median Absolute Deviation (MAD)72.316
Skewness1.2175276
Sum14254.114
Variance14776.455
MonotonicityNot monotonic
2024-04-19T15:18:31.520353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.807 8
 
6.6%
0.539 8
 
6.6%
65.0 5
 
4.1%
169.0 5
 
4.1%
51.0 5
 
4.1%
3.0 4
 
3.3%
72.95 3
 
2.5%
0.618 3
 
2.5%
5.364 3
 
2.5%
0.0 3
 
2.5%
Other values (63) 75
61.5%
ValueCountFrequency (%)
0.0 3
 
2.5%
0.539 8
6.6%
0.6 1
 
0.8%
0.618 3
 
2.5%
0.807 8
6.6%
1.3 1
 
0.8%
3.0 4
3.3%
4.0 1
 
0.8%
5.3 1
 
0.8%
5.364 3
 
2.5%
ValueCountFrequency (%)
482.2 1
0.8%
464.0 1
0.8%
463.9 1
0.8%
463.7 1
0.8%
451.0 1
0.8%
409.9 1
0.8%
380.0 1
0.8%
370.0 1
0.8%
300.8 1
0.8%
298.7 1
0.8%

Interactions

2024-04-19T15:18:29.787730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:18:29.257220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:18:29.516024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:18:29.870932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:18:29.347304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:18:29.601680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:18:29.960840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:18:29.433990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:18:29.693488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T15:18:31.951761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일구분단위강남중앙삼송
기준일1.0000.0000.0000.0000.0000.000
구분0.0001.0001.0000.9650.9910.928
단위0.0001.0001.0000.5770.5830.551
강남0.0000.9650.5771.0000.8700.861
중앙0.0000.9910.5830.8701.0000.733
삼송0.0000.9280.5510.8610.7331.000
2024-04-19T15:18:32.051001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위구분
단위1.0000.921
구분0.9211.000
2024-04-19T15:18:32.154730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강남중앙삼송구분단위
강남1.0000.8600.6710.6950.459
중앙0.8601.0000.6720.8050.466
삼송0.6710.6721.0000.6530.382
구분0.6950.8050.6531.0000.921
단위0.4590.4660.3820.9211.000

Missing values

2024-04-19T15:18:30.069067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T15:18:30.170262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준일구분단위강남중앙삼송
02023-06-16전력구 긍장km0.00.00.807
12023-06-16덕트뱅크 긍장km0.6170.00.539
22023-06-16관로 연장km33.71510.6298.7
32023-06-16지중케이블 연장(특고압)km67.23614.3482.2
42023-06-16지중케이블 연장(저압)km5.30.0184.9
52023-06-16광케이블 연장km5.42.672.95
62023-06-16가공케이블 연장(특고압)km0.00.00.618
72023-06-16가공케이블 연장(저압)km0.00.05.364
82023-06-16저압분전함0.00.051.0
92023-06-16변압기(삼상)4.07.0171.0
기준일구분단위강남중앙삼송
1122014-12-31지중케이블연장km4.82.698.0
1132014-12-31광케이블연장km5.42.965.0
1142014-12-31광케이블연장km0.00.00.0
1152014-12-31저압분전함0.00.044.0
1162014-12-31변압기4.07.0115.0
1172014-12-31개폐기17.014.0133.0
1182014-12-31개폐기0.04.015.0
1192014-12-31특고압맨홀개소14.014.0159.0
1202014-12-31핸드홀개소0.00.0189.0
1212014-12-31저압접속함개소23.011.044.0