Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory752.0 KiB
Average record size in memory77.0 B

Variable types

Numeric4
DateTime3
Categorical1

Dataset

Description부산광역시상수도사업본부_수용가정보시스템_요금계산관련정보_추징계산이력_20211216
Author부산광역시 상수도사업본부
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15083669

Alerts

사용순번 has constant value ""Constant
추징금액(하) is highly overall correlated with 추징금액(물)High correlation
추징금액(물) is highly overall correlated with 추징금액(하)High correlation
추징금액(상) is highly skewed (γ1 = -81.47998014)Skewed
추징금액(물) is highly skewed (γ1 = 68.84261153)Skewed
연번 has unique valuesUnique
추징금액(하) has 9991 (99.9%) zerosZeros
추징금액(물) has 9993 (99.9%) zerosZeros

Reproduction

Analysis started2023-12-10 17:14:07.564530
Analysis finished2023-12-10 17:14:14.057619
Duration6.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49783.889
Minimum5
Maximum99980
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T02:14:14.240712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile4718.8
Q124922.5
median50086
Q374334
95-th percentile94835.35
Maximum99980
Range99975
Interquartile range (IQR)49411.5

Descriptive statistics

Standard deviation28814.541
Coefficient of variation (CV)0.57879248
Kurtosis-1.1955431
Mean49783.889
Median Absolute Deviation (MAD)24634
Skewness0.0019640628
Sum4.9783889 × 108
Variance8.3027775 × 108
MonotonicityNot monotonic
2023-12-11T02:14:14.540414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
64921 1
 
< 0.1%
70830 1
 
< 0.1%
1978 1
 
< 0.1%
2817 1
 
< 0.1%
55621 1
 
< 0.1%
93311 1
 
< 0.1%
36959 1
 
< 0.1%
29742 1
 
< 0.1%
98762 1
 
< 0.1%
93906 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
5 1
< 0.1%
33 1
< 0.1%
67 1
< 0.1%
68 1
< 0.1%
71 1
< 0.1%
90 1
< 0.1%
91 1
< 0.1%
93 1
< 0.1%
98 1
< 0.1%
106 1
< 0.1%
ValueCountFrequency (%)
99980 1
< 0.1%
99967 1
< 0.1%
99962 1
< 0.1%
99953 1
< 0.1%
99946 1
< 0.1%
99917 1
< 0.1%
99882 1
< 0.1%
99874 1
< 0.1%
99857 1
< 0.1%
99835 1
< 0.1%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-06-01 00:00:00
Maximum2021-08-01 00:00:00
2023-12-11T02:14:14.834270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:15.119079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-06-01 00:00:00
Maximum2021-11-01 00:00:00
2023-12-11T02:14:15.365526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:15.661148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=5)
Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-05-01 00:00:00
Maximum2021-10-01 00:00:00
2023-12-11T02:14:15.918565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:16.144711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)

추징금액(상)
Real number (ℝ)

SKEWED 

Distinct896
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-3079.658
Minimum-5792950
Maximum155990
Zeros1
Zeros (%)< 0.1%
Negative9997
Negative (%)> 99.9%
Memory size166.0 KiB
2023-12-11T02:14:16.434594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-5792950
5-th percentile-7462.5
Q1-1230
median-430
Q3-140
95-th percentile-20
Maximum155990
Range5948940
Interquartile range (IQR)1090

Descriptive statistics

Standard deviation62605.478
Coefficient of variation (CV)-20.328711
Kurtosis7368.0752
Mean-3079.658
Median Absolute Deviation (MAD)350
Skewness-81.47998
Sum-30796580
Variance3.9194459 × 109
MonotonicityNot monotonic
2023-12-11T02:14:16.838297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-10 292
 
2.9%
-20 273
 
2.7%
-30 254
 
2.5%
-40 246
 
2.5%
-60 225
 
2.2%
-80 224
 
2.2%
-70 217
 
2.2%
-90 210
 
2.1%
-140 203
 
2.0%
-120 187
 
1.9%
Other values (886) 7669
76.7%
ValueCountFrequency (%)
-5792950 1
< 0.1%
-1607080 1
< 0.1%
-881220 1
< 0.1%
-761460 1
< 0.1%
-623760 1
< 0.1%
-463250 1
< 0.1%
-323480 1
< 0.1%
-312740 1
< 0.1%
-299040 1
< 0.1%
-280110 1
< 0.1%
ValueCountFrequency (%)
155990 1
 
< 0.1%
94630 1
 
< 0.1%
0 1
 
< 0.1%
-10 292
2.9%
-20 273
2.7%
-30 254
2.5%
-40 246
2.5%
-60 225
2.2%
-70 217
2.2%
-80 224
2.2%

추징금액(하)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.292
Minimum-254880
Maximum212900
Zeros9991
Zeros (%)99.9%
Negative7
Negative (%)0.1%
Memory size166.0 KiB
2023-12-11T02:14:17.655511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-254880
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum212900
Range467780
Interquartile range (IQR)0

Descriptive statistics

Standard deviation3439.6543
Coefficient of variation (CV)1500.7218
Kurtosis4526.3986
Mean2.292
Median Absolute Deviation (MAD)0
Skewness-15.29478
Sum22920
Variance11831222
MonotonicityNot monotonic
2023-12-11T02:14:17.916164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
0 9991
99.9%
-1360 1
 
< 0.1%
88320 1
 
< 0.1%
212900 1
 
< 0.1%
-4790 1
 
< 0.1%
-1200 1
 
< 0.1%
-1800 1
 
< 0.1%
-13400 1
 
< 0.1%
-870 1
 
< 0.1%
-254880 1
 
< 0.1%
ValueCountFrequency (%)
-254880 1
 
< 0.1%
-13400 1
 
< 0.1%
-4790 1
 
< 0.1%
-1800 1
 
< 0.1%
-1360 1
 
< 0.1%
-1200 1
 
< 0.1%
-870 1
 
< 0.1%
0 9991
99.9%
88320 1
 
< 0.1%
212900 1
 
< 0.1%
ValueCountFrequency (%)
212900 1
 
< 0.1%
88320 1
 
< 0.1%
0 9991
99.9%
-870 1
 
< 0.1%
-1200 1
 
< 0.1%
-1360 1
 
< 0.1%
-1800 1
 
< 0.1%
-4790 1
 
< 0.1%
-13400 1
 
< 0.1%
-254880 1
 
< 0.1%

추징금액(물)
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.593
Minimum-3720
Maximum18420
Zeros9993
Zeros (%)99.9%
Negative5
Negative (%)< 0.1%
Memory size166.0 KiB
2023-12-11T02:14:18.162305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-3720
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum18420
Range22140
Interquartile range (IQR)0

Descriptive statistics

Standard deviation235.27877
Coefficient of variation (CV)90.736126
Kurtosis5032.4878
Mean2.593
Median Absolute Deviation (MAD)0
Skewness68.842612
Sum25930
Variance55356.102
MonotonicityNot monotonic
2023-12-11T02:14:18.398580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 9993
99.9%
-440 1
 
< 0.1%
14050 1
 
< 0.1%
18420 1
 
< 0.1%
-1560 1
 
< 0.1%
-600 1
 
< 0.1%
-3720 1
 
< 0.1%
-220 1
 
< 0.1%
ValueCountFrequency (%)
-3720 1
 
< 0.1%
-1560 1
 
< 0.1%
-600 1
 
< 0.1%
-440 1
 
< 0.1%
-220 1
 
< 0.1%
0 9993
99.9%
14050 1
 
< 0.1%
18420 1
 
< 0.1%
ValueCountFrequency (%)
18420 1
 
< 0.1%
14050 1
 
< 0.1%
0 9993
99.9%
-220 1
 
< 0.1%
-440 1
 
< 0.1%
-600 1
 
< 0.1%
-1560 1
 
< 0.1%
-3720 1
 
< 0.1%

사용순번
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2023-12-11T02:14:18.662267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:14:18.890313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

Interactions

2023-12-11T02:14:12.685558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:09.097563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:10.728999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:11.824667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:12.908394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:09.471040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:11.040413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:12.040458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:13.127824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:10.153144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:11.334165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:12.254200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:13.337131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:10.465348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:11.619067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:14:12.461324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:14:19.040091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번추징발생년월고지년월계산년월추징금액(상)추징금액(하)추징금액(물)
연번1.0000.7660.6200.6410.0160.0000.000
추징발생년월0.7661.0000.7730.9930.0000.6960.327
고지년월0.6200.7731.0000.8350.0000.4750.708
계산년월0.6410.9930.8351.0000.0000.8730.873
추징금액(상)0.0160.0000.0000.0001.0000.0000.000
추징금액(하)0.0000.6960.4750.8730.0001.0001.000
추징금액(물)0.0000.3270.7080.8730.0001.0001.000
2023-12-11T02:14:19.244237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번추징금액(상)추징금액(하)추징금액(물)
연번1.0000.0430.0210.010
추징금액(상)0.0431.0000.0350.040
추징금액(하)0.0210.0351.0000.882
추징금액(물)0.0100.0400.8821.000

Missing values

2023-12-11T02:14:13.608967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:14:13.925661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번추징발생년월고지년월계산년월추징금액(상)추징금액(하)추징금액(물)사용순번
64920649212021-072021-072021-07-40001
96636966372021-082021-082021-08-870001
91980919812021-082021-092021-08-160001
96883968842021-082021-082021-08-680001
17306173072021-072021-082021-07-70001
45457454582021-072021-082021-07-920001
79347793482021-072021-082021-07-910001
67255672562021-072021-082021-07-560001
32019320202021-072021-072021-07-14630001
32360323612021-072021-082021-07-760001
연번추징발생년월고지년월계산년월추징금액(상)추징금액(하)추징금액(물)사용순번
65780657812021-072021-082021-07-700001
40457404582021-072021-082021-07-120001
84894848952021-072021-082021-07-820001
34276342772021-072021-082021-07-1200001
95636956372021-082021-092021-08-10060001
36650366512021-072021-072021-07-620001
96869968702021-082021-082021-08-540001
49453494542021-072021-082021-07-340001
99049990502021-082021-092021-08-30001
52930529312021-072021-072021-07-600001