Overview

Dataset statistics

Number of variables8
Number of observations141
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.5 KiB
Average record size in memory68.9 B

Variable types

Categorical6
Numeric2

Dataset

Description부산광역시연제구_세원유형별과세현황_20211021
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15079129

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 2 other fieldsHigh correlation
부과건수 has 40 (28.4%) zerosZeros
부과금액 has 40 (28.4%) zerosZeros

Reproduction

Analysis started2023-12-10 16:24:55.435419
Analysis finished2023-12-10 16:24:56.201898
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
부산광역시
141 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 141
100.0%

Length

2023-12-11T01:24:56.277923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:24:56.365421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 141
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
연제구
141 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연제구
2nd row연제구
3rd row연제구
4th row연제구
5th row연제구

Common Values

ValueCountFrequency (%)
연제구 141
100.0%

Length

2023-12-11T01:24:56.458424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:24:56.578264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연제구 141
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
26470
141 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row26470
2nd row26470
3rd row26470
4th row26470
5th row26470

Common Values

ValueCountFrequency (%)
26470 141
100.0%

Length

2023-12-11T01:24:56.707210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:24:56.859840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26470 141
100.0%

과세년도
Categorical

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2017
47 
2018
47 
2019
47 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 47
33.3%
2018 47
33.3%
2019 47
33.3%

Length

2023-12-11T01:24:57.118759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:24:57.382536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 47
33.3%
2018 47
33.3%
2019 47
33.3%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
취득세
27 
주민세
27 
자동차세
21 
재산세
15 
지방소득세
12 
Other values (8)
39 

Length

Max length7
Median length3
Mean length3.6808511
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방소득세
2nd row지방소득세
3rd row지방소득세
4th row지방소득세
5th row지방소비세

Common Values

ValueCountFrequency (%)
취득세 27
19.1%
주민세 27
19.1%
자동차세 21
14.9%
재산세 15
10.6%
지방소득세 12
8.5%
레저세 12
8.5%
등록면허세 6
 
4.3%
지역자원시설세 6
 
4.3%
지방소비세 3
 
2.1%
담배소비세 3
 
2.1%
Other values (3) 9
 
6.4%

Length

2023-12-11T01:24:57.589425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 27
19.1%
주민세 27
19.1%
자동차세 21
14.9%
재산세 15
10.6%
지방소득세 12
8.5%
레저세 12
8.5%
등록면허세 6
 
4.3%
지역자원시설세 6
 
4.3%
지방소비세 3
 
2.1%
담배소비세 3
 
2.1%
Other values (3) 9
 
6.4%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
지방소득세(특별징수)
 
3
항공기
 
3
지방소득세(양도소득)
 
3
지방소득세(종합소득)
 
3
지방소비세
 
3
Other values (42)
126 

Length

Max length11
Median length8
Mean length6.0425532
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방소득세(특별징수)
2nd row지방소득세(법인소득)
3rd row지방소득세(양도소득)
4th row지방소득세(종합소득)
5th row지방소비세

Common Values

ValueCountFrequency (%)
지방소득세(특별징수) 3
 
2.1%
항공기 3
 
2.1%
지방소득세(양도소득) 3
 
2.1%
지방소득세(종합소득) 3
 
2.1%
지방소비세 3
 
2.1%
담배소비세 3
 
2.1%
교육세 3
 
2.1%
도시계획세 3
 
2.1%
건축물 3
 
2.1%
주택(개별) 3
 
2.1%
Other values (37) 111
78.7%

Length

2023-12-11T01:24:57.761693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지방소득세(특별징수 3
 
2.1%
특수 3
 
2.1%
승합 3
 
2.1%
기타승용 3
 
2.1%
승용 3
 
2.1%
주민세(재산분 3
 
2.1%
주민세(종업원분 3
 
2.1%
주민세(특별징수 3
 
2.1%
주민세(법인세분 3
 
2.1%
주민세(양도소득 3
 
2.1%
Other values (37) 111
78.7%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct101
Distinct (%)71.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24839.511
Minimum0
Maximum419696
Zeros40
Zeros (%)28.4%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-11T01:24:57.926689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1092
Q314856
95-th percentile125347
Maximum419696
Range419696
Interquartile range (IQR)14856

Descriptive statistics

Standard deviation67620.639
Coefficient of variation (CV)2.7223016
Kurtosis21.400104
Mean24839.511
Median Absolute Deviation (MAD)1092
Skewness4.3666955
Sum3502371
Variance4.5725509 × 109
MonotonicityNot monotonic
2023-12-11T01:24:58.086651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 40
28.4%
13 2
 
1.4%
30392 1
 
0.7%
125347 1
 
0.7%
829 1
 
0.7%
647 1
 
0.7%
419696 1
 
0.7%
24476 1
 
0.7%
2343 1
 
0.7%
2272 1
 
0.7%
Other values (91) 91
64.5%
ValueCountFrequency (%)
0 40
28.4%
1 1
 
0.7%
3 1
 
0.7%
12 1
 
0.7%
13 2
 
1.4%
14 1
 
0.7%
23 1
 
0.7%
40 1
 
0.7%
43 1
 
0.7%
45 1
 
0.7%
ValueCountFrequency (%)
419696 1
0.7%
409382 1
0.7%
403373 1
0.7%
174679 1
0.7%
167351 1
0.7%
166359 1
0.7%
144056 1
0.7%
125347 1
0.7%
122663 1
0.7%
102149 1
0.7%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct102
Distinct (%)72.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.0812385 × 109
Minimum0
Maximum2.7431499 × 1010
Zeros40
Zeros (%)28.4%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-11T01:24:58.269734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2.86705 × 108
Q36.42165 × 109
95-th percentile1.6505745 × 1010
Maximum2.7431499 × 1010
Range2.7431499 × 1010
Interquartile range (IQR)6.42165 × 109

Descriptive statistics

Standard deviation6.1612444 × 109
Coefficient of variation (CV)1.5096507
Kurtosis1.5694805
Mean4.0812385 × 109
Median Absolute Deviation (MAD)2.86705 × 108
Skewness1.5323999
Sum5.7545462 × 1011
Variance3.7960932 × 1019
MonotonicityNot monotonic
2023-12-11T01:24:58.435241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 40
28.4%
14967105000 1
 
0.7%
4429070000 1
 
0.7%
5018039000 1
 
0.7%
5559187000 1
 
0.7%
16505745000 1
 
0.7%
7903052000 1
 
0.7%
4597845000 1
 
0.7%
14504636000 1
 
0.7%
16615890000 1
 
0.7%
Other values (92) 92
65.2%
ValueCountFrequency (%)
0 40
28.4%
438000 1
 
0.7%
1280000 1
 
0.7%
2335000 1
 
0.7%
4552000 1
 
0.7%
5676000 1
 
0.7%
6176000 1
 
0.7%
6424000 1
 
0.7%
7079000 1
 
0.7%
7747000 1
 
0.7%
ValueCountFrequency (%)
27431499000 1
0.7%
24059980000 1
0.7%
21077284000 1
0.7%
19058442000 1
0.7%
17107148000 1
0.7%
16615890000 1
0.7%
16534550000 1
0.7%
16505745000 1
0.7%
16109175000 1
0.7%
15985050000 1
0.7%

Interactions

2023-12-11T01:24:55.855956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:24:55.706413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:24:55.926787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:24:55.785439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:24:58.558799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8330.614
세원 유형명0.0001.0001.0000.9380.907
부과건수0.0000.8330.9381.0000.591
부과금액0.0000.6140.9070.5911.000
2023-12-11T01:24:58.697977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세원 유형명세목명
과세년도1.0000.0000.000
세원 유형명0.0001.0000.857
세목명0.0000.8571.000
2023-12-11T01:24:58.893864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.8520.0000.5930.616
부과금액0.8521.0000.0000.3040.507
과세년도0.0000.0001.0000.0000.000
세목명0.5930.3040.0001.0000.857
세원 유형명0.6160.5070.0000.8571.000

Missing values

2023-12-11T01:24:56.030280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:24:56.154594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0부산광역시연제구264702017지방소득세지방소득세(특별징수)3039214967105000
1부산광역시연제구264702017지방소득세지방소득세(법인소득)211715299197000
2부산광역시연제구264702017지방소득세지방소득세(양도소득)40566169909000
3부산광역시연제구264702017지방소득세지방소득세(종합소득)204637474538000
4부산광역시연제구264702017지방소비세지방소비세00
5부산광역시연제구264702017담배소비세담배소비세00
6부산광역시연제구264702017교육세교육세40938217107148000
7부산광역시연제구264702017도시계획세도시계획세00
8부산광역시연제구264702017취득세건축물175410403223000
9부산광역시연제구264702017취득세주택(개별)161712093598000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
131부산광역시연제구264702019주민세주민세(개인균등)78220787619000
132부산광역시연제구264702019등록면허세등록면허세(면허)20673792659000
133부산광역시연제구264702019등록면허세등록면허세(등록)473765330451000
134부산광역시연제구264702019지역자원시설세지역자원시설세(소방)1440564570176000
135부산광역시연제구264702019지역자원시설세지역자원시설세(특자)48528771000
136부산광역시연제구264702019레저세소싸움00
137부산광역시연제구264702019레저세경정00
138부산광역시연제구264702019레저세경륜00
139부산광역시연제구264702019레저세경마126421650000
140부산광역시연제구264702019체납체납1746798017609000