Overview

Dataset statistics

Number of variables8
Number of observations65
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory69.0 B

Variable types

Categorical6
Boolean1
Numeric1

Dataset

Description대구광역시 동구_지방세 납세자 현황_20220714
Author대구광역시 동구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15078475&dataSetDetailId=150784751a6c3c6dac877&provdMethod=FILE

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation

Reproduction

Analysis started2024-04-19 05:52:12.792908
Analysis finished2024-04-19 05:52:13.282276
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
대구광역시
65 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 65
100.0%

Length

2024-04-19T14:52:13.344105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:52:13.429188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 65
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
동구
65 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동구
2nd row동구
3rd row동구
4th row동구
5th row동구

Common Values

ValueCountFrequency (%)
동구 65
100.0%

Length

2024-04-19T14:52:13.517312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:52:13.603542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동구 65
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
27140
65 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row27140
2nd row27140
3rd row27140
4th row27140
5th row27140

Common Values

ValueCountFrequency (%)
27140 65
100.0%

Length

2024-04-19T14:52:13.700092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:52:13.806382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27140 65
100.0%

과세년도
Categorical

Distinct2
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size652.0 B
2021
33 
2020
32 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2021 33
50.8%
2020 32
49.2%

Length

2024-04-19T14:52:13.902141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:52:13.994041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 33
50.8%
2020 32
49.2%

세목명
Categorical

Distinct9
Distinct (%)13.8%
Missing0
Missing (%)0.0%
Memory size652.0 B
재산세
주민세
취득세
자동차세
등록면허세
Other values (4)
25 

Length

Max length7
Median length5
Mean length4.1692308
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row재산세
5th row재산세

Common Values

ValueCountFrequency (%)
재산세 8
12.3%
주민세 8
12.3%
취득세 8
12.3%
자동차세 8
12.3%
등록면허세 8
12.3%
지방소득세 8
12.3%
지역자원시설세 8
12.3%
등록세 7
10.8%
지방소비세 2
 
3.1%

Length

2024-04-19T14:52:14.097475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:52:14.219866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재산세 8
12.3%
주민세 8
12.3%
취득세 8
12.3%
자동차세 8
12.3%
등록면허세 8
12.3%
지방소득세 8
12.3%
지역자원시설세 8
12.3%
등록세 7
10.8%
지방소비세 2
 
3.1%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size652.0 B
법인
33 
개인
32 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
법인 33
50.8%
개인 32
49.2%

Length

2024-04-19T14:52:14.360789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:52:14.455338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 33
50.8%
개인 32
49.2%
Distinct2
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size197.0 B
True
34 
False
31 
ValueCountFrequency (%)
True 34
52.3%
False 31
47.7%
2024-04-19T14:52:14.531710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct63
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17661.708
Minimum1
Maximum129311
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2024-04-19T14:52:14.633739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q1119
median2252
Q317052
95-th percentile100922.2
Maximum129311
Range129310
Interquartile range (IQR)16933

Descriptive statistics

Standard deviation32334.094
Coefficient of variation (CV)1.8307456
Kurtosis4.3055091
Mean17661.708
Median Absolute Deviation (MAD)2212
Skewness2.2682927
Sum1148011
Variance1.0454936 × 109
MonotonicityNot monotonic
2024-04-19T14:52:14.765638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 3
 
4.6%
93 1
 
1.5%
72 1
 
1.5%
49566 1
 
1.5%
96791 1
 
1.5%
991 1
 
1.5%
1378 1
 
1.5%
12171 1
 
1.5%
129311 1
 
1.5%
1706 1
 
1.5%
Other values (53) 53
81.5%
ValueCountFrequency (%)
1 3
4.6%
3 1
 
1.5%
4 1
 
1.5%
20 1
 
1.5%
21 1
 
1.5%
39 1
 
1.5%
40 1
 
1.5%
43 1
 
1.5%
50 1
 
1.5%
72 1
 
1.5%
ValueCountFrequency (%)
129311 1
1.5%
122727 1
1.5%
109890 1
1.5%
101955 1
1.5%
96791 1
1.5%
90522 1
1.5%
51238 1
1.5%
49566 1
1.5%
49198 1
1.5%
44179 1
1.5%

Interactions

2024-04-19T14:52:12.995772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T14:52:14.848787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명납세자유형관내_관외납세자수
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0000.0000.0000.425
납세자유형0.0000.0001.0000.0000.538
관내_관외0.0000.0000.0001.0000.346
납세자수0.0000.4250.5380.3461.000
2024-04-19T14:52:14.958626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자유형관내_관외세목명과세년도
납세자유형1.0000.0000.0000.000
관내_관외0.0001.0000.0000.000
세목명0.0000.0001.0000.000
과세년도0.0000.0000.0001.000
2024-04-19T14:52:15.078234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자수과세년도세목명납세자유형관내_관외
납세자수1.0000.0000.2320.5530.353
과세년도0.0001.0000.0000.0000.000
세목명0.2320.0001.0000.0000.000
납세자유형0.5530.0000.0001.0000.000
관내_관외0.3530.0000.0000.0001.000

Missing values

2024-04-19T14:52:13.104681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:52:13.232977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
0대구광역시동구271402020등록세개인N93
1대구광역시동구271402020등록세개인Y96
2대구광역시동구271402020등록세법인Y4
3대구광역시동구271402020재산세개인N49198
4대구광역시동구271402020재산세개인Y90522
5대구광역시동구271402020재산세법인N936
6대구광역시동구271402020재산세법인Y1272
7대구광역시동구271402020주민세개인N20677
8대구광역시동구271402020주민세개인Y122727
9대구광역시동구271402020주민세법인N1800
시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
55대구광역시동구271402021등록면허세법인Y3166
56대구광역시동구271402021지방소득세개인N12029
57대구광역시동구271402021지방소득세개인Y51238
58대구광역시동구271402021지방소득세법인N1896
59대구광역시동구271402021지방소득세법인Y3300
60대구광역시동구271402021지방소비세법인Y1
61대구광역시동구271402021지역자원시설세개인N43
62대구광역시동구271402021지역자원시설세개인Y111
63대구광역시동구271402021지역자원시설세법인N20
64대구광역시동구271402021지역자원시설세법인Y39