Overview

Dataset statistics

Number of variables9
Number of observations4781
Missing cells4
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory350.3 KiB
Average record size in memory75.0 B

Variable types

Categorical7
Numeric2

Dataset

Description서울시에서 제공하는 서초구 행정동별, 연료별 자동차 등록 현황으로 용도, 차종종별, 연료 등에 대한 정보를 제공합니다.
Author서울특별시 서초구
URLhttps://www.data.go.kr/data/15104433/fileData.do

Alerts

사용본거지시군구 has constant value ""Constant
시군구코드 has constant value ""Constant
행정동사용본거지코드 is highly overall correlated with 행정동사용본거지High correlation
행정동사용본거지 is highly overall correlated with 행정동사용본거지코드High correlation

Reproduction

Analysis started2023-12-12 19:43:50.128431
Analysis finished2023-12-12 19:43:51.577201
Duration1.45 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size37.5 KiB
2020-12
720 
2021-12
718 
2019-12
695 
2018-12
679 
2017-12
671 
Other values (2)
1298 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015-12
2nd row2015-12
3rd row2015-12
4th row2015-12
5th row2015-12

Common Values

ValueCountFrequency (%)
2020-12 720
15.1%
2021-12 718
15.0%
2019-12 695
14.5%
2018-12 679
14.2%
2017-12 671
14.0%
2016-12 653
13.7%
2015-12 645
13.5%

Length

2023-12-13T04:43:51.652427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:51.788304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-12 720
15.1%
2021-12 718
15.0%
2019-12 695
14.5%
2018-12 679
14.2%
2017-12 671
14.0%
2016-12 653
13.7%
2015-12 645
13.5%

사용본거지시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size37.5 KiB
서울특별시 서초구
4781 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시 서초구
2nd row서울특별시 서초구
3rd row서울특별시 서초구
4th row서울특별시 서초구
5th row서울특별시 서초구

Common Values

ValueCountFrequency (%)
서울특별시 서초구 4781
100.0%

Length

2023-12-13T04:43:51.927989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:52.038842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 4781
50.0%
서초구 4781
50.0%

시군구코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size37.5 KiB
11650
4781 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11650
2nd row11650
3rd row11650
4th row11650
5th row11650

Common Values

ValueCountFrequency (%)
11650 4781
100.0%

Length

2023-12-13T04:43:52.143120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:52.271671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
11650 4781
100.0%

행정동사용본거지
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size37.5 KiB
<NA>
405 
서울특별시 서초구 서초1동
343 
서울특별시 서초구 서초3동
 
307
서울특별시 서초구 양재1동
 
302
서울특별시 서초구 서초2동
 
288
Other values (16)
3136 

Length

Max length16
Median length14
Mean length13.057101
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 405
 
8.5%
서울특별시 서초구 서초1동 343
 
7.2%
서울특별시 서초구 서초3동 307
 
6.4%
서울특별시 서초구 양재1동 302
 
6.3%
서울특별시 서초구 서초2동 288
 
6.0%
서울특별시 서초구 양재2동 280
 
5.9%
서울특별시 서초구 내곡동 272
 
5.7%
서울특별시 서초구 방배1동 252
 
5.3%
서울특별시 서초구 반포4동 240
 
5.0%
서울특별시 서초구 반포본동 227
 
4.7%
Other values (11) 1865
39.0%

Length

2023-12-13T04:43:52.763678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 4369
32.3%
서초구 4369
32.3%
na 405
 
3.0%
서초1동 343
 
2.5%
서초3동 307
 
2.3%
양재1동 302
 
2.2%
서초2동 288
 
2.1%
양재2동 280
 
2.1%
내곡동 272
 
2.0%
방배1동 252
 
1.9%
Other values (16) 2353
17.4%

행정동사용본거지코드
Real number (ℝ)

HIGH CORRELATION 

Distinct46
Distinct (%)1.0%
Missing4
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1.0989023 × 109
Minimum0
Maximum4.117159 × 109
Zeros15
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size42.1 KiB
2023-12-13T04:43:52.952456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile65100
Q11.165053 × 109
median1.165057 × 109
Q31.165062 × 109
95-th percentile1.165066 × 109
Maximum4.117159 × 109
Range4.117159 × 109
Interquartile range (IQR)9000

Descriptive statistics

Standard deviation3.0087015 × 108
Coefficient of variation (CV)0.27379153
Kurtosis22.642481
Mean1.0989023 × 109
Median Absolute Deviation (MAD)4000
Skewness-1.4588325
Sum5.2494563 × 1012
Variance9.0522845 × 1016
MonotonicityNot monotonic
2023-12-13T04:43:53.143581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
1165051000 343
 
7.2%
1165053000 307
 
6.4%
1165065100 302
 
6.3%
1165052000 288
 
6.0%
1165065200 280
 
5.9%
1165066000 272
 
5.7%
1165060000 252
 
5.3%
1165058100 240
 
5.0%
1165055000 227
 
4.7%
1165059000 226
 
4.7%
Other values (36) 2040
42.7%
ValueCountFrequency (%)
0 15
0.3%
51000 24
0.5%
52000 17
0.4%
53000 18
0.4%
53100 9
 
0.2%
54000 11
0.2%
55000 7
 
0.1%
56000 21
0.4%
57000 7
 
0.1%
58000 2
 
< 0.1%
ValueCountFrequency (%)
4117159000 7
 
0.1%
1165066000 272
5.7%
1165065200 280
5.9%
1165065100 302
6.3%
1165062100 217
4.5%
1165062000 218
4.6%
1165061000 208
4.4%
1165060000 252
5.3%
1165059000 226
4.7%
1165058100 240
5.0%

용도
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size37.5 KiB
자가용
3058 
영업용
1167 
관용
556 

Length

Max length3
Median length3
Mean length2.8837063
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자가용
2nd row자가용
3rd row자가용
4th row자가용
5th row자가용

Common Values

ValueCountFrequency (%)
자가용 3058
64.0%
영업용 1167
 
24.4%
관용 556
 
11.6%

Length

2023-12-13T04:43:53.317892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:53.454592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자가용 3058
64.0%
영업용 1167
 
24.4%
관용 556
 
11.6%

차종종별
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size37.5 KiB
승용
2274 
화물
1204 
승합
973 
특수
330 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row승용
2nd row화물
3rd row승합
4th row화물
5th row승용

Common Values

ValueCountFrequency (%)
승용 2274
47.6%
화물 1204
25.2%
승합 973
20.4%
특수 330
 
6.9%

Length

2023-12-13T04:43:53.584067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:53.694335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
승용 2274
47.6%
화물 1204
25.2%
승합 973
20.4%
특수 330
 
6.9%

연료
Categorical

Distinct13
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size37.5 KiB
경유
1346 
엘피지
716 
휘발유
653 
휘발유(무연)
591 
기타연료
407 
Other values (8)
1068 

Length

Max length13
Median length12
Mean length4.1886635
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row휘발유(무연)
2nd row휘발유(무연)
3rd row휘발유
4th row경유
5th row휘발유

Common Values

ValueCountFrequency (%)
경유 1346
28.2%
엘피지 716
15.0%
휘발유 653
13.7%
휘발유(무연) 591
12.4%
기타연료 407
 
8.5%
전기 254
 
5.3%
하이브리드(휘발유+전기) 234
 
4.9%
CNG 217
 
4.5%
하이브리드(LPG+전기) 134
 
2.8%
휘발유(유연) 106
 
2.2%
Other values (3) 123
 
2.6%

Length

2023-12-13T04:43:53.825285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경유 1346
28.2%
엘피지 716
15.0%
휘발유 653
13.7%
휘발유(무연 591
12.4%
기타연료 407
 
8.5%
전기 254
 
5.3%
하이브리드(휘발유+전기 234
 
4.9%
cng 217
 
4.5%
하이브리드(lpg+전기 134
 
2.8%
휘발유(유연 106
 
2.2%
Other values (3) 123
 
2.6%

건수
Real number (ℝ)

Distinct836
Distinct (%)17.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean263.92658
Minimum1
Maximum6975
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size42.1 KiB
2023-12-13T04:43:53.966527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median7
Q369
95-th percentile2058
Maximum6975
Range6974
Interquartile range (IQR)67

Descriptive statistics

Standard deviation762.95159
Coefficient of variation (CV)2.890772
Kurtosis17.073539
Mean263.92658
Median Absolute Deviation (MAD)6
Skewness3.9187969
Sum1261833
Variance582095.13
MonotonicityNot monotonic
2023-12-13T04:43:54.131308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 921
19.3%
2 471
 
9.9%
3 329
 
6.9%
4 287
 
6.0%
5 206
 
4.3%
6 153
 
3.2%
7 115
 
2.4%
8 93
 
1.9%
12 63
 
1.3%
9 61
 
1.3%
Other values (826) 2082
43.5%
ValueCountFrequency (%)
1 921
19.3%
2 471
9.9%
3 329
 
6.9%
4 287
 
6.0%
5 206
 
4.3%
6 153
 
3.2%
7 115
 
2.4%
8 93
 
1.9%
9 61
 
1.3%
10 47
 
1.0%
ValueCountFrequency (%)
6975 1
< 0.1%
6913 1
< 0.1%
6342 1
< 0.1%
6339 1
< 0.1%
6236 1
< 0.1%
6127 1
< 0.1%
6002 1
< 0.1%
4922 1
< 0.1%
4728 1
< 0.1%
4680 1
< 0.1%

Interactions

2023-12-13T04:43:50.989478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:50.740333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:51.124023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:50.867233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:43:54.266705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연월행정동사용본거지행정동사용본거지코드용도차종종별연료건수
연월1.0000.0000.0840.0000.0000.1460.000
행정동사용본거지0.0001.0001.0000.3560.0880.0200.321
행정동사용본거지코드0.0841.0001.0000.3760.0670.2850.066
용도0.0000.3560.3761.0000.0540.3500.239
차종종별0.0000.0880.0670.0541.0000.5300.268
연료0.1460.0200.2850.3500.5301.0000.273
건수0.0000.3210.0660.2390.2680.2731.000
2023-12-13T04:43:54.410711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연료용도연월행정동사용본거지차종종별
연료1.0000.2110.0680.0070.337
용도0.2111.0000.0000.2010.051
연월0.0680.0001.0000.0000.000
행정동사용본거지0.0070.2010.0001.0000.042
차종종별0.3370.0510.0000.0421.000
2023-12-13T04:43:54.553842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동사용본거지코드건수연월행정동사용본거지용도차종종별연료
행정동사용본거지코드1.0000.1650.0560.9980.1350.0630.167
건수0.1651.0000.0000.1070.1470.1630.116
연월0.0560.0001.0000.0000.0000.0000.068
행정동사용본거지0.9980.1070.0001.0000.2010.0420.007
용도0.1350.1470.0000.2011.0000.0510.211
차종종별0.0630.1630.0000.0420.0511.0000.337
연료0.1670.1160.0680.0070.2110.3371.000

Missing values

2023-12-13T04:43:51.313532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:43:51.501243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연월사용본거지시군구시군구코드행정동사용본거지행정동사용본거지코드용도차종종별연료건수
02015-12서울특별시 서초구11650<NA>51000자가용승용휘발유(무연)2
12015-12서울특별시 서초구11650<NA>51000자가용화물휘발유(무연)1
22015-12서울특별시 서초구11650<NA>52000자가용승합휘발유1
32015-12서울특별시 서초구11650<NA>52000자가용화물경유2
42015-12서울특별시 서초구11650<NA>53100자가용승용휘발유6
52015-12서울특별시 서초구11650<NA>58100자가용승용휘발유(무연)3
62015-12서울특별시 서초구11650<NA>59000자가용승합경유1
72015-12서울특별시 서초구11650<NA>61000자가용승용휘발유(무연)2
82015-12서울특별시 서초구11650<NA>62000자가용화물경유2
92015-12서울특별시 서초구11650<NA>62100자가용승용휘발유(무연)1
연월사용본거지시군구시군구코드행정동사용본거지행정동사용본거지코드용도차종종별연료건수
47712021-12서울특별시 서초구11650서울특별시 서초구 양재1동1165065100자가용특수기타연료7
47722021-12서울특별시 서초구11650서울특별시 서초구 양재1동1165065100영업용화물경유754
47732021-12서울특별시 서초구11650서울특별시 서초구 양재1동1165065100영업용화물기타연료13
47742021-12서울특별시 서초구11650서울특별시 서초구 양재2동1165065200관용승용휘발유(무연)7
47752021-12서울특별시 서초구11650서울특별시 서초구 양재2동1165065200관용화물엘피지11
47762021-12서울특별시 서초구11650서울특별시 서초구 양재2동1165065200영업용승용전기3
47772021-12서울특별시 서초구11650서울특별시 서초구 내곡동1165066000관용승합엘피지1
47782021-12서울특별시 서초구11650서울특별시 서초구 내곡동1165066000자가용승용휘발유1611
47792021-12서울특별시 서초구11650서울특별시 서초구 내곡동1165066000영업용승용전기44
47802021-12서울특별시 서초구11650서울특별시 서초구 내곡동1165066000영업용승합경유54