Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Categorical1
Text2
Numeric2

Dataset

Description대전광역시 2020년 횡단보도 통계 및 현황입니다. 2022년 공공데이터 기업매칭지원사업으로 수행되었습니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15111036/fileData.do

Alerts

Dataset has 2 (< 0.1%) duplicate rowsDuplicates
경도 is highly overall correlated with 구명High correlation
구명 is highly overall correlated with 경도High correlation

Reproduction

Analysis started2023-12-12 20:15:39.839198
Analysis finished2023-12-12 20:15:41.059947
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
유성구
2888 
서구
2413 
동구
1761 
중구
1503 
대덕구
1435 

Length

Max length3
Median length2
Mean length2.4323
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row유성구
4th row유성구
5th row서구

Common Values

ValueCountFrequency (%)
유성구 2888
28.9%
서구 2413
24.1%
동구 1761
17.6%
중구 1503
15.0%
대덕구 1435
14.3%

Length

2023-12-13T05:15:41.141016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:15:41.293625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유성구 2888
28.9%
서구 2413
24.1%
동구 1761
17.6%
중구 1503
15.0%
대덕구 1435
14.3%

동명
Text

Distinct154
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T05:15:41.734358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9444
Min length2

Characters and Unicode

Total characters29444
Distinct characters119
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row태평동
2nd row유천동
3rd row어은동
4th row추목동
5th row정림동
ValueCountFrequency (%)
둔산동 393
 
3.9%
관저동 347
 
3.5%
봉명동 242
 
2.4%
오정동 232
 
2.3%
지족동 218
 
2.2%
탄방동 188
 
1.9%
학하동 187
 
1.9%
월평동 187
 
1.9%
가양동 187
 
1.9%
도마동 185
 
1.8%
Other values (144) 7634
76.3%
2023-12-13T05:15:42.343333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10000
34.0%
698
 
2.4%
696
 
2.4%
632
 
2.1%
569
 
1.9%
547
 
1.9%
540
 
1.8%
513
 
1.7%
493
 
1.7%
478
 
1.6%
Other values (109) 14278
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29444
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10000
34.0%
698
 
2.4%
696
 
2.4%
632
 
2.1%
569
 
1.9%
547
 
1.9%
540
 
1.8%
513
 
1.7%
493
 
1.7%
478
 
1.6%
Other values (109) 14278
48.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29444
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10000
34.0%
698
 
2.4%
696
 
2.4%
632
 
2.1%
569
 
1.9%
547
 
1.9%
540
 
1.8%
513
 
1.7%
493
 
1.7%
478
 
1.6%
Other values (109) 14278
48.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29444
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10000
34.0%
698
 
2.4%
696
 
2.4%
632
 
2.1%
569
 
1.9%
547
 
1.9%
540
 
1.8%
513
 
1.7%
493
 
1.7%
478
 
1.6%
Other values (109) 14278
48.5%

지번
Text

Distinct3106
Distinct (%)31.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T05:15:42.739999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.9819
Min length2

Characters and Unicode

Total characters59819
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1811 ?
Unique (%)18.1%

Sample

1st row531번지
2nd row460번지
3rd row59-12번지
4th row557-2번지
5th row704번지
ValueCountFrequency (%)
번지 1416
 
12.4%
630번지 128
 
1.1%
260-1번지 72
 
0.6%
255-1번지 61
 
0.5%
458번지 57
 
0.5%
531번지 52
 
0.5%
385-1 49
 
0.4%
465번지 49
 
0.4%
457 44
 
0.4%
292번지 44
 
0.4%
Other values (3097) 9446
82.7%
2023-12-13T05:15:43.296487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10000
16.7%
10000
16.7%
1 5585
9.3%
5 3944
 
6.6%
2 3823
 
6.4%
4 3652
 
6.1%
3 3473
 
5.8%
6 3422
 
5.7%
- 3328
 
5.6%
7 3128
 
5.2%
Other values (6) 9464
15.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 34961
58.4%
Other Letter 20112
33.6%
Dash Punctuation 3328
 
5.6%
Space Separator 1418
 
2.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 5585
16.0%
5 3944
11.3%
2 3823
10.9%
4 3652
10.4%
3 3473
9.9%
6 3422
9.8%
7 3128
8.9%
0 2946
8.4%
9 2516
7.2%
8 2472
7.1%
Other Letter
ValueCountFrequency (%)
10000
49.7%
10000
49.7%
98
 
0.5%
14
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 3328
100.0%
Space Separator
ValueCountFrequency (%)
1418
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 39707
66.4%
Hangul 20112
33.6%

Most frequent character per script

Common
ValueCountFrequency (%)
1 5585
14.1%
5 3944
9.9%
2 3823
9.6%
4 3652
9.2%
3 3473
8.7%
6 3422
8.6%
- 3328
8.4%
7 3128
7.9%
0 2946
7.4%
9 2516
6.3%
Other values (2) 3890
9.8%
Hangul
ValueCountFrequency (%)
10000
49.7%
10000
49.7%
98
 
0.5%
14
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 39707
66.4%
Hangul 20112
33.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10000
49.7%
10000
49.7%
98
 
0.5%
14
 
0.1%
ASCII
ValueCountFrequency (%)
1 5585
14.1%
5 3944
9.9%
2 3823
9.6%
4 3652
9.2%
3 3473
8.7%
6 3422
8.6%
- 3328
8.4%
7 3128
7.9%
0 2946
7.4%
9 2516
6.3%
Other values (2) 3890
9.8%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct7701
Distinct (%)77.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.38963
Minimum127.27019
Maximum127.53756
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T05:15:43.501812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.27019
5-th percentile127.30999
Q1127.34888
median127.39244
Q3127.42714
95-th percentile127.45756
Maximum127.53756
Range0.26737
Interquartile range (IQR)0.078255

Descriptive statistics

Standard deviation0.046221192
Coefficient of variation (CV)0.00036283324
Kurtosis-0.88044243
Mean127.38963
Median Absolute Deviation (MAD)0.037642
Skewness-0.19975885
Sum1273896.3
Variance0.0021363985
MonotonicityNot monotonic
2023-12-13T05:15:43.706360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.389244 7
 
0.1%
127.37665 6
 
0.1%
127.340576 6
 
0.1%
127.376656 6
 
0.1%
127.38029 6
 
0.1%
127.38374 5
 
0.1%
127.37901 5
 
0.1%
127.37417 5
 
0.1%
127.4354 5
 
0.1%
127.43011 5
 
0.1%
Other values (7691) 9944
99.4%
ValueCountFrequency (%)
127.27019 1
< 0.1%
127.273476 1
< 0.1%
127.27948 1
< 0.1%
127.279884 1
< 0.1%
127.28087 1
< 0.1%
127.28109 1
< 0.1%
127.281586 1
< 0.1%
127.28401 1
< 0.1%
127.28446 1
< 0.1%
127.28618 1
< 0.1%
ValueCountFrequency (%)
127.53756 1
< 0.1%
127.53566 1
< 0.1%
127.534424 1
< 0.1%
127.5336 1
< 0.1%
127.53199 1
< 0.1%
127.53137 1
< 0.1%
127.53055 1
< 0.1%
127.5191 1
< 0.1%
127.515686 1
< 0.1%
127.51479 1
< 0.1%

위도
Real number (ℝ)

Distinct8440
Distinct (%)84.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.347559
Minimum36.200626
Maximum36.474888
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T05:15:43.887022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.200626
5-th percentile36.29581
Q136.321462
median36.343235
Q336.364611
95-th percentile36.432522
Maximum36.474888
Range0.274262
Interquartile range (IQR)0.04314825

Descriptive statistics

Standard deviation0.040161238
Coefficient of variation (CV)0.0011049226
Kurtosis0.71670706
Mean36.347559
Median Absolute Deviation (MAD)0.0216235
Skewness0.55776881
Sum363475.59
Variance0.001612925
MonotonicityNot monotonic
2023-12-13T05:15:44.027058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36.348747 7
 
0.1%
36.338062 5
 
0.1%
36.359684 5
 
0.1%
36.355495 5
 
0.1%
36.303043 4
 
< 0.1%
36.305943 4
 
< 0.1%
36.356174 4
 
< 0.1%
36.32147 4
 
< 0.1%
36.35584 4
 
< 0.1%
36.353565 4
 
< 0.1%
Other values (8430) 9954
99.5%
ValueCountFrequency (%)
36.200626 1
< 0.1%
36.21362 1
< 0.1%
36.21651 1
< 0.1%
36.216564 1
< 0.1%
36.21792 1
< 0.1%
36.21805 1
< 0.1%
36.21817 1
< 0.1%
36.2182 1
< 0.1%
36.21828 1
< 0.1%
36.21832 1
< 0.1%
ValueCountFrequency (%)
36.474888 1
< 0.1%
36.4727 1
< 0.1%
36.472233 1
< 0.1%
36.471462 1
< 0.1%
36.471428 1
< 0.1%
36.47035 1
< 0.1%
36.470116 1
< 0.1%
36.469463 1
< 0.1%
36.469036 1
< 0.1%
36.46709 1
< 0.1%

Interactions

2023-12-13T05:15:40.605597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:15:40.337368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:15:40.734299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:15:40.470425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:15:44.123935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구명경도위도
구명1.0000.9060.792
경도0.9061.0000.630
위도0.7920.6301.000
2023-12-13T05:15:44.216367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경도위도구명
경도1.000-0.0780.600
위도-0.0781.0000.448
구명0.6000.4481.000

Missing values

2023-12-13T05:15:40.874103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:15:41.004676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구명동명지번경도위도
11666중구태평동531번지127.396836.33008
11864중구유천동460번지127.39587436.32204
9762유성구어은동59-12번지127.3574936.361904
10170유성구추목동557-2번지127.3442436.41292
5881서구정림동704번지127.3664836.30145
1042대덕구송촌동568번지127.4458936.359684
10235유성구덕명동563 번지127.3016736.35842
11449중구용두동143-26번지127.4058936.33189
3850동구용운동757번지127.455336.33272
9595유성구관평동673번지127.38740536.42176
구명동명지번경도위도
10252유성구도룡동4-25번지127.3962336.376595
7794유성구관평동1292번지127.3882336.426685
5336서구변동254-416번지127.3681136.331303
8406유성구용계동684 번지127.33510636.328815
9951유성구지족동1112 번지127.30521436.38917
2120동구홍도동842-1번지127.4277336.345535
9201유성구관평동1286번지127.3883736.427555
8318유성구상대동497번지127.3390536.345505
493대덕구덕암동342번지127.4269936.439724
2543동구용운동77번지127.4594736.34296

Duplicate rows

Most frequently occurring

구명동명지번경도위도# duplicates
0동구신흥동152-2번지127.44086536.325832
1서구관저동1506번지127.34057636.2990342