Overview

Dataset statistics

Number of variables5
Number of observations385
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.3 KiB
Average record size in memory43.3 B

Variable types

Numeric3
Text2

Dataset

Description대구광역시_도시철도역 출구 인근 버스 정류소 정보_20170124
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15050940&dataSetDetailId=150509402c77f07f0986a&provdMethod=FILE

Reproduction

Analysis started2024-04-19 05:42:37.345964
Analysis finished2024-04-19 05:42:38.518949
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도시철도역_ID
Real number (ℝ)

Distinct87
Distinct (%)22.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean212.65195
Minimum117
Maximum341
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2024-04-19T14:42:38.590652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum117
5-th percentile120
Q1132
median224
Q3244
95-th percentile334
Maximum341
Range224
Interquartile range (IQR)112

Descriptive statistics

Standard deviation78.147553
Coefficient of variation (CV)0.36749042
Kurtosis-1.330334
Mean212.65195
Median Absolute Deviation (MAD)90
Skewness0.26541783
Sum81871
Variance6107.04
MonotonicityIncreasing
2024-04-19T14:42:38.727581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
131 14
 
3.6%
230 11
 
2.9%
130 11
 
2.9%
120 9
 
2.3%
118 8
 
2.1%
221 8
 
2.1%
243 8
 
2.1%
134 8
 
2.1%
314 8
 
2.1%
312 8
 
2.1%
Other values (77) 292
75.8%
ValueCountFrequency (%)
117 5
1.3%
118 8
2.1%
119 4
1.0%
120 9
2.3%
121 8
2.1%
122 4
1.0%
123 7
1.8%
124 4
1.0%
125 5
1.3%
126 5
1.3%
ValueCountFrequency (%)
341 4
1.0%
340 4
1.0%
339 1
 
0.3%
338 2
0.5%
337 2
0.5%
336 4
1.0%
335 2
0.5%
334 4
1.0%
333 4
1.0%
332 4
1.0%
Distinct84
Distinct (%)21.8%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2024-04-19T14:42:38.936451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length2
Mean length2.9532468
Min length2

Characters and Unicode

Total characters1137
Distinct characters117
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)1.3%

Sample

1st row대곡
2nd row대곡
3rd row대곡
4th row대곡
5th row대곡
ValueCountFrequency (%)
반월당 22
 
5.7%
중앙로 14
 
3.6%
서문시장 10
 
2.6%
명덕 10
 
2.6%
상인 9
 
2.3%
진천 8
 
2.1%
성서공단 8
 
2.1%
내당 8
 
2.1%
신천 8
 
2.1%
월촌 8
 
2.1%
Other values (74) 280
72.7%
2024-04-19T14:42:39.271498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
 
6.9%
45
 
4.0%
40
 
3.5%
37
 
3.3%
32
 
2.8%
30
 
2.6%
24
 
2.1%
24
 
2.1%
23
 
2.0%
21
 
1.8%
Other values (107) 782
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1097
96.5%
Open Punctuation 13
 
1.1%
Close Punctuation 13
 
1.1%
Other Punctuation 8
 
0.7%
Uppercase Letter 6
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
7.2%
45
 
4.1%
40
 
3.6%
37
 
3.4%
32
 
2.9%
30
 
2.7%
24
 
2.2%
24
 
2.2%
23
 
2.1%
21
 
1.9%
Other values (100) 742
67.6%
Uppercase Letter
ValueCountFrequency (%)
T 2
33.3%
B 2
33.3%
C 2
33.3%
Other Punctuation
ValueCountFrequency (%)
, 4
50.0%
. 4
50.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1097
96.5%
Common 34
 
3.0%
Latin 6
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
7.2%
45
 
4.1%
40
 
3.6%
37
 
3.4%
32
 
2.9%
30
 
2.7%
24
 
2.2%
24
 
2.2%
23
 
2.1%
21
 
1.9%
Other values (100) 742
67.6%
Common
ValueCountFrequency (%)
( 13
38.2%
) 13
38.2%
, 4
 
11.8%
. 4
 
11.8%
Latin
ValueCountFrequency (%)
T 2
33.3%
B 2
33.3%
C 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1097
96.5%
ASCII 40
 
3.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
79
 
7.2%
45
 
4.1%
40
 
3.6%
37
 
3.4%
32
 
2.9%
30
 
2.7%
24
 
2.2%
24
 
2.2%
23
 
2.1%
21
 
1.9%
Other values (100) 742
67.6%
ASCII
ValueCountFrequency (%)
( 13
32.5%
) 13
32.5%
, 4
 
10.0%
. 4
 
10.0%
T 2
 
5.0%
B 2
 
5.0%
C 2
 
5.0%
Distinct14
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1272727
Minimum1
Maximum19
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2024-04-19T14:42:39.382659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q34
95-th percentile8
Maximum19
Range18
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.7626431
Coefficient of variation (CV)0.88340332
Kurtosis13.551165
Mean3.1272727
Median Absolute Deviation (MAD)1
Skewness3.1648013
Sum1204
Variance7.632197
MonotonicityNot monotonic
2024-04-19T14:42:39.501897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
1 114
29.6%
3 76
19.7%
4 73
19.0%
2 70
18.2%
5 15
 
3.9%
6 9
 
2.3%
7 7
 
1.8%
8 7
 
1.8%
18 4
 
1.0%
9 2
 
0.5%
Other values (4) 8
 
2.1%
ValueCountFrequency (%)
1 114
29.6%
2 70
18.2%
3 76
19.7%
4 73
19.0%
5 15
 
3.9%
6 9
 
2.3%
7 7
 
1.8%
8 7
 
1.8%
9 2
 
0.5%
10 2
 
0.5%
ValueCountFrequency (%)
19 2
 
0.5%
18 4
 
1.0%
13 2
 
0.5%
11 2
 
0.5%
10 2
 
0.5%
9 2
 
0.5%
8 7
1.8%
7 7
1.8%
6 9
2.3%
5 15
3.9%

정류소_ID
Real number (ℝ)

Distinct314
Distinct (%)81.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.0263896 × 109
Minimum3.6000907 × 109
Maximum7.1210058 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2024-04-19T14:42:39.624980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.6000907 × 109
5-th percentile7.0010018 × 109
Q17.0110145 × 109
median7.041001 × 109
Q37.0510083 × 109
95-th percentile7.1110101 × 109
Maximum7.1210058 × 109
Range3.5209151 × 109
Interquartile range (IQR)39993800

Descriptive statistics

Standard deviation1.7765777 × 108
Coefficient of variation (CV)0.02528436
Kurtosis362.92899
Mean7.0263896 × 109
Median Absolute Deviation (MAD)20015600
Skewness-18.768448
Sum2.70516 × 1012
Variance3.1562282 × 1016
MonotonicityNot monotonic
2024-04-19T14:42:39.758037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7121005000 4
 
1.0%
7001005300 4
 
1.0%
7021045200 4
 
1.0%
7021003800 4
 
1.0%
7121004900 4
 
1.0%
7001003200 3
 
0.8%
7061026200 3
 
0.8%
7001002100 3
 
0.8%
7121005700 3
 
0.8%
7031009000 2
 
0.5%
Other values (304) 351
91.2%
ValueCountFrequency (%)
3600090700 1
0.3%
7001000500 1
0.3%
7001000600 1
0.3%
7001000800 1
0.3%
7001000900 1
0.3%
7001001000 1
0.3%
7001001100 2
0.5%
7001001200 2
0.5%
7001001300 2
0.5%
7001001400 2
0.5%
ValueCountFrequency (%)
7121005800 2
0.5%
7121005700 3
0.8%
7121005300 1
 
0.3%
7121005200 1
 
0.3%
7121005100 1
 
0.3%
7121005000 4
1.0%
7121004900 4
1.0%
7111042100 1
 
0.3%
7111042000 1
 
0.3%
7111032700 1
 
0.3%
Distinct314
Distinct (%)81.6%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2024-04-19T14:42:39.987340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length7.7922078
Min length4

Characters and Unicode

Total characters3000
Distinct characters226
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique257 ?
Unique (%)66.8%

Sample

1st row대곡역(1번출구)
2nd row대곡역(한라하우젠트앞)
3rd row대곡역(2번출구)
4th row유천교1
5th row유천교2
ValueCountFrequency (%)
영남대 5
 
1.3%
임당역1번출구 4
 
1.0%
동산의료원앞1 4
 
1.0%
임당역2번출구 4
 
1.0%
칠곡경대병원역 4
 
1.0%
경북농업기술원앞 4
 
1.0%
3
 
0.8%
신매광장건너 3
 
0.8%
남산초등학교앞 3
 
0.8%
명덕역(7번출구 3
 
0.8%
Other values (305) 353
90.5%
2024-04-19T14:42:40.344843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
 
5.0%
138
 
4.6%
112
 
3.7%
( 97
 
3.2%
) 97
 
3.2%
95
 
3.2%
94
 
3.1%
1 91
 
3.0%
86
 
2.9%
86
 
2.9%
Other values (216) 1955
65.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2569
85.6%
Decimal Number 210
 
7.0%
Open Punctuation 97
 
3.2%
Close Punctuation 97
 
3.2%
Uppercase Letter 15
 
0.5%
Other Punctuation 7
 
0.2%
Space Separator 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
149
 
5.8%
138
 
5.4%
112
 
4.4%
95
 
3.7%
94
 
3.7%
86
 
3.3%
86
 
3.3%
85
 
3.3%
50
 
1.9%
48
 
1.9%
Other values (197) 1626
63.3%
Decimal Number
ValueCountFrequency (%)
1 91
43.3%
2 73
34.8%
4 18
 
8.6%
3 12
 
5.7%
5 5
 
2.4%
7 4
 
1.9%
8 3
 
1.4%
9 2
 
1.0%
6 2
 
1.0%
Uppercase Letter
ValueCountFrequency (%)
C 5
33.3%
G 3
20.0%
B 2
 
13.3%
T 2
 
13.3%
V 2
 
13.3%
N 1
 
6.7%
Open Punctuation
ValueCountFrequency (%)
( 97
100.0%
Close Punctuation
ValueCountFrequency (%)
) 97
100.0%
Other Punctuation
ValueCountFrequency (%)
. 7
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2569
85.6%
Common 416
 
13.9%
Latin 15
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
149
 
5.8%
138
 
5.4%
112
 
4.4%
95
 
3.7%
94
 
3.7%
86
 
3.3%
86
 
3.3%
85
 
3.3%
50
 
1.9%
48
 
1.9%
Other values (197) 1626
63.3%
Common
ValueCountFrequency (%)
( 97
23.3%
) 97
23.3%
1 91
21.9%
2 73
17.5%
4 18
 
4.3%
3 12
 
2.9%
. 7
 
1.7%
5 5
 
1.2%
5
 
1.2%
7 4
 
1.0%
Other values (3) 7
 
1.7%
Latin
ValueCountFrequency (%)
C 5
33.3%
G 3
20.0%
B 2
 
13.3%
T 2
 
13.3%
V 2
 
13.3%
N 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2569
85.6%
ASCII 431
 
14.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
149
 
5.8%
138
 
5.4%
112
 
4.4%
95
 
3.7%
94
 
3.7%
86
 
3.3%
86
 
3.3%
85
 
3.3%
50
 
1.9%
48
 
1.9%
Other values (197) 1626
63.3%
ASCII
ValueCountFrequency (%)
( 97
22.5%
) 97
22.5%
1 91
21.1%
2 73
16.9%
4 18
 
4.2%
3 12
 
2.8%
. 7
 
1.6%
5 5
 
1.2%
5
 
1.2%
C 5
 
1.2%
Other values (9) 21
 
4.9%

Interactions

2024-04-19T14:42:38.094278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:42:37.588692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:42:37.833191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:42:38.181555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:42:37.663749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:42:37.915998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:42:38.277858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:42:37.745343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:42:37.999772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T14:42:40.750898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도시철도역_ID도시철도역 명도시철도역 출구번호정류소_ID
도시철도역_ID1.0000.9990.121NaN
도시철도역 명0.9991.0000.000NaN
도시철도역 출구번호0.1210.0001.000NaN
정류소_IDNaNNaNNaN1.000
2024-04-19T14:42:40.839879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도시철도역_ID도시철도역 출구번호정류소_ID
도시철도역_ID1.000-0.1900.125
도시철도역 출구번호-0.1901.000-0.079
정류소_ID0.125-0.0791.000

Missing values

2024-04-19T14:42:38.398424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:42:38.484200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도시철도역_ID도시철도역 명도시철도역 출구번호정류소_ID정류소 명
0117대곡17111000100대곡역(1번출구)
1117대곡27041033900대곡역(한라하우젠트앞)
2117대곡37111000200대곡역(2번출구)
3117대곡47041053200유천교1
4117대곡47041053300유천교2
5118진천17041019800진천역(1번출구)
6118진천27041019900진천역(2번출구)
7118진천37041018600월배시장앞
8118진천37041020200진천청구타운앞
9118진천37041020300진천청구타운건너
도시철도역_ID도시철도역 명도시철도역 출구번호정류소_ID정류소 명
375338수성못(TBC)17061012900TBC건너
376339지산17061020300동아스포츠센터(지산중학교앞)
377340범물17061020900동아백화점수성점앞
378340범물27061020800동아백화점수성점건너
379340범물37061020800동아백화점수성점건너
380340범물47061020900동아백화점수성점앞
381341용지17061021400범물1동주민센터건너
382341용지27061021300범물1동주민센터앞
383341용지37061021300범물1동주민센터앞
384341용지47061021400범물1동주민센터건너