Overview

Dataset statistics

Number of variables12
Number of observations8075
Missing cells15
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory812.4 KiB
Average record size in memory103.0 B

Variable types

Numeric7
Text5

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15262/S/1/datasetView.do

Alerts

ROUTE_ID is highly overall correlated with ROUTE_TY and 1 other fieldsHigh correlation
DSTNC is highly overall correlated with FIRCAR_TM and 1 other fieldsHigh correlation
ROUTE_TY is highly overall correlated with ROUTE_IDHigh correlation
FIRCAR_TM is highly overall correlated with ROUTE_ID and 1 other fieldsHigh correlation
LSTCAR_TM is highly overall correlated with DSTNCHigh correlation

Reproduction

Analysis started2024-05-11 06:16:44.024039
Analysis finished2024-05-11 06:16:56.088175
Duration12.06 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

STDR_DE
Real number (ℝ)

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20230654
Minimum20230101
Maximum20231201
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size71.1 KiB
2024-05-11T15:16:56.195534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20230101
5-th percentile20230101
Q120230401
median20230701
Q320231001
95-th percentile20231201
Maximum20231201
Range1100
Interquartile range (IQR)600

Descriptive statistics

Standard deviation345.08993
Coefficient of variation (CV)1.7057775 × 10-5
Kurtosis-1.2154818
Mean20230654
Median Absolute Deviation (MAD)300
Skewness-0.0087972035
Sum1.6336253 × 1011
Variance119087.06
MonotonicityIncreasing
2024-05-11T15:16:56.411159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
20231201 682
8.4%
20231001 678
8.4%
20231101 678
8.4%
20230901 677
8.4%
20230701 675
8.4%
20230601 674
8.3%
20230501 673
8.3%
20230801 673
8.3%
20230401 669
8.3%
20230301 667
8.3%
Other values (2) 1329
16.5%
ValueCountFrequency (%)
20230101 663
8.2%
20230201 666
8.2%
20230301 667
8.3%
20230401 669
8.3%
20230501 673
8.3%
20230601 674
8.3%
20230701 675
8.4%
20230801 673
8.3%
20230901 677
8.4%
20231001 678
8.4%
ValueCountFrequency (%)
20231201 682
8.4%
20231101 678
8.4%
20231001 678
8.4%
20230901 677
8.4%
20230801 673
8.3%
20230701 675
8.4%
20230601 674
8.3%
20230501 673
8.3%
20230401 669
8.3%
20230301 667
8.3%

ROUTE_ID
Real number (ℝ)

HIGH CORRELATION 

Distinct694
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0641136 × 108
Minimum1.0000002 × 108
Maximum1.249 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size71.1 KiB
2024-05-11T15:16:56.630594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0000002 × 108
5-th percentile1.0010004 × 108
Q11.0010024 × 108
median1.0010059 × 108
Q31.1290001 × 108
95-th percentile1.2190002 × 108
Maximum1.249 × 108
Range24899986
Interquartile range (IQR)12799776

Descriptive statistics

Standard deviation8114419.6
Coefficient of variation (CV)0.076255204
Kurtosis-0.80504771
Mean1.0641136 × 108
Median Absolute Deviation (MAD)561
Skewness0.85247929
Sum8.592717 × 1011
Variance6.5843805 × 1013
MonotonicityNot monotonic
2024-05-11T15:16:56.865893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100100124 12
 
0.1%
120900002 12
 
0.1%
108900001 12
 
0.1%
108900012 12
 
0.1%
115900006 12
 
0.1%
115900003 12
 
0.1%
115900004 12
 
0.1%
115900001 12
 
0.1%
115900005 12
 
0.1%
115900008 12
 
0.1%
Other values (684) 7955
98.5%
ValueCountFrequency (%)
100000017 12
0.1%
100000018 12
0.1%
100000020 11
0.1%
100100001 12
0.1%
100100006 12
0.1%
100100007 12
0.1%
100100008 12
0.1%
100100009 12
0.1%
100100010 12
0.1%
100100011 12
0.1%
ValueCountFrequency (%)
124900003 12
0.1%
124900002 12
0.1%
124900001 12
0.1%
124000039 12
0.1%
124000038 12
0.1%
124000036 12
0.1%
124000016 9
0.1%
124000015 9
0.1%
124000014 8
0.1%
124000013 12
0.1%
Distinct699
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size63.2 KiB
2024-05-11T15:16:57.236283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length4
Mean length3.9611146
Min length2

Characters and Unicode

Total characters31986
Distinct characters78
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row0017
2nd row01
3rd row0411
4th row100
5th row101
ValueCountFrequency (%)
0017 12
 
0.1%
관악08 12
 
0.1%
강북10 12
 
0.1%
강서05-1 12
 
0.1%
강북11 12
 
0.1%
강북12 12
 
0.1%
강서01 12
 
0.1%
강서02 12
 
0.1%
강서03 12
 
0.1%
강서04 12
 
0.1%
Other values (689) 7955
98.5%
2024-05-11T15:16:57.810628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5346
16.7%
0 4462
13.9%
2 2913
 
9.1%
6 2558
 
8.0%
3 2190
 
6.8%
7 1990
 
6.2%
5 1931
 
6.0%
4 1844
 
5.8%
8 701
 
2.2%
564
 
1.8%
Other values (68) 7487
23.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24430
76.4%
Other Letter 6703
 
21.0%
Uppercase Letter 663
 
2.1%
Dash Punctuation 168
 
0.5%
Close Punctuation 11
 
< 0.1%
Open Punctuation 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
564
 
8.4%
486
 
7.3%
384
 
5.7%
373
 
5.6%
372
 
5.5%
355
 
5.3%
311
 
4.6%
292
 
4.4%
252
 
3.8%
252
 
3.8%
Other values (48) 3062
45.7%
Decimal Number
ValueCountFrequency (%)
1 5346
21.9%
0 4462
18.3%
2 2913
11.9%
6 2558
10.5%
3 2190
9.0%
7 1990
 
8.1%
5 1931
 
7.9%
4 1844
 
7.5%
8 701
 
2.9%
9 495
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
N 209
31.5%
A 89
13.4%
B 77
 
11.6%
R 72
 
10.9%
U 72
 
10.9%
O 72
 
10.9%
T 72
 
10.9%
Dash Punctuation
ValueCountFrequency (%)
- 168
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24620
77.0%
Hangul 6703
 
21.0%
Latin 663
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
564
 
8.4%
486
 
7.3%
384
 
5.7%
373
 
5.6%
372
 
5.5%
355
 
5.3%
311
 
4.6%
292
 
4.4%
252
 
3.8%
252
 
3.8%
Other values (48) 3062
45.7%
Common
ValueCountFrequency (%)
1 5346
21.7%
0 4462
18.1%
2 2913
11.8%
6 2558
10.4%
3 2190
8.9%
7 1990
 
8.1%
5 1931
 
7.8%
4 1844
 
7.5%
8 701
 
2.8%
9 495
 
2.0%
Other values (3) 190
 
0.8%
Latin
ValueCountFrequency (%)
N 209
31.5%
A 89
13.4%
B 77
 
11.6%
R 72
 
10.9%
U 72
 
10.9%
O 72
 
10.9%
T 72
 
10.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25283
79.0%
Hangul 6703
 
21.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5346
21.1%
0 4462
17.6%
2 2913
11.5%
6 2558
10.1%
3 2190
8.7%
7 1990
 
7.9%
5 1931
 
7.6%
4 1844
 
7.3%
8 701
 
2.8%
9 495
 
2.0%
Other values (10) 853
 
3.4%
Hangul
ValueCountFrequency (%)
564
 
8.4%
486
 
7.3%
384
 
5.7%
373
 
5.6%
372
 
5.5%
355
 
5.3%
311
 
4.6%
292
 
4.4%
252
 
3.8%
252
 
3.8%
Other values (48) 3062
45.7%
Distinct699
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size63.2 KiB
2024-05-11T15:16:58.330193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length3.8579567
Min length2

Characters and Unicode

Total characters31153
Distinct characters59
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row0017
2nd row01
3rd row0411
4th row100
5th row101
ValueCountFrequency (%)
0017 12
 
0.1%
관악08 12
 
0.1%
강북10 12
 
0.1%
강서5-1 12
 
0.1%
강북11 12
 
0.1%
강북12 12
 
0.1%
강서01 12
 
0.1%
강서02 12
 
0.1%
강서03 12
 
0.1%
강서04 12
 
0.1%
Other values (689) 7955
98.5%
2024-05-11T15:16:59.040470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5346
17.2%
0 4318
13.9%
2 2913
9.4%
6 2558
 
8.2%
3 2190
 
7.0%
7 1990
 
6.4%
5 1931
 
6.2%
4 1844
 
5.9%
8 701
 
2.3%
552
 
1.8%
Other values (49) 6810
21.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24286
78.0%
Other Letter 6036
 
19.4%
Uppercase Letter 663
 
2.1%
Dash Punctuation 168
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
552
 
9.1%
486
 
8.1%
384
 
6.4%
373
 
6.2%
355
 
5.9%
292
 
4.8%
276
 
4.6%
252
 
4.2%
240
 
4.0%
216
 
3.6%
Other values (31) 2610
43.2%
Decimal Number
ValueCountFrequency (%)
1 5346
22.0%
0 4318
17.8%
2 2913
12.0%
6 2558
10.5%
3 2190
9.0%
7 1990
 
8.2%
5 1931
 
8.0%
4 1844
 
7.6%
8 701
 
2.9%
9 495
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
N 209
31.5%
A 89
13.4%
B 77
 
11.6%
U 72
 
10.9%
O 72
 
10.9%
R 72
 
10.9%
T 72
 
10.9%
Dash Punctuation
ValueCountFrequency (%)
- 168
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24454
78.5%
Hangul 6036
 
19.4%
Latin 663
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
552
 
9.1%
486
 
8.1%
384
 
6.4%
373
 
6.2%
355
 
5.9%
292
 
4.8%
276
 
4.6%
252
 
4.2%
240
 
4.0%
216
 
3.6%
Other values (31) 2610
43.2%
Common
ValueCountFrequency (%)
1 5346
21.9%
0 4318
17.7%
2 2913
11.9%
6 2558
10.5%
3 2190
9.0%
7 1990
 
8.1%
5 1931
 
7.9%
4 1844
 
7.5%
8 701
 
2.9%
9 495
 
2.0%
Latin
ValueCountFrequency (%)
N 209
31.5%
A 89
13.4%
B 77
 
11.6%
U 72
 
10.9%
O 72
 
10.9%
R 72
 
10.9%
T 72
 
10.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25117
80.6%
Hangul 6036
 
19.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5346
21.3%
0 4318
17.2%
2 2913
11.6%
6 2558
10.2%
3 2190
8.7%
7 1990
 
7.9%
5 1931
 
7.7%
4 1844
 
7.3%
8 701
 
2.8%
9 495
 
2.0%
Other values (8) 831
 
3.3%
Hangul
ValueCountFrequency (%)
552
 
9.1%
486
 
8.1%
384
 
6.4%
373
 
6.2%
355
 
5.9%
292
 
4.8%
276
 
4.6%
252
 
4.2%
240
 
4.0%
216
 
3.6%
Other values (31) 2610
43.2%

DSTNC
Real number (ℝ)

HIGH CORRELATION 

Distinct488
Distinct (%)6.1%
Missing14
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean32.454641
Minimum1.2
Maximum220
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size71.1 KiB
2024-05-11T15:16:59.257301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.2
5-th percentile4
Q18.72
median23.4
Q345
95-th percentile85.9
Maximum220
Range218.8
Interquartile range (IQR)36.28

Descriptive statistics

Standard deviation34.832349
Coefficient of variation (CV)1.0732625
Kurtosis7.9392709
Mean32.454641
Median Absolute Deviation (MAD)15.8
Skewness2.5627973
Sum261616.86
Variance1213.2925
MonotonicityNot monotonic
2024-05-11T15:16:59.450206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7.0 116
 
1.4%
12.0 82
 
1.0%
13.0 79
 
1.0%
7.2 72
 
0.9%
5.5 69
 
0.9%
39.0 66
 
0.8%
7.5 63
 
0.8%
13.3 60
 
0.7%
4.8 57
 
0.7%
7.8 57
 
0.7%
Other values (478) 7340
90.9%
ValueCountFrequency (%)
1.2 12
 
0.1%
1.6 12
 
0.1%
1.8 6
 
0.1%
1.9 3
 
< 0.1%
2.0 9
 
0.1%
2.1 24
0.3%
2.4 3
 
< 0.1%
2.5 10
 
0.1%
2.6 44
0.5%
2.7 9
 
0.1%
ValueCountFrequency (%)
220.0 7
 
0.1%
204.4 11
0.1%
204.0 1
 
< 0.1%
201.4 12
0.1%
196.0 12
0.1%
193.0 10
0.1%
190.0 6
 
0.1%
188.0 13
0.2%
187.6 12
0.1%
184.0 24
0.3%

ROUTE_TY
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0022291
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size71.1 KiB
2024-05-11T15:16:59.601666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile4
Maximum13
Range12
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.2833589
Coefficient of variation (CV)0.42746869
Kurtosis11.401701
Mean3.0022291
Median Absolute Deviation (MAD)1
Skewness2.1511805
Sum24243
Variance1.6470101
MonotonicityNot monotonic
2024-05-11T15:16:59.763814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
2 2970
36.8%
4 2745
34.0%
3 1716
21.3%
1 406
 
5.0%
6 123
 
1.5%
10 72
 
0.9%
5 31
 
0.4%
13 12
 
0.1%
ValueCountFrequency (%)
1 406
 
5.0%
2 2970
36.8%
3 1716
21.3%
4 2745
34.0%
5 31
 
0.4%
6 123
 
1.5%
10 72
 
0.9%
13 12
 
0.1%
ValueCountFrequency (%)
13 12
 
0.1%
10 72
 
0.9%
6 123
 
1.5%
5 31
 
0.4%
4 2745
34.0%
3 1716
21.3%
2 2970
36.8%
1 406
 
5.0%
Distinct461
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size63.2 KiB
2024-05-11T15:17:00.075378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length5.2144892
Min length2

Characters and Unicode

Total characters42107
Distinct characters316
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row청암동
2nd row예장주차장
3rd row용산차고지
4th row하계동
5th row우이동
ValueCountFrequency (%)
양천공영차고지 216
 
2.7%
복정역환승센터 168
 
2.1%
은평차고지 156
 
1.9%
중랑공영차고지 138
 
1.7%
인천공항 137
 
1.7%
장지공영차고지 120
 
1.5%
우이동 120
 
1.5%
진관공영차고지 108
 
1.3%
강동공영차고지 96
 
1.2%
정릉 84
 
1.0%
Other values (451) 6732
83.4%
2024-05-11T15:17:00.616773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2734
 
6.5%
2219
 
5.3%
1846
 
4.4%
1770
 
4.2%
1358
 
3.2%
1290
 
3.1%
1037
 
2.5%
734
 
1.7%
723
 
1.7%
706
 
1.7%
Other values (306) 27690
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40633
96.5%
Decimal Number 815
 
1.9%
Uppercase Letter 289
 
0.7%
Other Punctuation 262
 
0.6%
Open Punctuation 48
 
0.1%
Close Punctuation 48
 
0.1%
Lowercase Letter 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2734
 
6.7%
2219
 
5.5%
1846
 
4.5%
1770
 
4.4%
1358
 
3.3%
1290
 
3.2%
1037
 
2.6%
734
 
1.8%
723
 
1.8%
706
 
1.7%
Other values (282) 26216
64.5%
Uppercase Letter
ValueCountFrequency (%)
T 87
30.1%
L 41
14.2%
H 32
 
11.1%
A 30
 
10.4%
P 30
 
10.4%
K 21
 
7.3%
C 18
 
6.2%
E 12
 
4.2%
S 9
 
3.1%
G 9
 
3.1%
Decimal Number
ValueCountFrequency (%)
1 225
27.6%
2 159
19.5%
7 128
15.7%
4 72
 
8.8%
3 69
 
8.5%
5 63
 
7.7%
6 51
 
6.3%
8 24
 
2.9%
0 24
 
2.9%
Other Punctuation
ValueCountFrequency (%)
. 259
98.9%
, 3
 
1.1%
Open Punctuation
ValueCountFrequency (%)
( 48
100.0%
Close Punctuation
ValueCountFrequency (%)
) 48
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40633
96.5%
Common 1173
 
2.8%
Latin 301
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2734
 
6.7%
2219
 
5.5%
1846
 
4.5%
1770
 
4.4%
1358
 
3.3%
1290
 
3.2%
1037
 
2.6%
734
 
1.8%
723
 
1.8%
706
 
1.7%
Other values (282) 26216
64.5%
Common
ValueCountFrequency (%)
. 259
22.1%
1 225
19.2%
2 159
13.6%
7 128
10.9%
4 72
 
6.1%
3 69
 
5.9%
5 63
 
5.4%
6 51
 
4.3%
( 48
 
4.1%
) 48
 
4.1%
Other values (3) 51
 
4.3%
Latin
ValueCountFrequency (%)
T 87
28.9%
L 41
13.6%
H 32
 
10.6%
A 30
 
10.0%
P 30
 
10.0%
K 21
 
7.0%
C 18
 
6.0%
e 12
 
4.0%
E 12
 
4.0%
S 9
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40633
96.5%
ASCII 1474
 
3.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2734
 
6.7%
2219
 
5.5%
1846
 
4.5%
1770
 
4.4%
1358
 
3.3%
1290
 
3.2%
1037
 
2.6%
734
 
1.8%
723
 
1.8%
706
 
1.7%
Other values (282) 26216
64.5%
ASCII
ValueCountFrequency (%)
. 259
17.6%
1 225
15.3%
2 159
10.8%
7 128
8.7%
T 87
 
5.9%
4 72
 
4.9%
3 69
 
4.7%
5 63
 
4.3%
6 51
 
3.5%
( 48
 
3.3%
Other values (14) 313
21.2%
Distinct455
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size63.2 KiB
2024-05-11T15:17:01.063386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length4.5648297
Min length2

Characters and Unicode

Total characters36861
Distinct characters305
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row이촌동
2nd row예장주차장
3rd rowAT센터.양재꽃시장
4th row용산구청
5th row서소문
ValueCountFrequency (%)
인천공항 170
 
2.1%
서울역 155
 
1.9%
강남역 145
 
1.8%
여의도 143
 
1.8%
석계역 106
 
1.3%
홍대입구역 96
 
1.2%
양재역 96
 
1.2%
대방역 87
 
1.1%
구로디지털단지역 84
 
1.0%
수유역 84
 
1.0%
Other values (445) 6909
85.6%
2024-05-11T15:17:01.682767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3691
 
10.0%
1386
 
3.8%
1212
 
3.3%
912
 
2.5%
658
 
1.8%
652
 
1.8%
618
 
1.7%
548
 
1.5%
518
 
1.4%
495
 
1.3%
Other values (295) 26171
71.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35832
97.2%
Decimal Number 420
 
1.1%
Other Punctuation 254
 
0.7%
Uppercase Letter 225
 
0.6%
Open Punctuation 65
 
0.2%
Close Punctuation 65
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3691
 
10.3%
1386
 
3.9%
1212
 
3.4%
912
 
2.5%
658
 
1.8%
652
 
1.8%
618
 
1.7%
548
 
1.5%
518
 
1.4%
495
 
1.4%
Other values (271) 25142
70.2%
Uppercase Letter
ValueCountFrequency (%)
A 48
21.3%
T 33
14.7%
C 30
13.3%
Y 24
10.7%
M 24
10.7%
D 14
 
6.2%
G 12
 
5.3%
S 12
 
5.3%
H 9
 
4.0%
L 9
 
4.0%
Other values (2) 10
 
4.4%
Decimal Number
ValueCountFrequency (%)
2 90
21.4%
7 72
17.1%
1 66
15.7%
3 54
12.9%
5 39
9.3%
6 36
 
8.6%
4 27
 
6.4%
9 24
 
5.7%
8 12
 
2.9%
Other Punctuation
ValueCountFrequency (%)
. 254
100.0%
Open Punctuation
ValueCountFrequency (%)
( 65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 65
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35832
97.2%
Common 804
 
2.2%
Latin 225
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3691
 
10.3%
1386
 
3.9%
1212
 
3.4%
912
 
2.5%
658
 
1.8%
652
 
1.8%
618
 
1.7%
548
 
1.5%
518
 
1.4%
495
 
1.4%
Other values (271) 25142
70.2%
Common
ValueCountFrequency (%)
. 254
31.6%
2 90
 
11.2%
7 72
 
9.0%
1 66
 
8.2%
( 65
 
8.1%
) 65
 
8.1%
3 54
 
6.7%
5 39
 
4.9%
6 36
 
4.5%
4 27
 
3.4%
Other values (2) 36
 
4.5%
Latin
ValueCountFrequency (%)
A 48
21.3%
T 33
14.7%
C 30
13.3%
Y 24
10.7%
M 24
10.7%
D 14
 
6.2%
G 12
 
5.3%
S 12
 
5.3%
H 9
 
4.0%
L 9
 
4.0%
Other values (2) 10
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35832
97.2%
ASCII 1029
 
2.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3691
 
10.3%
1386
 
3.9%
1212
 
3.4%
912
 
2.5%
658
 
1.8%
652
 
1.8%
618
 
1.7%
548
 
1.5%
518
 
1.4%
495
 
1.4%
Other values (271) 25142
70.2%
ASCII
ValueCountFrequency (%)
. 254
24.7%
2 90
 
8.7%
7 72
 
7.0%
1 66
 
6.4%
( 65
 
6.3%
) 65
 
6.3%
3 54
 
5.2%
A 48
 
4.7%
5 39
 
3.8%
6 36
 
3.5%
Other values (14) 240
23.3%

CARALC
Real number (ℝ)

Distinct60
Distinct (%)0.7%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean16.03567
Minimum0
Maximum340
Zeros20
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size71.1 KiB
2024-05-11T15:17:02.251824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6
Q19
median12
Q316
95-th percentile35
Maximum340
Range340
Interquartile range (IQR)7

Descriptive statistics

Standard deviation19.711218
Coefficient of variation (CV)1.2292107
Kurtosis90.071241
Mean16.03567
Median Absolute Deviation (MAD)3
Skewness8.3367337
Sum129472
Variance388.53211
MonotonicityNot monotonic
2024-05-11T15:17:02.471864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11 841
 
10.4%
8 723
 
9.0%
10 686
 
8.5%
12 676
 
8.4%
9 598
 
7.4%
15 556
 
6.9%
14 538
 
6.7%
13 463
 
5.7%
7 431
 
5.3%
16 289
 
3.6%
Other values (50) 2273
28.1%
ValueCountFrequency (%)
0 20
 
0.2%
4 34
 
0.4%
5 91
 
1.1%
6 278
 
3.4%
7 431
5.3%
8 723
9.0%
9 598
7.4%
10 686
8.5%
11 841
10.4%
12 676
8.4%
ValueCountFrequency (%)
340 2
 
< 0.1%
300 3
 
< 0.1%
250 1
 
< 0.1%
245 7
0.1%
240 7
0.1%
220 3
 
< 0.1%
210 2
 
< 0.1%
200 11
0.1%
190 4
 
< 0.1%
170 2
 
< 0.1%

FIRCAR_TM
Real number (ℝ)

HIGH CORRELATION 

Distinct62
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.020878576
Minimum-1.235
Maximum0.234
Zeros33
Zeros (%)0.4%
Negative180
Negative (%)2.2%
Memory size71.1 KiB
2024-05-11T15:17:02.691119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-1.235
5-th percentile0.04
Q10.041
median0.044
Q30.055
95-th percentile0.06
Maximum0.234
Range1.469
Interquartile range (IQR)0.014

Descriptive statistics

Standard deviation0.18973109
Coefficient of variation (CV)9.0873577
Kurtosis39.546511
Mean0.020878576
Median Absolute Deviation (MAD)0.006
Skewness-6.4284335
Sum168.5945
Variance0.035997885
MonotonicityNot monotonic
2024-05-11T15:17:02.946962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.06 1457
18.0%
0.043 1230
15.2%
0.04 1229
15.2%
0.05 498
 
6.2%
0.042 487
 
6.0%
0.053 451
 
5.6%
0.055 361
 
4.5%
0.041 348
 
4.3%
0.052 186
 
2.3%
0.054 165
 
2.0%
Other values (52) 1663
20.6%
ValueCountFrequency (%)
-1.235 27
0.3%
-1.234 48
0.6%
-1.2335 9
 
0.1%
-1.233 55
0.7%
-1.232 7
 
0.1%
-1.231 15
 
0.2%
-1.23 14
 
0.2%
-1.224 5
 
0.1%
0.0 33
0.4%
0.032 4
 
< 0.1%
ValueCountFrequency (%)
0.234 2
 
< 0.1%
0.224 2
 
< 0.1%
0.193 12
 
0.1%
0.133 3
 
< 0.1%
0.103 12
 
0.1%
0.101 12
 
0.1%
0.1 27
0.3%
0.093 12
 
0.1%
0.09 44
0.5%
0.08 12
 
0.1%

LSTCAR_TM
Real number (ℝ)

HIGH CORRELATION 

Distinct120
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.28638781
Minimum0
Maximum1.03
Zeros7
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size71.1 KiB
2024-05-11T15:17:03.228512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.1705
Q10.224
median0.23
Q30.233
95-th percentile1
Maximum1.03
Range1.03
Interquartile range (IQR)0.009

Descriptive statistics

Standard deviation0.2236323
Coefficient of variation (CV)0.78087227
Kurtosis6.1263739
Mean0.28638781
Median Absolute Deviation (MAD)0.005
Skewness2.7620593
Sum2312.5816
Variance0.050011406
MonotonicityNot monotonic
2024-05-11T15:17:03.502160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.23 1016
 
12.6%
0.233 881
 
10.9%
0.223 621
 
7.7%
0.225 514
 
6.4%
0.224 476
 
5.9%
0.231 439
 
5.4%
0.234 434
 
5.4%
0.232 406
 
5.0%
1.0 356
 
4.4%
0.235 323
 
4.0%
Other values (110) 2609
32.3%
ValueCountFrequency (%)
0.0 7
 
0.1%
0.0115 2
 
< 0.1%
0.012 1
 
< 0.1%
0.024 7
 
0.1%
0.025 2
 
< 0.1%
0.03 7
 
0.1%
0.031 24
0.3%
0.032 19
0.2%
0.0325 24
0.3%
0.033 24
0.3%
ValueCountFrequency (%)
1.03 2
 
< 0.1%
1.012 2
 
< 0.1%
1.01 23
 
0.3%
1.003 45
 
0.6%
1.0025 3
 
< 0.1%
1.002 40
 
0.5%
1.0016 3
 
< 0.1%
1.0015 16
 
0.2%
1.001 120
1.5%
1.0008 25
 
0.3%
Distinct259
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size63.2 KiB
2024-05-11T15:17:03.938487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length5
Mean length6.0272446
Min length4

Characters and Unicode

Total characters48670
Distinct characters170
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row 보광교통
2nd row 북부운수
3rd row 대원여객
4th row 한성여객
5th row 동아운수, 한성운수
ValueCountFrequency (%)
선진운수 241
 
2.6%
공항리무진 214
 
2.4%
대진여객 192
 
2.1%
한남여객 180
 
2.0%
범일운수 180
 
2.0%
한성여객 167
 
1.8%
흥안운수 167
 
1.8%
한성운수 162
 
1.8%
북부운수 152
 
1.7%
대원여객 144
 
1.6%
Other values (198) 7307
80.2%
2024-05-11T15:17:04.552694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9106
18.7%
4497
 
9.2%
4425
 
9.1%
1968
 
4.0%
1968
 
4.0%
1223
 
2.5%
1098
 
2.3%
1026
 
2.1%
995
 
2.0%
, 994
 
2.0%
Other values (160) 21370
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38198
78.5%
Space Separator 9106
 
18.7%
Other Punctuation 994
 
2.0%
Uppercase Letter 288
 
0.6%
Decimal Number 84
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4497
 
11.8%
4425
 
11.6%
1968
 
5.2%
1968
 
5.2%
1223
 
3.2%
1098
 
2.9%
1026
 
2.7%
995
 
2.6%
827
 
2.2%
720
 
1.9%
Other values (154) 19451
50.9%
Uppercase Letter
ValueCountFrequency (%)
B 96
33.3%
R 96
33.3%
T 96
33.3%
Space Separator
ValueCountFrequency (%)
9106
100.0%
Other Punctuation
ValueCountFrequency (%)
, 994
100.0%
Decimal Number
ValueCountFrequency (%)
3 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38198
78.5%
Common 10184
 
20.9%
Latin 288
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4497
 
11.8%
4425
 
11.6%
1968
 
5.2%
1968
 
5.2%
1223
 
3.2%
1098
 
2.9%
1026
 
2.7%
995
 
2.6%
827
 
2.2%
720
 
1.9%
Other values (154) 19451
50.9%
Common
ValueCountFrequency (%)
9106
89.4%
, 994
 
9.8%
3 84
 
0.8%
Latin
ValueCountFrequency (%)
B 96
33.3%
R 96
33.3%
T 96
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38198
78.5%
ASCII 10472
 
21.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9106
87.0%
, 994
 
9.5%
B 96
 
0.9%
R 96
 
0.9%
T 96
 
0.9%
3 84
 
0.8%
Hangul
ValueCountFrequency (%)
4497
 
11.8%
4425
 
11.6%
1968
 
5.2%
1968
 
5.2%
1223
 
3.2%
1098
 
2.9%
1026
 
2.7%
995
 
2.6%
827
 
2.2%
720
 
1.9%
Other values (154) 19451
50.9%

Interactions

2024-05-11T15:16:54.388235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:46.509687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:47.983181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:49.184501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:50.703167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:51.874976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:53.103088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:54.557916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:46.664949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:48.143042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:49.386729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:50.863466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:52.052150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:53.265814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:54.725410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:46.841690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:48.327867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:49.545840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:51.018534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:52.232746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:53.410374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:54.873096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:47.052807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:48.512133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:49.693007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:51.162706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:52.409599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:53.570518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:55.030187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:47.270048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:48.686381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:49.871481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:51.323626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:52.577342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:53.778317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:55.192864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:47.483216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:48.852225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:50.063664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:51.503777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:52.752470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:54.017830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:55.357790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:47.779189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:49.018145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:50.553629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:51.699512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:52.927454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:16:54.214429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T15:17:04.695751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
STDR_DEROUTE_IDDSTNCROUTE_TYCARALCFIRCAR_TMLSTCAR_TM
STDR_DE1.0000.0000.0000.0000.0000.0000.000
ROUTE_ID0.0001.0000.6260.5510.3110.1940.418
DSTNC0.0000.6261.0000.6510.7790.6030.644
ROUTE_TY0.0000.5510.6511.0000.2620.6630.449
CARALC0.0000.3110.7790.2621.0000.4320.567
FIRCAR_TM0.0000.1940.6030.6630.4321.0000.592
LSTCAR_TM0.0000.4180.6440.4490.5670.5921.000
2024-05-11T15:17:04.848161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
STDR_DEROUTE_IDDSTNCROUTE_TYCARALCFIRCAR_TMLSTCAR_TM
STDR_DE1.0000.0120.016-0.0050.032-0.014-0.024
ROUTE_ID0.0121.000-0.493-0.5870.2000.5730.230
DSTNC0.016-0.4931.0000.3430.124-0.799-0.595
ROUTE_TY-0.005-0.5870.3431.000-0.092-0.366-0.202
CARALC0.0320.2000.124-0.0921.0000.037-0.211
FIRCAR_TM-0.0140.573-0.799-0.3660.0371.0000.421
LSTCAR_TM-0.0240.230-0.595-0.202-0.2110.4211.000

Missing values

2024-05-11T15:16:55.540327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T15:16:55.808859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-11T15:16:55.992730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

STDR_DEROUTE_IDROUTE_NMROUTE_ABRVDSTNCROUTE_TYSSTTN_NMESTTN_NMCARALCFIRCAR_TMLSTCAR_TMGROUP_NM
0202301011001001240017001712.24청암동이촌동120.05150.233보광교통
120230101100100001010116.05예장주차장예장주차장90.0630.23북부운수
2202301011040000120411041144.34용산차고지AT센터.양재꽃시장140.0420.223대원여객
32023010110010054910010057.093하계동용산구청100.040.223한성여객
42023010110010000610110137.813우이동서소문100.040.23동아운수, 한성운수
5202301011001001291014101412.64성북생태체험관종로구민회관숭인동80.050.234대진여객
6202301011001001301017101723.954월계동상왕십리140.0430.232한성여객
72023010110010000710210230.23상계주공7단지동대문110.040.231삼화상운, 흥안운수
8202301011001001311020102023.24정릉교보문고90.0430.232대진여객
92023010110010000810310330.423삼화상운서울역90.0430.23삼화상운
STDR_DEROUTE_IDROUTE_NMROUTE_ABRVDSTNCROUTE_TYSSTTN_NMESTTN_NMCARALCFIRCAR_TMLSTCAR_TMGROUP_NM
806520231201100900010종로03종로037.22낙산공원종로5가90.060.234종로운수
806620231201100900011종로05종로055.02서대문3번출구종로문화센터110.060.233나경운수
806720231201100900004종로07종로075.82명륜새마을금고명륜새마을금고180.060.22와룡운수
806820231201100900005종로08종로086.82명륜3가종로5가70.0550.234와룡운수
806920231201100900003종로09종로096.42수성동계곡남대문100.060.233인왕교통
807020231201100900007종로11종로118.62삼청동서울역100.060.23삼청교통
807120231201100900009종로12종로125.42서울대병원종로3가90.060.233은수교통
807220231201100900002종로13종로137.52평창동주민센터부암슈퍼150.0550.223약수교통
807320231201106900001중랑01중랑013.62중화1동동아약국신이문역250.060.235금창운수, 금창운수 월계점
807420231201106900002중랑02중랑027.12진로아파트한신아파트80.060.2315중랑운수