Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 8075 |
Missing cells | 15 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 812.4 KiB |
Average record size in memory | 103.0 B |
Variable types
Numeric | 7 |
---|---|
Text | 5 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15262/S/1/datasetView.do |
ROUTE_ID is highly overall correlated with ROUTE_TY and 1 other fields | High correlation |
DSTNC is highly overall correlated with FIRCAR_TM and 1 other fields | High correlation |
ROUTE_TY is highly overall correlated with ROUTE_ID | High correlation |
FIRCAR_TM is highly overall correlated with ROUTE_ID and 1 other fields | High correlation |
LSTCAR_TM is highly overall correlated with DSTNC | High correlation |
Reproduction
Analysis started | 2024-05-11 06:16:44.024039 |
---|---|
Analysis finished | 2024-05-11 06:16:56.088175 |
Duration | 12.06 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
STDR_DE
Real number (ℝ)
Distinct | 12 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20230654 |
Minimum | 20230101 |
---|---|
Maximum | 20231201 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 71.1 KiB |
Quantile statistics
Minimum | 20230101 |
---|---|
5-th percentile | 20230101 |
Q1 | 20230401 |
median | 20230701 |
Q3 | 20231001 |
95-th percentile | 20231201 |
Maximum | 20231201 |
Range | 1100 |
Interquartile range (IQR) | 600 |
Descriptive statistics
Standard deviation | 345.08993 |
---|---|
Coefficient of variation (CV) | 1.7057775 × 10-5 |
Kurtosis | -1.2154818 |
Mean | 20230654 |
Median Absolute Deviation (MAD) | 300 |
Skewness | -0.0087972035 |
Sum | 1.6336253 × 1011 |
Variance | 119087.06 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
20231201 | 682 | |
20231001 | 678 | |
20231101 | 678 | |
20230901 | 677 | |
20230701 | 675 | |
20230601 | 674 | |
20230501 | 673 | |
20230801 | 673 | |
20230401 | 669 | |
20230301 | 667 | |
Other values (2) | 1329 |
Value | Count | Frequency (%) |
20230101 | 663 | |
20230201 | 666 | |
20230301 | 667 | |
20230401 | 669 | |
20230501 | 673 | |
20230601 | 674 | |
20230701 | 675 | |
20230801 | 673 | |
20230901 | 677 | |
20231001 | 678 |
Value | Count | Frequency (%) |
20231201 | 682 | |
20231101 | 678 | |
20231001 | 678 | |
20230901 | 677 | |
20230801 | 673 | |
20230701 | 675 | |
20230601 | 674 | |
20230501 | 673 | |
20230401 | 669 | |
20230301 | 667 |
ROUTE_ID
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 694 |
---|---|
Distinct (%) | 8.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.0641136 × 108 |
Minimum | 1.0000002 × 108 |
---|---|
Maximum | 1.249 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 71.1 KiB |
Quantile statistics
Minimum | 1.0000002 × 108 |
---|---|
5-th percentile | 1.0010004 × 108 |
Q1 | 1.0010024 × 108 |
median | 1.0010059 × 108 |
Q3 | 1.1290001 × 108 |
95-th percentile | 1.2190002 × 108 |
Maximum | 1.249 × 108 |
Range | 24899986 |
Interquartile range (IQR) | 12799776 |
Descriptive statistics
Standard deviation | 8114419.6 |
---|---|
Coefficient of variation (CV) | 0.076255204 |
Kurtosis | -0.80504771 |
Mean | 1.0641136 × 108 |
Median Absolute Deviation (MAD) | 561 |
Skewness | 0.85247929 |
Sum | 8.592717 × 1011 |
Variance | 6.5843805 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100100124 | 12 | 0.1% |
120900002 | 12 | 0.1% |
108900001 | 12 | 0.1% |
108900012 | 12 | 0.1% |
115900006 | 12 | 0.1% |
115900003 | 12 | 0.1% |
115900004 | 12 | 0.1% |
115900001 | 12 | 0.1% |
115900005 | 12 | 0.1% |
115900008 | 12 | 0.1% |
Other values (684) | 7955 |
Value | Count | Frequency (%) |
100000017 | 12 | |
100000018 | 12 | |
100000020 | 11 | |
100100001 | 12 | |
100100006 | 12 | |
100100007 | 12 | |
100100008 | 12 | |
100100009 | 12 | |
100100010 | 12 | |
100100011 | 12 |
Value | Count | Frequency (%) |
124900003 | 12 | |
124900002 | 12 | |
124900001 | 12 | |
124000039 | 12 | |
124000038 | 12 | |
124000036 | 12 | |
124000016 | 9 | |
124000015 | 9 | |
124000014 | 8 | |
124000013 | 12 |
ROUTE_NM
Text
Distinct | 699 |
---|---|
Distinct (%) | 8.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 63.2 KiB |
Value | Count | Frequency (%) |
0017 | 12 | 0.1% |
관악08 | 12 | 0.1% |
강북10 | 12 | 0.1% |
강서05-1 | 12 | 0.1% |
강북11 | 12 | 0.1% |
강북12 | 12 | 0.1% |
강서01 | 12 | 0.1% |
강서02 | 12 | 0.1% |
강서03 | 12 | 0.1% |
강서04 | 12 | 0.1% |
Other values (689) | 7955 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 5346 | |
0 | 4462 | |
2 | 2913 | 9.1% |
6 | 2558 | 8.0% |
3 | 2190 | 6.8% |
7 | 1990 | 6.2% |
5 | 1931 | 6.0% |
4 | 1844 | 5.8% |
8 | 701 | 2.2% |
서 | 564 | 1.8% |
Other values (68) | 7487 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 24430 | |
Other Letter | 6703 | 21.0% |
Uppercase Letter | 663 | 2.1% |
Dash Punctuation | 168 | 0.5% |
Close Punctuation | 11 | < 0.1% |
Open Punctuation | 11 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 564 | 8.4% |
동 | 486 | 7.3% |
강 | 384 | 5.7% |
성 | 373 | 5.6% |
포 | 372 | 5.5% |
북 | 355 | 5.3% |
대 | 311 | 4.6% |
로 | 292 | 4.4% |
작 | 252 | 3.8% |
문 | 252 | 3.8% |
Other values (48) | 3062 |
Decimal Number
Value | Count | Frequency (%) |
1 | 5346 | |
0 | 4462 | |
2 | 2913 | |
6 | 2558 | |
3 | 2190 | |
7 | 1990 | 8.1% |
5 | 1931 | 7.9% |
4 | 1844 | 7.5% |
8 | 701 | 2.9% |
9 | 495 | 2.0% |
Uppercase Letter
Value | Count | Frequency (%) |
N | 209 | |
A | 89 | |
B | 77 | 11.6% |
R | 72 | 10.9% |
U | 72 | 10.9% |
O | 72 | 10.9% |
T | 72 | 10.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 168 |
Close Punctuation
Value | Count | Frequency (%) |
) | 11 |
Open Punctuation
Value | Count | Frequency (%) |
( | 11 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 24620 | |
Hangul | 6703 | 21.0% |
Latin | 663 | 2.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 564 | 8.4% |
동 | 486 | 7.3% |
강 | 384 | 5.7% |
성 | 373 | 5.6% |
포 | 372 | 5.5% |
북 | 355 | 5.3% |
대 | 311 | 4.6% |
로 | 292 | 4.4% |
작 | 252 | 3.8% |
문 | 252 | 3.8% |
Other values (48) | 3062 |
Common
Value | Count | Frequency (%) |
1 | 5346 | |
0 | 4462 | |
2 | 2913 | |
6 | 2558 | |
3 | 2190 | |
7 | 1990 | 8.1% |
5 | 1931 | 7.8% |
4 | 1844 | 7.5% |
8 | 701 | 2.8% |
9 | 495 | 2.0% |
Other values (3) | 190 | 0.8% |
Latin
Value | Count | Frequency (%) |
N | 209 | |
A | 89 | |
B | 77 | 11.6% |
R | 72 | 10.9% |
U | 72 | 10.9% |
O | 72 | 10.9% |
T | 72 | 10.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 25283 | |
Hangul | 6703 | 21.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 5346 | |
0 | 4462 | |
2 | 2913 | |
6 | 2558 | |
3 | 2190 | |
7 | 1990 | 7.9% |
5 | 1931 | 7.6% |
4 | 1844 | 7.3% |
8 | 701 | 2.8% |
9 | 495 | 2.0% |
Other values (10) | 853 | 3.4% |
Hangul
Value | Count | Frequency (%) |
서 | 564 | 8.4% |
동 | 486 | 7.3% |
강 | 384 | 5.7% |
성 | 373 | 5.6% |
포 | 372 | 5.5% |
북 | 355 | 5.3% |
대 | 311 | 4.6% |
로 | 292 | 4.4% |
작 | 252 | 3.8% |
문 | 252 | 3.8% |
Other values (48) | 3062 |
ROUTE_ABRV
Text
Distinct | 699 |
---|---|
Distinct (%) | 8.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 63.2 KiB |
Value | Count | Frequency (%) |
0017 | 12 | 0.1% |
관악08 | 12 | 0.1% |
강북10 | 12 | 0.1% |
강서5-1 | 12 | 0.1% |
강북11 | 12 | 0.1% |
강북12 | 12 | 0.1% |
강서01 | 12 | 0.1% |
강서02 | 12 | 0.1% |
강서03 | 12 | 0.1% |
강서04 | 12 | 0.1% |
Other values (689) | 7955 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 5346 | |
0 | 4318 | |
2 | 2913 | |
6 | 2558 | 8.2% |
3 | 2190 | 7.0% |
7 | 1990 | 6.4% |
5 | 1931 | 6.2% |
4 | 1844 | 5.9% |
8 | 701 | 2.3% |
서 | 552 | 1.8% |
Other values (49) | 6810 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 24286 | |
Other Letter | 6036 | 19.4% |
Uppercase Letter | 663 | 2.1% |
Dash Punctuation | 168 | 0.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 552 | 9.1% |
동 | 486 | 8.1% |
강 | 384 | 6.4% |
성 | 373 | 6.2% |
북 | 355 | 5.9% |
로 | 292 | 4.8% |
대 | 276 | 4.6% |
작 | 252 | 4.2% |
초 | 240 | 4.0% |
포 | 216 | 3.6% |
Other values (31) | 2610 |
Decimal Number
Value | Count | Frequency (%) |
1 | 5346 | |
0 | 4318 | |
2 | 2913 | |
6 | 2558 | |
3 | 2190 | |
7 | 1990 | 8.2% |
5 | 1931 | 8.0% |
4 | 1844 | 7.6% |
8 | 701 | 2.9% |
9 | 495 | 2.0% |
Uppercase Letter
Value | Count | Frequency (%) |
N | 209 | |
A | 89 | |
B | 77 | 11.6% |
U | 72 | 10.9% |
O | 72 | 10.9% |
R | 72 | 10.9% |
T | 72 | 10.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 168 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 24454 | |
Hangul | 6036 | 19.4% |
Latin | 663 | 2.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 552 | 9.1% |
동 | 486 | 8.1% |
강 | 384 | 6.4% |
성 | 373 | 6.2% |
북 | 355 | 5.9% |
로 | 292 | 4.8% |
대 | 276 | 4.6% |
작 | 252 | 4.2% |
초 | 240 | 4.0% |
포 | 216 | 3.6% |
Other values (31) | 2610 |
Common
Value | Count | Frequency (%) |
1 | 5346 | |
0 | 4318 | |
2 | 2913 | |
6 | 2558 | |
3 | 2190 | |
7 | 1990 | 8.1% |
5 | 1931 | 7.9% |
4 | 1844 | 7.5% |
8 | 701 | 2.9% |
9 | 495 | 2.0% |
Latin
Value | Count | Frequency (%) |
N | 209 | |
A | 89 | |
B | 77 | 11.6% |
U | 72 | 10.9% |
O | 72 | 10.9% |
R | 72 | 10.9% |
T | 72 | 10.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 25117 | |
Hangul | 6036 | 19.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 5346 | |
0 | 4318 | |
2 | 2913 | |
6 | 2558 | |
3 | 2190 | |
7 | 1990 | 7.9% |
5 | 1931 | 7.7% |
4 | 1844 | 7.3% |
8 | 701 | 2.8% |
9 | 495 | 2.0% |
Other values (8) | 831 | 3.3% |
Hangul
Value | Count | Frequency (%) |
서 | 552 | 9.1% |
동 | 486 | 8.1% |
강 | 384 | 6.4% |
성 | 373 | 6.2% |
북 | 355 | 5.9% |
로 | 292 | 4.8% |
대 | 276 | 4.6% |
작 | 252 | 4.2% |
초 | 240 | 4.0% |
포 | 216 | 3.6% |
Other values (31) | 2610 |
DSTNC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 488 |
---|---|
Distinct (%) | 6.1% |
Missing | 14 |
Missing (%) | 0.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 32.454641 |
Minimum | 1.2 |
---|---|
Maximum | 220 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 71.1 KiB |
Quantile statistics
Minimum | 1.2 |
---|---|
5-th percentile | 4 |
Q1 | 8.72 |
median | 23.4 |
Q3 | 45 |
95-th percentile | 85.9 |
Maximum | 220 |
Range | 218.8 |
Interquartile range (IQR) | 36.28 |
Descriptive statistics
Standard deviation | 34.832349 |
---|---|
Coefficient of variation (CV) | 1.0732625 |
Kurtosis | 7.9392709 |
Mean | 32.454641 |
Median Absolute Deviation (MAD) | 15.8 |
Skewness | 2.5627973 |
Sum | 261616.86 |
Variance | 1213.2925 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7.0 | 116 | 1.4% |
12.0 | 82 | 1.0% |
13.0 | 79 | 1.0% |
7.2 | 72 | 0.9% |
5.5 | 69 | 0.9% |
39.0 | 66 | 0.8% |
7.5 | 63 | 0.8% |
13.3 | 60 | 0.7% |
4.8 | 57 | 0.7% |
7.8 | 57 | 0.7% |
Other values (478) | 7340 |
Value | Count | Frequency (%) |
1.2 | 12 | 0.1% |
1.6 | 12 | 0.1% |
1.8 | 6 | 0.1% |
1.9 | 3 | < 0.1% |
2.0 | 9 | 0.1% |
2.1 | 24 | |
2.4 | 3 | < 0.1% |
2.5 | 10 | 0.1% |
2.6 | 44 | |
2.7 | 9 | 0.1% |
Value | Count | Frequency (%) |
220.0 | 7 | 0.1% |
204.4 | 11 | |
204.0 | 1 | < 0.1% |
201.4 | 12 | |
196.0 | 12 | |
193.0 | 10 | |
190.0 | 6 | 0.1% |
188.0 | 13 | |
187.6 | 12 | |
184.0 | 24 |
ROUTE_TY
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.0022291 |
Minimum | 1 |
---|---|
Maximum | 13 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 71.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 3 |
Q3 | 4 |
95-th percentile | 4 |
Maximum | 13 |
Range | 12 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.2833589 |
---|---|
Coefficient of variation (CV) | 0.42746869 |
Kurtosis | 11.401701 |
Mean | 3.0022291 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.1511805 |
Sum | 24243 |
Variance | 1.6470101 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 2970 | |
4 | 2745 | |
3 | 1716 | |
1 | 406 | 5.0% |
6 | 123 | 1.5% |
10 | 72 | 0.9% |
5 | 31 | 0.4% |
13 | 12 | 0.1% |
Value | Count | Frequency (%) |
1 | 406 | 5.0% |
2 | 2970 | |
3 | 1716 | |
4 | 2745 | |
5 | 31 | 0.4% |
6 | 123 | 1.5% |
10 | 72 | 0.9% |
13 | 12 | 0.1% |
Value | Count | Frequency (%) |
13 | 12 | 0.1% |
10 | 72 | 0.9% |
6 | 123 | 1.5% |
5 | 31 | 0.4% |
4 | 2745 | |
3 | 1716 | |
2 | 2970 | |
1 | 406 | 5.0% |
SSTTN_NM
Text
Distinct | 461 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 63.2 KiB |
Value | Count | Frequency (%) |
양천공영차고지 | 216 | 2.7% |
복정역환승센터 | 168 | 2.1% |
은평차고지 | 156 | 1.9% |
중랑공영차고지 | 138 | 1.7% |
인천공항 | 137 | 1.7% |
장지공영차고지 | 120 | 1.5% |
우이동 | 120 | 1.5% |
진관공영차고지 | 108 | 1.3% |
강동공영차고지 | 96 | 1.2% |
정릉 | 84 | 1.0% |
Other values (451) | 6732 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 2734 | 6.5% |
지 | 2219 | 5.3% |
고 | 1846 | 4.4% |
차 | 1770 | 4.2% |
공 | 1358 | 3.2% |
역 | 1290 | 3.1% |
영 | 1037 | 2.5% |
아 | 734 | 1.7% |
파 | 723 | 1.7% |
산 | 706 | 1.7% |
Other values (306) | 27690 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 40633 | |
Decimal Number | 815 | 1.9% |
Uppercase Letter | 289 | 0.7% |
Other Punctuation | 262 | 0.6% |
Open Punctuation | 48 | 0.1% |
Close Punctuation | 48 | 0.1% |
Lowercase Letter | 12 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 2734 | 6.7% |
지 | 2219 | 5.5% |
고 | 1846 | 4.5% |
차 | 1770 | 4.4% |
공 | 1358 | 3.3% |
역 | 1290 | 3.2% |
영 | 1037 | 2.6% |
아 | 734 | 1.8% |
파 | 723 | 1.8% |
산 | 706 | 1.7% |
Other values (282) | 26216 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 87 | |
L | 41 | |
H | 32 | 11.1% |
A | 30 | 10.4% |
P | 30 | 10.4% |
K | 21 | 7.3% |
C | 18 | 6.2% |
E | 12 | 4.2% |
S | 9 | 3.1% |
G | 9 | 3.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 225 | |
2 | 159 | |
7 | 128 | |
4 | 72 | 8.8% |
3 | 69 | 8.5% |
5 | 63 | 7.7% |
6 | 51 | 6.3% |
8 | 24 | 2.9% |
0 | 24 | 2.9% |
Other Punctuation
Value | Count | Frequency (%) |
. | 259 | |
, | 3 | 1.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 48 |
Close Punctuation
Value | Count | Frequency (%) |
) | 48 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 12 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 40633 | |
Common | 1173 | 2.8% |
Latin | 301 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 2734 | 6.7% |
지 | 2219 | 5.5% |
고 | 1846 | 4.5% |
차 | 1770 | 4.4% |
공 | 1358 | 3.3% |
역 | 1290 | 3.2% |
영 | 1037 | 2.6% |
아 | 734 | 1.8% |
파 | 723 | 1.8% |
산 | 706 | 1.7% |
Other values (282) | 26216 |
Common
Value | Count | Frequency (%) |
. | 259 | |
1 | 225 | |
2 | 159 | |
7 | 128 | |
4 | 72 | 6.1% |
3 | 69 | 5.9% |
5 | 63 | 5.4% |
6 | 51 | 4.3% |
( | 48 | 4.1% |
) | 48 | 4.1% |
Other values (3) | 51 | 4.3% |
Latin
Value | Count | Frequency (%) |
T | 87 | |
L | 41 | |
H | 32 | 10.6% |
A | 30 | 10.0% |
P | 30 | 10.0% |
K | 21 | 7.0% |
C | 18 | 6.0% |
e | 12 | 4.0% |
E | 12 | 4.0% |
S | 9 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 40633 | |
ASCII | 1474 | 3.5% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 2734 | 6.7% |
지 | 2219 | 5.5% |
고 | 1846 | 4.5% |
차 | 1770 | 4.4% |
공 | 1358 | 3.3% |
역 | 1290 | 3.2% |
영 | 1037 | 2.6% |
아 | 734 | 1.8% |
파 | 723 | 1.8% |
산 | 706 | 1.7% |
Other values (282) | 26216 |
ASCII
Value | Count | Frequency (%) |
. | 259 | |
1 | 225 | |
2 | 159 | |
7 | 128 | |
T | 87 | 5.9% |
4 | 72 | 4.9% |
3 | 69 | 4.7% |
5 | 63 | 4.3% |
6 | 51 | 3.5% |
( | 48 | 3.3% |
Other values (14) | 313 |
ESTTN_NM
Text
Distinct | 455 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 63.2 KiB |
Value | Count | Frequency (%) |
인천공항 | 170 | 2.1% |
서울역 | 155 | 1.9% |
강남역 | 145 | 1.8% |
여의도 | 143 | 1.8% |
석계역 | 106 | 1.3% |
홍대입구역 | 96 | 1.2% |
양재역 | 96 | 1.2% |
대방역 | 87 | 1.1% |
구로디지털단지역 | 84 | 1.0% |
수유역 | 84 | 1.0% |
Other values (445) | 6909 |
Most occurring characters
Value | Count | Frequency (%) |
역 | 3691 | 10.0% |
동 | 1386 | 3.8% |
대 | 1212 | 3.3% |
구 | 912 | 2.5% |
서 | 658 | 1.8% |
지 | 652 | 1.8% |
신 | 618 | 1.7% |
문 | 548 | 1.5% |
리 | 518 | 1.4% |
아 | 495 | 1.3% |
Other values (295) | 26171 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 35832 | |
Decimal Number | 420 | 1.1% |
Other Punctuation | 254 | 0.7% |
Uppercase Letter | 225 | 0.6% |
Open Punctuation | 65 | 0.2% |
Close Punctuation | 65 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
역 | 3691 | 10.3% |
동 | 1386 | 3.9% |
대 | 1212 | 3.4% |
구 | 912 | 2.5% |
서 | 658 | 1.8% |
지 | 652 | 1.8% |
신 | 618 | 1.7% |
문 | 548 | 1.5% |
리 | 518 | 1.4% |
아 | 495 | 1.4% |
Other values (271) | 25142 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 48 | |
T | 33 | |
C | 30 | |
Y | 24 | |
M | 24 | |
D | 14 | 6.2% |
G | 12 | 5.3% |
S | 12 | 5.3% |
H | 9 | 4.0% |
L | 9 | 4.0% |
Other values (2) | 10 | 4.4% |
Decimal Number
Value | Count | Frequency (%) |
2 | 90 | |
7 | 72 | |
1 | 66 | |
3 | 54 | |
5 | 39 | |
6 | 36 | 8.6% |
4 | 27 | 6.4% |
9 | 24 | 5.7% |
8 | 12 | 2.9% |
Other Punctuation
Value | Count | Frequency (%) |
. | 254 |
Open Punctuation
Value | Count | Frequency (%) |
( | 65 |
Close Punctuation
Value | Count | Frequency (%) |
) | 65 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 35832 | |
Common | 804 | 2.2% |
Latin | 225 | 0.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
역 | 3691 | 10.3% |
동 | 1386 | 3.9% |
대 | 1212 | 3.4% |
구 | 912 | 2.5% |
서 | 658 | 1.8% |
지 | 652 | 1.8% |
신 | 618 | 1.7% |
문 | 548 | 1.5% |
리 | 518 | 1.4% |
아 | 495 | 1.4% |
Other values (271) | 25142 |
Common
Value | Count | Frequency (%) |
. | 254 | |
2 | 90 | 11.2% |
7 | 72 | 9.0% |
1 | 66 | 8.2% |
( | 65 | 8.1% |
) | 65 | 8.1% |
3 | 54 | 6.7% |
5 | 39 | 4.9% |
6 | 36 | 4.5% |
4 | 27 | 3.4% |
Other values (2) | 36 | 4.5% |
Latin
Value | Count | Frequency (%) |
A | 48 | |
T | 33 | |
C | 30 | |
Y | 24 | |
M | 24 | |
D | 14 | 6.2% |
G | 12 | 5.3% |
S | 12 | 5.3% |
H | 9 | 4.0% |
L | 9 | 4.0% |
Other values (2) | 10 | 4.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 35832 | |
ASCII | 1029 | 2.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
역 | 3691 | 10.3% |
동 | 1386 | 3.9% |
대 | 1212 | 3.4% |
구 | 912 | 2.5% |
서 | 658 | 1.8% |
지 | 652 | 1.8% |
신 | 618 | 1.7% |
문 | 548 | 1.5% |
리 | 518 | 1.4% |
아 | 495 | 1.4% |
Other values (271) | 25142 |
ASCII
Value | Count | Frequency (%) |
. | 254 | |
2 | 90 | 8.7% |
7 | 72 | 7.0% |
1 | 66 | 6.4% |
( | 65 | 6.3% |
) | 65 | 6.3% |
3 | 54 | 5.2% |
A | 48 | 4.7% |
5 | 39 | 3.8% |
6 | 36 | 3.5% |
Other values (14) | 240 |
CARALC
Real number (ℝ)
Distinct | 60 |
---|---|
Distinct (%) | 0.7% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.03567 |
Minimum | 0 |
---|---|
Maximum | 340 |
Zeros | 20 |
Zeros (%) | 0.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 71.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 6 |
Q1 | 9 |
median | 12 |
Q3 | 16 |
95-th percentile | 35 |
Maximum | 340 |
Range | 340 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 19.711218 |
---|---|
Coefficient of variation (CV) | 1.2292107 |
Kurtosis | 90.071241 |
Mean | 16.03567 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 8.3367337 |
Sum | 129472 |
Variance | 388.53211 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 841 | 10.4% |
8 | 723 | 9.0% |
10 | 686 | 8.5% |
12 | 676 | 8.4% |
9 | 598 | 7.4% |
15 | 556 | 6.9% |
14 | 538 | 6.7% |
13 | 463 | 5.7% |
7 | 431 | 5.3% |
16 | 289 | 3.6% |
Other values (50) | 2273 |
Value | Count | Frequency (%) |
0 | 20 | 0.2% |
4 | 34 | 0.4% |
5 | 91 | 1.1% |
6 | 278 | 3.4% |
7 | 431 | |
8 | 723 | |
9 | 598 | |
10 | 686 | |
11 | 841 | |
12 | 676 |
Value | Count | Frequency (%) |
340 | 2 | < 0.1% |
300 | 3 | < 0.1% |
250 | 1 | < 0.1% |
245 | 7 | |
240 | 7 | |
220 | 3 | < 0.1% |
210 | 2 | < 0.1% |
200 | 11 | |
190 | 4 | < 0.1% |
170 | 2 | < 0.1% |
FIRCAR_TM
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 62 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.020878576 |
Minimum | -1.235 |
---|---|
Maximum | 0.234 |
Zeros | 33 |
Zeros (%) | 0.4% |
Negative | 180 |
Negative (%) | 2.2% |
Memory size | 71.1 KiB |
Quantile statistics
Minimum | -1.235 |
---|---|
5-th percentile | 0.04 |
Q1 | 0.041 |
median | 0.044 |
Q3 | 0.055 |
95-th percentile | 0.06 |
Maximum | 0.234 |
Range | 1.469 |
Interquartile range (IQR) | 0.014 |
Descriptive statistics
Standard deviation | 0.18973109 |
---|---|
Coefficient of variation (CV) | 9.0873577 |
Kurtosis | 39.546511 |
Mean | 0.020878576 |
Median Absolute Deviation (MAD) | 0.006 |
Skewness | -6.4284335 |
Sum | 168.5945 |
Variance | 0.035997885 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.06 | 1457 | |
0.043 | 1230 | |
0.04 | 1229 | |
0.05 | 498 | 6.2% |
0.042 | 487 | 6.0% |
0.053 | 451 | 5.6% |
0.055 | 361 | 4.5% |
0.041 | 348 | 4.3% |
0.052 | 186 | 2.3% |
0.054 | 165 | 2.0% |
Other values (52) | 1663 |
Value | Count | Frequency (%) |
-1.235 | 27 | |
-1.234 | 48 | |
-1.2335 | 9 | 0.1% |
-1.233 | 55 | |
-1.232 | 7 | 0.1% |
-1.231 | 15 | 0.2% |
-1.23 | 14 | 0.2% |
-1.224 | 5 | 0.1% |
0.0 | 33 | |
0.032 | 4 | < 0.1% |
Value | Count | Frequency (%) |
0.234 | 2 | < 0.1% |
0.224 | 2 | < 0.1% |
0.193 | 12 | 0.1% |
0.133 | 3 | < 0.1% |
0.103 | 12 | 0.1% |
0.101 | 12 | 0.1% |
0.1 | 27 | |
0.093 | 12 | 0.1% |
0.09 | 44 | |
0.08 | 12 | 0.1% |
LSTCAR_TM
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 120 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.28638781 |
Minimum | 0 |
---|---|
Maximum | 1.03 |
Zeros | 7 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 71.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.1705 |
Q1 | 0.224 |
median | 0.23 |
Q3 | 0.233 |
95-th percentile | 1 |
Maximum | 1.03 |
Range | 1.03 |
Interquartile range (IQR) | 0.009 |
Descriptive statistics
Standard deviation | 0.2236323 |
---|---|
Coefficient of variation (CV) | 0.78087227 |
Kurtosis | 6.1263739 |
Mean | 0.28638781 |
Median Absolute Deviation (MAD) | 0.005 |
Skewness | 2.7620593 |
Sum | 2312.5816 |
Variance | 0.050011406 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.23 | 1016 | 12.6% |
0.233 | 881 | 10.9% |
0.223 | 621 | 7.7% |
0.225 | 514 | 6.4% |
0.224 | 476 | 5.9% |
0.231 | 439 | 5.4% |
0.234 | 434 | 5.4% |
0.232 | 406 | 5.0% |
1.0 | 356 | 4.4% |
0.235 | 323 | 4.0% |
Other values (110) | 2609 |
Value | Count | Frequency (%) |
0.0 | 7 | 0.1% |
0.0115 | 2 | < 0.1% |
0.012 | 1 | < 0.1% |
0.024 | 7 | 0.1% |
0.025 | 2 | < 0.1% |
0.03 | 7 | 0.1% |
0.031 | 24 | |
0.032 | 19 | |
0.0325 | 24 | |
0.033 | 24 |
Value | Count | Frequency (%) |
1.03 | 2 | < 0.1% |
1.012 | 2 | < 0.1% |
1.01 | 23 | 0.3% |
1.003 | 45 | 0.6% |
1.0025 | 3 | < 0.1% |
1.002 | 40 | 0.5% |
1.0016 | 3 | < 0.1% |
1.0015 | 16 | 0.2% |
1.001 | 120 | |
1.0008 | 25 | 0.3% |
GROUP_NM
Text
Distinct | 259 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 63.2 KiB |
Value | Count | Frequency (%) |
선진운수 | 241 | 2.6% |
공항리무진 | 214 | 2.4% |
대진여객 | 192 | 2.1% |
한남여객 | 180 | 2.0% |
범일운수 | 180 | 2.0% |
한성여객 | 167 | 1.8% |
흥안운수 | 167 | 1.8% |
한성운수 | 162 | 1.8% |
북부운수 | 152 | 1.7% |
대원여객 | 144 | 1.6% |
Other values (198) | 7307 |
Most occurring characters
Value | Count | Frequency (%) |
9106 | ||
운 | 4497 | 9.2% |
수 | 4425 | 9.1% |
교 | 1968 | 4.0% |
통 | 1968 | 4.0% |
진 | 1223 | 2.5% |
성 | 1098 | 2.3% |
객 | 1026 | 2.1% |
여 | 995 | 2.0% |
, | 994 | 2.0% |
Other values (160) | 21370 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 38198 | |
Space Separator | 9106 | 18.7% |
Other Punctuation | 994 | 2.0% |
Uppercase Letter | 288 | 0.6% |
Decimal Number | 84 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
운 | 4497 | 11.8% |
수 | 4425 | 11.6% |
교 | 1968 | 5.2% |
통 | 1968 | 5.2% |
진 | 1223 | 3.2% |
성 | 1098 | 2.9% |
객 | 1026 | 2.7% |
여 | 995 | 2.6% |
한 | 827 | 2.2% |
신 | 720 | 1.9% |
Other values (154) | 19451 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 96 | |
R | 96 | |
T | 96 |
Space Separator
Value | Count | Frequency (%) |
9106 |
Other Punctuation
Value | Count | Frequency (%) |
, | 994 |
Decimal Number
Value | Count | Frequency (%) |
3 | 84 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 38198 | |
Common | 10184 | 20.9% |
Latin | 288 | 0.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
운 | 4497 | 11.8% |
수 | 4425 | 11.6% |
교 | 1968 | 5.2% |
통 | 1968 | 5.2% |
진 | 1223 | 3.2% |
성 | 1098 | 2.9% |
객 | 1026 | 2.7% |
여 | 995 | 2.6% |
한 | 827 | 2.2% |
신 | 720 | 1.9% |
Other values (154) | 19451 |
Common
Value | Count | Frequency (%) |
9106 | ||
, | 994 | 9.8% |
3 | 84 | 0.8% |
Latin
Value | Count | Frequency (%) |
B | 96 | |
R | 96 | |
T | 96 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 38198 | |
ASCII | 10472 | 21.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
9106 | ||
, | 994 | 9.5% |
B | 96 | 0.9% |
R | 96 | 0.9% |
T | 96 | 0.9% |
3 | 84 | 0.8% |
Hangul
Value | Count | Frequency (%) |
운 | 4497 | 11.8% |
수 | 4425 | 11.6% |
교 | 1968 | 5.2% |
통 | 1968 | 5.2% |
진 | 1223 | 3.2% |
성 | 1098 | 2.9% |
객 | 1026 | 2.7% |
여 | 995 | 2.6% |
한 | 827 | 2.2% |
신 | 720 | 1.9% |
Other values (154) | 19451 |
STDR_DE | ROUTE_ID | DSTNC | ROUTE_TY | CARALC | FIRCAR_TM | LSTCAR_TM | |
---|---|---|---|---|---|---|---|
STDR_DE | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
ROUTE_ID | 0.000 | 1.000 | 0.626 | 0.551 | 0.311 | 0.194 | 0.418 |
DSTNC | 0.000 | 0.626 | 1.000 | 0.651 | 0.779 | 0.603 | 0.644 |
ROUTE_TY | 0.000 | 0.551 | 0.651 | 1.000 | 0.262 | 0.663 | 0.449 |
CARALC | 0.000 | 0.311 | 0.779 | 0.262 | 1.000 | 0.432 | 0.567 |
FIRCAR_TM | 0.000 | 0.194 | 0.603 | 0.663 | 0.432 | 1.000 | 0.592 |
LSTCAR_TM | 0.000 | 0.418 | 0.644 | 0.449 | 0.567 | 0.592 | 1.000 |
STDR_DE | ROUTE_ID | DSTNC | ROUTE_TY | CARALC | FIRCAR_TM | LSTCAR_TM | |
---|---|---|---|---|---|---|---|
STDR_DE | 1.000 | 0.012 | 0.016 | -0.005 | 0.032 | -0.014 | -0.024 |
ROUTE_ID | 0.012 | 1.000 | -0.493 | -0.587 | 0.200 | 0.573 | 0.230 |
DSTNC | 0.016 | -0.493 | 1.000 | 0.343 | 0.124 | -0.799 | -0.595 |
ROUTE_TY | -0.005 | -0.587 | 0.343 | 1.000 | -0.092 | -0.366 | -0.202 |
CARALC | 0.032 | 0.200 | 0.124 | -0.092 | 1.000 | 0.037 | -0.211 |
FIRCAR_TM | -0.014 | 0.573 | -0.799 | -0.366 | 0.037 | 1.000 | 0.421 |
LSTCAR_TM | -0.024 | 0.230 | -0.595 | -0.202 | -0.211 | 0.421 | 1.000 |
STDR_DE | ROUTE_ID | ROUTE_NM | ROUTE_ABRV | DSTNC | ROUTE_TY | SSTTN_NM | ESTTN_NM | CARALC | FIRCAR_TM | LSTCAR_TM | GROUP_NM | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 20230101 | 100100124 | 0017 | 0017 | 12.2 | 4 | 청암동 | 이촌동 | 12 | 0.0515 | 0.233 | 보광교통 |
1 | 20230101 | 100100001 | 01 | 01 | 16.0 | 5 | 예장주차장 | 예장주차장 | 9 | 0.063 | 0.23 | 북부운수 |
2 | 20230101 | 104000012 | 0411 | 0411 | 44.3 | 4 | 용산차고지 | AT센터.양재꽃시장 | 14 | 0.042 | 0.223 | 대원여객 |
3 | 20230101 | 100100549 | 100 | 100 | 57.09 | 3 | 하계동 | 용산구청 | 10 | 0.04 | 0.223 | 한성여객 |
4 | 20230101 | 100100006 | 101 | 101 | 37.81 | 3 | 우이동 | 서소문 | 10 | 0.04 | 0.23 | 동아운수, 한성운수 |
5 | 20230101 | 100100129 | 1014 | 1014 | 12.6 | 4 | 성북생태체험관 | 종로구민회관숭인동 | 8 | 0.05 | 0.234 | 대진여객 |
6 | 20230101 | 100100130 | 1017 | 1017 | 23.95 | 4 | 월계동 | 상왕십리 | 14 | 0.043 | 0.232 | 한성여객 |
7 | 20230101 | 100100007 | 102 | 102 | 30.2 | 3 | 상계주공7단지 | 동대문 | 11 | 0.04 | 0.231 | 삼화상운, 흥안운수 |
8 | 20230101 | 100100131 | 1020 | 1020 | 23.2 | 4 | 정릉 | 교보문고 | 9 | 0.043 | 0.232 | 대진여객 |
9 | 20230101 | 100100008 | 103 | 103 | 30.42 | 3 | 삼화상운 | 서울역 | 9 | 0.043 | 0.23 | 삼화상운 |
STDR_DE | ROUTE_ID | ROUTE_NM | ROUTE_ABRV | DSTNC | ROUTE_TY | SSTTN_NM | ESTTN_NM | CARALC | FIRCAR_TM | LSTCAR_TM | GROUP_NM | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
8065 | 20231201 | 100900010 | 종로03 | 종로03 | 7.2 | 2 | 낙산공원 | 종로5가 | 9 | 0.06 | 0.234 | 종로운수 |
8066 | 20231201 | 100900011 | 종로05 | 종로05 | 5.0 | 2 | 서대문3번출구 | 종로문화센터 | 11 | 0.06 | 0.233 | 나경운수 |
8067 | 20231201 | 100900004 | 종로07 | 종로07 | 5.8 | 2 | 명륜새마을금고 | 명륜새마을금고 | 18 | 0.06 | 0.22 | 와룡운수 |
8068 | 20231201 | 100900005 | 종로08 | 종로08 | 6.8 | 2 | 명륜3가 | 종로5가 | 7 | 0.055 | 0.234 | 와룡운수 |
8069 | 20231201 | 100900003 | 종로09 | 종로09 | 6.4 | 2 | 수성동계곡 | 남대문 | 10 | 0.06 | 0.233 | 인왕교통 |
8070 | 20231201 | 100900007 | 종로11 | 종로11 | 8.6 | 2 | 삼청동 | 서울역 | 10 | 0.06 | 0.23 | 삼청교통 |
8071 | 20231201 | 100900009 | 종로12 | 종로12 | 5.4 | 2 | 서울대병원 | 종로3가 | 9 | 0.06 | 0.233 | 은수교통 |
8072 | 20231201 | 100900002 | 종로13 | 종로13 | 7.5 | 2 | 평창동주민센터 | 부암슈퍼 | 15 | 0.055 | 0.223 | 약수교통 |
8073 | 20231201 | 106900001 | 중랑01 | 중랑01 | 3.6 | 2 | 중화1동동아약국 | 신이문역 | 25 | 0.06 | 0.235 | 금창운수, 금창운수 월계점 |
8074 | 20231201 | 106900002 | 중랑02 | 중랑02 | 7.1 | 2 | 진로아파트 | 한신아파트 | 8 | 0.06 | 0.2315 | 중랑운수 |