Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 7752 |
Missing cells | 6 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 779.9 KiB |
Average record size in memory | 103.0 B |
Variable types
Numeric | 7 |
---|---|
Text | 5 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15262/S/1/datasetView.do |
ROUTE_ID is highly overall correlated with DSTNC and 2 other fields | High correlation |
DSTNC is highly overall correlated with ROUTE_ID and 2 other fields | High correlation |
ROUTE_TY is highly overall correlated with ROUTE_ID | High correlation |
FIRCAR_TM is highly overall correlated with ROUTE_ID and 1 other fields | High correlation |
LSTCAR_TM is highly overall correlated with DSTNC | High correlation |
Reproduction
Analysis started | 2024-05-11 06:17:08.754299 |
---|---|
Analysis finished | 2024-05-11 06:17:19.989163 |
Duration | 11.23 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
STDR_DE
Real number (ℝ)
Distinct | 12 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20220656 |
Minimum | 20220101 |
---|---|
Maximum | 20221201 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 68.3 KiB |
Quantile statistics
Minimum | 20220101 |
---|---|
5-th percentile | 20220101 |
Q1 | 20220401 |
median | 20220701 |
Q3 | 20221001 |
95-th percentile | 20221201 |
Maximum | 20221201 |
Range | 1100 |
Interquartile range (IQR) | 600 |
Descriptive statistics
Standard deviation | 345.20055 |
---|---|
Coefficient of variation (CV) | 1.7071679 × 10-5 |
Kurtosis | -1.2156553 |
Mean | 20220656 |
Median Absolute Deviation (MAD) | 300 |
Skewness | -0.020482124 |
Sum | 1.5675053 × 1011 |
Variance | 119163.42 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
20221201 | 659 | |
20221001 | 658 | |
20221101 | 658 | |
20220701 | 653 | |
20220901 | 653 | |
20220801 | 652 | |
20220601 | 642 | |
20220501 | 640 | |
20220401 | 635 | |
20220101 | 634 | |
Other values (2) | 1268 |
Value | Count | Frequency (%) |
20220101 | 634 | |
20220201 | 634 | |
20220301 | 634 | |
20220401 | 635 | |
20220501 | 640 | |
20220601 | 642 | |
20220701 | 653 | |
20220801 | 652 | |
20220901 | 653 | |
20221001 | 658 |
Value | Count | Frequency (%) |
20221201 | 659 | |
20221101 | 658 | |
20221001 | 658 | |
20220901 | 653 | |
20220801 | 652 | |
20220701 | 653 | |
20220601 | 642 | |
20220501 | 640 | |
20220401 | 635 | |
20220301 | 634 |
ROUTE_ID
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 665 |
---|---|
Distinct (%) | 8.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.0616031 × 108 |
Minimum | 1.0000002 × 108 |
---|---|
Maximum | 1.249 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 68.3 KiB |
Quantile statistics
Minimum | 1.0000002 × 108 |
---|---|
5-th percentile | 1.0010004 × 108 |
Q1 | 1.0010022 × 108 |
median | 1.0010058 × 108 |
Q3 | 1.129 × 108 |
95-th percentile | 1.2190001 × 108 |
Maximum | 1.249 × 108 |
Range | 24899986 |
Interquartile range (IQR) | 12799781 |
Descriptive statistics
Standard deviation | 7935197.1 |
---|---|
Coefficient of variation (CV) | 0.074747304 |
Kurtosis | -0.72670248 |
Mean | 1.0616031 × 108 |
Median Absolute Deviation (MAD) | 547.5 |
Skewness | 0.89415443 |
Sum | 8.2295474 × 1011 |
Variance | 6.2967353 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100100124 | 12 | 0.2% |
120900007 | 12 | 0.2% |
120900005 | 12 | 0.2% |
120900008 | 12 | 0.2% |
120900003 | 12 | 0.2% |
120900009 | 12 | 0.2% |
120900010 | 12 | 0.2% |
120900004 | 12 | 0.2% |
120900006 | 12 | 0.2% |
120900002 | 12 | 0.2% |
Other values (655) | 7632 |
Value | Count | Frequency (%) |
100000017 | 12 | |
100000018 | 12 | |
100100001 | 12 | |
100100006 | 12 | |
100100007 | 12 | |
100100008 | 12 | |
100100009 | 12 | |
100100010 | 12 | |
100100011 | 12 | |
100100012 | 12 |
Value | Count | Frequency (%) |
124900003 | 12 | |
124900002 | 12 | |
124900001 | 12 | |
124000039 | 12 | |
124000038 | 12 | |
124000036 | 12 | |
124000013 | 6 | |
124000010 | 12 | |
124000008 | 12 | |
124000006 | 3 | < 0.1% |
ROUTE_NM
Text
Distinct | 675 |
---|---|
Distinct (%) | 8.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 60.7 KiB |
Value | Count | Frequency (%) |
0017 | 12 | 0.2% |
관악07 | 12 | 0.2% |
구로01 | 12 | 0.2% |
관악01 | 12 | 0.2% |
관악02 | 12 | 0.2% |
관악03 | 12 | 0.2% |
관악04 | 12 | 0.2% |
관악05 | 12 | 0.2% |
관악06 | 12 | 0.2% |
관악10 | 12 | 0.2% |
Other values (665) | 7632 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 5137 | |
0 | 4043 | |
2 | 2859 | 9.4% |
6 | 2284 | 7.5% |
3 | 2136 | 7.0% |
5 | 1928 | 6.3% |
7 | 1892 | 6.2% |
4 | 1823 | 6.0% |
8 | 630 | 2.1% |
서 | 552 | 1.8% |
Other values (55) | 7282 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 23222 | |
Other Letter | 6584 | 21.5% |
Uppercase Letter | 592 | 1.9% |
Dash Punctuation | 168 | 0.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 552 | 8.4% |
동 | 492 | 7.5% |
성 | 384 | 5.8% |
강 | 384 | 5.8% |
포 | 372 | 5.7% |
북 | 360 | 5.5% |
대 | 300 | 4.6% |
로 | 298 | 4.5% |
초 | 252 | 3.8% |
작 | 252 | 3.8% |
Other values (37) | 2938 |
Decimal Number
Value | Count | Frequency (%) |
1 | 5137 | |
0 | 4043 | |
2 | 2859 | |
6 | 2284 | |
3 | 2136 | |
5 | 1928 | 8.3% |
7 | 1892 | 8.1% |
4 | 1823 | 7.9% |
8 | 630 | 2.7% |
9 | 490 | 2.1% |
Uppercase Letter
Value | Count | Frequency (%) |
N | 148 | |
A | 78 | |
B | 78 | |
R | 72 | |
T | 72 | |
U | 72 | |
O | 72 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 168 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 23390 | |
Hangul | 6584 | 21.5% |
Latin | 592 | 1.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 552 | 8.4% |
동 | 492 | 7.5% |
성 | 384 | 5.8% |
강 | 384 | 5.8% |
포 | 372 | 5.7% |
북 | 360 | 5.5% |
대 | 300 | 4.6% |
로 | 298 | 4.5% |
초 | 252 | 3.8% |
작 | 252 | 3.8% |
Other values (37) | 2938 |
Common
Value | Count | Frequency (%) |
1 | 5137 | |
0 | 4043 | |
2 | 2859 | |
6 | 2284 | |
3 | 2136 | |
5 | 1928 | 8.2% |
7 | 1892 | 8.1% |
4 | 1823 | 7.8% |
8 | 630 | 2.7% |
9 | 490 | 2.1% |
Latin
Value | Count | Frequency (%) |
N | 148 | |
A | 78 | |
B | 78 | |
R | 72 | |
T | 72 | |
U | 72 | |
O | 72 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 23982 | |
Hangul | 6584 | 21.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 5137 | |
0 | 4043 | |
2 | 2859 | |
6 | 2284 | |
3 | 2136 | |
5 | 1928 | 8.0% |
7 | 1892 | 7.9% |
4 | 1823 | 7.6% |
8 | 630 | 2.6% |
9 | 490 | 2.0% |
Other values (8) | 760 | 3.2% |
Hangul
Value | Count | Frequency (%) |
서 | 552 | 8.4% |
동 | 492 | 7.5% |
성 | 384 | 5.8% |
강 | 384 | 5.8% |
포 | 372 | 5.7% |
북 | 360 | 5.5% |
대 | 300 | 4.6% |
로 | 298 | 4.5% |
초 | 252 | 3.8% |
작 | 252 | 3.8% |
Other values (37) | 2938 |
ROUTE_ABRV
Text
Distinct | 682 |
---|---|
Distinct (%) | 8.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 60.7 KiB |
Value | Count | Frequency (%) |
0017 | 12 | 0.2% |
관악02 | 12 | 0.2% |
강서04 | 12 | 0.2% |
광진02 | 12 | 0.2% |
강서05 | 12 | 0.2% |
강서5-1 | 12 | 0.2% |
강서06 | 12 | 0.2% |
강서07 | 12 | 0.2% |
관악01 | 12 | 0.2% |
관악03 | 12 | 0.2% |
Other values (672) | 7632 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 5137 | |
0 | 3915 | |
2 | 2859 | |
6 | 2284 | 7.7% |
3 | 2136 | 7.2% |
5 | 1928 | 6.5% |
7 | 1892 | 6.3% |
4 | 1823 | 6.1% |
8 | 630 | 2.1% |
서 | 540 | 1.8% |
Other values (43) | 6679 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 23094 | |
Other Letter | 5969 | 20.0% |
Uppercase Letter | 592 | 2.0% |
Dash Punctuation | 168 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 540 | 9.0% |
동 | 492 | 8.2% |
성 | 384 | 6.4% |
강 | 384 | 6.4% |
북 | 360 | 6.0% |
로 | 298 | 5.0% |
대 | 271 | 4.5% |
작 | 252 | 4.2% |
초 | 240 | 4.0% |
포 | 216 | 3.6% |
Other values (25) | 2532 |
Decimal Number
Value | Count | Frequency (%) |
1 | 5137 | |
0 | 3915 | |
2 | 2859 | |
6 | 2284 | |
3 | 2136 | |
5 | 1928 | 8.3% |
7 | 1892 | 8.2% |
4 | 1823 | 7.9% |
8 | 630 | 2.7% |
9 | 490 | 2.1% |
Uppercase Letter
Value | Count | Frequency (%) |
N | 148 | |
A | 78 | |
B | 78 | |
U | 72 | |
T | 72 | |
R | 72 | |
O | 72 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 168 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 23262 | |
Hangul | 5969 | 20.0% |
Latin | 592 | 2.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 540 | 9.0% |
동 | 492 | 8.2% |
성 | 384 | 6.4% |
강 | 384 | 6.4% |
북 | 360 | 6.0% |
로 | 298 | 5.0% |
대 | 271 | 4.5% |
작 | 252 | 4.2% |
초 | 240 | 4.0% |
포 | 216 | 3.6% |
Other values (25) | 2532 |
Common
Value | Count | Frequency (%) |
1 | 5137 | |
0 | 3915 | |
2 | 2859 | |
6 | 2284 | |
3 | 2136 | |
5 | 1928 | 8.3% |
7 | 1892 | 8.1% |
4 | 1823 | 7.8% |
8 | 630 | 2.7% |
9 | 490 | 2.1% |
Latin
Value | Count | Frequency (%) |
N | 148 | |
A | 78 | |
B | 78 | |
U | 72 | |
T | 72 | |
R | 72 | |
O | 72 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 23854 | |
Hangul | 5969 | 20.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 5137 | |
0 | 3915 | |
2 | 2859 | |
6 | 2284 | |
3 | 2136 | |
5 | 1928 | 8.1% |
7 | 1892 | 7.9% |
4 | 1823 | 7.6% |
8 | 630 | 2.6% |
9 | 490 | 2.1% |
Other values (8) | 760 | 3.2% |
Hangul
Value | Count | Frequency (%) |
서 | 540 | 9.0% |
동 | 492 | 8.2% |
성 | 384 | 6.4% |
강 | 384 | 6.4% |
북 | 360 | 6.0% |
로 | 298 | 5.0% |
대 | 271 | 4.5% |
작 | 252 | 4.2% |
초 | 240 | 4.0% |
포 | 216 | 3.6% |
Other values (25) | 2532 |
DSTNC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 464 |
---|---|
Distinct (%) | 6.0% |
Missing | 6 |
Missing (%) | 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.348538 |
Minimum | 1.2 |
---|---|
Maximum | 206 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 68.3 KiB |
Quantile statistics
Minimum | 1.2 |
---|---|
5-th percentile | 4 |
Q1 | 8.31 |
median | 20.7 |
Q3 | 42.6 |
95-th percentile | 66.57 |
Maximum | 206 |
Range | 204.8 |
Interquartile range (IQR) | 34.29 |
Descriptive statistics
Standard deviation | 26.932035 |
---|---|
Coefficient of variation (CV) | 0.95003261 |
Kurtosis | 10.85367 |
Mean | 28.348538 |
Median Absolute Deviation (MAD) | 14.1 |
Skewness | 2.5658775 |
Sum | 219587.77 |
Variance | 725.33452 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7.2 | 108 | 1.4% |
7.0 | 107 | 1.4% |
12.0 | 71 | 0.9% |
8.5 | 60 | 0.8% |
5.7 | 60 | 0.8% |
7.7 | 60 | 0.8% |
13.3 | 60 | 0.8% |
5.5 | 59 | 0.8% |
39.0 | 58 | 0.7% |
4.2 | 56 | 0.7% |
Other values (454) | 7047 |
Value | Count | Frequency (%) |
1.2 | 12 | 0.2% |
1.6 | 12 | 0.2% |
1.8 | 12 | 0.2% |
1.9 | 12 | 0.2% |
2.1 | 24 | |
2.4 | 12 | 0.2% |
2.5 | 2 | < 0.1% |
2.6 | 24 | |
2.8 | 12 | 0.2% |
2.9 | 36 |
Value | Count | Frequency (%) |
206.0 | 10 | |
204.4 | 3 | < 0.1% |
201.4 | 7 | |
190.0 | 2 | < 0.1% |
187.6 | 7 | |
184.0 | 8 | |
178.7 | 3 | < 0.1% |
168.0 | 16 | |
167.0 | 2 | < 0.1% |
161.4 | 7 |
ROUTE_TY
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.0425697 |
Minimum | 1 |
---|---|
Maximum | 10 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 68.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 2 |
median | 3 |
Q3 | 4 |
95-th percentile | 4 |
Maximum | 10 |
Range | 9 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.1956115 |
---|---|
Coefficient of variation (CV) | 0.39296108 |
Kurtosis | 8.8210991 |
Mean | 3.0425697 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.8969029 |
Sum | 23586 |
Variance | 1.4294868 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 2962 | |
4 | 2716 | |
3 | 1680 | |
1 | 173 | 2.2% |
6 | 120 | 1.5% |
10 | 72 | 0.9% |
5 | 29 | 0.4% |
Value | Count | Frequency (%) |
1 | 173 | 2.2% |
2 | 2962 | |
3 | 1680 | |
4 | 2716 | |
5 | 29 | 0.4% |
6 | 120 | 1.5% |
10 | 72 | 0.9% |
Value | Count | Frequency (%) |
10 | 72 | 0.9% |
6 | 120 | 1.5% |
5 | 29 | 0.4% |
4 | 2716 | |
3 | 1680 | |
2 | 2962 | |
1 | 173 | 2.2% |
SSTTN_NM
Text
Distinct | 376 |
---|---|
Distinct (%) | 4.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 60.7 KiB |
Value | Count | Frequency (%) |
양천공영차고지 | 216 | 2.8% |
중랑공영차고지 | 156 | 2.0% |
은평차고지 | 156 | 2.0% |
복정역환승센터 | 156 | 2.0% |
우이동 | 120 | 1.5% |
장지공영차고지 | 120 | 1.5% |
진관공영차고지 | 104 | 1.3% |
강동공영차고지 | 92 | 1.2% |
강동차고지 | 84 | 1.1% |
구산동 | 84 | 1.1% |
Other values (366) | 6464 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 2679 | 6.7% |
지 | 2171 | 5.4% |
고 | 1824 | 4.6% |
차 | 1740 | 4.3% |
공 | 1259 | 3.1% |
역 | 1146 | 2.9% |
영 | 1046 | 2.6% |
아 | 693 | 1.7% |
파 | 690 | 1.7% |
산 | 664 | 1.7% |
Other values (279) | 26089 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 38710 | |
Decimal Number | 751 | 1.9% |
Uppercase Letter | 208 | 0.5% |
Other Punctuation | 204 | 0.5% |
Open Punctuation | 58 | 0.1% |
Close Punctuation | 58 | 0.1% |
Lowercase Letter | 12 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 2679 | 6.9% |
지 | 2171 | 5.6% |
고 | 1824 | 4.7% |
차 | 1740 | 4.5% |
공 | 1259 | 3.3% |
역 | 1146 | 3.0% |
영 | 1046 | 2.7% |
아 | 693 | 1.8% |
파 | 690 | 1.8% |
산 | 664 | 1.7% |
Other values (258) | 24798 |
Decimal Number
Value | Count | Frequency (%) |
1 | 220 | |
2 | 121 | |
7 | 108 | |
3 | 65 | 8.7% |
5 | 62 | 8.3% |
4 | 62 | 8.3% |
6 | 60 | 8.0% |
0 | 29 | 3.9% |
8 | 24 | 3.2% |
Uppercase Letter
Value | Count | Frequency (%) |
T | 60 | |
P | 48 | |
A | 48 | |
L | 14 | 6.7% |
H | 14 | 6.7% |
E | 12 | 5.8% |
K | 12 | 5.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 192 | |
, | 12 | 5.9% |
Open Punctuation
Value | Count | Frequency (%) |
( | 58 |
Close Punctuation
Value | Count | Frequency (%) |
) | 58 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 12 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 38710 | |
Common | 1071 | 2.7% |
Latin | 220 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 2679 | 6.9% |
지 | 2171 | 5.6% |
고 | 1824 | 4.7% |
차 | 1740 | 4.5% |
공 | 1259 | 3.3% |
역 | 1146 | 3.0% |
영 | 1046 | 2.7% |
아 | 693 | 1.8% |
파 | 690 | 1.8% |
산 | 664 | 1.7% |
Other values (258) | 24798 |
Common
Value | Count | Frequency (%) |
1 | 220 | |
. | 192 | |
2 | 121 | |
7 | 108 | |
3 | 65 | 6.1% |
5 | 62 | 5.8% |
4 | 62 | 5.8% |
6 | 60 | 5.6% |
( | 58 | 5.4% |
) | 58 | 5.4% |
Other values (3) | 65 | 6.1% |
Latin
Value | Count | Frequency (%) |
T | 60 | |
P | 48 | |
A | 48 | |
L | 14 | 6.4% |
H | 14 | 6.4% |
e | 12 | 5.5% |
E | 12 | 5.5% |
K | 12 | 5.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 38710 | |
ASCII | 1291 | 3.2% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 2679 | 6.9% |
지 | 2171 | 5.6% |
고 | 1824 | 4.7% |
차 | 1740 | 4.5% |
공 | 1259 | 3.3% |
역 | 1146 | 3.0% |
영 | 1046 | 2.7% |
아 | 693 | 1.8% |
파 | 690 | 1.8% |
산 | 664 | 1.7% |
Other values (258) | 24798 |
ASCII
Value | Count | Frequency (%) |
1 | 220 | |
. | 192 | |
2 | 121 | |
7 | 108 | 8.4% |
3 | 65 | 5.0% |
5 | 62 | 4.8% |
4 | 62 | 4.8% |
6 | 60 | 4.6% |
T | 60 | 4.6% |
( | 58 | 4.5% |
Other values (11) | 283 |
ESTTN_NM
Text
Distinct | 403 |
---|---|
Distinct (%) | 5.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 60.7 KiB |
Value | Count | Frequency (%) |
서울역 | 162 | 2.1% |
여의도 | 144 | 1.9% |
강남역 | 138 | 1.8% |
석계역 | 115 | 1.5% |
홍제역 | 108 | 1.4% |
양재역 | 96 | 1.2% |
사당역 | 90 | 1.2% |
구로디지털단지역 | 90 | 1.2% |
대방역 | 86 | 1.1% |
수유역 | 84 | 1.1% |
Other values (393) | 6639 |
Most occurring characters
Value | Count | Frequency (%) |
역 | 3679 | 10.4% |
동 | 1358 | 3.9% |
대 | 1173 | 3.3% |
구 | 843 | 2.4% |
신 | 650 | 1.8% |
서 | 614 | 1.7% |
지 | 608 | 1.7% |
리 | 582 | 1.7% |
사 | 530 | 1.5% |
문 | 506 | 1.4% |
Other values (283) | 24667 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 34214 | |
Decimal Number | 390 | 1.1% |
Other Punctuation | 226 | 0.6% |
Uppercase Letter | 192 | 0.5% |
Open Punctuation | 94 | 0.3% |
Close Punctuation | 94 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
역 | 3679 | 10.8% |
동 | 1358 | 4.0% |
대 | 1173 | 3.4% |
구 | 843 | 2.5% |
신 | 650 | 1.9% |
서 | 614 | 1.8% |
지 | 608 | 1.8% |
리 | 582 | 1.7% |
사 | 530 | 1.5% |
문 | 506 | 1.5% |
Other values (263) | 23671 |
Decimal Number
Value | Count | Frequency (%) |
2 | 114 | |
7 | 62 | |
1 | 60 | |
3 | 41 | 10.5% |
5 | 36 | 9.2% |
6 | 36 | 9.2% |
9 | 24 | 6.2% |
8 | 12 | 3.1% |
0 | 5 | 1.3% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 48 | |
A | 42 | |
Y | 24 | |
M | 24 | |
T | 18 | 9.4% |
G | 12 | 6.2% |
S | 12 | 6.2% |
N | 12 | 6.2% |
Other Punctuation
Value | Count | Frequency (%) |
. | 226 |
Open Punctuation
Value | Count | Frequency (%) |
( | 94 |
Close Punctuation
Value | Count | Frequency (%) |
) | 94 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 34214 | |
Common | 804 | 2.3% |
Latin | 192 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
역 | 3679 | 10.8% |
동 | 1358 | 4.0% |
대 | 1173 | 3.4% |
구 | 843 | 2.5% |
신 | 650 | 1.9% |
서 | 614 | 1.8% |
지 | 608 | 1.8% |
리 | 582 | 1.7% |
사 | 530 | 1.5% |
문 | 506 | 1.5% |
Other values (263) | 23671 |
Common
Value | Count | Frequency (%) |
. | 226 | |
2 | 114 | |
( | 94 | |
) | 94 | |
7 | 62 | 7.7% |
1 | 60 | 7.5% |
3 | 41 | 5.1% |
5 | 36 | 4.5% |
6 | 36 | 4.5% |
9 | 24 | 3.0% |
Other values (2) | 17 | 2.1% |
Latin
Value | Count | Frequency (%) |
C | 48 | |
A | 42 | |
Y | 24 | |
M | 24 | |
T | 18 | 9.4% |
G | 12 | 6.2% |
S | 12 | 6.2% |
N | 12 | 6.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 34214 | |
ASCII | 996 | 2.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
역 | 3679 | 10.8% |
동 | 1358 | 4.0% |
대 | 1173 | 3.4% |
구 | 843 | 2.5% |
신 | 650 | 1.9% |
서 | 614 | 1.8% |
지 | 608 | 1.8% |
리 | 582 | 1.7% |
사 | 530 | 1.5% |
문 | 506 | 1.5% |
Other values (263) | 23671 |
ASCII
Value | Count | Frequency (%) |
. | 226 | |
2 | 114 | |
( | 94 | |
) | 94 | |
7 | 62 | 6.2% |
1 | 60 | 6.0% |
C | 48 | 4.8% |
A | 42 | 4.2% |
3 | 41 | 4.1% |
5 | 36 | 3.6% |
Other values (10) | 179 |
CARALC
Real number (ℝ)
Distinct | 73 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.858746 |
Minimum | 0 |
---|---|
Maximum | 630 |
Zeros | 12 |
Zeros (%) | 0.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 68.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 6 |
Q1 | 9 |
median | 11 |
Q3 | 15 |
95-th percentile | 29 |
Maximum | 630 |
Range | 630 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 33.593183 |
---|---|
Coefficient of variation (CV) | 1.9926264 |
Kurtosis | 124.98313 |
Mean | 16.858746 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 10.017133 |
Sum | 130689 |
Variance | 1128.5019 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 797 | |
8 | 691 | 8.9% |
11 | 653 | 8.4% |
12 | 649 | 8.4% |
9 | 629 | 8.1% |
7 | 563 | 7.3% |
13 | 542 | 7.0% |
15 | 472 | 6.1% |
6 | 397 | 5.1% |
14 | 382 | 4.9% |
Other values (63) | 1977 |
Value | Count | Frequency (%) |
0 | 12 | 0.2% |
4 | 36 | 0.5% |
5 | 132 | 1.7% |
6 | 397 | |
7 | 563 | |
8 | 691 | |
9 | 629 | |
10 | 797 | |
11 | 653 | |
12 | 649 |
Value | Count | Frequency (%) |
630 | 5 | 0.1% |
530 | 1 | < 0.1% |
460 | 2 | < 0.1% |
400 | 1 | < 0.1% |
380 | 1 | < 0.1% |
340 | 5 | 0.1% |
330 | 7 | |
325 | 4 | 0.1% |
305 | 2 | < 0.1% |
300 | 13 |
FIRCAR_TM
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 64 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.048647265 |
Minimum | -1.234 |
---|---|
Maximum | 0.235 |
Zeros | 40 |
Zeros (%) | 0.5% |
Negative | 15 |
Negative (%) | 0.2% |
Memory size | 68.3 KiB |
Quantile statistics
Minimum | -1.234 |
---|---|
5-th percentile | 0.04 |
Q1 | 0.042 |
median | 0.05 |
Q3 | 0.055 |
95-th percentile | 0.06 |
Maximum | 0.235 |
Range | 1.469 |
Interquartile range (IQR) | 0.013 |
Descriptive statistics
Standard deviation | 0.060375939 |
---|---|
Coefficient of variation (CV) | 1.2410963 |
Kurtosis | 392.34508 |
Mean | 0.048647265 |
Median Absolute Deviation (MAD) | 0.008 |
Skewness | -18.274532 |
Sum | 377.1136 |
Variance | 0.003645254 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.06 | 1414 | |
0.043 | 1209 | |
0.04 | 1157 | |
0.053 | 518 | 6.7% |
0.05 | 512 | 6.6% |
0.042 | 398 | 5.1% |
0.055 | 357 | 4.6% |
0.041 | 290 | 3.7% |
0.052 | 180 | 2.3% |
0.044 | 176 | 2.3% |
Other values (54) | 1541 |
Value | Count | Frequency (%) |
-1.234 | 15 | 0.2% |
0.0 | 40 | |
0.0001 | 12 | 0.2% |
0.001 | 11 | 0.1% |
0.032 | 1 | < 0.1% |
0.033 | 12 | 0.2% |
0.034 | 1 | < 0.1% |
0.035 | 60 | |
0.0355 | 48 | |
0.0357 | 24 | 0.3% |
Value | Count | Frequency (%) |
0.235 | 10 | |
0.234 | 15 | |
0.233 | 21 | |
0.232 | 7 | 0.1% |
0.231 | 16 | |
0.23 | 1 | < 0.1% |
0.193 | 12 | |
0.133 | 12 | |
0.103 | 12 | |
0.101 | 12 |
LSTCAR_TM
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 124 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.31196531 |
Minimum | 0.0115 |
---|---|
Maximum | 1.041 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 68.3 KiB |
Quantile statistics
Minimum | 0.0115 |
---|---|
5-th percentile | 0.193 |
Q1 | 0.224 |
median | 0.231 |
Q3 | 0.234 |
95-th percentile | 1.0005 |
Maximum | 1.041 |
Range | 1.0295 |
Interquartile range (IQR) | 0.01 |
Descriptive statistics
Standard deviation | 0.24988329 |
---|---|
Coefficient of variation (CV) | 0.80099704 |
Kurtosis | 3.7383383 |
Mean | 0.31196531 |
Median Absolute Deviation (MAD) | 0.004 |
Skewness | 2.3581501 |
Sum | 2418.3551 |
Variance | 0.062441659 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.23 | 930 | 12.0% |
0.233 | 838 | 10.8% |
0.223 | 607 | 7.8% |
0.231 | 478 | 6.2% |
1.0 | 478 | 6.2% |
0.225 | 464 | 6.0% |
0.234 | 455 | 5.9% |
0.224 | 428 | 5.5% |
0.232 | 404 | 5.2% |
0.235 | 331 | 4.3% |
Other values (114) | 2339 |
Value | Count | Frequency (%) |
0.0115 | 5 | 0.1% |
0.012 | 10 | |
0.013 | 15 | |
0.031 | 13 | |
0.032 | 6 | 0.1% |
0.0325 | 13 | |
0.033 | 6 | 0.1% |
0.0345 | 6 | 0.1% |
0.041 | 5 | 0.1% |
0.044 | 12 |
Value | Count | Frequency (%) |
1.041 | 2 | < 0.1% |
1.035 | 18 | |
1.0345 | 1 | < 0.1% |
1.034 | 5 | 0.1% |
1.0335 | 7 | 0.1% |
1.033 | 13 | |
1.0325 | 1 | < 0.1% |
1.032 | 6 | 0.1% |
1.031 | 1 | < 0.1% |
1.014 | 5 | 0.1% |
GROUP_NM
Text
Distinct | 255 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 60.7 KiB |
Value | Count | Frequency (%) |
선진운수 | 224 | 2.6% |
대진여객 | 188 | 2.2% |
범일운수 | 180 | 2.1% |
한남여객 | 176 | 2.0% |
한성운수 | 156 | 1.8% |
흥안운수 | 156 | 1.8% |
대원여객 | 152 | 1.7% |
한성여객 | 152 | 1.7% |
북부운수 | 151 | 1.7% |
삼화상운 | 144 | 1.7% |
Other values (196) | 7041 |
Most occurring characters
Value | Count | Frequency (%) |
8720 | ||
운 | 4473 | 9.7% |
수 | 4397 | 9.5% |
통 | 1912 | 4.1% |
교 | 1912 | 4.1% |
성 | 1044 | 2.3% |
객 | 1008 | 2.2% |
진 | 985 | 2.1% |
여 | 972 | 2.1% |
, | 920 | 2.0% |
Other values (158) | 19826 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 36169 | |
Space Separator | 8720 | 18.9% |
Other Punctuation | 920 | 2.0% |
Uppercase Letter | 276 | 0.6% |
Decimal Number | 84 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
운 | 4473 | 12.4% |
수 | 4397 | 12.2% |
통 | 1912 | 5.3% |
교 | 1912 | 5.3% |
성 | 1044 | 2.9% |
객 | 1008 | 2.8% |
진 | 985 | 2.7% |
여 | 972 | 2.7% |
한 | 743 | 2.1% |
신 | 716 | 2.0% |
Other values (152) | 18007 |
Uppercase Letter
Value | Count | Frequency (%) |
R | 92 | |
T | 92 | |
B | 92 |
Space Separator
Value | Count | Frequency (%) |
8720 |
Other Punctuation
Value | Count | Frequency (%) |
, | 920 |
Decimal Number
Value | Count | Frequency (%) |
3 | 84 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 36169 | |
Common | 9724 | 21.1% |
Latin | 276 | 0.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
운 | 4473 | 12.4% |
수 | 4397 | 12.2% |
통 | 1912 | 5.3% |
교 | 1912 | 5.3% |
성 | 1044 | 2.9% |
객 | 1008 | 2.8% |
진 | 985 | 2.7% |
여 | 972 | 2.7% |
한 | 743 | 2.1% |
신 | 716 | 2.0% |
Other values (152) | 18007 |
Common
Value | Count | Frequency (%) |
8720 | ||
, | 920 | 9.5% |
3 | 84 | 0.9% |
Latin
Value | Count | Frequency (%) |
R | 92 | |
T | 92 | |
B | 92 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 36169 | |
ASCII | 10000 | 21.7% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
8720 | ||
, | 920 | 9.2% |
R | 92 | 0.9% |
T | 92 | 0.9% |
B | 92 | 0.9% |
3 | 84 | 0.8% |
Hangul
Value | Count | Frequency (%) |
운 | 4473 | 12.4% |
수 | 4397 | 12.2% |
통 | 1912 | 5.3% |
교 | 1912 | 5.3% |
성 | 1044 | 2.9% |
객 | 1008 | 2.8% |
진 | 985 | 2.7% |
여 | 972 | 2.7% |
한 | 743 | 2.1% |
신 | 716 | 2.0% |
Other values (152) | 18007 |
STDR_DE | ROUTE_ID | DSTNC | ROUTE_TY | CARALC | FIRCAR_TM | LSTCAR_TM | |
---|---|---|---|---|---|---|---|
STDR_DE | 1.000 | 0.000 | 0.000 | 0.044 | 0.077 | 0.000 | 0.000 |
ROUTE_ID | 0.000 | 1.000 | 0.611 | 0.591 | 0.076 | 0.133 | 0.377 |
DSTNC | 0.000 | 0.611 | 1.000 | 0.784 | 0.825 | 0.296 | 0.586 |
ROUTE_TY | 0.044 | 0.591 | 0.784 | 1.000 | 0.642 | 0.551 | 0.620 |
CARALC | 0.077 | 0.076 | 0.825 | 0.642 | 1.000 | 0.335 | 0.594 |
FIRCAR_TM | 0.000 | 0.133 | 0.296 | 0.551 | 0.335 | 1.000 | 0.356 |
LSTCAR_TM | 0.000 | 0.377 | 0.586 | 0.620 | 0.594 | 0.356 | 1.000 |
STDR_DE | ROUTE_ID | DSTNC | ROUTE_TY | CARALC | FIRCAR_TM | LSTCAR_TM | |
---|---|---|---|---|---|---|---|
STDR_DE | 1.000 | 0.004 | 0.029 | -0.029 | 0.043 | -0.004 | -0.028 |
ROUTE_ID | 0.004 | 1.000 | -0.571 | -0.612 | 0.051 | 0.645 | 0.299 |
DSTNC | 0.029 | -0.571 | 1.000 | 0.480 | 0.156 | -0.751 | -0.527 |
ROUTE_TY | -0.029 | -0.612 | 0.480 | 1.000 | 0.116 | -0.485 | -0.251 |
CARALC | 0.043 | 0.051 | 0.156 | 0.116 | 1.000 | 0.029 | -0.211 |
FIRCAR_TM | -0.004 | 0.645 | -0.751 | -0.485 | 0.029 | 1.000 | 0.374 |
LSTCAR_TM | -0.028 | 0.299 | -0.527 | -0.251 | -0.211 | 0.374 | 1.000 |
STDR_DE | ROUTE_ID | ROUTE_NM | ROUTE_ABRV | DSTNC | ROUTE_TY | SSTTN_NM | ESTTN_NM | CARALC | FIRCAR_TM | LSTCAR_TM | GROUP_NM | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 20220101 | 100100124 | 0017 | 0017 | 12.2 | 4 | 청암동 | 이촌동 | 12 | 0.0515 | 0.233 | 보광교통 |
1 | 20220101 | 104000007 | 01A | 01A | 23.6 | 5 | 서울역환승센터 | 서울역환승센터 | 35 | 0.063 | 0.221 | 대원여객, 메트로버스 |
2 | 20220101 | 104000008 | 01B | 01B | 23.6 | 5 | 서울역환승센터 | 서울역환승센터 | 35 | 0.063 | 0.221 | 대원여객, 북부운수 |
3 | 20220101 | 100100001 | 02 | 02 | 16.5 | 5 | 예장주차장 | 예장주차장 | 12 | 0.063 | 0.23 | 북부운수 |
4 | 20220101 | 106000002 | 04 | 04 | 13.0 | 5 | 예장주차장 | 예장주차장 | 12 | 0.063 | 0.223 | 북부운수 |
5 | 20220101 | 100100549 | 100 | 100 | 57.09 | 3 | 하계동 | 용산구청 | 10 | 0.04 | 0.223 | 한성여객 |
6 | 20220101 | 100100006 | 101 | 101 | 37.81 | 3 | 우이동 | 서소문 | 10 | 0.04 | 0.23 | 동아운수, 한성운수 |
7 | 20220101 | 100100129 | 1014 | 1014 | 12.6 | 4 | 성북생태체험관 | 종로구민회관숭인동 | 8 | 0.05 | 0.234 | 대진여객 |
8 | 20220101 | 100100130 | 1017 | 1017 | 23.95 | 4 | 월계동 | 상왕십리 | 14 | 0.043 | 0.232 | 한성여객 |
9 | 20220101 | 100100007 | 102 | 102 | 30.2 | 3 | 상계주공7단지 | 동대문 | 11 | 0.04 | 0.231 | 삼화상운, 흥안운수 |
STDR_DE | ROUTE_ID | ROUTE_NM | ROUTE_ABRV | DSTNC | ROUTE_TY | SSTTN_NM | ESTTN_NM | CARALC | FIRCAR_TM | LSTCAR_TM | GROUP_NM | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
7742 | 20221201 | 100900008 | 종로02 | 종로02 | 7.2 | 2 | 성균관대 | YMCA | 15 | 0.06 | 0.232 | 대산교통 |
7743 | 20221201 | 100900010 | 종로03 | 종로03 | 7.2 | 2 | 낙산공원 | 종로5가 | 7 | 0.06 | 0.233 | 종로운수 |
7744 | 20221201 | 100900011 | 종로05 | 종로05 | 4.9 | 2 | 서대문 | 교남동 | 8 | 0.06 | 0.233 | 나경운수 |
7745 | 20221201 | 100900005 | 종로08 | 종로08 | 6.8 | 2 | 명륜3가종점 | 종로5가 | 6 | 0.055 | 0.234 | 와룡운수 |
7746 | 20221201 | 100900003 | 종로09 | 종로09 | 6.4 | 2 | 수성동계곡 | 남대문 | 10 | 0.06 | 0.233 | 인왕교통 |
7747 | 20221201 | 100900007 | 종로11 | 종로11 | 8.6 | 2 | 삼청동 | 서울역 | 9 | 0.06 | 0.23 | 삼청교통 |
7748 | 20221201 | 100900009 | 종로12 | 종로12 | 5.4 | 2 | 서울대병원 | 종로3가 | 6 | 0.06 | 0.233 | 은수교통 |
7749 | 20221201 | 100900002 | 종로13 | 종로13 | 7.5 | 2 | 평창동주민센터 | 부암동주민센터.무계원 | 13 | 0.055 | 0.223 | 약수교통 |
7750 | 20221201 | 106900001 | 중랑01 | 중랑01 | 3.6 | 2 | 동아약국 | 신이문역 | 16 | 0.06 | 0.2348 | 금창운수, 금창운수 월계점 |
7751 | 20221201 | 106900002 | 중랑02 | 중랑02 | 6.86 | 2 | 진로아파트 | 한신아파트 | 10 | 0.06 | 0.2315 | 중랑운수 |