Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 2000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 101.7 KiB |
Average record size in memory | 52.1 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Categorical | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | (주)모토브 / 신재훈 |
URL | https://www.bigdata-transportation.kr/frn/prdt/detail?prdtId=PRDTNUM_000000020253 |
register_at has constant value "" | Constant |
fine_dust_value_id has unique values | Unique |
fine_dust has 165 (8.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-01-14 00:46:16.173363 |
---|---|
Analysis finished | 2024-01-14 00:46:22.063083 |
Duration | 5.89 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
fine_dust_value_id
Real number (ℝ)
UNIQUE
 
Distinct | 2000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1000.5 |
Minimum | 1 |
---|---|
Maximum | 2000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 100.95 |
Q1 | 500.75 |
median | 1000.5 |
Q3 | 1500.25 |
95-th percentile | 1900.05 |
Maximum | 2000 |
Range | 1999 |
Interquartile range (IQR) | 999.5 |
Descriptive statistics
Standard deviation | 577.49459 |
---|---|
Coefficient of variation (CV) | 0.57720599 |
Kurtosis | -1.2 |
Mean | 1000.5 |
Median Absolute Deviation (MAD) | 500 |
Skewness | 0 |
Sum | 2001000 |
Variance | 333500 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.1% |
1331 | 1 | 0.1% |
1344 | 1 | 0.1% |
1343 | 1 | 0.1% |
1342 | 1 | 0.1% |
1341 | 1 | 0.1% |
1340 | 1 | 0.1% |
1339 | 1 | 0.1% |
1338 | 1 | 0.1% |
1337 | 1 | 0.1% |
Other values (1990) | 1990 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
2000 | 1 | |
1999 | 1 | |
1998 | 1 | |
1997 | 1 | |
1996 | 1 | |
1995 | 1 | |
1994 | 1 | |
1993 | 1 | |
1992 | 1 | |
1991 | 1 |
taxi_id
Text
Distinct | 61 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.8 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 20000 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | T_96289981 |
---|---|
2nd row | T_73322493 |
3rd row | T_47791477 |
4th row | T_97388636 |
5th row | T_73102763 |
Value | Count | Frequency (%) |
t_97608367 | 34 | 1.7% |
t_73468981 | 34 | 1.7% |
t_48230939 | 34 | 1.7% |
t_23798578 | 34 | 1.7% |
t_98633779 | 33 | 1.7% |
t_98047829 | 33 | 1.7% |
t_48084452 | 33 | 1.7% |
t_23945065 | 33 | 1.7% |
t_98267560 | 33 | 1.7% |
t_96289981 | 33 | 1.7% |
Other values (51) | 1666 |
Most occurring characters
Value | Count | Frequency (%) |
7 | 2298 | |
4 | 2278 | |
T | 2000 | |
_ | 2000 | |
9 | 1902 | |
2 | 1866 | |
8 | 1738 | |
3 | 1655 | |
5 | 1156 | |
6 | 1075 | |
Other values (2) | 2032 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16000 | |
Uppercase Letter | 2000 | 10.0% |
Connector Punctuation | 2000 | 10.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
7 | 2298 | |
4 | 2278 | |
9 | 1902 | |
2 | 1866 | |
8 | 1738 | |
3 | 1655 | |
5 | 1156 | |
6 | 1075 | |
1 | 1057 | |
0 | 975 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 2000 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 18000 | |
Latin | 2000 | 10.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
7 | 2298 | |
4 | 2278 | |
_ | 2000 | |
9 | 1902 | |
2 | 1866 | |
8 | 1738 | |
3 | 1655 | |
5 | 1156 | |
6 | 1075 | |
1 | 1057 |
Latin
Value | Count | Frequency (%) |
T | 2000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 20000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
7 | 2298 | |
4 | 2278 | |
T | 2000 | |
_ | 2000 | |
9 | 1902 | |
2 | 1866 | |
8 | 1738 | |
3 | 1655 | |
5 | 1156 | |
6 | 1075 | |
Other values (2) | 2032 |
latitude
Real number (ℝ)
Distinct | 1297 |
---|---|
Distinct (%) | 64.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.517553 |
Minimum | 37.30882 |
---|---|
Maximum | 37.67605 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.7 KiB |
Quantile statistics
Minimum | 37.30882 |
---|---|
5-th percentile | 37.421769 |
Q1 | 37.48688 |
median | 37.516973 |
Q3 | 37.540834 |
95-th percentile | 37.609883 |
Maximum | 37.67605 |
Range | 0.36723 |
Interquartile range (IQR) | 0.05395425 |
Descriptive statistics
Standard deviation | 0.057521084 |
---|---|
Coefficient of variation (CV) | 0.0015331779 |
Kurtosis | 2.2041823 |
Mean | 37.517553 |
Median Absolute Deviation (MAD) | 0.0300925 |
Skewness | -0.39069245 |
Sum | 75035.105 |
Variance | 0.0033086751 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.5649 | 34 | 1.7% |
37.48688 | 33 | 1.7% |
37.473686 | 29 | 1.5% |
37.573574 | 25 | 1.2% |
37.523636 | 23 | 1.1% |
37.53414 | 21 | 1.1% |
37.60681 | 19 | 0.9% |
37.48328 | 19 | 0.9% |
37.523 | 13 | 0.7% |
37.62119 | 13 | 0.7% |
Other values (1287) | 1771 |
Value | Count | Frequency (%) |
37.30882 | 2 | |
37.308823 | 1 | |
37.30883 | 2 | |
37.308846 | 2 | |
37.30886 | 1 | |
37.308876 | 2 | |
37.30889 | 1 | |
37.308907 | 1 | |
37.308914 | 1 | |
37.30892 | 1 |
Value | Count | Frequency (%) |
37.67605 | 1 | 0.1% |
37.676044 | 6 | |
37.67604 | 4 | 0.2% |
37.676037 | 1 | 0.1% |
37.676033 | 3 | 0.1% |
37.67603 | 7 | |
37.676025 | 11 | |
37.621807 | 2 | 0.1% |
37.621803 | 1 | 0.1% |
37.6218 | 1 | 0.1% |
longitude
Real number (ℝ)
Distinct | 976 |
---|---|
Distinct (%) | 48.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 126.97993 |
Minimum | 126.70944 |
---|---|
Maximum | 127.16431 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.7 KiB |
Quantile statistics
Minimum | 126.70944 |
---|---|
5-th percentile | 126.74964 |
Q1 | 126.89201 |
median | 126.97982 |
Q3 | 127.08631 |
95-th percentile | 127.13107 |
Maximum | 127.16431 |
Range | 0.45487 |
Interquartile range (IQR) | 0.194304 |
Descriptive statistics
Standard deviation | 0.11583467 |
---|---|
Coefficient of variation (CV) | 0.00091222818 |
Kurtosis | -0.61849875 |
Mean | 126.97993 |
Median Absolute Deviation (MAD) | 0.09063 |
Skewness | -0.41629414 |
Sum | 253959.87 |
Variance | 0.013417672 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
127.03322 | 34 | 1.7% |
127.10529 | 34 | 1.7% |
126.83376 | 34 | 1.7% |
126.74964 | 30 | 1.5% |
127.06097 | 29 | 1.5% |
126.97982 | 27 | 1.4% |
127.055176 | 22 | 1.1% |
126.90667 | 21 | 1.1% |
127.12796 | 21 | 1.1% |
126.90562 | 20 | 1.0% |
Other values (966) | 1728 |
Value | Count | Frequency (%) |
126.70944 | 2 | 0.1% |
126.70946 | 2 | 0.1% |
126.70947 | 3 | 0.1% |
126.70949 | 14 | |
126.7095 | 5 | 0.2% |
126.70952 | 6 | |
126.70954 | 1 | 0.1% |
126.724655 | 1 | 0.1% |
126.72468 | 1 | 0.1% |
126.72469 | 3 | 0.1% |
Value | Count | Frequency (%) |
127.16431 | 1 | |
127.16403 | 1 | |
127.16375 | 1 | |
127.163475 | 1 | |
127.1632 | 1 | |
127.162926 | 1 | |
127.16263 | 1 | |
127.16235 | 1 | |
127.16208 | 1 | |
127.1618 | 1 |
fine_dust
Real number (ℝ)
ZEROS
 
Distinct | 19 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 18.954154 |
Minimum | 0 |
---|---|
Maximum | 45.64286 |
Zeros | 165 |
Zeros (%) | 8.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 13.555804 |
median | 18.136162 |
Q3 | 24.716518 |
95-th percentile | 28.901787 |
Maximum | 45.64286 |
Range | 45.64286 |
Interquartile range (IQR) | 11.160714 |
Descriptive statistics
Standard deviation | 8.6012533 |
---|---|
Coefficient of variation (CV) | 0.45379253 |
Kurtosis | 0.92407414 |
Mean | 18.954154 |
Median Absolute Deviation (MAD) | 5.185268 |
Skewness | -0.12731212 |
Sum | 37908.307 |
Variance | 73.981558 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
27.506697 | 245 | |
16.345983 | 231 | |
13.555804 | 182 | |
17.741072 | 165 | |
20.53125 | 165 | |
0.0 | 165 | |
23.32143 | 123 | 6.2% |
26.111609 | 119 | 5.9% |
10.765625 | 100 | 5.0% |
19.136162 | 99 | 5.0% |
Other values (9) | 406 |
Value | Count | Frequency (%) |
0.0 | 165 | |
10.765625 | 100 | |
12.160715 | 66 | 3.3% |
13.555804 | 182 | |
14.950893 | 67 | 3.4% |
16.345983 | 231 | |
17.741072 | 165 | |
18.136162 | 33 | 1.7% |
19.136162 | 99 | |
20.53125 | 165 |
Value | Count | Frequency (%) |
45.64286 | 14 | 0.7% |
44.24777 | 19 | 0.9% |
34.482143 | 66 | 3.3% |
28.901787 | 33 | 1.7% |
27.506697 | 245 | |
26.111609 | 119 | |
24.716518 | 66 | 3.3% |
23.32143 | 123 | |
21.92634 | 42 | 2.1% |
20.53125 | 165 |
register_at
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.8 KiB |
2020-09-10 22:00 |
---|
Length
Max length | 16 |
---|---|
Median length | 16 |
Mean length | 16 |
Min length | 16 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-09-10 22:00 |
---|---|
2nd row | 2020-09-10 22:00 |
3rd row | 2020-09-10 22:00 |
4th row | 2020-09-10 22:00 |
5th row | 2020-09-10 22:00 |
Common Values
Value | Count | Frequency (%) |
2020-09-10 22:00 | 2000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-09-10 | 2000 | |
22:00 | 2000 |
fine_dust_value_id | taxi_id | latitude | longitude | fine_dust | |
---|---|---|---|---|---|
fine_dust_value_id | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
taxi_id | 0.000 | 1.000 | 1.000 | 0.999 | 0.999 |
latitude | 0.000 | 1.000 | 1.000 | 0.771 | 0.543 |
longitude | 0.000 | 0.999 | 0.771 | 1.000 | 0.585 |
fine_dust | 0.000 | 0.999 | 0.543 | 0.585 | 1.000 |
fine_dust_value_id | latitude | longitude | fine_dust | |
---|---|---|---|---|
fine_dust_value_id | 1.000 | 0.013 | -0.006 | 0.009 |
latitude | 0.013 | 1.000 | -0.294 | 0.171 |
longitude | -0.006 | -0.294 | 1.000 | -0.093 |
fine_dust | 0.009 | 0.171 | -0.093 | 1.000 |
fine_dust_value_id | taxi_id | latitude | longitude | fine_dust | register_at | |
---|---|---|---|---|---|---|
0 | 1 | T_96289981 | 37.610664 | 126.724655 | 17.741072 | 2020-09-10 22:00 |
1 | 2 | T_73322493 | 37.621788 | 127.08755 | 24.716518 | 2020-09-10 22:00 |
2 | 3 | T_47791477 | 37.502796 | 127.04183 | 23.32143 | 2020-09-10 22:00 |
3 | 4 | T_97388636 | 37.527405 | 126.90563 | 34.482143 | 2020-09-10 22:00 |
4 | 5 | T_73102763 | 37.55852 | 126.859764 | 10.765625 | 2020-09-10 22:00 |
5 | 6 | T_74347905 | 37.309208 | 127.1311 | 20.53125 | 2020-09-10 22:00 |
6 | 7 | T_74128174 | 37.483276 | 127.10531 | 0.0 | 2020-09-10 22:00 |
7 | 8 | T_23725334 | 37.522526 | 126.925735 | 20.53125 | 2020-09-10 22:00 |
8 | 9 | T_47059040 | 37.512455 | 126.88669 | 27.506697 | 2020-09-10 22:00 |
9 | 10 | T_72663300 | 37.503887 | 126.947334 | 0.0 | 2020-09-10 22:00 |
fine_dust_value_id | taxi_id | latitude | longitude | fine_dust | register_at | |
---|---|---|---|---|---|---|
1990 | 1991 | T_96875931 | 37.50764 | 127.033905 | 21.92634 | 2020-09-10 22:00 |
1991 | 1992 | T_48743645 | 37.523792 | 126.881676 | 20.53125 | 2020-09-10 22:00 |
1992 | 1993 | T_98047829 | 37.573574 | 126.97982 | 12.160715 | 2020-09-10 22:00 |
1993 | 1994 | T_48084452 | 37.53414 | 126.90667 | 19.136162 | 2020-09-10 22:00 |
1994 | 1995 | T_98267560 | 37.51667 | 126.93942 | 14.950893 | 2020-09-10 22:00 |
1995 | 1996 | T_48084452 | 37.53414 | 126.90667 | 19.136162 | 2020-09-10 22:00 |
1996 | 1997 | T_74494392 | 37.569824 | 126.81937 | 16.345983 | 2020-09-10 22:00 |
1997 | 1998 | T_23798578 | 37.525528 | 126.834915 | 13.555804 | 2020-09-10 22:00 |
1998 | 1999 | T_97608367 | 37.5649 | 126.83376 | 26.111609 | 2020-09-10 22:00 |
1999 | 2000 | T_73468981 | 37.540596 | 126.97334 | 10.765625 | 2020-09-10 22:00 |