Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 199 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 11.4 KiB |
Average record size in memory | 58.7 B |
Variable types
DateTime | 1 |
---|---|
Text | 1 |
Categorical | 3 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 두잉랩 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=DLADATA202005 |
2020-01-01 has constant value "" | Constant |
1 is highly overall correlated with 12.121212 | High correlation |
12.121212 is highly overall correlated with 1 | High correlation |
[40-49] is highly overall correlated with [25-27] | High correlation |
[25-27] is highly overall correlated with [40-49] | High correlation |
4 is highly imbalanced (66.6%) | Imbalance |
Reproduction
Analysis started | 2023-12-10 06:16:08.866593 |
---|---|
Analysis finished | 2023-12-10 06:16:10.184779 |
Duration | 1.32 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
2020-01-01
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Minimum | 2020-01-01 00:00:00 |
---|---|
Maximum | 2020-01-01 00:00:00 |
배추김치
Text
Distinct | 146 |
---|---|
Distinct (%) | 73.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
쌀밥 | 10 | 4.5% |
배추김치 | 9 | 4.1% |
떡국 | 5 | 2.3% |
쌈장 | 4 | 1.8% |
서울우유 | 4 | 1.8% |
아메리카노 | 3 | 1.4% |
떡만둣국 | 3 | 1.4% |
우유 | 3 | 1.4% |
바나나 | 3 | 1.4% |
뚝배기불고기 | 3 | 1.4% |
Other values (152) | 175 |
Most occurring characters
Value | Count | Frequency (%) |
222 | 20.6% | |
치 | 30 | 2.8% |
기 | 21 | 1.9% |
이 | 21 | 1.9% |
김 | 20 | 1.9% |
고 | 16 | 1.5% |
국 | 15 | 1.4% |
나 | 15 | 1.4% |
밥 | 14 | 1.3% |
배 | 14 | 1.3% |
Other values (234) | 692 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 824 | |
Space Separator | 222 | 20.6% |
Open Punctuation | 14 | 1.3% |
Close Punctuation | 11 | 1.0% |
Decimal Number | 6 | 0.6% |
Lowercase Letter | 2 | 0.2% |
Other Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
치 | 30 | 3.6% |
기 | 21 | 2.5% |
이 | 21 | 2.5% |
김 | 20 | 2.4% |
고 | 16 | 1.9% |
국 | 15 | 1.8% |
나 | 15 | 1.8% |
밥 | 14 | 1.7% |
배 | 14 | 1.7% |
추 | 14 | 1.7% |
Other values (223) | 644 |
Decimal Number
Value | Count | Frequency (%) |
0 | 2 | |
1 | 1 | |
5 | 1 | |
7 | 1 | |
2 | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
l | 1 | |
m | 1 |
Space Separator
Value | Count | Frequency (%) |
222 |
Open Punctuation
Value | Count | Frequency (%) |
( | 14 |
Close Punctuation
Value | Count | Frequency (%) |
) | 11 |
Other Punctuation
Value | Count | Frequency (%) |
% | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 824 | |
Common | 254 | 23.5% |
Latin | 2 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
치 | 30 | 3.6% |
기 | 21 | 2.5% |
이 | 21 | 2.5% |
김 | 20 | 2.4% |
고 | 16 | 1.9% |
국 | 15 | 1.8% |
나 | 15 | 1.8% |
밥 | 14 | 1.7% |
배 | 14 | 1.7% |
추 | 14 | 1.7% |
Other values (223) | 644 |
Common
Value | Count | Frequency (%) |
222 | ||
( | 14 | 5.5% |
) | 11 | 4.3% |
0 | 2 | 0.8% |
1 | 1 | 0.4% |
% | 1 | 0.4% |
5 | 1 | 0.4% |
7 | 1 | 0.4% |
2 | 1 | 0.4% |
Latin
Value | Count | Frequency (%) |
l | 1 | |
m | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 824 | |
ASCII | 256 | 23.7% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
222 | ||
( | 14 | 5.5% |
) | 11 | 4.3% |
0 | 2 | 0.8% |
1 | 1 | 0.4% |
% | 1 | 0.4% |
l | 1 | 0.4% |
m | 1 | 0.4% |
5 | 1 | 0.4% |
7 | 1 | 0.4% |
Hangul
Value | Count | Frequency (%) |
치 | 30 | 3.6% |
기 | 21 | 2.5% |
이 | 21 | 2.5% |
김 | 20 | 2.4% |
고 | 16 | 1.9% |
국 | 15 | 1.8% |
나 | 15 | 1.8% |
밥 | 14 | 1.7% |
배 | 14 | 1.7% |
추 | 14 | 1.7% |
Other values (223) | 644 |
4
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
1 | |
---|---|
2 | |
3 | 6 |
4 | 2 |
삶은것) | 2 |
Length
Max length | 5 |
---|---|
Median length | 2 |
Mean length | 2.040201 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 168 | |
2 | 20 | 10.1% |
3 | 6 | 3.0% |
4 | 2 | 1.0% |
삶은것) | 2 | 1.0% |
튀김) | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 168 | |
2 | 20 | 10.1% |
3 | 6 | 3.0% |
4 | 2 | 1.0% |
삶은것 | 2 | 1.0% |
튀김 | 1 | 0.5% |
1
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 30 |
---|---|
Distinct (%) | 15.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.4422111 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 8 |
Q3 | 13.5 |
95-th percentile | 24 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 9.5 |
Descriptive statistics
Standard deviation | 7.0757394 |
---|---|
Coefficient of variation (CV) | 0.74937315 |
Kurtosis | 0.041637885 |
Mean | 9.4422111 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 0.89156146 |
Sum | 1879 |
Variance | 50.066088 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 17 | 8.5% |
2 | 15 | 7.5% |
4 | 15 | 7.5% |
3 | 14 | 7.0% |
5 | 14 | 7.0% |
6 | 11 | 5.5% |
7 | 11 | 5.5% |
8 | 10 | 5.0% |
10 | 10 | 5.0% |
9 | 9 | 4.5% |
Other values (20) | 73 |
Value | Count | Frequency (%) |
1 | 17 | |
2 | 15 | |
3 | 14 | |
4 | 15 | |
5 | 14 | |
6 | 11 | |
7 | 11 | |
8 | 10 | |
9 | 9 | |
10 | 10 |
Value | Count | Frequency (%) |
30 | 1 | 0.5% |
29 | 1 | 0.5% |
28 | 1 | 0.5% |
27 | 2 | |
26 | 2 | |
25 | 2 | |
24 | 2 | |
23 | 2 | |
22 | 3 | |
21 | 3 |
12.121212
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 12.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.2569814 |
Minimum | 2.857143 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 2.857143 |
---|---|
5-th percentile | 2.857143 |
Q1 | 3.030303 |
median | 5.714286 |
Q3 | 8.333333 |
95-th percentile | 20 |
Maximum | 30 |
Range | 27.142857 |
Interquartile range (IQR) | 5.30303 |
Descriptive statistics
Standard deviation | 5.5301561 |
---|---|
Coefficient of variation (CV) | 0.76204634 |
Kurtosis | 3.2651564 |
Mean | 7.2569814 |
Median Absolute Deviation (MAD) | 2.683983 |
Skewness | 1.8911826 |
Sum | 1444.1393 |
Variance | 30.582627 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2.857143 | 40 | |
3.030303 | 23 | |
5.0 | 16 | 8.0% |
4.761905 | 16 | 8.0% |
10.0 | 16 | 8.0% |
8.333333 | 14 | 7.0% |
20.0 | 11 | 5.5% |
7.142857 | 11 | 5.5% |
5.882353 | 10 | 5.0% |
6.25 | 10 | 5.0% |
Other values (15) | 32 |
Value | Count | Frequency (%) |
2.857143 | 40 | |
3.0 | 1 | 0.5% |
3.030303 | 23 | |
4.761905 | 16 | 8.0% |
5.0 | 16 | 8.0% |
5.714286 | 6 | 3.0% |
5.882353 | 10 | 5.0% |
6.060606 | 3 | 1.5% |
6.25 | 10 | 5.0% |
7.142857 | 11 | 5.5% |
Value | Count | Frequency (%) |
30.0 | 1 | 0.5% |
25.0 | 4 | 2.0% |
23.529412 | 1 | 0.5% |
23.0 | 1 | 0.5% |
20.0 | 11 | |
16.666667 | 2 | 1.0% |
14.285714 | 2 | 1.0% |
12.5 | 3 | 1.5% |
11.764706 | 1 | 0.5% |
11.428571 | 1 | 0.5% |
[40-49]
Categorical
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
[40-49] | |
---|---|
[0-19] | |
[50-59] | |
[60-99] | |
[20-29] | 5 |
Other values (2) | 3 |
Length
Max length | 10 |
---|---|
Median length | 8 |
Mean length | 7.6582915 |
Min length | 7 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | [40-49] |
---|---|
2nd row | [40-49] |
3rd row | [40-49] |
4th row | [40-49] |
5th row | [40-49] |
Common Values
Value | Count | Frequency (%) |
[40-49] | 98 | |
[0-19] | 72 | |
[50-59] | 11 | 5.5% |
[60-99] | 10 | 5.0% |
[20-29] | 5 | 2.5% |
2.857143 | 2 | 1.0% |
10.000000 | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
40-49 | 98 | |
0-19 | 72 | |
50-59 | 11 | 5.5% |
60-99 | 10 | 5.0% |
20-29 | 5 | 2.5% |
2.857143 | 2 | 1.0% |
10.000000 | 1 | 0.5% |
[25-27]
Categorical
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
[25-27] | |
---|---|
[27-29] | |
[29-31] | |
[31-33] | |
[35-100] | |
Other values (3) |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 8.0653266 |
Min length | 7 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | [25-27] |
---|---|
2nd row | [25-27] |
3rd row | [25-27] |
4th row | [25-27] |
5th row | [25-27] |
Common Values
Value | Count | Frequency (%) |
[25-27] | 59 | |
[27-29] | 53 | |
[29-31] | 39 | |
[31-33] | 25 | |
[35-100] | 15 | 7.5% |
[33-35] | 5 | 2.5% |
[0-19] | 2 | 1.0% |
[50-59] | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
25-27 | 59 | |
27-29 | 53 | |
29-31 | 39 | |
31-33 | 25 | |
35-100 | 15 | 7.5% |
33-35 | 5 | 2.5% |
0-19 | 2 | 1.0% |
50-59 | 1 | 0.5% |
4 | 1 | 12.121212 | [40-49] | [25-27] | |
---|---|---|---|---|---|
4 | 1.000 | 0.314 | 0.703 | 0.674 | 0.710 |
1 | 0.314 | 1.000 | 0.556 | 0.000 | 0.000 |
12.121212 | 0.703 | 0.556 | 1.000 | 0.656 | 0.739 |
[40-49] | 0.674 | 0.000 | 0.656 | 1.000 | 0.807 |
[25-27] | 0.710 | 0.000 | 0.739 | 0.807 | 1.000 |
4 | [40-49] | [25-27] | |
---|---|---|---|
4 | 1.000 | 0.482 | 0.494 |
[40-49] | 0.482 | 1.000 | 0.602 |
[25-27] | 0.494 | 0.602 | 1.000 |
1 | 12.121212 | 4 | [40-49] | [25-27] | |
---|---|---|---|---|---|
1 | 1.000 | -0.752 | 0.168 | 0.000 | 0.000 |
12.121212 | -0.752 | 1.000 | 0.463 | 0.404 | 0.469 |
4 | 0.168 | 0.463 | 1.000 | 0.482 | 0.494 |
[40-49] | 0.000 | 0.404 | 0.482 | 1.000 | 0.602 |
[25-27] | 0.000 | 0.469 | 0.494 | 0.602 | 1.000 |
2020-01-01 | 배추김치 | 4 | 1 | 12.121212 | [40-49] | [25-27] | |
---|---|---|---|---|---|---|---|
0 | 2020-01-01 | 아메리카노 | 2 | 2 | 6.060606 | [40-49] | [25-27] |
1 | 2020-01-01 | 떡국 | 2 | 3 | 6.060606 | [40-49] | [25-27] |
2 | 2020-01-01 | 깍두기 | 2 | 4 | 6.060606 | [40-49] | [25-27] |
3 | 2020-01-01 | 감자조림 | 1 | 5 | 3.030303 | [40-49] | [25-27] |
4 | 2020-01-01 | 토마토케첩 | 1 | 6 | 3.030303 | [40-49] | [25-27] |
5 | 2020-01-01 | 홈런볼 초코 | 1 | 7 | 3.030303 | [40-49] | [25-27] |
6 | 2020-01-01 | 락토핏 생유산균 골드 | 1 | 8 | 3.030303 | [40-49] | [25-27] |
7 | 2020-01-01 | 쌀밥 | 1 | 9 | 3.030303 | [40-49] | [25-27] |
8 | 2020-01-01 | 빈대떡 | 1 | 10 | 3.030303 | [40-49] | [25-27] |
9 | 2020-01-01 | 할라피뇨 | 1 | 11 | 3.030303 | [40-49] | [25-27] |
2020-01-01 | 배추김치 | 4 | 1 | 12.121212 | [40-49] | [25-27] | |
---|---|---|---|---|---|---|---|
189 | 2020-01-01 | 물김치 | 1 | 6 | 10.0 | [60-99] | [25-27] |
190 | 2020-01-01 | 물냉면 | 1 | 7 | 10.0 | [60-99] | [25-27] |
191 | 2020-01-01 | 치킨버거 | 1 | 8 | 10.0 | [60-99] | [25-27] |
192 | 2020-01-01 | 돼지고기보쌈(사태) | 1 | 9 | 10.0 | [60-99] | [25-27] |
193 | 2020-01-01 | 옥수수통조림(가당) | 1 | 10 | 10.0 | [60-99] | [25-27] |
194 | 2020-01-01 | 배추김치 | 2 | 1 | 16.666667 | [20-29] | [25-27] |
195 | 2020-01-01 | 코다리조림 | 1 | 2 | 8.333333 | [20-29] | [25-27] |
196 | 2020-01-01 | 청포도 | 1 | 3 | 8.333333 | [20-29] | [25-27] |
197 | 2020-01-01 | 쌀밥 | 1 | 4 | 8.333333 | [20-29] | [25-27] |
198 | 2020-01-01 | 오뚜기 미역국 라면 | 1 | 5 | 8.333333 | [20-29] | [25-27] |