Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 3.5 KiB |
Average record size in memory | 36.3 B |
Variable types
Categorical | 3 |
---|---|
Numeric | 1 |
Dataset
Description | Sample |
---|---|
Author | 부산정보산업진흥원 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=9fccd1cb-5ed3-4c27-b534-518e139ab806 |
avg_sales_pc is highly overall correlated with gu_dc | High correlation |
gu_dc is highly overall correlated with avg_sales_pc | High correlation |
Reproduction
Analysis started | 2023-12-10 09:58:55.109740 |
---|---|
Analysis finished | 2023-12-10 09:58:56.041475 |
Duration | 0.93 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
base_year
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2014 | |
---|---|
2015 | |
2018 | 3 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2014 |
---|---|
2nd row | 2018 |
3rd row | 2014 |
4th row | 2014 |
5th row | 2014 |
Common Values
Value | Count | Frequency (%) |
2014 | 61 | |
2015 | 36 | |
2018 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2014 | 61 | |
2015 | 36 | |
2018 | 3 | 3.0% |
base_month
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
6 | |
---|---|
3 | |
9 | |
12 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.19 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 3 |
---|---|
2nd row | 12 |
3rd row | 3 |
4th row | 3 |
5th row | 3 |
Common Values
Value | Count | Frequency (%) |
6 | 31 | |
3 | 30 | |
9 | 20 | |
12 | 19 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
6 | 31 | |
3 | 30 | |
9 | 20 | |
12 | 19 |
gu_dc
Categorical
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 16.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
강서구 | |
---|---|
영도구 | |
기장군 | |
남구 | |
중구 | |
Other values (11) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 2.81 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강서구 |
---|---|
2nd row | 영도구 |
3rd row | 기장군 |
4th row | 남구 |
5th row | 동구 |
Common Values
Value | Count | Frequency (%) |
강서구 | 7 | 7.0% |
영도구 | 7 | 7.0% |
기장군 | 7 | 7.0% |
남구 | 7 | 7.0% |
중구 | 7 | 7.0% |
동구 | 6 | 6.0% |
동래구 | 6 | 6.0% |
부산진구 | 6 | 6.0% |
사상구 | 6 | 6.0% |
사하구 | 6 | 6.0% |
Other values (6) | 35 |
Length
Value | Count | Frequency (%) |
강서구 | 7 | 7.0% |
영도구 | 7 | 7.0% |
기장군 | 7 | 7.0% |
남구 | 7 | 7.0% |
중구 | 7 | 7.0% |
동구 | 6 | 6.0% |
동래구 | 6 | 6.0% |
부산진구 | 6 | 6.0% |
사상구 | 6 | 6.0% |
사하구 | 6 | 6.0% |
Other values (6) | 35 |
avg_sales_pc
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 21500 |
Minimum | 14000 |
---|---|
Maximum | 34000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 14000 |
---|---|
5-th percentile | 16000 |
Q1 | 19500 |
median | 22000 |
Q3 | 24000 |
95-th percentile | 28000 |
Maximum | 34000 |
Range | 20000 |
Interquartile range (IQR) | 4500 |
Descriptive statistics
Standard deviation | 3633.4584 |
---|---|
Coefficient of variation (CV) | 0.16899807 |
Kurtosis | 0.76665013 |
Mean | 21500 |
Median Absolute Deviation (MAD) | 2000 |
Skewness | 0.53567382 |
Sum | 2150000 |
Variance | 13202020 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
22000 | 31 | |
20000 | 16 | |
24000 | 16 | |
18000 | 13 | |
16000 | 11 | 11.0% |
28000 | 6 | 6.0% |
26000 | 3 | 3.0% |
30000 | 2 | 2.0% |
14000 | 1 | 1.0% |
34000 | 1 | 1.0% |
Value | Count | Frequency (%) |
14000 | 1 | 1.0% |
16000 | 11 | 11.0% |
18000 | 13 | |
20000 | 16 | |
22000 | 31 | |
24000 | 16 | |
26000 | 3 | 3.0% |
28000 | 6 | 6.0% |
30000 | 2 | 2.0% |
34000 | 1 | 1.0% |
Value | Count | Frequency (%) |
34000 | 1 | 1.0% |
30000 | 2 | 2.0% |
28000 | 6 | 6.0% |
26000 | 3 | 3.0% |
24000 | 16 | |
22000 | 31 | |
20000 | 16 | |
18000 | 13 | |
16000 | 11 | 11.0% |
14000 | 1 | 1.0% |
base_year | base_month | gu_dc | avg_sales_pc | |
---|---|---|---|---|
base_year | 1.000 | 0.365 | 0.000 | 0.824 |
base_month | 0.365 | 1.000 | 0.000 | 0.000 |
gu_dc | 0.000 | 0.000 | 1.000 | 0.917 |
avg_sales_pc | 0.824 | 0.000 | 0.917 | 1.000 |
base_month | base_year | gu_dc | |
---|---|---|---|
base_month | 1.000 | 0.352 | 0.000 |
base_year | 0.352 | 1.000 | 0.000 |
gu_dc | 0.000 | 0.000 | 1.000 |
avg_sales_pc | base_year | base_month | gu_dc | |
---|---|---|---|---|
avg_sales_pc | 1.000 | 0.460 | 0.000 | 0.673 |
base_year | 0.460 | 1.000 | 0.352 | 0.000 |
base_month | 0.000 | 0.352 | 1.000 | 0.000 |
gu_dc | 0.673 | 0.000 | 0.000 | 1.000 |
base_year | base_month | gu_dc | avg_sales_pc | |
---|---|---|---|---|
0 | 2014 | 3 | 강서구 | 22000 |
1 | 2018 | 12 | 영도구 | 16000 |
2 | 2014 | 3 | 기장군 | 20000 |
3 | 2014 | 3 | 남구 | 22000 |
4 | 2014 | 3 | 동구 | 20000 |
5 | 2014 | 3 | 동래구 | 28000 |
6 | 2014 | 3 | 부산진구 | 22000 |
7 | 2018 | 12 | 중구 | 14000 |
8 | 2014 | 3 | 사상구 | 18000 |
9 | 2014 | 3 | 사하구 | 16000 |
base_year | base_month | gu_dc | avg_sales_pc | |
---|---|---|---|---|
90 | 2015 | 6 | 서구 | 24000 |
91 | 2015 | 6 | 수영구 | 30000 |
92 | 2015 | 6 | 연제구 | 24000 |
93 | 2015 | 6 | 영도구 | 16000 |
94 | 2015 | 6 | 중구 | 22000 |
95 | 2015 | 6 | 해운대구 | 24000 |
96 | 2015 | 9 | 강서구 | 24000 |
97 | 2015 | 9 | 금정구 | 24000 |
98 | 2015 | 9 | 기장군 | 20000 |
99 | 2015 | 9 | 남구 | 22000 |