Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory43.3 B

Variable types

Categorical3
Numeric2

Alerts

base_quarter is highly overall correlated with place_nmHigh correlation
place_nm is highly overall correlated with base_quarterHigh correlation
card_utiliiza_price is highly overall correlated with card_utiliiza_cas_coHigh correlation
card_utiliiza_cas_co is highly overall correlated with card_utiliiza_priceHigh correlation
base_quarter is highly imbalanced (80.6%)Imbalance
card_utiliiza_price has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:01:23.478302
Analysis finished2023-12-10 10:01:25.747591
Duration2.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

place_nm
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
광안리해수욕장
16 
국립해양박물관
16 
다대포 해수욕장(굼의 낙조분수, 몰운대)
16 
동백섬&누리마루APEC하우스
16 
감천문화마을
15 
Other values (3)
21 

Length

Max length22
Median length20
Mean length12.27
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBIFF광장&용두산공원&보수동책방골목
2nd row흰여울문화마을
3rd rowBIFF광장&용두산공원&보수동책방골목
4th rowBIFF광장&용두산공원&보수동책방골목
5th rowBIFF광장&용두산공원&보수동책방골목

Common Values

ValueCountFrequency (%)
광안리해수욕장 16
16.0%
국립해양박물관 16
16.0%
다대포 해수욕장(굼의 낙조분수, 몰운대) 16
16.0%
동백섬&누리마루APEC하우스 16
16.0%
감천문화마을 15
15.0%
BIFF광장&용두산공원&보수동책방골목 14
14.0%
렛츠런파크 4
 
4.0%
흰여울문화마을 3
 
3.0%

Length

2023-12-10T19:01:25.951189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:01:26.260726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광안리해수욕장 16
10.8%
국립해양박물관 16
10.8%
다대포 16
10.8%
해수욕장(굼의 16
10.8%
낙조분수 16
10.8%
몰운대 16
10.8%
동백섬&누리마루apec하우스 16
10.8%
감천문화마을 15
10.1%
biff광장&용두산공원&보수동책방골목 14
9.5%
렛츠런파크 4
 
2.7%

ctprvn_nm
Categorical

Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경남
제주
서울
울산
대구
Other values (11)
65 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경남
2nd row제주
3rd row서울
4th row울산
5th row경북

Common Values

ValueCountFrequency (%)
경남 7
 
7.0%
제주 7
 
7.0%
서울 7
 
7.0%
울산 7
 
7.0%
대구 7
 
7.0%
충북 7
 
7.0%
경북 6
 
6.0%
인천 6
 
6.0%
충남 6
 
6.0%
전남 6
 
6.0%
Other values (6) 34
34.0%

Length

2023-12-10T19:01:26.656648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경남 7
 
7.0%
제주 7
 
7.0%
서울 7
 
7.0%
울산 7
 
7.0%
대구 7
 
7.0%
충북 7
 
7.0%
경북 6
 
6.0%
인천 6
 
6.0%
충남 6
 
6.0%
전남 6
 
6.0%
Other values (6) 34
34.0%

card_utiliiza_price
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0887412 × 108
Minimum30345
Maximum1.4993484 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:01:26.951835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30345
5-th percentile270048.75
Q14209924.5
median20806614
Q380847803
95-th percentile4.5016052 × 108
Maximum1.4993484 × 109
Range1.4993181 × 109
Interquartile range (IQR)76637879

Descriptive statistics

Standard deviation2.5557042 × 108
Coefficient of variation (CV)2.3473937
Kurtosis18.113014
Mean1.0887412 × 108
Median Absolute Deviation (MAD)19758424
Skewness4.096421
Sum1.0887412 × 1010
Variance6.5316238 × 1016
MonotonicityNot monotonic
2023-12-10T19:01:27.338208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
638753571 1
 
1.0%
201732369 1
 
1.0%
5129573 1
 
1.0%
5694552 1
 
1.0%
6636721 1
 
1.0%
6818232 1
 
1.0%
13806355 1
 
1.0%
21447740 1
 
1.0%
25548001 1
 
1.0%
45944752 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
30345 1
1.0%
109500 1
1.0%
122400 1
1.0%
214200 1
1.0%
216825 1
1.0%
272850 1
1.0%
410475 1
1.0%
493725 1
1.0%
683550 1
1.0%
742875 1
1.0%
ValueCountFrequency (%)
1499348403 1
1.0%
1408721424 1
1.0%
1250203201 1
1.0%
638753571 1
1.0%
459732499 1
1.0%
449656736 1
1.0%
443864947 1
1.0%
392488808 1
1.0%
339875949 1
1.0%
326921210 1
1.0%

card_utiliiza_cas_co
Real number (ℝ)

HIGH CORRELATION 

Distinct94
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4354.48
Minimum5
Maximum59552
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:01:27.684851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile25
Q1292.5
median892.5
Q33825.5
95-th percentile17764.8
Maximum59552
Range59547
Interquartile range (IQR)3533

Descriptive statistics

Standard deviation9878.8669
Coefficient of variation (CV)2.2686674
Kurtosis18.62289
Mean4354.48
Median Absolute Deviation (MAD)787.5
Skewness4.1355842
Sum435448
Variance97592011
MonotonicityNot monotonic
2023-12-10T19:01:28.036602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
75 2
 
2.0%
85 2
 
2.0%
25 2
 
2.0%
10 2
 
2.0%
265 2
 
2.0%
295 2
 
2.0%
1660 1
 
1.0%
320 1
 
1.0%
330 1
 
1.0%
475 1
 
1.0%
Other values (84) 84
84.0%
ValueCountFrequency (%)
5 1
1.0%
10 2
2.0%
20 1
1.0%
25 2
2.0%
35 1
1.0%
45 1
1.0%
60 1
1.0%
75 2
2.0%
80 1
1.0%
85 2
2.0%
ValueCountFrequency (%)
59552 1
1.0%
56155 1
1.0%
42957 1
1.0%
30244 1
1.0%
19243 1
1.0%
17687 1
1.0%
13759 1
1.0%
13629 1
1.0%
12149 1
1.0%
11883 1
1.0%

base_quarter
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020년 1분기
97 
2021년 2분기
 
3

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020년 1분기
2nd row2021년 2분기
3rd row2020년 1분기
4th row2020년 1분기
5th row2020년 1분기

Common Values

ValueCountFrequency (%)
2020년 1분기 97
97.0%
2021년 2분기 3
 
3.0%

Length

2023-12-10T19:01:28.283057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:01:28.468109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020년 97
48.5%
1분기 97
48.5%
2021년 3
 
1.5%
2분기 3
 
1.5%

Interactions

2023-12-10T19:01:24.780023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:01:24.113486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:01:24.992955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:01:24.537174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:01:28.573606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
place_nmctprvn_nmcard_utiliiza_pricecard_utiliiza_cas_cobase_quarter
place_nm1.0000.0000.1850.2231.000
ctprvn_nm0.0001.0000.0000.0000.000
card_utiliiza_price0.1850.0001.0000.9830.000
card_utiliiza_cas_co0.2230.0000.9831.0000.000
base_quarter1.0000.0000.0000.0001.000
2023-12-10T19:01:28.770152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
base_quarterctprvn_nmplace_nm
base_quarter1.0000.0000.969
ctprvn_nm0.0001.0000.000
place_nm0.9690.0001.000
2023-12-10T19:01:28.933295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
card_utiliiza_pricecard_utiliiza_cas_coplace_nmctprvn_nmbase_quarter
card_utiliiza_price1.0000.9680.1000.0000.000
card_utiliiza_cas_co0.9681.0000.1170.0000.000
place_nm0.1000.1171.0000.0000.969
ctprvn_nm0.0000.0000.0001.0000.000
base_quarter0.0000.0000.9690.0001.000

Missing values

2023-12-10T19:01:25.325422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:01:25.590062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

place_nmctprvn_nmcard_utiliiza_pricecard_utiliiza_cas_cobase_quarter
0BIFF광장&용두산공원&보수동책방골목경남638753571302442020년 1분기
1흰여울문화마을제주805180752021년 2분기
2BIFF광장&용두산공원&보수동책방골목서울326921210192432020년 1분기
3BIFF광장&용두산공원&보수동책방골목울산15862700076582020년 1분기
4BIFF광장&용두산공원&보수동책방골목경북11987519961262020년 1분기
5BIFF광장&용두산공원&보수동책방골목인천8485113252312020년 1분기
6BIFF광장&용두산공원&보수동책방골목대구7990185446992020년 1분기
7흰여울문화마을충남32598362852021년 2분기
8BIFF광장&용두산공원&보수동책방골목충북4882958026362020년 1분기
9BIFF광장&용두산공원&보수동책방골목전남5006686724812020년 1분기
place_nmctprvn_nmcard_utiliiza_pricecard_utiliiza_cas_cobase_quarter
90동백섬&누리마루APEC하우스충남392978186912020년 1분기
91동백섬&누리마루APEC하우스강원174655526252020년 1분기
92동백섬&누리마루APEC하우스전북211020295502020년 1분기
93동백섬&누리마루APEC하우스전남151273844902020년 1분기
94동백섬&누리마루APEC하우스세종64522012402020년 1분기
95동백섬&누리마루APEC하우스제주83542822052020년 1분기
96렛츠런파크경남2051120024912020년 1분기
97렛츠런파크울산28239903502020년 1분기
98렛츠런파크대구17389202412020년 1분기
99렛츠런파크서울18836402002020년 1분기