Overview

Dataset statistics

Number of variables7
Number of observations214
Missing cells4
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.2 KiB
Average record size in memory58.6 B

Variable types

Categorical4
Numeric2
Text1

Dataset

Description인천광역시 소재 관내 피서지 물가정보(품목, 상표, 판매가격, 업소명 등)등의 정보에 대한 항목 정보를 알 수 있습니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15053083&srcSe=7661IVAWM27C61E190

Alerts

판매가격 적용일 has constant value ""Constant
판매가격 is highly overall correlated with 이전가격 and 1 other fieldsHigh correlation
이전가격 is highly overall correlated with 판매가격 and 1 other fieldsHigh correlation
품목 is highly overall correlated with 판매가격 and 2 other fieldsHigh correlation
상표_규격 is highly overall correlated with 품목High correlation
업소명 has 4 (1.9%) missing valuesMissing

Reproduction

Analysis started2024-01-28 13:32:48.176266
Analysis finished2024-01-28 13:32:49.016046
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

피서지
Categorical

Distinct10
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
왕산
26 
을왕
25 
십리포1
24 
선녀바위
23 
십리포2
23 
Other values (5)
93 

Length

Max length5
Median length4
Mean length3.3691589
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row을왕
2nd row을왕
3rd row을왕
4th row을왕
5th row을왕

Common Values

ValueCountFrequency (%)
왕산 26
12.1%
을왕 25
11.7%
십리포1 24
11.2%
선녀바위 23
10.7%
십리포2 23
10.7%
하나개 22
10.3%
동막2 21
9.8%
동막1 20
9.3%
함허동천1 16
7.5%
함허동천2 14
6.5%

Length

2024-01-28T22:32:49.076455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T22:32:49.193103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
왕산 26
12.1%
을왕 25
11.7%
십리포1 24
11.2%
선녀바위 23
10.7%
십리포2 23
10.7%
하나개 22
10.3%
동막2 21
9.8%
동막1 20
9.3%
함허동천1 16
7.5%
함허동천2 14
6.5%

품목
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
빙과류1
 
10
과자류1
 
10
과자류2
 
10
음료류1
 
10
음료류2
 
10
Other values (25)
164 

Length

Max length6
Median length5
Mean length3.4252336
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row빙과류1
2nd row빙과류2
3rd row과자류1
4th row과자류2
5th row음료류1

Common Values

ValueCountFrequency (%)
빙과류1 10
 
4.7%
과자류1 10
 
4.7%
과자류2 10
 
4.7%
음료류1 10
 
4.7%
음료류2 10
 
4.7%
주류1 10
 
4.7%
주류2 10
 
4.7%
주류3 10
 
4.7%
주류4 10
 
4.7%
수건 10
 
4.7%
Other values (20) 114
53.3%

Length

2024-01-28T22:32:49.324836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
빙과류1 10
 
4.5%
과자류2 10
 
4.5%
음료류1 10
 
4.5%
음료류2 10
 
4.5%
주류1 10
 
4.5%
주류2 10
 
4.5%
주류3 10
 
4.5%
주류4 10
 
4.5%
수건 10
 
4.5%
빙과류2 10
 
4.5%
Other values (21) 120
54.5%

상표_규격
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
1실(4인기준)
19 
1인
17 
수건
 
10
꼬깔콘
 
10
새우깡
 
10
Other values (25)
148 

Length

Max length8
Median length6
Mean length4.546729
Min length2

Unique

Unique5 ?
Unique (%)2.3%

Sample

1st row월드콘
2nd row메로나
3rd row꼬깔콘
4th row새우깡
5th row사이다(캔)

Common Values

ValueCountFrequency (%)
1실(4인기준) 19
 
8.9%
1인 17
 
7.9%
수건 10
 
4.7%
꼬깔콘 10
 
4.7%
새우깡 10
 
4.7%
참이슬 10
 
4.7%
처음처럼 10
 
4.7%
하이트(캔) 10
 
4.7%
카스(캔) 10
 
4.7%
메로나 10
 
4.7%
Other values (20) 98
45.8%

Length

2024-01-28T22:32:49.445957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1실(4인기준 19
 
8.9%
1인 17
 
7.9%
수건 10
 
4.7%
꼬깔콘 10
 
4.7%
새우깡 10
 
4.7%
참이슬 10
 
4.7%
처음처럼 10
 
4.7%
하이트(캔 10
 
4.7%
카스(캔 10
 
4.7%
메로나 10
 
4.7%
Other values (20) 98
45.8%

판매가격
Real number (ℝ)

HIGH CORRELATION 

Distinct34
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19840.187
Minimum700
Maximum200000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2024-01-28T22:32:49.549660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum700
5-th percentile1000
Q11500
median2000
Q310000
95-th percentile107000
Maximum200000
Range199300
Interquartile range (IQR)8500

Descriptive statistics

Standard deviation36761.068
Coefficient of variation (CV)1.852859
Kurtosis6.9063462
Mean19840.187
Median Absolute Deviation (MAD)1000
Skewness2.5736263
Sum4245800
Variance1.3513761 × 109
MonotonicityNot monotonic
2024-01-28T22:32:49.660721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
2000 34
15.9%
1500 25
 
11.7%
1000 20
 
9.3%
7000 13
 
6.1%
10000 12
 
5.6%
50000 11
 
5.1%
5000 10
 
4.7%
1800 10
 
4.7%
120000 6
 
2.8%
40000 6
 
2.8%
Other values (24) 67
31.3%
ValueCountFrequency (%)
700 1
 
0.5%
800 2
 
0.9%
900 1
 
0.5%
1000 20
9.3%
1100 6
 
2.8%
1200 4
 
1.9%
1300 5
 
2.3%
1400 3
 
1.4%
1500 25
11.7%
1800 10
 
4.7%
ValueCountFrequency (%)
200000 2
 
0.9%
150000 2
 
0.9%
140000 1
 
0.5%
120000 6
2.8%
100000 4
 
1.9%
80000 3
 
1.4%
70000 5
2.3%
60000 3
 
1.4%
50000 11
5.1%
45000 2
 
0.9%

판매가격 적용일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
08-13
214 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row08-13
2nd row08-13
3rd row08-13
4th row08-13
5th row08-13

Common Values

ValueCountFrequency (%)
08-13 214
100.0%

Length

2024-01-28T22:32:49.774642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T22:32:49.852898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
08-13 214
100.0%

이전가격
Real number (ℝ)

HIGH CORRELATION 

Distinct35
Distinct (%)16.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21318.224
Minimum700
Maximum200000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2024-01-28T22:32:49.937634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum700
5-th percentile1000
Q11500
median2000
Q310000
95-th percentile120000
Maximum200000
Range199300
Interquartile range (IQR)8500

Descriptive statistics

Standard deviation39679.589
Coefficient of variation (CV)1.861299
Kurtosis5.6756736
Mean21318.224
Median Absolute Deviation (MAD)1000
Skewness2.4473919
Sum4562100
Variance1.5744698 × 109
MonotonicityNot monotonic
2024-01-28T22:32:50.048538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
2000 34
15.9%
1500 25
 
11.7%
1000 20
 
9.3%
7000 13
 
6.1%
10000 13
 
6.1%
1800 10
 
4.7%
5000 9
 
4.2%
50000 9
 
4.2%
120000 7
 
3.3%
40000 6
 
2.8%
Other values (25) 68
31.8%
ValueCountFrequency (%)
700 1
 
0.5%
800 2
 
0.9%
900 2
 
0.9%
1000 20
9.3%
1100 5
 
2.3%
1200 4
 
1.9%
1300 5
 
2.3%
1400 3
 
1.4%
1500 25
11.7%
1800 10
 
4.7%
ValueCountFrequency (%)
200000 2
 
0.9%
160000 1
 
0.5%
150000 3
 
1.4%
140000 2
 
0.9%
120000 7
3.3%
100000 2
 
0.9%
80000 4
1.9%
70000 5
2.3%
60000 2
 
0.9%
50000 9
4.2%

업소명
Text

MISSING 

Distinct54
Distinct (%)25.7%
Missing4
Missing (%)1.9%
Memory size1.8 KiB
2024-01-28T22:32:50.264329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.7428571
Min length3

Characters and Unicode

Total characters1206
Distinct characters134
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)10.5%

Sample

1st row자연소리슈퍼
2nd row자연소리슈퍼
3rd row자연소리슈퍼
4th row자연소리슈퍼
5th row자연소리슈퍼
ValueCountFrequency (%)
하나개편의점 12
 
5.0%
착한할인마트 12
 
5.0%
선녀바위번영회슈퍼 12
 
5.0%
구름계곡 11
 
4.6%
편의점,사랑마트 11
 
4.6%
자연소리슈퍼 10
 
4.2%
해상공원매점 10
 
4.2%
d,c 10
 
4.2%
마트 10
 
4.2%
약수터 10
 
4.2%
Other values (47) 130
54.6%
2024-01-28T22:32:50.565119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
 
4.9%
41
 
3.4%
41
 
3.4%
40
 
3.3%
36
 
3.0%
30
 
2.5%
30
 
2.5%
28
 
2.3%
27
 
2.2%
27
 
2.2%
Other values (124) 847
70.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1129
93.6%
Space Separator 28
 
2.3%
Uppercase Letter 24
 
2.0%
Other Punctuation 21
 
1.7%
Lowercase Letter 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
5.2%
41
 
3.6%
41
 
3.6%
40
 
3.5%
36
 
3.2%
30
 
2.7%
30
 
2.7%
27
 
2.4%
27
 
2.4%
26
 
2.3%
Other values (117) 772
68.4%
Uppercase Letter
ValueCountFrequency (%)
C 12
50.0%
D 10
41.7%
U 2
 
8.3%
Lowercase Letter
ValueCountFrequency (%)
c 2
50.0%
u 2
50.0%
Space Separator
ValueCountFrequency (%)
28
100.0%
Other Punctuation
ValueCountFrequency (%)
, 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1129
93.6%
Common 49
 
4.1%
Latin 28
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
5.2%
41
 
3.6%
41
 
3.6%
40
 
3.5%
36
 
3.2%
30
 
2.7%
30
 
2.7%
27
 
2.4%
27
 
2.4%
26
 
2.3%
Other values (117) 772
68.4%
Latin
ValueCountFrequency (%)
C 12
42.9%
D 10
35.7%
U 2
 
7.1%
c 2
 
7.1%
u 2
 
7.1%
Common
ValueCountFrequency (%)
28
57.1%
, 21
42.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1129
93.6%
ASCII 77
 
6.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
59
 
5.2%
41
 
3.6%
41
 
3.6%
40
 
3.5%
36
 
3.2%
30
 
2.7%
30
 
2.7%
27
 
2.4%
27
 
2.4%
26
 
2.3%
Other values (117) 772
68.4%
ASCII
ValueCountFrequency (%)
28
36.4%
, 21
27.3%
C 12
15.6%
D 10
 
13.0%
U 2
 
2.6%
c 2
 
2.6%
u 2
 
2.6%

Interactions

2024-01-28T22:32:48.691469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:32:48.513992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:32:48.778369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:32:48.600577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T22:32:50.916028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
피서지품목상표_규격판매가격이전가격업소명
피서지1.0000.0000.0000.0000.0000.998
품목0.0001.0000.9990.8560.8600.864
상표_규격0.0000.9991.0000.7490.7530.333
판매가격0.0000.8560.7491.0000.9930.983
이전가격0.0000.8600.7530.9931.0000.987
업소명0.9980.8640.3330.9830.9871.000
2024-01-28T22:32:50.996083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목상표_규격피서지
품목1.0000.8680.000
상표_규격0.8681.0000.000
피서지0.0000.0001.000
2024-01-28T22:32:51.066574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
판매가격이전가격피서지품목상표_규격
판매가격1.0000.9980.0000.5040.371
이전가격0.9981.0000.0000.5100.375
피서지0.0000.0001.0000.0000.000
품목0.5040.5100.0001.0000.868
상표_규격0.3710.3750.0000.8681.000

Missing values

2024-01-28T22:32:48.885402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T22:32:48.976143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

피서지품목상표_규격판매가격판매가격 적용일이전가격업소명
0을왕빙과류1월드콘120008-131200자연소리슈퍼
1을왕빙과류2메로나70008-13700자연소리슈퍼
2을왕과자류1꼬깔콘140008-131400자연소리슈퍼
3을왕과자류2새우깡110008-131100자연소리슈퍼
4을왕음료류1사이다(캔)80008-13800자연소리슈퍼
5을왕음료류2콜라(캔)90008-13900자연소리슈퍼
6을왕주류1참이슬140008-131400자연소리슈퍼
7을왕주류2처음처럼140008-131400자연소리슈퍼
8을왕주류3하이트(캔)180008-131800자연소리슈퍼
9을왕주류4카스(캔)180008-131800자연소리슈퍼
피서지품목상표_규격판매가격판매가격 적용일이전가격업소명
204십리포2주차료1대(1일기준)1000008-1310000관리소
205십리포2샤워장1인200008-132000관리소
206십리포2텐트자릿세1박2000008-1320000관리소
207십리포2파라솔대여1일1000008-1310000부녀회
208십리포2튜브 대여1인1000008-1310000부녀회
209십리포2수건수건350008-133500부녀회
210십리포2음식1광어회(대)5000008-1350000만수횟집
211십리포2음식2칼국수(1인분)700008-137000만수횟집
212십리포2음식3매운탕(중)4000008-1340000만수횟집
213십리포2음식4조개구이(대)4000008-1340000만수횟집