Overview

Dataset statistics

Number of variables6
Number of observations77
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory50.7 B

Variable types

Numeric1
Categorical2
Text2
DateTime1

Dataset

Description일반음식점 연면적 200㎡이상 또는 집단급식소 일급식인원 100명 이상의 음식물류폐기물 다량배출사업장 현황에 대한 데이터임(업종, 업소명, 소재지,데이터기준일)
URLhttps://www.data.go.kr/data/15094765/fileData.do

Alerts

처리방법 has constant value ""Constant
데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:00:14.163050
Analysis finished2023-12-12 11:00:15.325967
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct77
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39
Minimum1
Maximum77
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size825.0 B
2023-12-12T20:00:15.939841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.8
Q120
median39
Q358
95-th percentile73.2
Maximum77
Range76
Interquartile range (IQR)38

Descriptive statistics

Standard deviation22.371857
Coefficient of variation (CV)0.57363737
Kurtosis-1.2
Mean39
Median Absolute Deviation (MAD)19
Skewness0
Sum3003
Variance500.5
MonotonicityStrictly increasing
2023-12-12T20:00:16.211991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.3%
50 1
 
1.3%
57 1
 
1.3%
56 1
 
1.3%
55 1
 
1.3%
54 1
 
1.3%
53 1
 
1.3%
52 1
 
1.3%
51 1
 
1.3%
49 1
 
1.3%
Other values (67) 67
87.0%
ValueCountFrequency (%)
1 1
1.3%
2 1
1.3%
3 1
1.3%
4 1
1.3%
5 1
1.3%
6 1
1.3%
7 1
1.3%
8 1
1.3%
9 1
1.3%
10 1
1.3%
ValueCountFrequency (%)
77 1
1.3%
76 1
1.3%
75 1
1.3%
74 1
1.3%
73 1
1.3%
72 1
1.3%
71 1
1.3%
70 1
1.3%
69 1
1.3%
68 1
1.3%

업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size748.0 B
일반음식점
46 
집단급식소
31 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 46
59.7%
집단급식소 31
40.3%

Length

2023-12-12T20:00:16.460456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:00:16.646945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 46
59.7%
집단급식소 31
40.3%

업소명
Text

UNIQUE 

Distinct77
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size748.0 B
2023-12-12T20:00:17.056681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length13
Mean length7.4285714
Min length2

Characters and Unicode

Total characters572
Distinct characters212
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)100.0%

Sample

1st row아구마을해물촌
2nd row본가 장수촌
3rd row쌍바우등뼈해장국24시
4th row장연식당
5th row시골두부
ValueCountFrequency (%)
주식회사 2
 
2.2%
주)후니드 2
 
2.2%
주)디엔피코퍼레이션(증평지점 1
 
1.1%
증평현대장례식장 1
 
1.1%
도안초등학교 1
 
1.1%
주)식미안 1
 
1.1%
죽리초등학교 1
 
1.1%
아이사랑어린이집 1
 
1.1%
증평공장점 1
 
1.1%
현대종합특수강 1
 
1.1%
Other values (77) 77
86.5%
2023-12-12T20:00:17.729685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
3.3%
19
 
3.3%
17
 
3.0%
15
 
2.6%
13
 
2.3%
( 12
 
2.1%
) 12
 
2.1%
12
 
2.1%
12
 
2.1%
12
 
2.1%
Other values (202) 429
75.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 528
92.3%
Open Punctuation 12
 
2.1%
Close Punctuation 12
 
2.1%
Space Separator 12
 
2.1%
Uppercase Letter 4
 
0.7%
Decimal Number 2
 
0.3%
Other Symbol 1
 
0.2%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
3.6%
19
 
3.6%
17
 
3.2%
15
 
2.8%
13
 
2.5%
12
 
2.3%
12
 
2.3%
10
 
1.9%
9
 
1.7%
9
 
1.7%
Other values (192) 393
74.4%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
K 1
25.0%
S 1
25.0%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
4 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 529
92.5%
Common 39
 
6.8%
Latin 4
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
3.6%
19
 
3.6%
17
 
3.2%
15
 
2.8%
13
 
2.5%
12
 
2.3%
12
 
2.3%
10
 
1.9%
9
 
1.7%
9
 
1.7%
Other values (193) 394
74.5%
Common
ValueCountFrequency (%)
( 12
30.8%
) 12
30.8%
12
30.8%
2 1
 
2.6%
4 1
 
2.6%
- 1
 
2.6%
Latin
ValueCountFrequency (%)
C 2
50.0%
K 1
25.0%
S 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 528
92.3%
ASCII 43
 
7.5%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
3.6%
19
 
3.6%
17
 
3.2%
15
 
2.8%
13
 
2.5%
12
 
2.3%
12
 
2.3%
10
 
1.9%
9
 
1.7%
9
 
1.7%
Other values (192) 393
74.4%
ASCII
ValueCountFrequency (%)
( 12
27.9%
) 12
27.9%
12
27.9%
C 2
 
4.7%
2 1
 
2.3%
4 1
 
2.3%
K 1
 
2.3%
S 1
 
2.3%
- 1
 
2.3%
None
ValueCountFrequency (%)
1
100.0%
Distinct72
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size748.0 B
2023-12-12T20:00:18.163508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length34
Mean length23.623377
Min length14

Characters and Unicode

Total characters1819
Distinct characters149
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)87.0%

Sample

1st row충청북도 증평군 증평읍 초중4길 75
2nd row충청북도 증평군 증평읍 초중2길 15-7
3rd row충청북도 증평군 증평읍 삼보로 78
4th row충청북도 증평군 증평읍 초중8길 7_ 장연식당
5th row충청북도 증평군 증평읍 초중5길 10
ValueCountFrequency (%)
증평군 76
18.5%
충청북도 73
17.8%
증평읍 67
16.3%
도안면 9
 
2.2%
초중로 7
 
1.7%
중앙로 6
 
1.5%
광장로 5
 
1.2%
1층 5
 
1.2%
초중2길 3
 
0.7%
증평로 3
 
0.7%
Other values (125) 156
38.0%
2023-12-12T20:00:18.815413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
333
18.3%
161
 
8.9%
160
 
8.8%
82
 
4.5%
77
 
4.2%
74
 
4.1%
74
 
4.1%
73
 
4.0%
1 72
 
4.0%
68
 
3.7%
Other values (139) 645
35.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1168
64.2%
Space Separator 333
 
18.3%
Decimal Number 254
 
14.0%
Connector Punctuation 24
 
1.3%
Dash Punctuation 13
 
0.7%
Open Punctuation 9
 
0.5%
Close Punctuation 9
 
0.5%
Uppercase Letter 6
 
0.3%
Other Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
161
13.8%
160
13.7%
82
 
7.0%
77
 
6.6%
74
 
6.3%
74
 
6.3%
73
 
6.2%
68
 
5.8%
59
 
5.1%
26
 
2.2%
Other values (116) 314
26.9%
Decimal Number
ValueCountFrequency (%)
1 72
28.3%
2 30
11.8%
3 24
 
9.4%
0 23
 
9.1%
4 23
 
9.1%
5 20
 
7.9%
8 19
 
7.5%
6 18
 
7.1%
7 14
 
5.5%
9 11
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
P 2
33.3%
J 2
33.3%
H 1
16.7%
N 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 8
88.9%
[ 1
 
11.1%
Close Punctuation
ValueCountFrequency (%)
) 8
88.9%
] 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
* 1
33.3%
Space Separator
ValueCountFrequency (%)
333
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1168
64.2%
Common 645
35.5%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
161
13.8%
160
13.7%
82
 
7.0%
77
 
6.6%
74
 
6.3%
74
 
6.3%
73
 
6.2%
68
 
5.8%
59
 
5.1%
26
 
2.2%
Other values (116) 314
26.9%
Common
ValueCountFrequency (%)
333
51.6%
1 72
 
11.2%
2 30
 
4.7%
3 24
 
3.7%
_ 24
 
3.7%
0 23
 
3.6%
4 23
 
3.6%
5 20
 
3.1%
8 19
 
2.9%
6 18
 
2.8%
Other values (9) 59
 
9.1%
Latin
ValueCountFrequency (%)
P 2
33.3%
J 2
33.3%
H 1
16.7%
N 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1168
64.2%
ASCII 651
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
333
51.2%
1 72
 
11.1%
2 30
 
4.6%
3 24
 
3.7%
_ 24
 
3.7%
0 23
 
3.5%
4 23
 
3.5%
5 20
 
3.1%
8 19
 
2.9%
6 18
 
2.8%
Other values (13) 65
 
10.0%
Hangul
ValueCountFrequency (%)
161
13.8%
160
13.7%
82
 
7.0%
77
 
6.6%
74
 
6.3%
74
 
6.3%
73
 
6.2%
68
 
5.8%
59
 
5.1%
26
 
2.2%
Other values (116) 314
26.9%

처리방법
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size748.0 B
위탁
77 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위탁
2nd row위탁
3rd row위탁
4th row위탁
5th row위탁

Common Values

ValueCountFrequency (%)
위탁 77
100.0%

Length

2023-12-12T20:00:19.034043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:00:19.174968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위탁 77
100.0%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size748.0 B
Minimum2023-04-24 00:00:00
Maximum2023-04-24 00:00:00
2023-12-12T20:00:19.310958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:00:19.481086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T20:00:14.780725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:00:19.610825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종업소명소재지(도로명)
연번1.0001.0001.0000.666
업종1.0001.0001.0000.682
업소명1.0001.0001.0001.000
소재지(도로명)0.6660.6821.0001.000
2023-12-12T20:00:19.773021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.945
업종0.9451.000

Missing values

2023-12-12T20:00:15.049123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:00:15.257921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종업소명소재지(도로명)처리방법데이터기준일
01일반음식점아구마을해물촌충청북도 증평군 증평읍 초중4길 75위탁2023-04-24
12일반음식점본가 장수촌충청북도 증평군 증평읍 초중2길 15-7위탁2023-04-24
23일반음식점쌍바우등뼈해장국24시충청북도 증평군 증평읍 삼보로 78위탁2023-04-24
34일반음식점장연식당충청북도 증평군 증평읍 초중8길 7_ 장연식당위탁2023-04-24
45일반음식점시골두부충청북도 증평군 증평읍 초중5길 10위탁2023-04-24
56일반음식점미락충청북도 증평군 증평읍 문화로 83위탁2023-04-24
67일반음식점가장맛있는족발충청북도 증평군 증평읍 초중1길 30_ 1층위탁2023-04-24
78일반음식점계룡병원장례식장충청북도 증평군 증평읍 중부로 2465위탁2023-04-24
89일반음식점동궁오리요리전문점충청북도 증평군 증평읍 초중2길 51-16위탁2023-04-24
910일반음식점수영숯불갈매기충청북도 증평군 증평읍 초중2길 16위탁2023-04-24
연번업종업소명소재지(도로명)처리방법데이터기준일
6768집단급식소(주)신세계푸드 두산전자증평충청북도 증평군 증평읍 두산로 40-19_ 두산전자(주)위탁2023-04-24
6869집단급식소증평중학교충청북도 증평군 증평읍 광장로 158_ 증평중학교위탁2023-04-24
6970집단급식소증평여자중학교충청북도 증평군 증평읍 중앙로 132위탁2023-04-24
7071집단급식소증평공업고등학교충청북도 증평군 증평읍 광장로 180위탁2023-04-24
7172집단급식소삼보초등학교충청북도 증평군 증평읍 삼보로 140위탁2023-04-24
7273집단급식소한국교통대학교(생활관)충청북도 증평군 증평읍 대학로 61위탁2023-04-24
7374집단급식소형석고등학교충청북도 증평군 증평읍 미암로 26위탁2023-04-24
7475집단급식소증평초등학교충청북도 증평군 증평읍 광장로 152위탁2023-04-24
7576집단급식소위드씨엠에스증평군 증평읍 증평산단로14위탁2023-04-24
7677집단급식소그린하우스현대종합특수강충청북도 증평군 도안면 증평2산단로155,1층위탁2023-04-24