Overview

Dataset statistics

Number of variables7
Number of observations29
Missing cells2
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory62.6 B

Variable types

Numeric2
Text3
Categorical2

Dataset

Description인천광역시 동구 관내 음식물 쓰레기 다량 배출(월1ton이상) 사업장 정보로, 업소명, 업종, 주소, 전화번호, 수거형태, 예상배출량 등 항목을 제공합니다.
Author인천광역시 동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15034374&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
전화번호 has 2 (6.9%) missing valuesMissing
연번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2024-03-18 03:04:46.883343
Analysis finished2024-03-18 03:04:49.379553
Duration2.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2024-03-18T12:04:49.448822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.4
Q18
median15
Q322
95-th percentile27.6
Maximum29
Range28
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.5146932
Coefficient of variation (CV)0.56764621
Kurtosis-1.2
Mean15
Median Absolute Deviation (MAD)7
Skewness0
Sum435
Variance72.5
MonotonicityStrictly increasing
2024-03-18T12:04:49.573969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
1 1
 
3.4%
2 1
 
3.4%
29 1
 
3.4%
28 1
 
3.4%
27 1
 
3.4%
26 1
 
3.4%
25 1
 
3.4%
24 1
 
3.4%
23 1
 
3.4%
22 1
 
3.4%
Other values (19) 19
65.5%
ValueCountFrequency (%)
1 1
3.4%
2 1
3.4%
3 1
3.4%
4 1
3.4%
5 1
3.4%
6 1
3.4%
7 1
3.4%
8 1
3.4%
9 1
3.4%
10 1
3.4%
ValueCountFrequency (%)
29 1
3.4%
28 1
3.4%
27 1
3.4%
26 1
3.4%
25 1
3.4%
24 1
3.4%
23 1
3.4%
22 1
3.4%
21 1
3.4%
20 1
3.4%

업소명
Text

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2024-03-18T12:04:49.759416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length21
Mean length9.9310345
Min length4

Characters and Unicode

Total characters288
Distinct characters108
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row동명초등학교
2nd row동산중고교
3rd row재능고등학교
4th row만석초등학교
5th row서림초등학교
ValueCountFrequency (%)
주)현대그린푸드 3
 
6.4%
인천공장 2
 
4.3%
동국제강 2
 
4.3%
나인프라임푸드 2
 
4.3%
동명초등학교 1
 
2.1%
해안식당 1
 
2.1%
두산산업차량 1
 
2.1%
현대제철 1
 
2.1%
인천 1
 
2.1%
주)풀무원푸드앤컬처 1
 
2.1%
Other values (32) 32
68.1%
2024-03-18T12:04:50.062724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
6.2%
13
 
4.5%
12
 
4.2%
11
 
3.8%
9
 
3.1%
8
 
2.8%
7
 
2.4%
7
 
2.4%
7
 
2.4%
7
 
2.4%
Other values (98) 189
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 257
89.2%
Space Separator 18
 
6.2%
Open Punctuation 6
 
2.1%
Close Punctuation 6
 
2.1%
Decimal Number 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
5.1%
12
 
4.7%
11
 
4.3%
9
 
3.5%
8
 
3.1%
7
 
2.7%
7
 
2.7%
7
 
2.7%
7
 
2.7%
6
 
2.3%
Other values (94) 170
66.1%
Space Separator
ValueCountFrequency (%)
18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Decimal Number
ValueCountFrequency (%)
8 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 257
89.2%
Common 31
 
10.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
5.1%
12
 
4.7%
11
 
4.3%
9
 
3.5%
8
 
3.1%
7
 
2.7%
7
 
2.7%
7
 
2.7%
7
 
2.7%
6
 
2.3%
Other values (94) 170
66.1%
Common
ValueCountFrequency (%)
18
58.1%
( 6
 
19.4%
) 6
 
19.4%
8 1
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 257
89.2%
ASCII 31
 
10.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18
58.1%
( 6
 
19.4%
) 6
 
19.4%
8 1
 
3.2%
Hangul
ValueCountFrequency (%)
13
 
5.1%
12
 
4.7%
11
 
4.3%
9
 
3.5%
8
 
3.1%
7
 
2.7%
7
 
2.7%
7
 
2.7%
7
 
2.7%
6
 
2.3%
Other values (94) 170
66.1%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size364.0 B
집단급식소
23 
일반음식점
대규모점포
 
2

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row집단급식소
2nd row집단급식소
3rd row집단급식소
4th row집단급식소
5th row집단급식소

Common Values

ValueCountFrequency (%)
집단급식소 23
79.3%
일반음식점 4
 
13.8%
대규모점포 2
 
6.9%

Length

2024-03-18T12:04:50.181229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:04:50.261489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
집단급식소 23
79.3%
일반음식점 4
 
13.8%
대규모점포 2
 
6.9%

주소
Text

Distinct25
Distinct (%)86.2%
Missing0
Missing (%)0.0%
Memory size364.0 B
2024-03-18T12:04:50.412857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length16.310345
Min length14

Characters and Unicode

Total characters473
Distinct characters52
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)75.9%

Sample

1st row인천광역시 동구 송림로70번길 10
2nd row인천광역시 동구 동산로 58
3rd row인천광역시 동구 재능로 178
4th row인천광역시 동구 제물량로 412
5th row인천광역시 동구 금곡로 105
ValueCountFrequency (%)
인천광역시 29
25.0%
동구 29
25.0%
중봉대로 3
 
2.6%
인중로 3
 
2.6%
화수로 3
 
2.6%
178 3
 
2.6%
재능로 3
 
2.6%
15 2
 
1.7%
489 2
 
1.7%
봉수대로 2
 
1.7%
Other values (35) 37
31.9%
2024-03-18T12:04:50.733126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
88
18.6%
32
 
6.8%
30
 
6.3%
29
 
6.1%
29
 
6.1%
29
 
6.1%
29
 
6.1%
29
 
6.1%
29
 
6.1%
1 20
 
4.2%
Other values (42) 129
27.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 305
64.5%
Space Separator 88
 
18.6%
Decimal Number 77
 
16.3%
Dash Punctuation 2
 
0.4%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
10.5%
30
9.8%
29
9.5%
29
9.5%
29
9.5%
29
9.5%
29
9.5%
29
9.5%
6
 
2.0%
5
 
1.6%
Other values (29) 58
19.0%
Decimal Number
ValueCountFrequency (%)
1 20
26.0%
8 12
15.6%
4 10
13.0%
5 7
 
9.1%
2 6
 
7.8%
0 5
 
6.5%
9 5
 
6.5%
6 4
 
5.2%
7 4
 
5.2%
3 4
 
5.2%
Space Separator
ValueCountFrequency (%)
88
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 305
64.5%
Common 168
35.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
10.5%
30
9.8%
29
9.5%
29
9.5%
29
9.5%
29
9.5%
29
9.5%
29
9.5%
6
 
2.0%
5
 
1.6%
Other values (29) 58
19.0%
Common
ValueCountFrequency (%)
88
52.4%
1 20
 
11.9%
8 12
 
7.1%
4 10
 
6.0%
5 7
 
4.2%
2 6
 
3.6%
0 5
 
3.0%
9 5
 
3.0%
6 4
 
2.4%
7 4
 
2.4%
Other values (3) 7
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 305
64.5%
ASCII 168
35.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
88
52.4%
1 20
 
11.9%
8 12
 
7.1%
4 10
 
6.0%
5 7
 
4.2%
2 6
 
3.6%
0 5
 
3.0%
9 5
 
3.0%
6 4
 
2.4%
7 4
 
2.4%
Other values (3) 7
 
4.2%
Hangul
ValueCountFrequency (%)
32
10.5%
30
9.8%
29
9.5%
29
9.5%
29
9.5%
29
9.5%
29
9.5%
29
9.5%
6
 
2.0%
5
 
1.6%
Other values (29) 58
19.0%

전화번호
Text

MISSING 

Distinct26
Distinct (%)96.3%
Missing2
Missing (%)6.9%
Memory size364.0 B
2024-03-18T12:04:50.935457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters324
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)92.6%

Sample

1st row032-773-7549
2nd row032-763-7857
3rd row032-890-7696
4th row032-772-4787
5th row032-629-1753
ValueCountFrequency (%)
032-830-6823 2
 
7.4%
032-763-7857 1
 
3.7%
032-773-7549 1
 
3.7%
032-777-5021 1
 
3.7%
032-582-4057 1
 
3.7%
032-764-6969 1
 
3.7%
032-777-9996 1
 
3.7%
032-762-8100 1
 
3.7%
032-770-6433 1
 
3.7%
032-211-1325 1
 
3.7%
Other values (16) 16
59.3%
2024-03-18T12:04:51.270356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 54
16.7%
2 53
16.4%
0 47
14.5%
3 41
12.7%
7 33
10.2%
6 23
7.1%
8 18
 
5.6%
9 16
 
4.9%
1 15
 
4.6%
4 13
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 270
83.3%
Dash Punctuation 54
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 53
19.6%
0 47
17.4%
3 41
15.2%
7 33
12.2%
6 23
8.5%
8 18
 
6.7%
9 16
 
5.9%
1 15
 
5.6%
4 13
 
4.8%
5 11
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 324
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 54
16.7%
2 53
16.4%
0 47
14.5%
3 41
12.7%
7 33
10.2%
6 23
7.1%
8 18
 
5.6%
9 16
 
4.9%
1 15
 
4.6%
4 13
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 324
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 54
16.7%
2 53
16.4%
0 47
14.5%
3 41
12.7%
7 33
10.2%
6 23
7.1%
8 18
 
5.6%
9 16
 
4.9%
1 15
 
4.6%
4 13
 
4.0%

수거형태
Categorical

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
위탁처리
23 
자가처리

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위탁처리
2nd row자가처리
3rd row위탁처리
4th row위탁처리
5th row위탁처리

Common Values

ValueCountFrequency (%)
위탁처리 23
79.3%
자가처리 6
 
20.7%

Length

2024-03-18T12:04:51.422875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:04:51.550261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위탁처리 23
79.3%
자가처리 6
 
20.7%

예상배출량(월)
Real number (ℝ)

Distinct27
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4291.4483
Minimum34
Maximum36000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2024-03-18T12:04:51.722182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34
5-th percentile64.6
Q1654
median1900
Q33117
95-th percentile19050.4
Maximum36000
Range35966
Interquartile range (IQR)2463

Descriptive statistics

Standard deviation7642.3471
Coefficient of variation (CV)1.7808317
Kurtosis11.065362
Mean4291.4483
Median Absolute Deviation (MAD)1246
Skewness3.2234547
Sum124452
Variance58405470
MonotonicityNot monotonic
2024-03-18T12:04:51.840156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
1800 2
 
6.9%
600 2
 
6.9%
1200 1
 
3.4%
4974 1
 
3.4%
16126 1
 
3.4%
362 1
 
3.4%
34 1
 
3.4%
3262 1
 
3.4%
88 1
 
3.4%
49 1
 
3.4%
Other values (17) 17
58.6%
ValueCountFrequency (%)
34 1
3.4%
49 1
3.4%
88 1
3.4%
362 1
3.4%
600 2
6.9%
640 1
3.4%
654 1
3.4%
900 1
3.4%
930 1
3.4%
1200 1
3.4%
ValueCountFrequency (%)
36000 1
3.4%
21000 1
3.4%
16126 1
3.4%
6218 1
3.4%
5700 1
3.4%
4974 1
3.4%
3262 1
3.4%
3117 1
3.4%
3000 1
3.4%
2890 1
3.4%

Interactions

2024-03-18T12:04:49.018874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:04:48.781243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:04:49.103661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:04:48.930759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T12:04:51.918751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명업종주소전화번호수거형태예상배출량(월)
연번1.0001.0000.8690.8691.0000.0000.102
업소명1.0001.0001.0001.0001.0001.0001.000
업종0.8691.0001.0001.0001.0000.0000.449
주소0.8691.0001.0001.0001.0000.7070.000
전화번호1.0001.0001.0001.0001.0001.0000.631
수거형태0.0001.0000.0000.7071.0001.0000.146
예상배출량(월)0.1021.0000.4490.0000.6310.1461.000
2024-03-18T12:04:52.007350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종수거형태
업종1.0000.000
수거형태0.0001.000
2024-03-18T12:04:52.071682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번예상배출량(월)업종수거형태
연번1.0000.0370.6790.000
예상배출량(월)0.0371.0000.3590.153
업종0.6790.3591.0000.000
수거형태0.0000.1530.0001.000

Missing values

2024-03-18T12:04:49.224224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T12:04:49.324445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명업종주소전화번호수거형태예상배출량(월)
01동명초등학교집단급식소인천광역시 동구 송림로70번길 10032-773-7549위탁처리1200
12동산중고교집단급식소인천광역시 동구 동산로 58032-763-7857자가처리2890
23재능고등학교집단급식소인천광역시 동구 재능로 178032-890-7696위탁처리3117
34만석초등학교집단급식소인천광역시 동구 제물량로 412032-772-4787위탁처리640
45서림초등학교집단급식소인천광역시 동구 금곡로 105032-629-1753위탁처리1800
56서흥초등학교집단급식소인천광역시 동구 샛골로 189032-629-0021위탁처리1900
67송림초등학교집단급식소인천광역시 동구 배송로 2032-629-1806위탁처리1223
78송현초등학교집단급식소인천광역시 동구 송화로 45032-764-4482위탁처리2179
89영화국제관광고교집단급식소인천광역시 동구 우각로 39032-764-7920자가처리3000
910창영초등학교집단급식소인천광역시 동구 우각로15번길 16032-765-4332위탁처리930
연번업소명업종주소전화번호수거형태예상배출량(월)
1920(주)현대그린푸드 두산인프라코어 해안식당집단급식소인천광역시 동구 인중로 489032-211-1241자가처리2400
2021(주)현대그린푸드 두산산업차량집단급식소인천광역시 동구 인중로 468032-211-1325자가처리5700
2122(주)현대그린푸드 현대제철 인천집단급식소인천광역시 동구 중봉대로 63<NA>자가처리21000
2223(주)풀무원푸드앤컬처 현대두산인프라코어 등나무식당집단급식소인천광역시 동구 인중로 489032-770-6433위탁처리36000
2324행복한밥상일반음식점인천광역시 동구 만석부두로 11032-762-8100자가처리49
2425월수금 통돼지일반음식점인천광역시 동구 송미로 11-14<NA>위탁처리88
2526소플러스일반음식점인천광역시 동구 염전로40번길 85032-777-9996위탁처리3262
2627찐 풍천장어일반음식점인천광역시 동구 송림로118, 2-3층032-764-6969위탁처리34
2728송림공구상가관리(주)대규모점포인천광역시 동구 봉수대로 98032-582-4057위탁처리362
2829트레이더스 홀세일 클럽대규모점포인천광역시 동구 봉수대로 82032-722-1055위탁처리16126