Overview

Dataset statistics

Number of variables5
Number of observations38
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory44.5 B

Variable types

Categorical2
Text2
Numeric1

Dataset

Description경상북도 환경, 환경보호와 관련한 정보를 제공합니다.(경상북도 내 음식물류 폐기물 처리시설 업체의 업체명, 소재지, 시설용량, 시설종류 현황입니다.)
Author경상북도
URLhttps://www.data.go.kr/data/15063151/fileData.do

Reproduction

Analysis started2023-12-12 08:51:23.816238
Analysis finished2023-12-12 08:51:24.393708
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size436.0 B
민간시설
25 
공공시설
11 
민간시설
 
2

Length

Max length5
Median length5
Mean length4.6578947
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공시설
2nd row공공시설
3rd row공공시설
4th row공공시설
5th row공공시설

Common Values

ValueCountFrequency (%)
민간시설 25
65.8%
공공시설 11
28.9%
민간시설 2
 
5.3%

Length

2023-12-12T17:51:24.525394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:51:24.657942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간시설 27
71.1%
공공시설 11
28.9%
Distinct36
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size436.0 B
2023-12-12T17:51:24.962994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length14.815789
Min length8

Characters and Unicode

Total characters563
Distinct characters106
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)89.5%

Sample

1st row안동시 풍천면 도양리 1424
2nd row경주시 외동읍 문산리 864-5
3rd row경주시 천군동 1519
4th row김천시 대광동 850
5th row칠곡군 석적읍 3공단1로 62-6
ValueCountFrequency (%)
경산시 10
 
7.1%
영천시 5
 
3.5%
봉화군 4
 
2.8%
경주시 4
 
2.8%
고령 3
 
2.1%
압량읍 2
 
1.4%
남산면 2
 
1.4%
서면 2
 
1.4%
진량읍 2
 
1.4%
다산 2
 
1.4%
Other values (96) 105
74.5%
2023-12-12T17:51:25.451378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
 
18.5%
22
 
3.9%
1 21
 
3.7%
21
 
3.7%
18
 
3.2%
- 16
 
2.8%
2 15
 
2.7%
4 15
 
2.7%
15
 
2.7%
6 15
 
2.7%
Other values (96) 301
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 321
57.0%
Decimal Number 122
 
21.7%
Space Separator 104
 
18.5%
Dash Punctuation 16
 
2.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
6.9%
21
 
6.5%
18
 
5.6%
15
 
4.7%
13
 
4.0%
12
 
3.7%
12
 
3.7%
11
 
3.4%
9
 
2.8%
9
 
2.8%
Other values (84) 179
55.8%
Decimal Number
ValueCountFrequency (%)
1 21
17.2%
2 15
12.3%
4 15
12.3%
6 15
12.3%
8 14
11.5%
5 12
9.8%
7 9
7.4%
0 8
 
6.6%
9 7
 
5.7%
3 6
 
4.9%
Space Separator
ValueCountFrequency (%)
104
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 321
57.0%
Common 242
43.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
6.9%
21
 
6.5%
18
 
5.6%
15
 
4.7%
13
 
4.0%
12
 
3.7%
12
 
3.7%
11
 
3.4%
9
 
2.8%
9
 
2.8%
Other values (84) 179
55.8%
Common
ValueCountFrequency (%)
104
43.0%
1 21
 
8.7%
- 16
 
6.6%
2 15
 
6.2%
4 15
 
6.2%
6 15
 
6.2%
8 14
 
5.8%
5 12
 
5.0%
7 9
 
3.7%
0 8
 
3.3%
Other values (2) 13
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 321
57.0%
ASCII 242
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
104
43.0%
1 21
 
8.7%
- 16
 
6.6%
2 15
 
6.2%
4 15
 
6.2%
6 15
 
6.2%
8 14
 
5.8%
5 12
 
5.0%
7 9
 
3.7%
0 8
 
3.3%
Other values (2) 13
 
5.4%
Hangul
ValueCountFrequency (%)
22
 
6.9%
21
 
6.5%
18
 
5.6%
15
 
4.7%
13
 
4.0%
12
 
3.7%
12
 
3.7%
11
 
3.4%
9
 
2.8%
9
 
2.8%
Other values (84) 179
55.8%
Distinct37
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size436.0 B
2023-12-12T17:51:25.704033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length10.5
Mean length5.9473684
Min length3

Characters and Unicode

Total characters226
Distinct characters97
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)94.7%

Sample

1st row맑은누리파크
2nd row음식물자원화시설
3rd row경주시장
4th row김천시 음식물처리시설
5th row구미시 남은음식물 사료화시설
ValueCountFrequency (%)
울릉군수 2
 
4.8%
음식물자원화시설 1
 
2.4%
제일산업 1
 
2.4%
금봉양돈 1
 
2.4%
계명농장 1
 
2.4%
황농장 1
 
2.4%
부자농장 1
 
2.4%
은성농장 1
 
2.4%
태백농장 1
 
2.4%
제2범수농장 1
 
2.4%
Other values (31) 31
73.8%
2023-12-12T17:51:26.161754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
5.3%
10
 
4.4%
9
 
4.0%
9
 
4.0%
6
 
2.7%
) 5
 
2.2%
( 5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (87) 155
68.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 201
88.9%
Other Symbol 10
 
4.4%
Close Punctuation 5
 
2.2%
Open Punctuation 5
 
2.2%
Space Separator 4
 
1.8%
Decimal Number 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
6.0%
9
 
4.5%
9
 
4.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (82) 137
68.2%
Other Symbol
ValueCountFrequency (%)
10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 211
93.4%
Common 15
 
6.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
5.7%
10
 
4.7%
9
 
4.3%
9
 
4.3%
6
 
2.8%
5
 
2.4%
5
 
2.4%
5
 
2.4%
5
 
2.4%
4
 
1.9%
Other values (83) 141
66.8%
Common
ValueCountFrequency (%)
) 5
33.3%
( 5
33.3%
4
26.7%
2 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 201
88.9%
ASCII 15
 
6.6%
None 10
 
4.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
6.0%
9
 
4.5%
9
 
4.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (82) 137
68.2%
None
ValueCountFrequency (%)
10
100.0%
ASCII
ValueCountFrequency (%)
) 5
33.3%
( 5
33.3%
4
26.7%
2 1
 
6.7%

시설용량(톤_일)
Real number (ℝ)

Distinct24
Distinct (%)63.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.176842
Minimum1
Maximum120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size474.0 B
2023-12-12T17:51:26.301196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.017
Q12.425
median25
Q369.5
95-th percentile99.6
Maximum120
Range119
Interquartile range (IQR)67.075

Descriptive statistics

Standard deviation39.314478
Coefficient of variation (CV)1.0035132
Kurtosis-0.99104188
Mean39.176842
Median Absolute Deviation (MAD)23.25
Skewness0.67158827
Sum1488.72
Variance1545.6282
MonotonicityNot monotonic
2023-12-12T17:51:26.454373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
2.0 4
 
10.5%
20.0 3
 
7.9%
25.0 3
 
7.9%
90.0 3
 
7.9%
120.0 2
 
5.3%
1.0 2
 
5.3%
96.0 2
 
5.3%
50.0 2
 
5.3%
60.0 2
 
5.3%
6.0 1
 
2.6%
Other values (14) 14
36.8%
ValueCountFrequency (%)
1.0 2
5.3%
1.02 1
 
2.6%
1.5 1
 
2.6%
2.0 4
10.5%
2.3 1
 
2.6%
2.4 1
 
2.6%
2.5 1
 
2.6%
4.0 1
 
2.6%
5.0 1
 
2.6%
6.0 1
 
2.6%
ValueCountFrequency (%)
120.0 2
5.3%
96.0 2
5.3%
95.0 1
 
2.6%
90.0 3
7.9%
88.0 1
 
2.6%
70.0 1
 
2.6%
68.0 1
 
2.6%
60.0 2
5.3%
50.0 2
5.3%
36.0 1
 
2.6%

시설종류
Categorical

Distinct10
Distinct (%)26.3%
Missing0
Missing (%)0.0%
Memory size436.0 B
사료화
18 
퇴비화
10 
하수병합
바이오가스화
바이오가스
 
1
Other values (5)

Length

Max length8
Median length3
Mean length3.4736842
Min length2

Unique

Unique6 ?
Unique (%)15.8%

Sample

1st row바이오가스
2nd row하수병합
3rd row사료화
4th row하수병합
5th row사료화

Common Values

ValueCountFrequency (%)
사료화 18
47.4%
퇴비화 10
26.3%
하수병합 2
 
5.3%
바이오가스화 2
 
5.3%
바이오가스 1
 
2.6%
파쇄,탈수 1
 
2.6%
건조 1
 
2.6%
사료화, 비료화 1
 
2.6%
사료·퇴비화 1
 
2.6%
사료 1
 
2.6%

Length

2023-12-12T17:51:26.627445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:51:26.795673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사료화 19
48.7%
퇴비화 10
25.6%
하수병합 2
 
5.1%
바이오가스화 2
 
5.1%
바이오가스 1
 
2.6%
파쇄,탈수 1
 
2.6%
건조 1
 
2.6%
비료화 1
 
2.6%
사료·퇴비화 1
 
2.6%
사료 1
 
2.6%

Interactions

2023-12-12T17:51:24.073744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:51:26.915040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분소 재 지업체명시설용량(톤_일)시설종류
구분1.0001.0001.0000.4480.563
소 재 지1.0001.0001.0000.9080.833
업체명1.0001.0001.0001.0000.000
시설용량(톤_일)0.4480.9081.0001.0000.621
시설종류0.5630.8330.0000.6211.000
2023-12-12T17:51:27.026266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분시설종류
구분1.0000.356
시설종류0.3561.000
2023-12-12T17:51:27.132791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설용량(톤_일)구분시설종류
시설용량(톤_일)1.0000.3110.348
구분0.3111.0000.356
시설종류0.3480.3561.000

Missing values

2023-12-12T17:51:24.202011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:51:24.339582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분소 재 지업체명시설용량(톤_일)시설종류
0공공시설안동시 풍천면 도양리 1424맑은누리파크120.0바이오가스
1공공시설경주시 외동읍 문산리 864-5음식물자원화시설4.0하수병합
2공공시설경주시 천군동 1519경주시장68.0사료화
3공공시설김천시 대광동 850김천시 음식물처리시설20.0하수병합
4공공시설칠곡군 석적읍 3공단1로 62-6구미시 남은음식물 사료화시설95.0사료화
5공공시설영천시 금호읍 구암리 724-2영천시장30.0바이오가스화
6공공시설상주시 낙동면 분황리 464-11상주시 축산환경사업소25.0퇴비화
7공공시설칠곡군 왜관읍 강변대로 888칠곡군수20.0파쇄,탈수
8공공시설울진군 근남면 수산리198-1가축분뇨공공처리시설25.0바이오가스화
9공공시설울릉군 서면 남서리 산 592울릉군수6.0퇴비화
구분소 재 지업체명시설용량(톤_일)시설종류
28민간시설고령 다산 월암(주)원일환경96.0사료화
29민간시설고령 성산 지리골제일산업50.0퇴비화
30민간시설고령 다산 다산산단(주)오케이산업96.0사료화
31민간시설성주군 선남면 선노로 545-37(주)앞선환경60.0사료화
32민간시설예천군 개포면 용개로 1416-60대경운송10.0사료
33민간시설칠곡 약목 복성그린농장2.0사료화
34민간시설봉화군 문단1길 475(영)봉화계분비료공장90.0퇴비화
35민간시설봉화군 의상로 538-8㈜동양그린바이오120.0퇴비화
36민간시설봉화군 봉성면 금봉리 595금봉양돈1.0사료화
37민간시설봉화군 상운면 운계리 242청량농장1.0사료화