Overview

Dataset statistics

Number of variables4
Number of observations809
Missing cells0
Missing cells (%)0.0%
Duplicate rows133
Duplicate rows (%)16.4%
Total size in memory25.4 KiB
Average record size in memory32.2 B

Variable types

Text3
DateTime1

Dataset

Description충청북도 제천시 사업장폐기물배출자 신고 현황 관련 자료입니다. 제공목록은 상호, 사업장 지번주소, 폐기물종류 등에 관한 자료입니다.
Author충청북도 제천시
URLhttps://www.data.go.kr/data/15060360/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 133 (16.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 12:36:54.874720
Analysis finished2023-12-12 12:36:55.488713
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct259
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-12T21:36:55.701004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length19
Mean length9.4313968
Min length2

Characters and Unicode

Total characters7630
Distinct characters285
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique121 ?
Unique (%)15.0%

Sample

1st row제천광역친환경영농조합법인
2nd row제천광역친환경영농조합법인
3rd row청풍호노인사랑병원
4th row이환산업
5th row이환산업
ValueCountFrequency (%)
주식회사 73
 
7.2%
한국철도공사 24
 
2.4%
아세아시멘트(주)제천 22
 
2.2%
일진글로벌 20
 
2.0%
코스맥스바이오(주 20
 
2.0%
대림비앤코주식회사 19
 
1.9%
주)케이엠 16
 
1.6%
청풍산업 14
 
1.4%
성대산업(주 14
 
1.4%
세명대학교 13
 
1.3%
Other values (271) 776
76.8%
2023-12-12T21:36:56.147722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
651
 
8.5%
( 572
 
7.5%
) 572
 
7.5%
223
 
2.9%
202
 
2.6%
200
 
2.6%
188
 
2.5%
152
 
2.0%
148
 
1.9%
145
 
1.9%
Other values (275) 4577
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6207
81.3%
Open Punctuation 572
 
7.5%
Close Punctuation 572
 
7.5%
Space Separator 202
 
2.6%
Decimal Number 67
 
0.9%
Uppercase Letter 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
651
 
10.5%
223
 
3.6%
200
 
3.2%
188
 
3.0%
152
 
2.4%
148
 
2.4%
145
 
2.3%
142
 
2.3%
135
 
2.2%
125
 
2.0%
Other values (262) 4098
66.0%
Decimal Number
ValueCountFrequency (%)
2 21
31.3%
3 19
28.4%
1 11
16.4%
4 6
 
9.0%
6 5
 
7.5%
0 5
 
7.5%
Uppercase Letter
ValueCountFrequency (%)
D 4
40.0%
S 4
40.0%
V 1
 
10.0%
I 1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 572
100.0%
Close Punctuation
ValueCountFrequency (%)
) 572
100.0%
Space Separator
ValueCountFrequency (%)
202
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6207
81.3%
Common 1413
 
18.5%
Latin 10
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
651
 
10.5%
223
 
3.6%
200
 
3.2%
188
 
3.0%
152
 
2.4%
148
 
2.4%
145
 
2.3%
142
 
2.3%
135
 
2.2%
125
 
2.0%
Other values (262) 4098
66.0%
Common
ValueCountFrequency (%)
( 572
40.5%
) 572
40.5%
202
 
14.3%
2 21
 
1.5%
3 19
 
1.3%
1 11
 
0.8%
4 6
 
0.4%
6 5
 
0.4%
0 5
 
0.4%
Latin
ValueCountFrequency (%)
D 4
40.0%
S 4
40.0%
V 1
 
10.0%
I 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6207
81.3%
ASCII 1423
 
18.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
651
 
10.5%
223
 
3.6%
200
 
3.2%
188
 
3.0%
152
 
2.4%
148
 
2.4%
145
 
2.3%
142
 
2.3%
135
 
2.2%
125
 
2.0%
Other values (262) 4098
66.0%
ASCII
ValueCountFrequency (%)
( 572
40.2%
) 572
40.2%
202
 
14.2%
2 21
 
1.5%
3 19
 
1.3%
1 11
 
0.8%
4 6
 
0.4%
6 5
 
0.4%
0 5
 
0.4%
D 4
 
0.3%
Other values (3) 6
 
0.4%
Distinct246
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-12T21:36:56.561990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length32
Mean length20.15204
Min length13

Characters and Unicode

Total characters16303
Distinct characters186
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)14.7%

Sample

1st row충청북도 제천시 금성면 동막리 산 165-1
2nd row충청북도 제천시 금성면 동막리 산 165-1
3rd row충청북도 제천시 금성면 구룡리 25
4th row충청북도 제천시 송학면 포전리 351
5th row충청북도 제천시 송학면 포전리 351
ValueCountFrequency (%)
충청북도 795
21.7%
제천시 786
21.4%
왕암동 232
 
6.3%
봉양읍 120
 
3.3%
송학면 116
 
3.2%
고암동 57
 
1.6%
강제동 51
 
1.4%
금성면 42
 
1.1%
연박리 41
 
1.1%
입석리 41
 
1.1%
Other values (365) 1388
37.8%
2023-12-12T21:36:57.170192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3537
21.7%
874
 
5.4%
867
 
5.3%
861
 
5.3%
839
 
5.1%
821
 
5.0%
798
 
4.9%
798
 
4.9%
1 568
 
3.5%
525
 
3.2%
Other values (176) 5815
35.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9518
58.4%
Space Separator 3537
 
21.7%
Decimal Number 2814
 
17.3%
Dash Punctuation 352
 
2.2%
Uppercase Letter 62
 
0.4%
Math Symbol 6
 
< 0.1%
Open Punctuation 6
 
< 0.1%
Close Punctuation 6
 
< 0.1%
Other Punctuation 1
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
874
 
9.2%
867
 
9.1%
861
 
9.0%
839
 
8.8%
821
 
8.6%
798
 
8.4%
798
 
8.4%
525
 
5.5%
309
 
3.2%
291
 
3.1%
Other values (148) 2535
26.6%
Uppercase Letter
ValueCountFrequency (%)
D 14
22.6%
M 9
14.5%
B 7
11.3%
A 6
9.7%
T 6
9.7%
E 5
 
8.1%
R 5
 
8.1%
P 4
 
6.5%
G 4
 
6.5%
I 1
 
1.6%
Decimal Number
ValueCountFrequency (%)
1 568
20.2%
3 375
13.3%
2 356
12.7%
4 323
11.5%
9 281
10.0%
5 243
8.6%
6 175
 
6.2%
0 170
 
6.0%
8 164
 
5.8%
7 159
 
5.7%
Space Separator
ValueCountFrequency (%)
3537
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 352
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Other Punctuation
ValueCountFrequency (%)
@ 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9518
58.4%
Common 6723
41.2%
Latin 62
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
874
 
9.2%
867
 
9.1%
861
 
9.0%
839
 
8.8%
821
 
8.6%
798
 
8.4%
798
 
8.4%
525
 
5.5%
309
 
3.2%
291
 
3.1%
Other values (148) 2535
26.6%
Common
ValueCountFrequency (%)
3537
52.6%
1 568
 
8.4%
3 375
 
5.6%
2 356
 
5.3%
- 352
 
5.2%
4 323
 
4.8%
9 281
 
4.2%
5 243
 
3.6%
6 175
 
2.6%
0 170
 
2.5%
Other values (7) 343
 
5.1%
Latin
ValueCountFrequency (%)
D 14
22.6%
M 9
14.5%
B 7
11.3%
A 6
9.7%
T 6
9.7%
E 5
 
8.1%
R 5
 
8.1%
P 4
 
6.5%
G 4
 
6.5%
I 1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9518
58.4%
ASCII 6785
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3537
52.1%
1 568
 
8.4%
3 375
 
5.5%
2 356
 
5.2%
- 352
 
5.2%
4 323
 
4.8%
9 281
 
4.1%
5 243
 
3.6%
6 175
 
2.6%
0 170
 
2.5%
Other values (18) 405
 
6.0%
Hangul
ValueCountFrequency (%)
874
 
9.2%
867
 
9.1%
861
 
9.0%
839
 
8.8%
821
 
8.6%
798
 
8.4%
798
 
8.4%
525
 
5.5%
309
 
3.2%
291
 
3.1%
Other values (148) 2535
26.6%
Distinct100
Distinct (%)12.4%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-12T21:36:57.524313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length54
Mean length13.530284
Min length2

Characters and Unicode

Total characters10946
Distinct characters201
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)4.1%

Sample

1st row폐합성수지류(폐염화비닐수지류는 제외한다)
2nd row폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row그 밖의 폐섬유
4th row그 밖의 폐목재류
5th row폐합성수지류(폐염화비닐수지류는 제외한다)
ValueCountFrequency (%)
195
 
10.2%
밖의 195
 
10.2%
제외한다 195
 
10.2%
폐합성수지류(폐염화비닐수지류는 173
 
9.0%
폐수처리오니 54
 
2.8%
폐합성수지류 53
 
2.8%
폐목재류 46
 
2.4%
공정오니 43
 
2.2%
발생한 33
 
1.7%
과정에서 32
 
1.7%
Other values (162) 893
46.7%
2023-12-12T21:36:58.092290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1109
 
10.1%
847
 
7.7%
530
 
4.8%
522
 
4.8%
462
 
4.2%
366
 
3.3%
306
 
2.8%
268
 
2.4%
247
 
2.3%
( 244
 
2.2%
Other values (191) 6045
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9253
84.5%
Space Separator 1109
 
10.1%
Open Punctuation 245
 
2.2%
Close Punctuation 245
 
2.2%
Connector Punctuation 61
 
0.6%
Decimal Number 16
 
0.1%
Lowercase Letter 12
 
0.1%
Other Punctuation 3
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
847
 
9.2%
530
 
5.7%
522
 
5.6%
462
 
5.0%
366
 
4.0%
306
 
3.3%
268
 
2.9%
247
 
2.7%
239
 
2.6%
237
 
2.6%
Other values (175) 5229
56.5%
Lowercase Letter
ValueCountFrequency (%)
e 4
33.3%
t 2
16.7%
a 2
16.7%
h 2
16.7%
l 2
16.7%
Decimal Number
ValueCountFrequency (%)
1 14
87.5%
3 1
 
6.2%
2 1
 
6.2%
Open Punctuation
ValueCountFrequency (%)
( 244
99.6%
1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 244
99.6%
1
 
0.4%
Space Separator
ValueCountFrequency (%)
1109
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 61
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9253
84.5%
Common 1679
 
15.3%
Latin 14
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
847
 
9.2%
530
 
5.7%
522
 
5.6%
462
 
5.0%
366
 
4.0%
306
 
3.3%
268
 
2.9%
247
 
2.7%
239
 
2.6%
237
 
2.6%
Other values (175) 5229
56.5%
Common
ValueCountFrequency (%)
1109
66.1%
( 244
 
14.5%
) 244
 
14.5%
_ 61
 
3.6%
1 14
 
0.8%
. 3
 
0.2%
3 1
 
0.1%
2 1
 
0.1%
1
 
0.1%
1
 
0.1%
Latin
ValueCountFrequency (%)
e 4
28.6%
t 2
14.3%
a 2
14.3%
h 2
14.3%
C 2
14.3%
l 2
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9178
83.8%
ASCII 1691
 
15.4%
Compat Jamo 75
 
0.7%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1109
65.6%
( 244
 
14.4%
) 244
 
14.4%
_ 61
 
3.6%
1 14
 
0.8%
e 4
 
0.2%
. 3
 
0.2%
t 2
 
0.1%
a 2
 
0.1%
h 2
 
0.1%
Other values (4) 6
 
0.4%
Hangul
ValueCountFrequency (%)
847
 
9.2%
530
 
5.8%
522
 
5.7%
462
 
5.0%
366
 
4.0%
306
 
3.3%
268
 
2.9%
247
 
2.7%
239
 
2.6%
237
 
2.6%
Other values (174) 5154
56.2%
Compat Jamo
ValueCountFrequency (%)
75
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
Minimum2023-11-28 00:00:00
Maximum2023-11-28 00:00:00
2023-12-12T21:36:58.232786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:36:58.363234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-12T21:36:55.255102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:36:55.453235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호사업장지번주소폐기물 종류데이터기준일
0제천광역친환경영농조합법인충청북도 제천시 금성면 동막리 산 165-1폐합성수지류(폐염화비닐수지류는 제외한다)2023-11-28
1제천광역친환경영농조합법인충청북도 제천시 금성면 동막리 산 165-1폐합성수지류(폐염화비닐수지류는 제외한다)2023-11-28
2청풍호노인사랑병원충청북도 제천시 금성면 구룡리 25그 밖의 폐섬유2023-11-28
3이환산업충청북도 제천시 송학면 포전리 351그 밖의 폐목재류2023-11-28
4이환산업충청북도 제천시 송학면 포전리 351폐합성수지류(폐염화비닐수지류는 제외한다)2023-11-28
5(주)지구인컴퍼니충청북도 제천시 왕암동 0그 밖의 폐수처리오니2023-11-28
6(주)지구인컴퍼니충청북도 제천시 왕암동 0폐합성수지류(폐염화비닐수지류는 제외한다)2023-11-28
7(주)지구인컴퍼니충청북도 제천시 왕암동 0음식물류폐기물2023-11-28
8(주)지구인컴퍼니충청북도 제천시 왕암동 0폐식용유(식용을 목적으로 식품 재료와 원료를 제조ㆍ조리ㆍ가공하거나 식용유를 유통ㆍ사용 또는 음식물류 폐기물을 처리하는 과정에서 발생하는 기름을 말한다)2023-11-28
9(주)지구인컴퍼니충청북도 제천시 왕암동 0그 밖의 분진2023-11-28
상호사업장지번주소폐기물 종류데이터기준일
799대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3폐합성수지류(폐염화비닐수지류는 제외한다)2023-11-28
800대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3폐도자기조각2023-11-28
801대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3폐도자기조각2023-11-28
802대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3폐수처리오니2023-11-28
803대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3폐수처리오니2023-11-28
804대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3그 밖의 폐목재류2023-11-28
805대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3폐내화물2023-11-28
806대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3폐도자기조각2023-11-28
807대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3폐도자기조각2023-11-28
808대림비앤코주식회사충청북도 제천시 봉양읍 주포리 1-3폐도자기조각2023-11-28

Duplicate rows

Most frequently occurring

상호사업장지번주소폐기물 종류데이터기준일# duplicates
36(주)케이엠충청북도 제천시 천남동 426-7그 밖의 무기성오니2023-11-2811
97엠케이코리아(주)충청북도 제천시 봉양읍 장평리 36석재ㆍ골재폐수처리오니(석재ㆍ골재 생산 시 발생한 폐수를 처리하는 과정에서 발생한 오니로 한정한다)2023-11-289
75삼양마이닝 주식회사충청북도 제천시 금성면 대장리 292-1그 밖의 공정오니2023-11-288
83세명대학교충청북도 제천시 신월동 579폐합성수지류(폐염화비닐수지류는 제외한다)2023-11-288
105자연환경(주)충청북도 제천시 고암동 145-4폐합성수지류(폐염화비닐수지류는 제외한다)2023-11-287
6(주)밀리션리사이클링충청북도 제천시 송학면 시곡리 895-6폐합성수지류2023-11-286
47(주)풀잎라인충청북도 제천시 고암동 145-9그 밖의 동ㆍ식물성잔재물2023-11-286
80성대산업(주)충청북도 제천시 고암동 145-4폐주물사2023-11-286
116주식회사 청풍산업충청북도 제천시 봉양읍 연박리 304폐합성수지류(폐염화비닐수지류는 제외한다)2023-11-286
2(주)금보식품제천공장충청북도 제천시 산곡동 64축산물가공잔재물(동물성 유지류는 제외한다)2023-11-285