Overview

Dataset statistics

Number of variables6
Number of observations225
Missing cells20
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.7 KiB
Average record size in memory48.6 B

Variable types

Text4
Categorical2

Dataset

Description충청남도 예산군에 허가 또는 신고받은 대기배출시설 설치신고업체 현황 자료로 사업장명, 서업장주소, 전화번호, 대표업종, 종수에 대한 정보를 제공
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=339&beforeMenuCd=DOM_000000201001001000&publicdatapk=15082057

Alerts

대표자 is highly imbalanced (67.2%)Imbalance
전화번호 has 20 (8.9%) missing valuesMissing

Reproduction

Analysis started2024-01-09 22:42:23.447521
Analysis finished2024-01-09 22:42:23.887486
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct221
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-01-10T07:42:24.008021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length9.0533333
Min length2

Characters and Unicode

Total characters2037
Distinct characters258
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique219 ?
Unique (%)97.3%

Sample

1st row동림제재소
2nd row보명레미콘주식회사
3rd row(주)유아이헬리콥터
4th row예산토기
5th row(주)센텍
ValueCountFrequency (%)
주식회사 25
 
8.3%
예산공장 8
 
2.7%
농업회사법인 5
 
1.7%
예산군농협쌀조합공동사업법인 4
 
1.3%
주)신호인더스트리 4
 
1.3%
제2공장 3
 
1.0%
예산지점 3
 
1.0%
주)네오오토 3
 
1.0%
이엔지스틸(주 2
 
0.7%
2공장 2
 
0.7%
Other values (235) 242
80.4%
2024-01-10T07:42:24.310944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
175
 
8.6%
) 150
 
7.4%
( 150
 
7.4%
76
 
3.7%
57
 
2.8%
53
 
2.6%
48
 
2.4%
42
 
2.1%
42
 
2.1%
41
 
2.0%
Other values (248) 1203
59.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1635
80.3%
Close Punctuation 150
 
7.4%
Open Punctuation 150
 
7.4%
Space Separator 76
 
3.7%
Decimal Number 21
 
1.0%
Uppercase Letter 3
 
0.1%
Other Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
175
 
10.7%
57
 
3.5%
53
 
3.2%
48
 
2.9%
42
 
2.6%
42
 
2.6%
41
 
2.5%
39
 
2.4%
37
 
2.3%
33
 
2.0%
Other values (235) 1068
65.3%
Decimal Number
ValueCountFrequency (%)
2 8
38.1%
1 7
33.3%
3 3
 
14.3%
0 2
 
9.5%
9 1
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
C 1
33.3%
P 1
33.3%
R 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 150
100.0%
Open Punctuation
ValueCountFrequency (%)
( 150
100.0%
Space Separator
ValueCountFrequency (%)
76
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1635
80.3%
Common 399
 
19.6%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
175
 
10.7%
57
 
3.5%
53
 
3.2%
48
 
2.9%
42
 
2.6%
42
 
2.6%
41
 
2.5%
39
 
2.4%
37
 
2.3%
33
 
2.0%
Other values (235) 1068
65.3%
Common
ValueCountFrequency (%)
) 150
37.6%
( 150
37.6%
76
19.0%
2 8
 
2.0%
1 7
 
1.8%
3 3
 
0.8%
0 2
 
0.5%
/ 1
 
0.3%
9 1
 
0.3%
- 1
 
0.3%
Latin
ValueCountFrequency (%)
C 1
33.3%
P 1
33.3%
R 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1635
80.3%
ASCII 402
 
19.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
175
 
10.7%
57
 
3.5%
53
 
3.2%
48
 
2.9%
42
 
2.6%
42
 
2.6%
41
 
2.5%
39
 
2.4%
37
 
2.3%
33
 
2.0%
Other values (235) 1068
65.3%
ASCII
ValueCountFrequency (%)
) 150
37.3%
( 150
37.3%
76
18.9%
2 8
 
2.0%
1 7
 
1.7%
3 3
 
0.7%
0 2
 
0.5%
/ 1
 
0.2%
C 1
 
0.2%
P 1
 
0.2%
Other values (3) 3
 
0.7%

대표자
Categorical

IMBALANCE 

Distinct7
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
대표이사
182 
개인
34 
조합장
 
5
사업소장
 
1
(재)충남테크노파크 이사
 
1
Other values (2)
 
2

Length

Max length13
Median length4
Mean length3.7111111
Min length2

Unique

Unique4 ?
Unique (%)1.8%

Sample

1st row개인
2nd row대표이사
3rd row대표이사
4th row개인
5th row대표이사

Common Values

ValueCountFrequency (%)
대표이사 182
80.9%
개인 34
 
15.1%
조합장 5
 
2.2%
사업소장 1
 
0.4%
(재)충남테크노파크 이사 1
 
0.4%
교육장 1
 
0.4%
예산군수 1
 
0.4%

Length

2024-01-10T07:42:24.419483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:42:24.510458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대표이사 182
80.5%
개인 34
 
15.0%
조합장 5
 
2.2%
사업소장 1
 
0.4%
재)충남테크노파크 1
 
0.4%
이사 1
 
0.4%
교육장 1
 
0.4%
예산군수 1
 
0.4%
Distinct222
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-01-10T07:42:24.672709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length21.986667
Min length19

Characters and Unicode

Total characters4947
Distinct characters117
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique220 ?
Unique (%)97.8%

Sample

1st row충청남도 예산군 오가면 신장원평길 278
2nd row충청남도 예산군 신양면 서계양배약길 22
3rd row충청남도 예산군 삽교읍 효림송석길 275
4th row충청남도 예산군 오가면 오촌중앙길 65-10
5th row충청남도 예산군 오가면 예산산업단지로 93-10
ValueCountFrequency (%)
충청남도 225
20.0%
예산군 225
20.0%
고덕면 57
 
5.1%
삽교읍 42
 
3.7%
예산읍 26
 
2.3%
신암면 25
 
2.2%
응봉면 19
 
1.7%
오가면 19
 
1.7%
대술면 14
 
1.2%
봉산면 13
 
1.2%
Other values (274) 460
40.9%
2024-01-10T07:42:24.949446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
900
18.2%
382
 
7.7%
306
 
6.2%
234
 
4.7%
227
 
4.6%
225
 
4.5%
225
 
4.5%
225
 
4.5%
157
 
3.2%
1 152
 
3.1%
Other values (107) 1914
38.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3165
64.0%
Space Separator 900
 
18.2%
Decimal Number 799
 
16.2%
Dash Punctuation 83
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
382
12.1%
306
 
9.7%
234
 
7.4%
227
 
7.2%
225
 
7.1%
225
 
7.1%
225
 
7.1%
157
 
5.0%
119
 
3.8%
116
 
3.7%
Other values (95) 949
30.0%
Decimal Number
ValueCountFrequency (%)
1 152
19.0%
2 121
15.1%
3 105
13.1%
5 82
10.3%
7 64
8.0%
4 63
7.9%
6 63
7.9%
8 51
 
6.4%
0 50
 
6.3%
9 48
 
6.0%
Space Separator
ValueCountFrequency (%)
900
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 83
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3165
64.0%
Common 1782
36.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
382
12.1%
306
 
9.7%
234
 
7.4%
227
 
7.2%
225
 
7.1%
225
 
7.1%
225
 
7.1%
157
 
5.0%
119
 
3.8%
116
 
3.7%
Other values (95) 949
30.0%
Common
ValueCountFrequency (%)
900
50.5%
1 152
 
8.5%
2 121
 
6.8%
3 105
 
5.9%
- 83
 
4.7%
5 82
 
4.6%
7 64
 
3.6%
4 63
 
3.5%
6 63
 
3.5%
8 51
 
2.9%
Other values (2) 98
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3165
64.0%
ASCII 1782
36.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
900
50.5%
1 152
 
8.5%
2 121
 
6.8%
3 105
 
5.9%
- 83
 
4.7%
5 82
 
4.6%
7 64
 
3.6%
4 63
 
3.5%
6 63
 
3.5%
8 51
 
2.9%
Other values (2) 98
 
5.5%
Hangul
ValueCountFrequency (%)
382
12.1%
306
 
9.7%
234
 
7.4%
227
 
7.2%
225
 
7.1%
225
 
7.1%
225
 
7.1%
157
 
5.0%
119
 
3.8%
116
 
3.7%
Other values (95) 949
30.0%

전화번호
Text

MISSING 

Distinct192
Distinct (%)93.7%
Missing20
Missing (%)8.9%
Memory size1.9 KiB
2024-01-10T07:42:25.160044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.004878
Min length11

Characters and Unicode

Total characters2461
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)87.8%

Sample

1st row041-335-6830
2nd row041-333-6831
3rd row041-337-1991
4th row041-332-8301
5th row041-333-3272
ValueCountFrequency (%)
041-337-1730 3
 
1.5%
041-330-4545 2
 
1.0%
041-335-5111 2
 
1.0%
041-331-5567 2
 
1.0%
041-337-7411 2
 
1.0%
041-331-0980 2
 
1.0%
041-333-4352 2
 
1.0%
041-332-6346 2
 
1.0%
041-337-0164 2
 
1.0%
041-533-9322 2
 
1.0%
Other values (183) 185
89.8%
2024-01-10T07:42:25.492763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 481
19.5%
- 408
16.6%
0 355
14.4%
1 330
13.4%
4 285
11.6%
7 129
 
5.2%
8 116
 
4.7%
2 94
 
3.8%
5 91
 
3.7%
6 89
 
3.6%
Other values (2) 83
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2052
83.4%
Dash Punctuation 408
 
16.6%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 481
23.4%
0 355
17.3%
1 330
16.1%
4 285
13.9%
7 129
 
6.3%
8 116
 
5.7%
2 94
 
4.6%
5 91
 
4.4%
6 89
 
4.3%
9 82
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 408
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2461
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 481
19.5%
- 408
16.6%
0 355
14.4%
1 330
13.4%
4 285
11.6%
7 129
 
5.2%
8 116
 
4.7%
2 94
 
3.8%
5 91
 
3.7%
6 89
 
3.6%
Other values (2) 83
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2461
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 481
19.5%
- 408
16.6%
0 355
14.4%
1 330
13.4%
4 285
11.6%
7 129
 
5.2%
8 116
 
4.7%
2 94
 
3.8%
5 91
 
3.7%
6 89
 
3.6%
Other values (2) 83
 
3.4%
Distinct110
Distinct (%)48.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-01-10T07:42:25.773573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length19
Mean length13.16
Min length1

Characters and Unicode

Total characters2961
Distinct characters179
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)31.1%

Sample

1st row제재 및 목재 가공업
2nd row레미콘 제조업
3rd row항공기 우주선 및 부품 제조업
4th row도자기 및 기타 요업제품 제조업
5th row고무제품 제조업
ValueCountFrequency (%)
제조업 149
 
18.0%
87
 
10.5%
기타 59
 
7.1%
자동차 27
 
3.3%
그외 16
 
1.9%
처리업 16
 
1.9%
곡물 15
 
1.8%
도정업 15
 
1.8%
폐기물 15
 
1.8%
수리업 15
 
1.8%
Other values (190) 415
50.1%
2024-01-10T07:42:26.171338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
633
21.4%
226
 
7.6%
216
 
7.3%
170
 
5.7%
92
 
3.1%
88
 
3.0%
87
 
2.9%
75
 
2.5%
61
 
2.1%
49
 
1.7%
Other values (169) 1264
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2325
78.5%
Space Separator 633
 
21.4%
Other Punctuation 2
 
0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
226
 
9.7%
216
 
9.3%
170
 
7.3%
92
 
4.0%
88
 
3.8%
87
 
3.7%
75
 
3.2%
61
 
2.6%
49
 
2.1%
43
 
1.8%
Other values (166) 1218
52.4%
Space Separator
ValueCountFrequency (%)
633
100.0%
Other Punctuation
ValueCountFrequency (%)
· 2
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2325
78.5%
Common 636
 
21.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
226
 
9.7%
216
 
9.3%
170
 
7.3%
92
 
4.0%
88
 
3.8%
87
 
3.7%
75
 
3.2%
61
 
2.6%
49
 
2.1%
43
 
1.8%
Other values (166) 1218
52.4%
Common
ValueCountFrequency (%)
633
99.5%
· 2
 
0.3%
1 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2325
78.5%
ASCII 634
 
21.4%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
633
99.8%
1 1
 
0.2%
Hangul
ValueCountFrequency (%)
226
 
9.7%
216
 
9.3%
170
 
7.3%
92
 
4.0%
88
 
3.8%
87
 
3.7%
75
 
3.2%
61
 
2.6%
49
 
2.1%
43
 
1.8%
Other values (166) 1218
52.4%
None
ValueCountFrequency (%)
· 2
100.0%


Categorical

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
5종
115 
4종
97 
3종
13 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5종
2nd row5종
3rd row5종
4th row5종
5th row4종

Common Values

ValueCountFrequency (%)
5종 115
51.1%
4종 97
43.1%
3종 13
 
5.8%

Length

2024-01-10T07:42:26.277226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:42:26.362187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 115
51.1%
4종 97
43.1%
3종 13
 
5.8%

Correlations

2024-01-10T07:42:26.421865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대표자
대표자1.0000.155
0.1551.000
2024-01-10T07:42:26.514618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대표자
1.0000.103
대표자0.1031.000
2024-01-10T07:42:26.581768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대표자
대표자1.0000.103
0.1031.000

Missing values

2024-01-10T07:42:23.777353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:42:23.855474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명대표자도로명소재지전화번호대표업종
0동림제재소개인충청남도 예산군 오가면 신장원평길 278041-335-6830제재 및 목재 가공업5종
1보명레미콘주식회사대표이사충청남도 예산군 신양면 서계양배약길 22041-333-6831레미콘 제조업5종
2(주)유아이헬리콥터대표이사충청남도 예산군 삽교읍 효림송석길 275041-337-1991항공기 우주선 및 부품 제조업5종
3예산토기개인충청남도 예산군 오가면 오촌중앙길 65-10041-332-8301도자기 및 기타 요업제품 제조업5종
4(주)센텍대표이사충청남도 예산군 오가면 예산산업단지로 93-10041-333-3272고무제품 제조업4종
5농업회사법인유한회사 예산라이스대표이사충청남도 예산군 예산읍 관작중앙길 6041-334-4655곡물 도정업4종
6(주)중앙타프라대표이사충청남도 예산군 응봉면 예당로 1053-15041-331-0412기타 식품 제조업4종
7예산자동차정비공업사개인충청남도 예산군 예산읍 신례원로 103041-333-4955자동차 종합 수리업5종
8한광코팅센터개인충청남도 예산군 오가면 예산산업단지로 80041-332-6316기타 자동차부품 제조업4종
9(주)셰프라인대표이사충청남도 예산군 대술면 시루미길 29-5041-333-9100기타 조립금속제품 제조업4종
사업장명대표자도로명소재지전화번호대표업종
215대동농산개인충청남도 예산군 오가면 국사봉로 426<NA>곡물 도정업4종
216(주)미래스틸대표이사충청남도 예산군 대술면 대술로 583-28041-555-6479기타 구조용 금속제품 제조업4종
217(주)에이치에스켐트론대표이사충청남도 예산군 고덕면 예당산단5길 27031-494-9060그외 기타 분류안된 화학제품 제조업5종
218(주)유티아이 예산지점 제1공장대표이사충청남도 예산군 응봉면 충서로 90041-333-4352그외 기타 전자부품 제조업5종
219예산 1100년 기념관예산군수충청남도 예산군 예산읍 벚꽃로 214041-339-7717지방행정 집행기관5종
220(주)지-플라텍대표이사충청남도 예산군 예산읍 벚꽃로388번길 12-12062-570-0960그외 기타 자동차 부품 제조업4종
221인트라정공(주)대표이사충청남도 예산군 봉산면 한천로 311-11041-337-6750강주물 주조업4종
222에이치피코리아(주)대표이사충청남도 예산군 고덕면 예당산단2길 33041-338-0703구조용 금속판제품 및 금속공작물 제조업5종
223(주)네오오토 예산공장 제3공장대표이사충청남도 예산군 삽교읍 산단2길 85041-337-1730기타 자동차 부품 제조업5종
224(주)명성대표이사충청남도 예산군 고덕면 예당산단5길 5041-337-3933기타 비철금속 압연 압출 및 연신제품 제조업5종