Overview

Dataset statistics

Number of variables7
Number of observations272
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.3 KiB
Average record size in memory57.5 B

Variable types

Numeric1
Text4
Categorical1
DateTime1

Dataset

Description이 데이터는 대기배출시설을 설치하고 신고한 사업장에 대한 현황으로, 업체명, 대표자, 주소, 업종 등을 제공합니다.
Author충청남도 금산군
URLhttps://www.data.go.kr/data/15080575/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
is highly imbalanced (53.8%)Imbalance
업무구분 has unique valuesUnique

Reproduction

Analysis started2024-03-23 06:17:43.734761
Analysis finished2024-03-23 06:17:47.125710
Duration3.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업무구분
Real number (ℝ)

UNIQUE 

Distinct272
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean136.5
Minimum1
Maximum272
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2024-03-23T06:17:47.473921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.55
Q168.75
median136.5
Q3204.25
95-th percentile258.45
Maximum272
Range271
Interquartile range (IQR)135.5

Descriptive statistics

Standard deviation78.663842
Coefficient of variation (CV)0.57629188
Kurtosis-1.2
Mean136.5
Median Absolute Deviation (MAD)68
Skewness0
Sum37128
Variance6188
MonotonicityStrictly increasing
2024-03-23T06:17:48.194523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
181 1
 
0.4%
187 1
 
0.4%
186 1
 
0.4%
185 1
 
0.4%
184 1
 
0.4%
183 1
 
0.4%
182 1
 
0.4%
180 1
 
0.4%
138 1
 
0.4%
Other values (262) 262
96.3%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
272 1
0.4%
271 1
0.4%
270 1
0.4%
269 1
0.4%
268 1
0.4%
267 1
0.4%
266 1
0.4%
265 1
0.4%
264 1
0.4%
263 1
0.4%
Distinct265
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-23T06:17:48.900307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length7.8198529
Min length2

Characters and Unicode

Total characters2127
Distinct characters275
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique258 ?
Unique (%)94.9%

Sample

1st row삼남제약(주)
2nd row(주)광성화학
3rd row경기광업(주)
4th row중앙목욕탕
5th row광흥제면
ValueCountFrequency (%)
주식회사 10
 
3.3%
금산공장 4
 
1.3%
농업회사법인 3
 
1.0%
주)이에스에프씨티 2
 
0.7%
주)광성화학 2
 
0.7%
제2공장 2
 
0.7%
주)신화기전 2
 
0.7%
주)동신화학 2
 
0.7%
주)에스코알티에스 2
 
0.7%
주)유성화연테크 2
 
0.7%
Other values (265) 268
89.6%
2024-03-23T06:17:50.225156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
194
 
9.1%
( 180
 
8.5%
) 180
 
8.5%
87
 
4.1%
70
 
3.3%
43
 
2.0%
39
 
1.8%
35
 
1.6%
35
 
1.6%
33
 
1.6%
Other values (265) 1231
57.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1716
80.7%
Open Punctuation 181
 
8.5%
Close Punctuation 181
 
8.5%
Space Separator 27
 
1.3%
Decimal Number 12
 
0.6%
Other Symbol 4
 
0.2%
Uppercase Letter 4
 
0.2%
Dash Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
194
 
11.3%
87
 
5.1%
70
 
4.1%
43
 
2.5%
39
 
2.3%
35
 
2.0%
35
 
2.0%
33
 
1.9%
33
 
1.9%
27
 
1.6%
Other values (248) 1120
65.3%
Decimal Number
ValueCountFrequency (%)
2 6
50.0%
9 2
 
16.7%
8 2
 
16.7%
1 1
 
8.3%
3 1
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
E 1
25.0%
G 1
25.0%
S 1
25.0%
M 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 180
99.4%
[ 1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 180
99.4%
] 1
 
0.6%
Space Separator
ValueCountFrequency (%)
27
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1720
80.9%
Common 403
 
18.9%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
194
 
11.3%
87
 
5.1%
70
 
4.1%
43
 
2.5%
39
 
2.3%
35
 
2.0%
35
 
2.0%
33
 
1.9%
33
 
1.9%
27
 
1.6%
Other values (249) 1124
65.3%
Common
ValueCountFrequency (%)
( 180
44.7%
) 180
44.7%
27
 
6.7%
2 6
 
1.5%
9 2
 
0.5%
8 2
 
0.5%
- 1
 
0.2%
/ 1
 
0.2%
] 1
 
0.2%
[ 1
 
0.2%
Other values (2) 2
 
0.5%
Latin
ValueCountFrequency (%)
E 1
25.0%
G 1
25.0%
S 1
25.0%
M 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1716
80.7%
ASCII 407
 
19.1%
None 4
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
194
 
11.3%
87
 
5.1%
70
 
4.1%
43
 
2.5%
39
 
2.3%
35
 
2.0%
35
 
2.0%
33
 
1.9%
33
 
1.9%
27
 
1.6%
Other values (248) 1120
65.3%
ASCII
ValueCountFrequency (%)
( 180
44.2%
) 180
44.2%
27
 
6.6%
2 6
 
1.5%
9 2
 
0.5%
8 2
 
0.5%
- 1
 
0.2%
/ 1
 
0.2%
] 1
 
0.2%
E 1
 
0.2%
Other values (6) 6
 
1.5%
None
ValueCountFrequency (%)
4
100.0%
Distinct207
Distinct (%)76.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-23T06:17:51.179322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.3419118
Min length3

Characters and Unicode

Total characters909
Distinct characters151
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique193 ?
Unique (%)71.0%

Sample

1st row대표이사
2nd row김광래
3rd row권희문
4th row김민수
5th row최광식
ValueCountFrequency (%)
대표이사 51
 
18.5%
장호윤 3
 
1.1%
강관백 3
 
1.1%
금산군수 2
 
0.7%
임종득 2
 
0.7%
김정림 2
 
0.7%
김용기 2
 
0.7%
조합장 2
 
0.7%
노민성 2
 
0.7%
송석진 2
 
0.7%
Other values (200) 205
74.3%
2024-03-23T06:17:52.596015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
88
 
9.7%
54
 
5.9%
52
 
5.7%
52
 
5.7%
33
 
3.6%
27
 
3.0%
16
 
1.8%
16
 
1.8%
15
 
1.7%
15
 
1.7%
Other values (141) 541
59.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 897
98.7%
Space Separator 7
 
0.8%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
88
 
9.8%
54
 
6.0%
52
 
5.8%
52
 
5.8%
33
 
3.7%
27
 
3.0%
16
 
1.8%
16
 
1.8%
15
 
1.7%
15
 
1.7%
Other values (137) 529
59.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 897
98.7%
Common 12
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
88
 
9.8%
54
 
6.0%
52
 
5.8%
52
 
5.8%
33
 
3.7%
27
 
3.0%
16
 
1.8%
16
 
1.8%
15
 
1.7%
15
 
1.7%
Other values (137) 529
59.0%
Common
ValueCountFrequency (%)
7
58.3%
( 2
 
16.7%
) 2
 
16.7%
1 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 897
98.7%
ASCII 12
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
88
 
9.8%
54
 
6.0%
52
 
5.8%
52
 
5.8%
33
 
3.7%
27
 
3.0%
16
 
1.8%
16
 
1.8%
15
 
1.7%
15
 
1.7%
Other values (137) 529
59.0%
ASCII
ValueCountFrequency (%)
7
58.3%
( 2
 
16.7%
) 2
 
16.7%
1 1
 
8.3%

주소
Text

Distinct249
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-23T06:17:53.524105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length36
Mean length22.911765
Min length17

Characters and Unicode

Total characters6232
Distinct characters153
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique233 ?
Unique (%)85.7%

Sample

1st row충청남도 금산군 금산읍 상리 99-1
2nd row충청남도 금산군 추부면 마전리 176-2
3rd row충청남도 금산군 진산면 삼가리 336-1
4th row충청남도 금산군 금산읍 중도리 506
5th row충청남도 금산군 추부면 장대리 577
ValueCountFrequency (%)
충청남도 272
19.2%
금산군 272
19.2%
추부면 99
 
7.0%
복수면 65
 
4.6%
금성면 34
 
2.4%
용진리 26
 
1.8%
진산면 25
 
1.8%
군북면 20
 
1.4%
금산읍 18
 
1.3%
마전리 17
 
1.2%
Other values (347) 569
40.2%
2024-03-23T06:17:54.908987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1393
22.4%
331
 
5.3%
331
 
5.3%
292
 
4.7%
281
 
4.5%
278
 
4.5%
272
 
4.4%
272
 
4.4%
254
 
4.1%
248
 
4.0%
Other values (143) 2280
36.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3718
59.7%
Space Separator 1393
 
22.4%
Decimal Number 951
 
15.3%
Dash Punctuation 145
 
2.3%
Close Punctuation 10
 
0.2%
Open Punctuation 10
 
0.2%
Other Symbol 3
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
331
 
8.9%
331
 
8.9%
292
 
7.9%
281
 
7.6%
278
 
7.5%
272
 
7.3%
272
 
7.3%
254
 
6.8%
248
 
6.7%
116
 
3.1%
Other values (127) 1043
28.1%
Decimal Number
ValueCountFrequency (%)
1 193
20.3%
2 106
11.1%
5 99
10.4%
6 98
10.3%
4 85
8.9%
8 83
8.7%
3 80
8.4%
9 71
 
7.5%
7 69
 
7.3%
0 67
 
7.0%
Space Separator
ValueCountFrequency (%)
1393
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 145
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3721
59.7%
Common 2509
40.3%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
331
 
8.9%
331
 
8.9%
292
 
7.8%
281
 
7.6%
278
 
7.5%
272
 
7.3%
272
 
7.3%
254
 
6.8%
248
 
6.7%
116
 
3.1%
Other values (128) 1046
28.1%
Common
ValueCountFrequency (%)
1393
55.5%
1 193
 
7.7%
- 145
 
5.8%
2 106
 
4.2%
5 99
 
3.9%
6 98
 
3.9%
4 85
 
3.4%
8 83
 
3.3%
3 80
 
3.2%
9 71
 
2.8%
Other values (4) 156
 
6.2%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3718
59.7%
ASCII 2511
40.3%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1393
55.5%
1 193
 
7.7%
- 145
 
5.8%
2 106
 
4.2%
5 99
 
3.9%
6 98
 
3.9%
4 85
 
3.4%
8 83
 
3.3%
3 80
 
3.2%
9 71
 
2.8%
Other values (5) 158
 
6.3%
Hangul
ValueCountFrequency (%)
331
 
8.9%
331
 
8.9%
292
 
7.9%
281
 
7.6%
278
 
7.5%
272
 
7.3%
272
 
7.3%
254
 
6.8%
248
 
6.7%
116
 
3.1%
Other values (127) 1043
28.1%
None
ValueCountFrequency (%)
3
100.0%

업종
Text

Distinct79
Distinct (%)29.0%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-23T06:17:55.655241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length1
Mean length6.5772059
Min length1

Characters and Unicode

Total characters1789
Distinct characters151
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)20.6%

Sample

1st row의약품 제조업
2nd row기타 비금속광물 광업
3rd row
4th row
5th row
ValueCountFrequency (%)
제조업 86
 
19.5%
39
 
8.9%
기타 32
 
7.3%
처리업 12
 
2.7%
폐기물 11
 
2.5%
인삼식품 9
 
2.0%
생산업 7
 
1.6%
자동차 7
 
1.6%
수리업 7
 
1.6%
화학제품 6
 
1.4%
Other values (133) 224
50.9%
2024-03-23T06:17:56.686846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
449
25.1%
140
 
7.8%
128
 
7.2%
102
 
5.7%
61
 
3.4%
49
 
2.7%
41
 
2.3%
35
 
2.0%
31
 
1.7%
27
 
1.5%
Other values (141) 726
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1339
74.8%
Space Separator 449
 
25.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
140
 
10.5%
128
 
9.6%
102
 
7.6%
61
 
4.6%
49
 
3.7%
41
 
3.1%
35
 
2.6%
31
 
2.3%
27
 
2.0%
26
 
1.9%
Other values (139) 699
52.2%
Space Separator
ValueCountFrequency (%)
449
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1339
74.8%
Common 450
 
25.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
140
 
10.5%
128
 
9.6%
102
 
7.6%
61
 
4.6%
49
 
3.7%
41
 
3.1%
35
 
2.6%
31
 
2.3%
27
 
2.0%
26
 
1.9%
Other values (139) 699
52.2%
Common
ValueCountFrequency (%)
449
99.8%
· 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1339
74.8%
ASCII 449
 
25.1%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
449
100.0%
Hangul
ValueCountFrequency (%)
140
 
10.5%
128
 
9.6%
102
 
7.6%
61
 
4.6%
49
 
3.7%
41
 
3.1%
35
 
2.6%
31
 
2.3%
27
 
2.0%
26
 
1.9%
Other values (139) 699
52.2%
None
ValueCountFrequency (%)
· 1
100.0%


Categorical

IMBALANCE 

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
5종
191 
4종
71 
3종
 
8
2종
 
1
 
1

Length

Max length2
Median length2
Mean length1.9963235
Min length1

Unique

Unique2 ?
Unique (%)0.7%

Sample

1st row4종
2nd row4종
3rd row4종
4th row5종
5th row5종

Common Values

ValueCountFrequency (%)
5종 191
70.2%
4종 71
 
26.1%
3종 8
 
2.9%
2종 1
 
0.4%
1
 
0.4%

Length

2024-03-23T06:17:57.296470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T06:17:57.874654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 191
70.5%
4종 71
 
26.2%
3종 8
 
3.0%
2종 1
 
0.4%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Minimum2024-03-20 00:00:00
Maximum2024-03-20 00:00:00
2024-03-23T06:17:58.265511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:17:59.042690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-23T06:17:44.957853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T06:17:59.331833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업무구분업종
업무구분1.0000.4910.255
업종0.4911.0000.000
0.2550.0001.000
2024-03-23T06:17:59.638977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업무구분
업무구분1.0000.107
0.1071.000

Missing values

2024-03-23T06:17:45.775065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T06:17:46.689153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업무구분사업장명대표자주소업종데이터기준일자
01삼남제약(주)대표이사충청남도 금산군 금산읍 상리 99-1의약품 제조업4종2024-03-20
12(주)광성화학김광래충청남도 금산군 추부면 마전리 176-2기타 비금속광물 광업4종2024-03-20
23경기광업(주)권희문충청남도 금산군 진산면 삼가리 336-14종2024-03-20
34중앙목욕탕김민수충청남도 금산군 금산읍 중도리 5065종2024-03-20
45광흥제면최광식충청남도 금산군 추부면 장대리 5775종2024-03-20
56주안아스콘(주)유인식충청남도 금산군 진산면 막현리 286-1아스콘 제조업3종2024-03-20
67(주)삼진당박동선충청남도 금산군 금산읍 양지리 16-14종2024-03-20
78(주)EG대표이사충청남도 금산군 추부면 신평리 8204종2024-03-20
89대륙화학공업(주) 금산공장송인혁충청남도 금산군 복수면 용진리 115-5산업용 비경화고무제품 제조업3종2024-03-20
910(주)금성방적윤용근충청남도 금산군 복수면 용진리 115-75종2024-03-20
업무구분사업장명대표자주소업종데이터기준일자
262263(주)대한환경이상경충청남도 금산군 복수면 다복리 267-2폐기물 처리업5종2024-03-20
263264대주산업 주식회사대표이사충청남도 금산군 진산면 막현리 286-1레미콘 제조업4종2024-03-20
264265서울플라스틱정승운충청남도 금산군 복수면 용진리 318폐기물 처리업5종2024-03-20
265266의성산업(주)오명진충청남도 금산군 추부면 서대리 97포장용 플라스틱제품 제조업5종2024-03-20
266267금산군청(금산인삼약초건강관)금산군수충청남도 금산군 금산읍 신대리 400 금산인삼약초건강관5종2024-03-20
267268석천기업(주)강형순충청남도 금산군 남일면 초현리 61-2모래 및 자갈 채취업5종2024-03-20
268269주안아스콘㈜ 건설폐기물 처리장유인식충청남도 금산군 진산면 막현리 286-1폐기물 처리업5종2024-03-20
269270금산환경재생산업㈜강관백충청남도 금산군 복수면 곡남리 10건축폐기물 처리업5종2024-03-20
270271두리산업이재윤충청남도 금산군 추부면 자부리 428주방용 및 음식점용 목재가구 제조업5종2024-03-20
271272(주)대호토건하봉순충청남도 금산군 군북면 보광리 5565종2024-03-20