Overview

Dataset statistics

Number of variables16
Number of observations2945
Missing cells32400
Missing cells (%)68.8%
Duplicate rows16
Duplicate rows (%)0.5%
Total size in memory399.9 KiB
Average record size in memory139.0 B

Variable types

Text4
Categorical1
Unsupported11

Dataset

Description전라남도 순천시 비산먼지 발생사업 현황 데이터로, 사업장명, 공사장소재지, 발생사업, 공사기간 등의 항목을 제공합니다.
Author전라남도 순천시
URLhttps://www.data.go.kr/data/3072836/fileData.do

Alerts

Dataset has 16 (0.5%) duplicate rowsDuplicates
발생사업 is highly imbalanced (94.9%)Imbalance
Unnamed: 5 has 2945 (100.0%) missing valuesMissing
Unnamed: 6 has 2945 (100.0%) missing valuesMissing
Unnamed: 7 has 2945 (100.0%) missing valuesMissing
Unnamed: 8 has 2945 (100.0%) missing valuesMissing
Unnamed: 9 has 2945 (100.0%) missing valuesMissing
Unnamed: 10 has 2945 (100.0%) missing valuesMissing
Unnamed: 11 has 2945 (100.0%) missing valuesMissing
Unnamed: 12 has 2945 (100.0%) missing valuesMissing
Unnamed: 13 has 2945 (100.0%) missing valuesMissing
Unnamed: 14 has 2945 (100.0%) missing valuesMissing
Unnamed: 15 has 2945 (100.0%) missing valuesMissing
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 15:27:03.756814
Analysis finished2023-12-12 15:27:05.021396
Duration1.26 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1555
Distinct (%)52.8%
Missing0
Missing (%)0.0%
Memory size23.1 KiB
2023-12-13T00:27:05.207073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length7.6003396
Min length2

Characters and Unicode

Total characters22383
Distinct characters409
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1132 ?
Unique (%)38.4%

Sample

1st row에이치디건설(주)
2nd row대유이엔에스(주)
3rd row대유이엔에스(주)
4th row(주)진호건설
5th row승보종합건설(주)
ValueCountFrequency (%)
개인 138
 
4.3%
주식회사 86
 
2.7%
순천시산림조합 55
 
1.7%
성주에너지(주 31
 
1.0%
주)삼덕기업 29
 
0.9%
주)임성건설 24
 
0.8%
대산종합건설(주 22
 
0.7%
거룡종합건설(주 21
 
0.7%
성우종합건설(주 21
 
0.7%
20
 
0.6%
Other values (1605) 2748
86.0%
2023-12-13T00:27:05.598385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2428
 
10.8%
) 2326
 
10.4%
( 2322
 
10.4%
1662
 
7.4%
1488
 
6.6%
659
 
2.9%
604
 
2.7%
362
 
1.6%
344
 
1.5%
306
 
1.4%
Other values (399) 9882
44.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17402
77.7%
Close Punctuation 2327
 
10.4%
Open Punctuation 2323
 
10.4%
Space Separator 270
 
1.2%
Decimal Number 32
 
0.1%
Uppercase Letter 19
 
0.1%
Math Symbol 3
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2428
 
14.0%
1662
 
9.6%
1488
 
8.6%
659
 
3.8%
604
 
3.5%
362
 
2.1%
344
 
2.0%
306
 
1.8%
292
 
1.7%
291
 
1.7%
Other values (369) 8966
51.5%
Uppercase Letter
ValueCountFrequency (%)
S 4
21.1%
K 3
15.8%
T 2
10.5%
P 2
10.5%
W 1
 
5.3%
E 1
 
5.3%
I 1
 
5.3%
B 1
 
5.3%
H 1
 
5.3%
L 1
 
5.3%
Other values (2) 2
10.5%
Decimal Number
ValueCountFrequency (%)
1 16
50.0%
2 8
25.0%
6 2
 
6.2%
4 2
 
6.2%
3 2
 
6.2%
5 1
 
3.1%
8 1
 
3.1%
Close Punctuation
ValueCountFrequency (%)
) 2326
> 99.9%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2322
> 99.9%
1
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
v 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
270
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17402
77.7%
Common 4960
 
22.2%
Latin 21
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2428
 
14.0%
1662
 
9.6%
1488
 
8.6%
659
 
3.8%
604
 
3.5%
362
 
2.1%
344
 
2.0%
306
 
1.8%
292
 
1.7%
291
 
1.7%
Other values (369) 8966
51.5%
Common
ValueCountFrequency (%)
) 2326
46.9%
( 2322
46.8%
270
 
5.4%
1 16
 
0.3%
2 8
 
0.2%
~ 3
 
0.1%
- 3
 
0.1%
6 2
 
< 0.1%
4 2
 
< 0.1%
3 2
 
< 0.1%
Other values (6) 6
 
0.1%
Latin
ValueCountFrequency (%)
S 4
19.0%
K 3
14.3%
T 2
9.5%
P 2
9.5%
W 1
 
4.8%
E 1
 
4.8%
v 1
 
4.8%
e 1
 
4.8%
I 1
 
4.8%
B 1
 
4.8%
Other values (4) 4
19.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17402
77.7%
ASCII 4979
 
22.2%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2428
 
14.0%
1662
 
9.6%
1488
 
8.6%
659
 
3.8%
604
 
3.5%
362
 
2.1%
344
 
2.0%
306
 
1.8%
292
 
1.7%
291
 
1.7%
Other values (369) 8966
51.5%
ASCII
ValueCountFrequency (%)
) 2326
46.7%
( 2322
46.6%
270
 
5.4%
1 16
 
0.3%
2 8
 
0.2%
S 4
 
0.1%
~ 3
 
0.1%
K 3
 
0.1%
- 3
 
0.1%
T 2
 
< 0.1%
Other values (18) 22
 
0.4%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct2796
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Memory size23.1 KiB
2023-12-13T00:27:05.974474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length101
Median length75
Mean length26.527674
Min length14

Characters and Unicode

Total characters78124
Distinct characters357
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2671 ?
Unique (%)90.7%

Sample

1st row전라남도 순천시 덕월동 9-1 청미래아파트
2nd row순천시 승주읍 유흥리 산256번지
3rd row순천시 승주읍 유흥리 산231번지
4th row순천시 저전동 90-3번지
5th row순천시 교량동 146-24번지
ValueCountFrequency (%)
순천시 2939
 
16.7%
전라남도 2924
 
16.6%
일원 818
 
4.6%
해룡면 516
 
2.9%
351
 
2.0%
서면 306
 
1.7%
조례동 221
 
1.3%
별량면 202
 
1.1%
1호 179
 
1.0%
160
 
0.9%
Other values (3698) 9015
51.1%
2023-12-13T00:27:06.580493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15404
19.7%
3174
 
4.1%
3174
 
4.1%
3043
 
3.9%
2997
 
3.8%
2991
 
3.8%
2961
 
3.8%
2928
 
3.7%
1 2749
 
3.5%
2556
 
3.3%
Other values (347) 36147
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47833
61.2%
Space Separator 15404
 
19.7%
Decimal Number 12292
 
15.7%
Dash Punctuation 1590
 
2.0%
Math Symbol 309
 
0.4%
Open Punctuation 278
 
0.4%
Close Punctuation 278
 
0.4%
Uppercase Letter 98
 
0.1%
Other Punctuation 35
 
< 0.1%
Lowercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3174
 
6.6%
3174
 
6.6%
3043
 
6.4%
2997
 
6.3%
2991
 
6.3%
2961
 
6.2%
2928
 
6.1%
2556
 
5.3%
1926
 
4.0%
1736
 
3.6%
Other values (310) 20347
42.5%
Uppercase Letter
ValueCountFrequency (%)
A 28
28.6%
B 20
20.4%
C 15
15.3%
L 11
 
11.2%
I 9
 
9.2%
N 6
 
6.1%
T 2
 
2.0%
K 2
 
2.0%
E 2
 
2.0%
R 1
 
1.0%
Other values (2) 2
 
2.0%
Decimal Number
ValueCountFrequency (%)
1 2749
22.4%
2 1610
13.1%
3 1308
10.6%
5 1135
9.2%
4 1107
9.0%
6 963
 
7.8%
7 945
 
7.7%
8 870
 
7.1%
9 837
 
6.8%
0 768
 
6.2%
Other Punctuation
ValueCountFrequency (%)
. 16
45.7%
: 14
40.0%
! 2
 
5.7%
/ 2
 
5.7%
@ 1
 
2.9%
Math Symbol
ValueCountFrequency (%)
~ 297
96.1%
10
 
3.2%
+ 2
 
0.6%
Lowercase Letter
ValueCountFrequency (%)
k 3
50.0%
m 3
50.0%
Space Separator
ValueCountFrequency (%)
15404
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1590
100.0%
Open Punctuation
ValueCountFrequency (%)
( 278
100.0%
Close Punctuation
ValueCountFrequency (%)
) 278
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47833
61.2%
Common 30187
38.6%
Latin 104
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3174
 
6.6%
3174
 
6.6%
3043
 
6.4%
2997
 
6.3%
2991
 
6.3%
2961
 
6.2%
2928
 
6.1%
2556
 
5.3%
1926
 
4.0%
1736
 
3.6%
Other values (310) 20347
42.5%
Common
ValueCountFrequency (%)
15404
51.0%
1 2749
 
9.1%
2 1610
 
5.3%
- 1590
 
5.3%
3 1308
 
4.3%
5 1135
 
3.8%
4 1107
 
3.7%
6 963
 
3.2%
7 945
 
3.1%
8 870
 
2.9%
Other values (13) 2506
 
8.3%
Latin
ValueCountFrequency (%)
A 28
26.9%
B 20
19.2%
C 15
14.4%
L 11
 
10.6%
I 9
 
8.7%
N 6
 
5.8%
k 3
 
2.9%
m 3
 
2.9%
T 2
 
1.9%
K 2
 
1.9%
Other values (4) 5
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47832
61.2%
ASCII 30281
38.8%
Math Operators 10
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15404
50.9%
1 2749
 
9.1%
2 1610
 
5.3%
- 1590
 
5.3%
3 1308
 
4.3%
5 1135
 
3.7%
4 1107
 
3.7%
6 963
 
3.2%
7 945
 
3.1%
8 870
 
2.9%
Other values (26) 2600
 
8.6%
Hangul
ValueCountFrequency (%)
3174
 
6.6%
3174
 
6.6%
3043
 
6.4%
2997
 
6.3%
2991
 
6.3%
2961
 
6.2%
2928
 
6.1%
2556
 
5.3%
1926
 
4.0%
1736
 
3.6%
Other values (309) 20346
42.5%
Math Operators
ValueCountFrequency (%)
10
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

발생사업
Categorical

IMBALANCE 

Distinct9
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size23.1 KiB
건설업
2896 
비금속물질채취·제조·가공업
 
24
금속제품제조·가공업
 
7
시멘트·석회·프라스터및시멘트관련제품제조및가공업
 
5
제1차금속제조업
 
4
Other values (4)
 
9

Length

Max length25
Median length3
Mean length3.170798
Min length3

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row건설업
2nd row건설업
3rd row건설업
4th row건설업
5th row건설업

Common Values

ValueCountFrequency (%)
건설업 2896
98.3%
비금속물질채취·제조·가공업 24
 
0.8%
금속제품제조·가공업 7
 
0.2%
시멘트·석회·프라스터및시멘트관련제품제조및가공업 5
 
0.2%
제1차금속제조업 4
 
0.1%
시멘트·석탄·토사등의운송업 4
 
0.1%
비료및사료제조업 3
 
0.1%
건설 1
 
< 0.1%
<NA> 1
 
< 0.1%

Length

2023-12-13T00:27:06.797360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:27:06.925405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건설업 2896
98.3%
비금속물질채취·제조·가공업 24
 
0.8%
금속제품제조·가공업 7
 
0.2%
시멘트·석회·프라스터및시멘트관련제품제조및가공업 5
 
0.2%
제1차금속제조업 4
 
0.1%
시멘트·석탄·토사등의운송업 4
 
0.1%
비료및사료제조업 3
 
0.1%
건설 1
 
< 0.1%
na 1
 
< 0.1%
Distinct2798
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size23.1 KiB
2023-12-13T00:27:07.185303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length21
Mean length20.904924
Min length5

Characters and Unicode

Total characters61565
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2670 ?
Unique (%)90.7%

Sample

1st row2021-07-27~2021-08-24
2nd row2021-08-01~2021-12-31
3rd row2021-08-01~2021-12-31
4th row2021-07-21~2021-12-28
5th row2021-07-21~2021-10-02
ValueCountFrequency (%)
2021-02-15~2021-04-15 4
 
0.1%
2021-06-01~2021-08-31 4
 
0.1%
2019-07-16~2019-08-15 4
 
0.1%
2020-05-25~2020-12-31 3
 
0.1%
2019-11-01~2020-01-31 3
 
0.1%
2008-04-05~2008-06-25 3
 
0.1%
2011-03-17~2011-05-03 3
 
0.1%
2009-03-23~2009-12-30 3
 
0.1%
2018-03-23~2018-07-30 3
 
0.1%
2020-05-25~2020-07-30 3
 
0.1%
Other values (2788) 2912
98.9%
2023-12-13T00:27:07.650641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 14955
24.3%
- 11780
19.1%
2 10619
17.2%
1 9733
15.8%
~ 2945
 
4.8%
3 2731
 
4.4%
9 1745
 
2.8%
8 1684
 
2.7%
6 1437
 
2.3%
5 1407
 
2.3%
Other values (2) 2529
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 46840
76.1%
Dash Punctuation 11780
 
19.1%
Math Symbol 2945
 
4.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 14955
31.9%
2 10619
22.7%
1 9733
20.8%
3 2731
 
5.8%
9 1745
 
3.7%
8 1684
 
3.6%
6 1437
 
3.1%
5 1407
 
3.0%
7 1393
 
3.0%
4 1136
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 11780
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2945
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 61565
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 14955
24.3%
- 11780
19.1%
2 10619
17.2%
1 9733
15.8%
~ 2945
 
4.8%
3 2731
 
4.4%
9 1745
 
2.8%
8 1684
 
2.7%
6 1437
 
2.3%
5 1407
 
2.3%
Other values (2) 2529
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 61565
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 14955
24.3%
- 11780
19.1%
2 10619
17.2%
1 9733
15.8%
~ 2945
 
4.8%
3 2731
 
4.4%
9 1745
 
2.8%
8 1684
 
2.7%
6 1437
 
2.3%
5 1407
 
2.3%
Other values (2) 2529
 
4.1%
Distinct2848
Distinct (%)96.9%
Missing5
Missing (%)0.2%
Memory size23.1 KiB
2023-12-13T00:27:08.041571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length173
Median length94
Mean length21.881973
Min length6

Characters and Unicode

Total characters64333
Distinct characters336
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2765 ?
Unique (%)94.0%

Sample

1st row도장공사 (연면적 12987㎡)
2nd row토목공사 (공사면적 2004㎡)
3rd row토목공사 (공사면적 1910㎡)
4th row건축물축조공사 (연면적 11216.85㎡)
5th row토목공사 (공사면적 2454㎡)
ValueCountFrequency (%)
토목공사 1432
 
13.9%
건축물축조공사 680
 
6.6%
546
 
5.3%
공사 350
 
3.4%
345
 
3.4%
밖의 345
 
3.4%
연면적 311
 
3.0%
공사면적 229
 
2.2%
지반조성공사중건축물해체공사·토공사및정지공사 165
 
1.6%
총연장 134
 
1.3%
Other values (3820) 5757
55.9%
2023-12-13T00:27:08.631067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7843
 
12.2%
3824
 
5.9%
3685
 
5.7%
( 3533
 
5.5%
) 3526
 
5.5%
1 2375
 
3.7%
0 2040
 
3.2%
2 1985
 
3.1%
1976
 
3.1%
1947
 
3.0%
Other values (326) 31599
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27475
42.7%
Decimal Number 15806
24.6%
Space Separator 7843
 
12.2%
Open Punctuation 3537
 
5.5%
Close Punctuation 3530
 
5.5%
Other Symbol 2677
 
4.2%
Other Punctuation 2057
 
3.2%
Lowercase Letter 940
 
1.5%
Uppercase Letter 263
 
0.4%
Math Symbol 187
 
0.3%
Other values (2) 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3824
 
13.9%
3685
 
13.4%
1976
 
7.2%
1621
 
5.9%
1438
 
5.2%
1045
 
3.8%
962
 
3.5%
956
 
3.5%
946
 
3.4%
940
 
3.4%
Other values (266) 10082
36.7%
Uppercase Letter
ValueCountFrequency (%)
L 197
74.9%
A 17
 
6.5%
B 11
 
4.2%
K 9
 
3.4%
H 6
 
2.3%
U 4
 
1.5%
P 4
 
1.5%
D 4
 
1.5%
M 3
 
1.1%
C 3
 
1.1%
Other values (4) 5
 
1.9%
Decimal Number
ValueCountFrequency (%)
1 2375
15.0%
0 2040
12.9%
2 1985
12.6%
3 1632
10.3%
4 1490
9.4%
5 1407
8.9%
6 1299
8.2%
8 1236
7.8%
9 1196
7.6%
7 1146
7.3%
Other Punctuation
ValueCountFrequency (%)
. 968
47.1%
: 885
43.0%
· 171
 
8.3%
/ 28
 
1.4%
' 2
 
0.1%
; 1
 
< 0.1%
? 1
 
< 0.1%
* 1
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
m 894
95.1%
a 17
 
1.8%
k 14
 
1.5%
t 8
 
0.9%
h 3
 
0.3%
f 2
 
0.2%
j 1
 
0.1%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
= 166
88.8%
~ 9
 
4.8%
+ 8
 
4.3%
× 2
 
1.1%
1
 
0.5%
> 1
 
0.5%
Other Symbol
ValueCountFrequency (%)
1947
72.7%
598
 
22.3%
129
 
4.8%
2
 
0.1%
1
 
< 0.1%
Other Number
ValueCountFrequency (%)
³ 1
33.3%
1
33.3%
1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 3533
99.9%
[ 4
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 3526
99.9%
] 4
 
0.1%
Space Separator
ValueCountFrequency (%)
7843
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 35656
55.4%
Hangul 27475
42.7%
Latin 1202
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3824
 
13.9%
3685
 
13.4%
1976
 
7.2%
1621
 
5.9%
1438
 
5.2%
1045
 
3.8%
962
 
3.5%
956
 
3.5%
946
 
3.4%
940
 
3.4%
Other values (266) 10082
36.7%
Common
ValueCountFrequency (%)
7843
22.0%
( 3533
9.9%
) 3526
9.9%
1 2375
 
6.7%
0 2040
 
5.7%
2 1985
 
5.6%
1947
 
5.5%
3 1632
 
4.6%
4 1490
 
4.2%
5 1407
 
3.9%
Other values (29) 7878
22.1%
Latin
ValueCountFrequency (%)
m 894
74.4%
L 197
 
16.4%
a 17
 
1.4%
A 17
 
1.4%
k 14
 
1.2%
B 11
 
0.9%
K 9
 
0.7%
t 8
 
0.7%
H 6
 
0.5%
U 4
 
0.3%
Other values (11) 25
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 34003
52.9%
Hangul 27473
42.7%
CJK Compat 2677
 
4.2%
None 174
 
0.3%
Compat Jamo 2
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
Arrows 1
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7843
23.1%
( 3533
10.4%
) 3526
10.4%
1 2375
 
7.0%
0 2040
 
6.0%
2 1985
 
5.8%
3 1632
 
4.8%
4 1490
 
4.4%
5 1407
 
4.1%
6 1299
 
3.8%
Other values (38) 6873
20.2%
Hangul
ValueCountFrequency (%)
3824
 
13.9%
3685
 
13.4%
1976
 
7.2%
1621
 
5.9%
1438
 
5.2%
1045
 
3.8%
962
 
3.5%
956
 
3.5%
946
 
3.4%
940
 
3.4%
Other values (265) 10080
36.7%
CJK Compat
ValueCountFrequency (%)
1947
72.7%
598
 
22.3%
129
 
4.8%
2
 
0.1%
1
 
< 0.1%
None
ValueCountFrequency (%)
· 171
98.3%
× 2
 
1.1%
³ 1
 
0.6%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2945
Missing (%)100.0%
Memory size26.0 KiB

Missing values

2023-12-13T00:27:04.737393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:27:04.938368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명공사장소재지발생사업공사기간대상사업(규모)Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15
0에이치디건설(주)전라남도 순천시 덕월동 9-1 청미래아파트건설업2021-07-27~2021-08-24도장공사 (연면적 12987㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1대유이엔에스(주)순천시 승주읍 유흥리 산256번지건설업2021-08-01~2021-12-31토목공사 (공사면적 2004㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2대유이엔에스(주)순천시 승주읍 유흥리 산231번지건설업2021-08-01~2021-12-31토목공사 (공사면적 1910㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
3(주)진호건설순천시 저전동 90-3번지건설업2021-07-21~2021-12-28건축물축조공사 (연면적 11216.85㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
4승보종합건설(주)순천시 교량동 146-24번지건설업2021-07-21~2021-10-02토목공사 (공사면적 2454㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5(주)한국태양광발전연구소순천시 외서면 월암리 1078-2 1106-1 1106-2번지건설업2021-07-20~2022-01-19토목공사 (공사면적 4202㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
6(주)한국태양광발전연구소순천시 외서면 월암리 521 522-1 522-2번지건설업2021-07-20~2022-01-19토목공사 (공사면적 6125㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
7산수조경(주)순천시 조곡동 산80-1번지 일원건설업2021-07-19~2021-10-13토목공사 (토공사 8227㎡ 총연장 L335m)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
8대석종합건설(주)전라남도 순천시 조곡동 564 앞 도로일원(용당둑길~용당신흥길)건설업2021-08-01~2023-01-31토목공사 (중소도로 개설공사(2.2)㎞)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9금호건설(주)순천시 서면 선평리 85번지 일원건설업2021-07-15~2021-08-31도장공사 (연면적 56946424.1㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
사업장명공사장소재지발생사업공사기간대상사업(규모)Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15
2935한신공영(주)전라남도 순천시 왕지동 조례동 일원건설업2006-08-11~2009-09-30(왕조 운곡지구 도시개발 사업 부지면적 227000㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2936수평광산전라남도 순천시 황전면 수평리 산6 산7-3번지비금속물질채취·제조·가공업2006-06-23~2019-05-07그 밖의 공사 (6618㎡) :: 그 밖의 공사 (6618㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2937혜림건설(주)전라남도 순천시 석현동 118번지 2호 외건설업2006-07-25~2008-09-30(5864.07㎡) :: 조경공사 (4200㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2938백석토건(주)전라남도 순천시 왕지동 11번지외 5필지시멘트·석회·프라스터및시멘트관련제품제조및가공업2006-07-18~2010-12-31시멘트제조·가공및저장업 (부지면적 12111㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2939백석토건(주)전라남도 순천시 왕지동 산50-5비금속물질채취·제조·가공업2006-04-30~2010-01-31비금속광물 분쇄물 생산업 (비금속광물 분쇄물 생산업 부지면적6.600㎡ 생산량 72211㎥㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2940대주건설(주)전라남도 순천시 용당동 431번지건설업2006-04-29~2009-08-31(용당동 대주피오레 신축공사 연면적 181116㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2941현대건설(주)전라남도 순천시 서면 학구리 야흥동 덕월동 삼거동 석현동 상사면 일원건설업2005-12-22~2010-12-31(전라선 남원~순천간 전철전원설비공사 A=82774㎡)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2942한신공영(주)전라남도 순천시 서면 압곡리 ~ 서면 청소리(7.12km)건설업2005-09-05~2011-06-30(지장물 해체공사 2805t(톤)) :: 토목공사 (전주~광양간 고속도로 건설공사 15공구 L=7120m) :: (지장물 해체공사 2805t(톤)) :: 비금속광물 분쇄물 생산업 (굴착 암 토사 채취 55492 골재생산 40000㎥) :: 콘크리트제품제조업 (콘크리트 제품 제조 28793㎥)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2943두산중공업(주)전라남도 순천시 서면 구상리 산 210번지 1호건설업2005-09-30~2011-06-30(총연장 6640m) :: (면적 25800㎡) :: 토사석광업 (터널 공사 160000㎥)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2944(주)삼원전력전라남도 순천시 낙안면 목촌리 557번지건설업2005-07-31~2009-05-31(송전선로공사및 토목공사 송전탑 67기 길이 26.072 면적 151089헤베㎞)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

사업장명공사장소재지발생사업공사기간대상사업(규모)# duplicates
10개인전라남도 순천시 해룡면 선월리 371번지건설업2016-01-11~2016-06-11지반조성공사중건축물해체공사·토공사및정지공사 (3585㎡)3
0(주)동서종합건설전라남도 순천시 상사면 비촌리 지내건설업2009-03-23~2009-12-30토목공사 (토목공사 등 6711㎥)2
1(주)씨아이티시스템전라남도 순천시 월등면 대평리 88번지건설업2017-02-24~2017-06-24건축물축조공사 (연면적 2414.28㎡)2
2(주)에스엘건설전라남도 순천시 대룡동 124번지 4호건설업2016-03-01~2016-09-30건축물축조공사 (연면적 : 1143.45㎡) :: 지반조성공사중건축물해체공사·토공사및정지공사 (성토량 : 1114.8㎥)2
3(주)엘에스건설전라남도 순천시 해룡면 남가리 718 720건설업2018-04-09~2018-06-30지반조성공사중건축물해체공사·토공사및정지공사 (1900㎥)2
4(주)축하종합건설전라남도 순천시 조곡동 131번지 62호건설업2015-04-27~2015-12-31건축물축조공사 (연면적 2847.519㎡)2
5(주)크레이전라남도 순천시 서면 선평리 248번지 외 5필지건설업2014-10-01~2015-06-30건축물축조공사 (건축면적 973.52 연면적 4493.70 (지하1 지상 4)㎡)2
6(주)토당전라남도 순천시 별량면 동송리 686번지 1호비금속물질채취·제조·가공업1993-11-05~--석탄제품제조업및아스콘제조업 (아스콘제조 2톤(160㎥/시간)t(톤))2
7(주)한화건설전라남도 순천시 서면 선평리 346건설업2020-08-20~2022-12-31건축물축조공사 (건축연면적 96356.82㎡)2
8개인전라남도 순천시 석현동 677-4건설업2018-02-10~2018-12-31토목공사 (3795㎡)2