Overview

Dataset statistics

Number of variables4
Number of observations1065
Missing cells9
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory33.4 KiB
Average record size in memory32.1 B

Variable types

Text4

Dataset

Description대전광역시 대한전문건설협회 업체 현황에 대한 데이터로 건설 업종, 상호명, 소재지, 전화번호 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15061026/fileData.do

Alerts

상호명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:01:52.504460
Analysis finished2023-12-12 08:01:53.151337
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Text

Distinct65
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size8.4 KiB
2023-12-12T17:01:53.325801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length29
Mean length8.3117371
Min length4

Characters and Unicode

Total characters8852
Distinct characters45
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)2.6%

Sample

1st row구조물해체비계상하수도설비
2nd row실내건축
3rd row조경식재시설물
4th row금속창호지붕건조
5th row금속창호지붕건조
ValueCountFrequency (%)
실내건축 173
16.2%
금속창호지붕건조 142
13.3%
도장습식방수석공 141
13.2%
조경식재시설물 133
12.5%
상하수도설비 96
9.0%
지반조성포장 58
 
5.4%
구조물해체비계 39
 
3.7%
철근콘크리트 37
 
3.5%
지반조성포장상하수도설비 29
 
2.7%
지반조성포장철근콘크리트상하수도설비 23
 
2.2%
Other values (55) 194
18.2%
2023-12-12T17:01:53.750537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
632
 
7.1%
410
 
4.6%
408
 
4.6%
396
 
4.5%
378
 
4.3%
373
 
4.2%
367
 
4.1%
366
 
4.1%
276
 
3.1%
266
 
3.0%
Other values (35) 4980
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8852
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
632
 
7.1%
410
 
4.6%
408
 
4.6%
396
 
4.5%
378
 
4.3%
373
 
4.2%
367
 
4.1%
366
 
4.1%
276
 
3.1%
266
 
3.0%
Other values (35) 4980
56.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8852
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
632
 
7.1%
410
 
4.6%
408
 
4.6%
396
 
4.5%
378
 
4.3%
373
 
4.2%
367
 
4.1%
366
 
4.1%
276
 
3.1%
266
 
3.0%
Other values (35) 4980
56.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8852
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
632
 
7.1%
410
 
4.6%
408
 
4.6%
396
 
4.5%
378
 
4.3%
373
 
4.2%
367
 
4.1%
366
 
4.1%
276
 
3.1%
266
 
3.0%
Other values (35) 4980
56.3%

상호명
Text

UNIQUE 

Distinct1065
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size8.4 KiB
2023-12-12T17:01:54.083663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length7.5802817
Min length2

Characters and Unicode

Total characters8073
Distinct characters333
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1065 ?
Unique (%)100.0%

Sample

1st row21세기건설산업㈜
2nd rowO.J건축인테리어
3rd row(주)가가조경
4th row(주)가경건설산업
5th row가나건설(주)
ValueCountFrequency (%)
21세기건설산업㈜ 1
 
0.1%
주)이든아이디 1
 
0.1%
유현건설(주 1
 
0.1%
주)이튼알렌 1
 
0.1%
유)윤우건설 1
 
0.1%
은돌건설(주 1
 
0.1%
주)은성기업 1
 
0.1%
은성산업(주 1
 
0.1%
주)은성씨앤알 1
 
0.1%
주)은성아트 1
 
0.1%
Other values (1055) 1055
99.1%
2023-12-12T17:01:54.600450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 1024
 
12.7%
( 1024
 
12.7%
1001
 
12.4%
492
 
6.1%
430
 
5.3%
204
 
2.5%
198
 
2.5%
197
 
2.4%
125
 
1.5%
110
 
1.4%
Other values (323) 3268
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6016
74.5%
Close Punctuation 1024
 
12.7%
Open Punctuation 1024
 
12.7%
Uppercase Letter 4
 
< 0.1%
Other Symbol 2
 
< 0.1%
Decimal Number 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1001
 
16.6%
492
 
8.2%
430
 
7.1%
204
 
3.4%
198
 
3.3%
197
 
3.3%
125
 
2.1%
110
 
1.8%
89
 
1.5%
86
 
1.4%
Other values (314) 3084
51.3%
Uppercase Letter
ValueCountFrequency (%)
J 2
50.0%
O 1
25.0%
S 1
25.0%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 1024
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1024
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6018
74.5%
Common 2051
 
25.4%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1001
 
16.6%
492
 
8.2%
430
 
7.1%
204
 
3.4%
198
 
3.3%
197
 
3.3%
125
 
2.1%
110
 
1.8%
89
 
1.5%
86
 
1.4%
Other values (315) 3086
51.3%
Common
ValueCountFrequency (%)
) 1024
49.9%
( 1024
49.9%
2 1
 
< 0.1%
1 1
 
< 0.1%
. 1
 
< 0.1%
Latin
ValueCountFrequency (%)
J 2
50.0%
O 1
25.0%
S 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6016
74.5%
ASCII 2055
 
25.5%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 1024
49.8%
( 1024
49.8%
J 2
 
0.1%
2 1
 
< 0.1%
1 1
 
< 0.1%
. 1
 
< 0.1%
O 1
 
< 0.1%
S 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
1001
 
16.6%
492
 
8.2%
430
 
7.1%
204
 
3.4%
198
 
3.3%
197
 
3.3%
125
 
2.1%
110
 
1.8%
89
 
1.5%
86
 
1.4%
Other values (314) 3084
51.3%
None
ValueCountFrequency (%)
2
100.0%
Distinct1046
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size8.4 KiB
2023-12-12T17:01:55.151079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length43
Mean length29.139906
Min length13

Characters and Unicode

Total characters31034
Distinct characters325
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1029 ?
Unique (%)96.6%

Sample

1st row대전광역시 서구 유등로 353 (변동)
2nd row대전광역시 중구 계백로1615번길 34, 104호(유천동,현대2차상가)
3rd row대전광역시 유성구 신성로72번길 46, 201호 (신성동)
4th row대전광역시 유성구 박산로 62 (구암동)
5th row대전광역시 유성구 유성대로 615 (구암동)
ValueCountFrequency (%)
대전광역시 952
 
16.5%
유성구 335
 
5.8%
서구 275
 
4.8%
대덕구 180
 
3.1%
중구 160
 
2.8%
동구 115
 
2.0%
대전 92
 
1.6%
2층 70
 
1.2%
오정동 46
 
0.8%
1층 43
 
0.7%
Other values (1705) 3508
60.7%
2023-12-12T17:01:55.792888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4712
 
15.2%
1671
 
5.4%
1336
 
4.3%
1 1333
 
4.3%
1122
 
3.6%
1101
 
3.5%
981
 
3.2%
) 957
 
3.1%
( 957
 
3.1%
955
 
3.1%
Other values (315) 15909
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17388
56.0%
Decimal Number 5856
 
18.9%
Space Separator 4712
 
15.2%
Close Punctuation 957
 
3.1%
Open Punctuation 957
 
3.1%
Other Punctuation 848
 
2.7%
Dash Punctuation 293
 
0.9%
Uppercase Letter 19
 
0.1%
Lowercase Letter 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1671
 
9.6%
1336
 
7.7%
1122
 
6.5%
1101
 
6.3%
981
 
5.6%
955
 
5.5%
952
 
5.5%
933
 
5.4%
510
 
2.9%
482
 
2.8%
Other values (285) 7345
42.2%
Decimal Number
ValueCountFrequency (%)
1 1333
22.8%
2 861
14.7%
3 633
10.8%
0 623
10.6%
5 541
9.2%
4 474
 
8.1%
6 424
 
7.2%
7 360
 
6.1%
9 306
 
5.2%
8 301
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
B 6
31.6%
A 4
21.1%
T 2
 
10.5%
K 2
 
10.5%
C 1
 
5.3%
D 1
 
5.3%
G 1
 
5.3%
H 1
 
5.3%
E 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 846
99.8%
. 1
 
0.1%
& 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
o 1
33.3%
n 1
33.3%
e 1
33.3%
Space Separator
ValueCountFrequency (%)
4712
100.0%
Close Punctuation
ValueCountFrequency (%)
) 957
100.0%
Open Punctuation
ValueCountFrequency (%)
( 957
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 293
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17388
56.0%
Common 13624
43.9%
Latin 22
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1671
 
9.6%
1336
 
7.7%
1122
 
6.5%
1101
 
6.3%
981
 
5.6%
955
 
5.5%
952
 
5.5%
933
 
5.4%
510
 
2.9%
482
 
2.8%
Other values (285) 7345
42.2%
Common
ValueCountFrequency (%)
4712
34.6%
1 1333
 
9.8%
) 957
 
7.0%
( 957
 
7.0%
2 861
 
6.3%
, 846
 
6.2%
3 633
 
4.6%
0 623
 
4.6%
5 541
 
4.0%
4 474
 
3.5%
Other values (8) 1687
 
12.4%
Latin
ValueCountFrequency (%)
B 6
27.3%
A 4
18.2%
T 2
 
9.1%
K 2
 
9.1%
C 1
 
4.5%
D 1
 
4.5%
G 1
 
4.5%
o 1
 
4.5%
H 1
 
4.5%
E 1
 
4.5%
Other values (2) 2
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17388
56.0%
ASCII 13646
44.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4712
34.5%
1 1333
 
9.8%
) 957
 
7.0%
( 957
 
7.0%
2 861
 
6.3%
, 846
 
6.2%
3 633
 
4.6%
0 623
 
4.6%
5 541
 
4.0%
4 474
 
3.5%
Other values (20) 1709
 
12.5%
Hangul
ValueCountFrequency (%)
1671
 
9.6%
1336
 
7.7%
1122
 
6.5%
1101
 
6.3%
981
 
5.6%
955
 
5.5%
952
 
5.5%
933
 
5.4%
510
 
2.9%
482
 
2.8%
Other values (285) 7345
42.2%
Distinct1014
Distinct (%)96.0%
Missing9
Missing (%)0.8%
Memory size8.4 KiB
2023-12-12T17:01:56.131529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length12.110795
Min length11

Characters and Unicode

Total characters12789
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique981 ?
Unique (%)92.9%

Sample

1st row042-523-0121
2nd row070-8793-7025
3rd row042-863-5052
4th row042-824-7776
5th row042-825-0938
ValueCountFrequency (%)
042-000-0000 8
 
0.8%
042-587-0900 3
 
0.3%
042-535-5588 3
 
0.3%
042-632-8312 3
 
0.3%
042-364-3877 2
 
0.2%
042-825-7882 2
 
0.2%
042-320-7715 2
 
0.2%
042-534-8015 2
 
0.2%
042-632-9677 2
 
0.2%
042-622-4616 2
 
0.2%
Other values (1004) 1027
97.3%
2023-12-12T17:01:56.558028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2112
16.5%
2 2108
16.5%
0 1864
14.6%
4 1689
13.2%
8 878
6.9%
5 866
6.8%
6 797
 
6.2%
3 760
 
5.9%
7 653
 
5.1%
1 630
 
4.9%
Other values (3) 432
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10637
83.2%
Dash Punctuation 2112
 
16.5%
Math Symbol 37
 
0.3%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 2108
19.8%
0 1864
17.5%
4 1689
15.9%
8 878
8.3%
5 866
8.1%
6 797
 
7.5%
3 760
 
7.1%
7 653
 
6.1%
1 630
 
5.9%
9 392
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 2112
100.0%
Math Symbol
ValueCountFrequency (%)
~ 37
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12789
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2112
16.5%
2 2108
16.5%
0 1864
14.6%
4 1689
13.2%
8 878
6.9%
5 866
6.8%
6 797
 
6.2%
3 760
 
5.9%
7 653
 
5.1%
1 630
 
4.9%
Other values (3) 432
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12789
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2112
16.5%
2 2108
16.5%
0 1864
14.6%
4 1689
13.2%
8 878
6.9%
5 866
6.8%
6 797
 
6.2%
3 760
 
5.9%
7 653
 
5.1%
1 630
 
4.9%
Other values (3) 432
 
3.4%

Missing values

2023-12-12T17:01:52.996884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:01:53.111721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호명소재지연락처
0구조물해체비계상하수도설비21세기건설산업㈜대전광역시 서구 유등로 353 (변동)042-523-0121
1실내건축O.J건축인테리어대전광역시 중구 계백로1615번길 34, 104호(유천동,현대2차상가)070-8793-7025
2조경식재시설물(주)가가조경대전광역시 유성구 신성로72번길 46, 201호 (신성동)042-863-5052
3금속창호지붕건조(주)가경건설산업대전광역시 유성구 박산로 62 (구암동)042-824-7776
4금속창호지붕건조가나건설(주)대전광역시 유성구 유성대로 615 (구암동)042-825-0938
5도장습식방수석공가나공영(주)대전광역시 대덕구 중리동로27번길 15, 1층(중리동)042-632-1075
6지반조성포장철근콘크리트상하수도설비가득건설(주)대전광역시 동구 옻밭2길 19 (신흥동)042-545-8870
7구조물해체비계(주)가디언대전광역시 동구 신기로101번길 44-8, 101호(가오동, 은어송빌라)042-273-1504
8금속창호지붕건조(주)가람대전광역시 중구 보문산로 363, 3층 (문화동)042-581-1404
9조경식재시설물(주)가람아트조경대전광역시 서구 둔산대로117번길 66, 1119호(만년동,골드벤처타운)042-862-1756
업종상호명소재지연락처
1055조경식재시설물상하수도설비황소건설(주)대전 유성구 방동 708-1042-825-6667
1056지반조성포장(주)효림건설대전광역시 대덕구 송촌로 1, 1층 101호(송촌동)042-581-3312
1057실내건축(주)효성건축대전광역시 중구 태평로113번길 26,1층(태평동)042-543-9111
1058조경식재시설물철근콘크리트상하수도설비효성조경개발(주)대전광역시 중구 대둔산로 419-4, 901-1호(산성동, 한밭프라자)042-584-2146
1059조경식재시설물(주)휘게건설대전광역시 유성구 반석로 100, 2층 206호(반석동)042-824-4680
1060실내건축(주)휴앤대전 서구 둔산동 2152042-471-4466
1061지반조성포장흥남토건(주)대전광역시 유성구 유성대로729번길 25, 102호(장대동)042-823-9898
1062지반조성포장철근콘크리트(주)흥용건설대전광역시 동구 대전로994번길 87 (홍도동)042-634-5355
1063지반조성포장상하수도설비흥정건설(주)대전광역시 유성구 유성대로654번길 125, 207호 (구암동)041-522-7608
1064철근콘크리트구조물해체비계희망건설(주)대전광역시 유성구 유성대로736번길 19, 604호 (장대동,넥스투빌)042-825-5028