Overview

Dataset statistics

Number of variables4
Number of observations4694
Missing cells30
Missing cells (%)0.2%
Duplicate rows5
Duplicate rows (%)0.1%
Total size in memory146.8 KiB
Average record size in memory32.0 B

Variable types

Text3
Categorical1

Dataset

Description자동차관리법제34조(자동차의 튜닝) 2항에 따라 튜닝 작업을 완료한 자동차정비업자 또는튜닝자동차제작 등록 현황입니다. - 전화번호 미입력 업체 포함
URLhttps://www.data.go.kr/data/15117712/fileData.do

Alerts

Dataset has 5 (0.1%) duplicate rowsDuplicates
정비구분 is highly imbalanced (61.1%)Imbalance

Reproduction

Analysis started2023-12-12 17:40:39.733357
Analysis finished2023-12-12 17:40:40.987502
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct4478
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
2023-12-13T02:40:41.216389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length22
Mean length7.8894333
Min length1

Characters and Unicode

Total characters37033
Distinct characters649
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4319 ?
Unique (%)92.0%

Sample

1st row 진모터스
2nd row(bieung)마린모터스
3rd row(뉴)글로벌모터스
4th row(유) 신원1급자동차정비공업사
5th row(유)1급전북자동차정비공업사
ValueCountFrequency (%)
주식회사 195
 
3.5%
모터스 46
 
0.8%
애니카랜드 27
 
0.5%
쌍용자동차 23
 
0.4%
1급 23
 
0.4%
motors 19
 
0.3%
유한회사 18
 
0.3%
스피드메이트 17
 
0.3%
오토오아시스 17
 
0.3%
덱스크루 14
 
0.3%
Other values (4697) 5189
92.9%
2023-12-13T02:40:41.651085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1505
 
4.1%
1479
 
4.0%
1371
 
3.7%
1365
 
3.7%
1346
 
3.6%
1285
 
3.5%
1124
 
3.0%
1105
 
3.0%
1070
 
2.9%
1026
 
2.8%
Other values (639) 24357
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32393
87.5%
Uppercase Letter 1157
 
3.1%
Space Separator 899
 
2.4%
Close Punctuation 876
 
2.4%
Open Punctuation 871
 
2.4%
Decimal Number 338
 
0.9%
Lowercase Letter 334
 
0.9%
Other Symbol 90
 
0.2%
Other Punctuation 50
 
0.1%
Dash Punctuation 23
 
0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1505
 
4.6%
1479
 
4.6%
1371
 
4.2%
1365
 
4.2%
1346
 
4.2%
1285
 
4.0%
1124
 
3.5%
1105
 
3.4%
1070
 
3.3%
1026
 
3.2%
Other values (564) 19717
60.9%
Uppercase Letter
ValueCountFrequency (%)
S 110
 
9.5%
O 99
 
8.6%
T 94
 
8.1%
R 84
 
7.3%
M 84
 
7.3%
A 71
 
6.1%
E 60
 
5.2%
C 59
 
5.1%
K 55
 
4.8%
N 51
 
4.4%
Other values (16) 390
33.7%
Lowercase Letter
ValueCountFrequency (%)
e 50
15.0%
o 45
13.5%
r 36
10.8%
t 33
9.9%
s 23
 
6.9%
a 22
 
6.6%
c 14
 
4.2%
i 14
 
4.2%
m 14
 
4.2%
p 12
 
3.6%
Other values (13) 71
21.3%
Decimal Number
ValueCountFrequency (%)
1 256
75.7%
2 29
 
8.6%
3 12
 
3.6%
0 9
 
2.7%
5 8
 
2.4%
4 6
 
1.8%
6 6
 
1.8%
9 5
 
1.5%
8 4
 
1.2%
7 3
 
0.9%
Other Punctuation
ValueCountFrequency (%)
. 28
56.0%
& 11
 
22.0%
, 4
 
8.0%
· 2
 
4.0%
? 2
 
4.0%
' 2
 
4.0%
; 1
 
2.0%
Close Punctuation
ValueCountFrequency (%)
) 875
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 870
99.9%
[ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
899
100.0%
Other Symbol
ValueCountFrequency (%)
90
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
´ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32483
87.7%
Common 3059
 
8.3%
Latin 1491
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1505
 
4.6%
1479
 
4.6%
1371
 
4.2%
1365
 
4.2%
1346
 
4.1%
1285
 
4.0%
1124
 
3.5%
1105
 
3.4%
1070
 
3.3%
1026
 
3.2%
Other values (565) 19807
61.0%
Latin
ValueCountFrequency (%)
S 110
 
7.4%
O 99
 
6.6%
T 94
 
6.3%
R 84
 
5.6%
M 84
 
5.6%
A 71
 
4.8%
E 60
 
4.0%
C 59
 
4.0%
K 55
 
3.7%
N 51
 
3.4%
Other values (39) 724
48.6%
Common
ValueCountFrequency (%)
899
29.4%
) 875
28.6%
( 870
28.4%
1 256
 
8.4%
2 29
 
0.9%
. 28
 
0.9%
- 23
 
0.8%
3 12
 
0.4%
& 11
 
0.4%
0 9
 
0.3%
Other values (15) 47
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32392
87.5%
ASCII 4547
 
12.3%
None 93
 
0.3%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1505
 
4.6%
1479
 
4.6%
1371
 
4.2%
1365
 
4.2%
1346
 
4.2%
1285
 
4.0%
1124
 
3.5%
1105
 
3.4%
1070
 
3.3%
1026
 
3.2%
Other values (563) 19716
60.9%
ASCII
ValueCountFrequency (%)
899
19.8%
) 875
19.2%
( 870
19.1%
1 256
 
5.6%
S 110
 
2.4%
O 99
 
2.2%
T 94
 
2.1%
R 84
 
1.8%
M 84
 
1.8%
A 71
 
1.6%
Other values (62) 1105
24.3%
None
ValueCountFrequency (%)
90
96.8%
· 2
 
2.2%
´ 1
 
1.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

정비구분
Categorical

IMBALANCE 

Distinct20
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
자동차전문정비업
2055 
자동차종합정비업
1913 
제작
360 
소형자동차종합정비업
328 
<NA>
 
15
Other values (15)
 
23

Length

Max length12
Median length8
Mean length7.6723477
Min length1

Unique

Unique13 ?
Unique (%)0.3%

Sample

1st row소형자동차종합정비업
2nd row자동차전문정비업
3rd row자동차종합정비업
4th row자동차종합정비업
5th row자동차종합정비업

Common Values

ValueCountFrequency (%)
자동차전문정비업 2055
43.8%
자동차종합정비업 1913
40.8%
제작 360
 
7.7%
소형자동차종합정비업 328
 
7.0%
<NA> 15
 
0.3%
원동기전문정비업 8
 
0.2%
2
 
< 0.1%
261-3673 1
 
< 0.1%
051-303-5511 1
 
< 0.1%
051-305-4020 1
 
< 0.1%
Other values (10) 10
 
0.2%

Length

2023-12-13T02:40:41.817697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
자동차전문정비업 2055
43.8%
자동차종합정비업 1913
40.8%
제작 360
 
7.7%
소형자동차종합정비업 328
 
7.0%
na 15
 
0.3%
원동기전문정비업 8
 
0.2%
052-261-2721 1
 
< 0.1%
031-266-0697 1
 
< 0.1%
031-943-4010 1
 
< 0.1%
053-355-8255 1
 
< 0.1%
Other values (9) 9
 
0.2%
Distinct3317
Distinct (%)70.7%
Missing0
Missing (%)0.0%
Memory size36.8 KiB
2023-12-13T02:40:42.008371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length12
Mean length17.369195
Min length4

Characters and Unicode

Total characters81531
Distinct characters98
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3293 ?
Unique (%)70.2%

Sample

1st row031-944-4486
2nd row063-465-5119
3rd row062-954-1992
4th row061-337-6111
5th row063-532-6841
ValueCountFrequency (%)
연락처 1351
11.7%
입력으로 1351
11.7%
삭제(검색포털 1351
11.7%
검색 1351
11.7%
필요 1351
11.7%
미입력/휴대폰 1351
11.7%
남구 7
 
0.1%
울산광역시 7
 
0.1%
02-2105-8423 4
 
< 0.1%
부산광역시 3
 
< 0.1%
Other values (3343) 3372
29.3%
2023-12-13T02:40:42.380936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6806
 
8.3%
- 6538
 
8.0%
0 5546
 
6.8%
5 4127
 
5.1%
3 4104
 
5.0%
2 3597
 
4.4%
1 3309
 
4.1%
4 2973
 
3.6%
6 2725
 
3.3%
2702
 
3.3%
Other values (88) 39104
48.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 32803
40.2%
Other Letter 31287
38.4%
Space Separator 6806
 
8.3%
Dash Punctuation 6538
 
8.0%
Close Punctuation 1373
 
1.7%
Open Punctuation 1366
 
1.7%
Other Punctuation 1356
 
1.7%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2702
 
8.6%
2702
 
8.6%
2702
 
8.6%
2702
 
8.6%
1366
 
4.4%
1355
 
4.3%
1352
 
4.3%
1351
 
4.3%
1351
 
4.3%
1351
 
4.3%
Other values (69) 12353
39.5%
Decimal Number
ValueCountFrequency (%)
0 5546
16.9%
5 4127
12.6%
3 4104
12.5%
2 3597
11.0%
1 3309
10.1%
4 2973
9.1%
6 2725
8.3%
7 2392
7.3%
8 2330
7.1%
9 1700
 
5.2%
Other Punctuation
ValueCountFrequency (%)
/ 1351
99.6%
, 3
 
0.2%
? 1
 
0.1%
: 1
 
0.1%
Space Separator
ValueCountFrequency (%)
6806
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6538
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1373
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1366
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 50244
61.6%
Hangul 31287
38.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2702
 
8.6%
2702
 
8.6%
2702
 
8.6%
2702
 
8.6%
1366
 
4.4%
1355
 
4.3%
1352
 
4.3%
1351
 
4.3%
1351
 
4.3%
1351
 
4.3%
Other values (69) 12353
39.5%
Common
ValueCountFrequency (%)
6806
13.5%
- 6538
13.0%
0 5546
11.0%
5 4127
8.2%
3 4104
8.2%
2 3597
7.2%
1 3309
6.6%
4 2973
 
5.9%
6 2725
 
5.4%
7 2392
 
4.8%
Other values (9) 8127
16.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 50244
61.6%
Hangul 31287
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6806
13.5%
- 6538
13.0%
0 5546
11.0%
5 4127
8.2%
3 4104
8.2%
2 3597
7.2%
1 3309
6.6%
4 2973
 
5.9%
6 2725
 
5.4%
7 2392
 
4.8%
Other values (9) 8127
16.2%
Hangul
ValueCountFrequency (%)
2702
 
8.6%
2702
 
8.6%
2702
 
8.6%
2702
 
8.6%
1366
 
4.4%
1355
 
4.3%
1352
 
4.3%
1351
 
4.3%
1351
 
4.3%
1351
 
4.3%
Other values (69) 12353
39.5%

주소
Text

Distinct4599
Distinct (%)98.6%
Missing30
Missing (%)0.6%
Memory size36.8 KiB
2023-12-13T02:40:42.718255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length55
Mean length23.655017
Min length1

Characters and Unicode

Total characters110327
Distinct characters483
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4546 ?
Unique (%)97.5%

Sample

1st row경기도 파주시 광탄면 장지산로200번길 47-48
2nd row전라북도 군산시 비응도동 41번지 12호
3rd row광주광역시 광산구 북문대로 599(수완동)
4th row전라남도 나주시 왕곡면 신원리 345-9, 346-1, 345-33
5th row전라북도 정읍시 서부산업도로 595(하북동)
ValueCountFrequency (%)
경기도 1126
 
5.2%
경상북도 427
 
2.0%
경상남도 357
 
1.6%
충청남도 301
 
1.4%
전라북도 294
 
1.4%
대구광역시 273
 
1.3%
서울특별시 214
 
1.0%
부산광역시 211
 
1.0%
서구 204
 
0.9%
전라남도 201
 
0.9%
Other values (8319) 18126
83.4%
2023-12-13T02:40:43.175982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17131
 
15.5%
4337
 
3.9%
4209
 
3.8%
1 3744
 
3.4%
3620
 
3.3%
3367
 
3.1%
( 2975
 
2.7%
) 2973
 
2.7%
2742
 
2.5%
2 2416
 
2.2%
Other values (473) 62813
56.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67847
61.5%
Decimal Number 17663
 
16.0%
Space Separator 17131
 
15.5%
Open Punctuation 2975
 
2.7%
Close Punctuation 2973
 
2.7%
Dash Punctuation 1151
 
1.0%
Other Punctuation 547
 
0.5%
Uppercase Letter 40
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4337
 
6.4%
4209
 
6.2%
3620
 
5.3%
3367
 
5.0%
2742
 
4.0%
2119
 
3.1%
1767
 
2.6%
1710
 
2.5%
1545
 
2.3%
1524
 
2.2%
Other values (444) 40907
60.3%
Uppercase Letter
ValueCountFrequency (%)
B 14
35.0%
A 7
17.5%
C 6
15.0%
E 3
 
7.5%
D 3
 
7.5%
L 2
 
5.0%
F 1
 
2.5%
R 1
 
2.5%
T 1
 
2.5%
N 1
 
2.5%
Decimal Number
ValueCountFrequency (%)
1 3744
21.2%
2 2416
13.7%
3 1997
11.3%
4 1729
9.8%
5 1566
8.9%
6 1395
 
7.9%
7 1303
 
7.4%
0 1244
 
7.0%
8 1170
 
6.6%
9 1099
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 525
96.0%
14
 
2.6%
. 7
 
1.3%
/ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
17131
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2975
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2973
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1151
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67847
61.5%
Common 42440
38.5%
Latin 40
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4337
 
6.4%
4209
 
6.2%
3620
 
5.3%
3367
 
5.0%
2742
 
4.0%
2119
 
3.1%
1767
 
2.6%
1710
 
2.5%
1545
 
2.3%
1524
 
2.2%
Other values (444) 40907
60.3%
Common
ValueCountFrequency (%)
17131
40.4%
1 3744
 
8.8%
( 2975
 
7.0%
) 2973
 
7.0%
2 2416
 
5.7%
3 1997
 
4.7%
4 1729
 
4.1%
5 1566
 
3.7%
6 1395
 
3.3%
7 1303
 
3.1%
Other values (8) 5211
 
12.3%
Latin
ValueCountFrequency (%)
B 14
35.0%
A 7
17.5%
C 6
15.0%
E 3
 
7.5%
D 3
 
7.5%
L 2
 
5.0%
F 1
 
2.5%
R 1
 
2.5%
T 1
 
2.5%
N 1
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67847
61.5%
ASCII 42466
38.5%
None 14
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17131
40.3%
1 3744
 
8.8%
( 2975
 
7.0%
) 2973
 
7.0%
2 2416
 
5.7%
3 1997
 
4.7%
4 1729
 
4.1%
5 1566
 
3.7%
6 1395
 
3.3%
7 1303
 
3.1%
Other values (18) 5237
 
12.3%
Hangul
ValueCountFrequency (%)
4337
 
6.4%
4209
 
6.2%
3620
 
5.3%
3367
 
5.0%
2742
 
4.0%
2119
 
3.1%
1767
 
2.6%
1710
 
2.5%
1545
 
2.3%
1524
 
2.2%
Other values (444) 40907
60.3%
None
ValueCountFrequency (%)
14
100.0%

Missing values

2023-12-13T02:40:40.845703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:40:40.948031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호명정비구분연락처주소
0진모터스소형자동차종합정비업031-944-4486경기도 파주시 광탄면 장지산로200번길 47-48
1(bieung)마린모터스자동차전문정비업063-465-5119전라북도 군산시 비응도동 41번지 12호
2(뉴)글로벌모터스자동차종합정비업062-954-1992광주광역시 광산구 북문대로 599(수완동)
3(유) 신원1급자동차정비공업사자동차종합정비업061-337-6111전라남도 나주시 왕곡면 신원리 345-9, 346-1, 345-33
4(유)1급전북자동차정비공업사자동차종합정비업063-532-6841전라북도 정읍시 서부산업도로 595(하북동)
5(유)광목자동차종합정비업061-323-0055전라남도 함평군 학교면 죽정리 85번지
6(유)그린웨이 자동차공업사자동차종합정비업061-682-8113전라남도 여수시 봉계동 755번지
7(유)그린자동차공업사자동차종합정비업063-446-0033전라북도 군산시 조촌로 174, (조촌동)
8(유)금강자동차자동차전문정비업061-282-7988전라남도 목포시 영산로 715(석현동)
9(유)금강자동차공업사자동차종합정비업063-446-8866전라북도 군산시 내항2길 349(해망동)
상호명정비구분연락처주소
4684흑석카센타자동차전문정비업042-586-7410대전광역시 서구 벌곡로 631(매노동)
4685흥국정공(주)자동차종합정비업052-239-8787울산광역시 울주군 온산읍 덕신로 483
4686흥해종합정비자동차종합정비업054-262-9008경상북도 포항시 북구 흥해읍 동해대로 1934
4687흥화자동차공업사자동차종합정비업064-762-4531제주특별자치도 서귀포시 토평공단로 151(토평동)
4688히아브 만평특장제작연락처 미입력/휴대폰 입력으로 삭제(검색포털 검색 필요)대구광역시 달성군 현풍읍 현풍서로 114-1
4689히아브SBC제작연락처 미입력/휴대폰 입력으로 삭제(검색포털 검색 필요)경상남도 함안군 산인면 함마대로 2303
4690히아브특장 주식회사제작연락처 미입력/휴대폰 입력으로 삭제(검색포털 검색 필요)경상북도 경산시 와촌면 하양로 443
4691히아브한양자동차공업사자동차종합정비업연락처 미입력/휴대폰 입력으로 삭제(검색포털 검색 필요)경기도 양주시 광적면 백은로 485-39
4692히아트자동차전문정비업02-6264-6127서울특별시 서초구 마방로6길 14(양재동)
4693힐링카센타자동차전문정비업연락처 미입력/휴대폰 입력으로 삭제(검색포털 검색 필요)강원도 홍천군 홍천읍 홍천로 495, 104동

Duplicate rows

Most frequently occurring

상호명정비구분연락처주소# duplicates
0DS모터스포츠자동차전문정비업062-572-9733광주광역시 북구 양산택지로 138, 1층 103호104호(본촌동)2
1광주수양자동차공업사(주)자동차종합정비업연락처 미입력/휴대폰 입력으로 삭제(검색포털 검색 필요)경기도 광주시 곤지암읍 구수동길 14-182
2액티브양산종합검사정비자동차종합정비업055-375-3535경상남도 양산시 상북면 상북중앙로 412
3오토오아시스<NA>연락처 미입력/휴대폰 입력으로 삭제(검색포털 검색 필요)<NA>2
4주식회사안전기업목포지점자동차종합정비업연락처 미입력/휴대폰 입력으로 삭제(검색포털 검색 필요)전라남도 영암군 삼호읍 대불산단7로 2182