Overview

Dataset statistics

Number of variables5
Number of observations270
Missing cells48
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.7 KiB
Average record size in memory40.5 B

Variable types

Categorical1
Text4

Dataset

Description부산광역시연제구_단란주점현황_20221021
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3082714

Alerts

소재지전화 has 48 (17.8%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:51:08.160514
Analysis finished2023-12-10 16:51:09.162831
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
유흥주점영업
186 
단란주점
84 

Length

Max length6
Median length6
Mean length5.3777778
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row유흥주점영업

Common Values

ValueCountFrequency (%)
유흥주점영업 186
68.9%
단란주점 84
31.1%

Length

2023-12-11T01:51:09.281099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:51:09.451640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유흥주점영업 186
68.9%
단란주점 84
31.1%
Distinct265
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-11T01:51:09.871125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length5.4888889
Min length1

Characters and Unicode

Total characters1482
Distinct characters298
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique260 ?
Unique (%)96.3%

Sample

1st row수정성인룸크럽
2nd row술마시는 싱싱노래방
3rd row7080태평양
4th row여궁 노래주점
5th row조아노래주점
ValueCountFrequency (%)
노래방 11
 
3.3%
라이브 7
 
2.1%
노래주점 7
 
2.1%
술마시는 6
 
1.8%
단란주점 3
 
0.9%
7080 3
 
0.9%
노래타운 3
 
0.9%
바카스노래주점 2
 
0.6%
노래하는 2
 
0.6%
주점 2
 
0.6%
Other values (278) 286
86.1%
2023-12-11T01:51:10.612897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
6.8%
98
 
6.6%
73
 
4.9%
70
 
4.7%
62
 
4.2%
35
 
2.4%
32
 
2.2%
31
 
2.1%
31
 
2.1%
30
 
2.0%
Other values (288) 919
62.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1364
92.0%
Space Separator 62
 
4.2%
Decimal Number 27
 
1.8%
Uppercase Letter 10
 
0.7%
Close Punctuation 7
 
0.5%
Open Punctuation 7
 
0.5%
Lowercase Letter 3
 
0.2%
Other Punctuation 1
 
0.1%
Letter Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
 
7.4%
98
 
7.2%
73
 
5.4%
70
 
5.1%
35
 
2.6%
32
 
2.3%
31
 
2.3%
31
 
2.3%
30
 
2.2%
28
 
2.1%
Other values (267) 835
61.2%
Uppercase Letter
ValueCountFrequency (%)
K 2
20.0%
N 2
20.0%
J 2
20.0%
V 1
10.0%
U 1
10.0%
C 1
10.0%
O 1
10.0%
Decimal Number
ValueCountFrequency (%)
0 10
37.0%
8 5
18.5%
2 5
18.5%
7 5
18.5%
4 1
 
3.7%
1 1
 
3.7%
Lowercase Letter
ValueCountFrequency (%)
k 1
33.3%
i 1
33.3%
m 1
33.3%
Space Separator
ValueCountFrequency (%)
62
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Other Punctuation
ValueCountFrequency (%)
% 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1363
92.0%
Common 104
 
7.0%
Latin 14
 
0.9%
Han 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
 
7.4%
98
 
7.2%
73
 
5.4%
70
 
5.1%
35
 
2.6%
32
 
2.3%
31
 
2.3%
31
 
2.3%
30
 
2.2%
28
 
2.1%
Other values (266) 834
61.2%
Latin
ValueCountFrequency (%)
K 2
14.3%
N 2
14.3%
J 2
14.3%
V 1
7.1%
U 1
7.1%
k 1
7.1%
i 1
7.1%
m 1
7.1%
C 1
7.1%
O 1
7.1%
Common
ValueCountFrequency (%)
62
59.6%
0 10
 
9.6%
) 7
 
6.7%
( 7
 
6.7%
8 5
 
4.8%
2 5
 
4.8%
7 5
 
4.8%
4 1
 
1.0%
1 1
 
1.0%
% 1
 
1.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1363
92.0%
ASCII 117
 
7.9%
CJK 1
 
0.1%
Number Forms 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
101
 
7.4%
98
 
7.2%
73
 
5.4%
70
 
5.1%
35
 
2.6%
32
 
2.3%
31
 
2.3%
31
 
2.3%
30
 
2.2%
28
 
2.1%
Other values (266) 834
61.2%
ASCII
ValueCountFrequency (%)
62
53.0%
0 10
 
8.5%
) 7
 
6.0%
( 7
 
6.0%
8 5
 
4.3%
2 5
 
4.3%
7 5
 
4.3%
K 2
 
1.7%
N 2
 
1.7%
J 2
 
1.7%
Other values (10) 10
 
8.5%
CJK
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct218
Distinct (%)80.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-11T01:51:10.959477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length36
Mean length27.225926
Min length21

Characters and Unicode

Total characters7351
Distinct characters67
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique190 ?
Unique (%)70.4%

Sample

1st row부산광역시 연제구 중앙대로 1116-9 (연산동)
2nd row부산광역시 연제구 반송로 13-10 (연산동)
3rd row부산광역시 연제구 반송로 13-8 (연산동)
4th row부산광역시 연제구 거제천로230번길 98 (연산동)
5th row부산광역시 연제구 중앙대로 1116-11 (연산동,연산4동)
ValueCountFrequency (%)
부산광역시 270
19.2%
연제구 270
19.2%
연산동 229
16.3%
반송로 52
 
3.7%
월드컵대로 40
 
2.8%
중앙대로1120번길 30
 
2.1%
과정로 26
 
1.8%
고분로 23
 
1.6%
고분로13번길 21
 
1.5%
중앙대로 20
 
1.4%
Other values (206) 426
30.3%
2023-12-11T01:51:11.487150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1137
 
15.5%
544
 
7.4%
539
 
7.3%
1 397
 
5.4%
( 287
 
3.9%
) 287
 
3.9%
287
 
3.9%
276
 
3.8%
276
 
3.8%
272
 
3.7%
Other values (57) 3049
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4367
59.4%
Space Separator 1137
 
15.5%
Decimal Number 1100
 
15.0%
Open Punctuation 287
 
3.9%
Close Punctuation 287
 
3.9%
Dash Punctuation 88
 
1.2%
Other Punctuation 76
 
1.0%
Uppercase Letter 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
544
12.5%
539
12.3%
287
 
6.6%
276
 
6.3%
276
 
6.3%
272
 
6.2%
270
 
6.2%
270
 
6.2%
270
 
6.2%
270
 
6.2%
Other values (38) 1093
25.0%
Decimal Number
ValueCountFrequency (%)
1 397
36.1%
2 151
 
13.7%
4 100
 
9.1%
3 99
 
9.0%
0 72
 
6.5%
6 68
 
6.2%
5 68
 
6.2%
8 58
 
5.3%
7 45
 
4.1%
9 42
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
C 4
44.4%
B 2
22.2%
T 2
22.2%
M 1
 
11.1%
Space Separator
ValueCountFrequency (%)
1137
100.0%
Open Punctuation
ValueCountFrequency (%)
( 287
100.0%
Close Punctuation
ValueCountFrequency (%)
) 287
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%
Other Punctuation
ValueCountFrequency (%)
, 76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4367
59.4%
Common 2975
40.5%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
544
12.5%
539
12.3%
287
 
6.6%
276
 
6.3%
276
 
6.3%
272
 
6.2%
270
 
6.2%
270
 
6.2%
270
 
6.2%
270
 
6.2%
Other values (38) 1093
25.0%
Common
ValueCountFrequency (%)
1137
38.2%
1 397
 
13.3%
( 287
 
9.6%
) 287
 
9.6%
2 151
 
5.1%
4 100
 
3.4%
3 99
 
3.3%
- 88
 
3.0%
, 76
 
2.6%
0 72
 
2.4%
Other values (5) 281
 
9.4%
Latin
ValueCountFrequency (%)
C 4
44.4%
B 2
22.2%
T 2
22.2%
M 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4367
59.4%
ASCII 2984
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1137
38.1%
1 397
 
13.3%
( 287
 
9.6%
) 287
 
9.6%
2 151
 
5.1%
4 100
 
3.4%
3 99
 
3.3%
- 88
 
2.9%
, 76
 
2.5%
0 72
 
2.4%
Other values (9) 290
 
9.7%
Hangul
ValueCountFrequency (%)
544
12.5%
539
12.3%
287
 
6.6%
276
 
6.3%
276
 
6.3%
272
 
6.2%
270
 
6.2%
270
 
6.2%
270
 
6.2%
270
 
6.2%
Other values (38) 1093
25.0%
Distinct207
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-11T01:51:12.147118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length31
Mean length22.07037
Min length20

Characters and Unicode

Total characters5959
Distinct characters49
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique175 ?
Unique (%)64.8%

Sample

1st row부산광역시 연제구 연산동 724-6
2nd row부산광역시 연제구 연산동 723-14
3rd row부산광역시 연제구 연산동 723-16
4th row부산광역시 연제구 연산동 723-2
5th row부산광역시 연제구 연산동 724-7 연산4동
ValueCountFrequency (%)
부산광역시 270
23.4%
연제구 270
23.4%
연산동 266
23.1%
지하1층 10
 
0.9%
1127-5 9
 
0.8%
1127-6 9
 
0.8%
2층 8
 
0.7%
730-20 6
 
0.5%
603-4 6
 
0.5%
3층 6
 
0.5%
Other values (201) 292
25.3%
2023-12-11T01:51:13.142267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1151
19.3%
539
 
9.0%
538
 
9.0%
1 284
 
4.8%
277
 
4.6%
275
 
4.6%
274
 
4.6%
270
 
4.5%
270
 
4.5%
270
 
4.5%
Other values (39) 1811
30.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3126
52.5%
Decimal Number 1347
22.6%
Space Separator 1151
 
19.3%
Dash Punctuation 270
 
4.5%
Open Punctuation 28
 
0.5%
Close Punctuation 28
 
0.5%
Uppercase Letter 8
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
539
17.2%
538
17.2%
277
8.9%
275
8.8%
274
8.8%
270
8.6%
270
8.6%
270
8.6%
270
8.6%
50
 
1.6%
Other values (20) 93
 
3.0%
Decimal Number
ValueCountFrequency (%)
1 284
21.1%
2 217
16.1%
7 212
15.7%
3 148
11.0%
0 117
8.7%
6 103
 
7.6%
4 94
 
7.0%
5 71
 
5.3%
9 63
 
4.7%
8 38
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
C 4
50.0%
T 2
25.0%
B 1
 
12.5%
M 1
 
12.5%
Space Separator
ValueCountFrequency (%)
1151
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 270
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3126
52.5%
Common 2825
47.4%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
539
17.2%
538
17.2%
277
8.9%
275
8.8%
274
8.8%
270
8.6%
270
8.6%
270
8.6%
270
8.6%
50
 
1.6%
Other values (20) 93
 
3.0%
Common
ValueCountFrequency (%)
1151
40.7%
1 284
 
10.1%
- 270
 
9.6%
2 217
 
7.7%
7 212
 
7.5%
3 148
 
5.2%
0 117
 
4.1%
6 103
 
3.6%
4 94
 
3.3%
5 71
 
2.5%
Other values (5) 158
 
5.6%
Latin
ValueCountFrequency (%)
C 4
50.0%
T 2
25.0%
B 1
 
12.5%
M 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3126
52.5%
ASCII 2833
47.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1151
40.6%
1 284
 
10.0%
- 270
 
9.5%
2 217
 
7.7%
7 212
 
7.5%
3 148
 
5.2%
0 117
 
4.1%
6 103
 
3.6%
4 94
 
3.3%
5 71
 
2.5%
Other values (9) 166
 
5.9%
Hangul
ValueCountFrequency (%)
539
17.2%
538
17.2%
277
8.9%
275
8.8%
274
8.8%
270
8.6%
270
8.6%
270
8.6%
270
8.6%
50
 
1.6%
Other values (20) 93
 
3.0%

소재지전화
Text

MISSING 

Distinct218
Distinct (%)98.2%
Missing48
Missing (%)17.8%
Memory size2.2 KiB
2023-12-11T01:51:13.614497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters3108
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique214 ?
Unique (%)96.4%

Sample

1st row051 -853 -0666
2nd row 051- 868-5587
3rd row 051- 868-6466
4th row 051- 867-8877
5th row051 -868 -6585
ValueCountFrequency (%)
051 218
41.7%
852 13
 
2.5%
853 10
 
1.9%
867 10
 
1.9%
851 7
 
1.3%
865 7
 
1.3%
868 6
 
1.1%
864 5
 
1.0%
861 5
 
1.0%
866 5
 
1.0%
Other values (227) 237
45.3%
2023-12-11T01:51:14.097533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
457
14.7%
- 444
14.3%
5 413
13.3%
0 339
10.9%
1 337
10.8%
8 327
10.5%
6 233
7.5%
7 142
 
4.6%
2 122
 
3.9%
3 119
 
3.8%
Other values (2) 175
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2207
71.0%
Space Separator 457
 
14.7%
Dash Punctuation 444
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 413
18.7%
0 339
15.4%
1 337
15.3%
8 327
14.8%
6 233
10.6%
7 142
 
6.4%
2 122
 
5.5%
3 119
 
5.4%
4 90
 
4.1%
9 85
 
3.9%
Space Separator
ValueCountFrequency (%)
457
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 444
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3108
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
457
14.7%
- 444
14.3%
5 413
13.3%
0 339
10.9%
1 337
10.8%
8 327
10.5%
6 233
7.5%
7 142
 
4.6%
2 122
 
3.9%
3 119
 
3.8%
Other values (2) 175
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3108
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
457
14.7%
- 444
14.3%
5 413
13.3%
0 339
10.9%
1 337
10.8%
8 327
10.5%
6 233
7.5%
7 142
 
4.6%
2 122
 
3.9%
3 119
 
3.8%
Other values (2) 175
 
5.6%

Missing values

2023-12-11T01:51:08.935963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:51:09.097861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지(지번)소재지전화
0유흥주점영업수정성인룸크럽부산광역시 연제구 중앙대로 1116-9 (연산동)부산광역시 연제구 연산동 724-6051 -853 -0666
1유흥주점영업술마시는 싱싱노래방부산광역시 연제구 반송로 13-10 (연산동)부산광역시 연제구 연산동 723-14051- 868-5587
2유흥주점영업7080태평양부산광역시 연제구 반송로 13-8 (연산동)부산광역시 연제구 연산동 723-16051- 868-6466
3유흥주점영업여궁 노래주점부산광역시 연제구 거제천로230번길 98 (연산동)부산광역시 연제구 연산동 723-2051- 867-8877
4유흥주점영업조아노래주점부산광역시 연제구 중앙대로 1116-11 (연산동,연산4동)부산광역시 연제구 연산동 724-7 연산4동051 -868 -6585
5유흥주점영업메리트부산광역시 연제구 중앙대로1120번길 13 (연산동)부산광역시 연제구 연산동 675-16051- 852-1254
6유흥주점영업술마시는 도화 노래방부산광역시 연제구 반송로 16 (연산동)부산광역시 연제구 연산동 728-1051- 864-4375
7유흥주점영업카네기 실내포장부산광역시 연제구 중앙대로1120번길 14-6 (연산동)부산광역시 연제구 연산동 724-9<NA>
8유흥주점영업올리브부산광역시 연제구 과정로 156 (연산동)부산광역시 연제구 연산동 478-7051 -758 -9491
9유흥주점영업초콜릿부산광역시 연제구 고분로13번길 5-20 (연산동)부산광역시 연제구 연산동 603-11051 -865 -5200
업종명업소명소재지(도로명)소재지(지번)소재지전화
260단란주점미림부산광역시 연제구 고분로 12, 2층 (연산동)부산광역시 연제구 연산동 731-2<NA>
261단란주점발리노래방 단란주점부산광역시 연제구 고분로13번길 43, 4층 (연산동)부산광역시 연제구 연산동 590-106051 -862 -1900
262단란주점목화라이브클럽부산광역시 연제구 거제천로182번길 50, 3층 (연산동)부산광역시 연제구 연산동 1127-2051 -853 -5053
263단란주점샤인소맥클럽 단란주점부산광역시 연제구 반송로 32-15, 2층 (연산동)부산광역시 연제구 연산동 590-12<NA>
264단란주점나도가수다부산광역시 연제구 반송로 9-1, 3층 (연산동)부산광역시 연제구 연산동 726-5051 -852 -5789
265단란주점캡틴원탁가라오케부산광역시 연제구 중앙대로1120번길 8, 5층 (연산동)부산광역시 연제구 연산동 724-2<NA>
266단란주점퀸라이브 7080부산광역시 연제구 반송로 13-8, 3층 (연산동)부산광역시 연제구 연산동 723-16<NA>
267단란주점신데렐라부산광역시 연제구 고분로13번길 43, 3층 일부호 (연산동)부산광역시 연제구 연산동 590-106<NA>
268단란주점U턴 원탁가라오케부산광역시 연제구 고분로13번길 11, 2층 (연산동)부산광역시 연제구 연산동 603-29<NA>
269단란주점소금창고부산광역시 연제구 과정로 166, 지하1층 (연산동)부산광역시 연제구 연산동 469-16 지하1층<NA>