Overview

Dataset statistics

Number of variables4
Number of observations84
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory33.6 B

Variable types

Text2
Categorical1
DateTime1

Dataset

Description부산광역시_중구_석면조사대상건축물현황_20210726
Author부산광역시 중구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15026377

Alerts

건물명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:17:38.670372
Analysis finished2023-12-10 17:17:39.753674
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

건물명
Text

UNIQUE 

Distinct84
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size804.0 B
2023-12-11T02:17:40.150720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length15
Mean length8.4404762
Min length4

Characters and Unicode

Total characters709
Distinct characters182
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)100.0%

Sample

1st row영주어린이집
2nd row한진중공업 R
3rd row국도타운
4th row롯데백화점 광복점
5th row한국전기안전공사부산울산지역본부
ValueCountFrequency (%)
주차장 4
 
3.4%
본관 3
 
2.5%
부산세관 2
 
1.7%
부산차량사업소 2
 
1.7%
중구청 2
 
1.7%
신동아수산물종합시장 2
 
1.7%
별관 2
 
1.7%
자갈치시장 2
 
1.7%
광복점 2
 
1.7%
롯데백화점 2
 
1.7%
Other values (94) 95
80.5%
2023-12-11T02:17:41.014022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
4.8%
32
 
4.5%
30
 
4.2%
21
 
3.0%
17
 
2.4%
16
 
2.3%
15
 
2.1%
15
 
2.1%
13
 
1.8%
13
 
1.8%
Other values (172) 503
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 640
90.3%
Space Separator 34
 
4.8%
Uppercase Letter 17
 
2.4%
Decimal Number 7
 
1.0%
Open Punctuation 5
 
0.7%
Close Punctuation 5
 
0.7%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
5.0%
30
 
4.7%
21
 
3.3%
17
 
2.7%
16
 
2.5%
15
 
2.3%
15
 
2.3%
13
 
2.0%
13
 
2.0%
11
 
1.7%
Other values (156) 457
71.4%
Uppercase Letter
ValueCountFrequency (%)
B 3
17.6%
K 2
11.8%
P 2
11.8%
C 2
11.8%
S 2
11.8%
I 2
11.8%
M 1
 
5.9%
Y 1
 
5.9%
R 1
 
5.9%
E 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 4
57.1%
1 3
42.9%
Space Separator
ValueCountFrequency (%)
34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 640
90.3%
Common 52
 
7.3%
Latin 17
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
5.0%
30
 
4.7%
21
 
3.3%
17
 
2.7%
16
 
2.5%
15
 
2.3%
15
 
2.3%
13
 
2.0%
13
 
2.0%
11
 
1.7%
Other values (156) 457
71.4%
Latin
ValueCountFrequency (%)
B 3
17.6%
K 2
11.8%
P 2
11.8%
C 2
11.8%
S 2
11.8%
I 2
11.8%
M 1
 
5.9%
Y 1
 
5.9%
R 1
 
5.9%
E 1
 
5.9%
Common
ValueCountFrequency (%)
34
65.4%
( 5
 
9.6%
) 5
 
9.6%
2 4
 
7.7%
1 3
 
5.8%
, 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 640
90.3%
ASCII 69
 
9.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34
49.3%
( 5
 
7.2%
) 5
 
7.2%
2 4
 
5.8%
1 3
 
4.3%
B 3
 
4.3%
K 2
 
2.9%
P 2
 
2.9%
C 2
 
2.9%
S 2
 
2.9%
Other values (6) 7
 
10.1%
Hangul
ValueCountFrequency (%)
32
 
5.0%
30
 
4.7%
21
 
3.3%
17
 
2.7%
16
 
2.5%
15
 
2.3%
15
 
2.3%
13
 
2.0%
13
 
2.0%
11
 
1.7%
Other values (156) 457
71.4%
Distinct77
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size804.0 B
2023-12-11T02:17:41.494547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length33
Mean length23.75
Min length16

Characters and Unicode

Total characters1995
Distinct characters118
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)84.5%

Sample

1st row부산광역시 중구 망양로 396-0 (동아아파트)
2nd row부산광역시 중구 충장대로 6 (중앙동4가)
3rd row부산광역시 중구 비프광장로 18 (남포동6가)
4th row부산광역시 중구 중앙대로 2-0 (롯데백화점광복점)
5th row부산광역시 중구 해관로 27-0
ValueCountFrequency (%)
부산광역시 84
21.5%
중구 84
21.5%
중앙대로 9
 
2.3%
충장대로 7
 
1.8%
중구청 6
 
1.5%
중구로 5
 
1.3%
주민센터 5
 
1.3%
구덕로 4
 
1.0%
자갈치로 4
 
1.0%
충장대로13번길 3
 
0.8%
Other values (142) 179
45.9%
2023-12-11T02:17:42.231410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
309
 
15.5%
116
 
5.8%
104
 
5.2%
101
 
5.1%
98
 
4.9%
98
 
4.9%
88
 
4.4%
0 86
 
4.3%
86
 
4.3%
- 80
 
4.0%
Other values (108) 829
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1217
61.0%
Space Separator 309
 
15.5%
Decimal Number 305
 
15.3%
Dash Punctuation 80
 
4.0%
Open Punctuation 39
 
2.0%
Close Punctuation 39
 
2.0%
Uppercase Letter 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
116
 
9.5%
104
 
8.5%
101
 
8.3%
98
 
8.1%
98
 
8.1%
88
 
7.2%
86
 
7.1%
77
 
6.3%
34
 
2.8%
33
 
2.7%
Other values (91) 382
31.4%
Decimal Number
ValueCountFrequency (%)
0 86
28.2%
1 45
14.8%
3 33
 
10.8%
2 32
 
10.5%
4 24
 
7.9%
5 22
 
7.2%
9 22
 
7.2%
6 15
 
4.9%
7 14
 
4.6%
8 12
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
X 2
33.3%
T 2
33.3%
S 2
33.3%
Space Separator
ValueCountFrequency (%)
309
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1217
61.0%
Common 772
38.7%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
116
 
9.5%
104
 
8.5%
101
 
8.3%
98
 
8.1%
98
 
8.1%
88
 
7.2%
86
 
7.1%
77
 
6.3%
34
 
2.8%
33
 
2.7%
Other values (91) 382
31.4%
Common
ValueCountFrequency (%)
309
40.0%
0 86
 
11.1%
- 80
 
10.4%
1 45
 
5.8%
( 39
 
5.1%
) 39
 
5.1%
3 33
 
4.3%
2 32
 
4.1%
4 24
 
3.1%
5 22
 
2.8%
Other values (4) 63
 
8.2%
Latin
ValueCountFrequency (%)
X 2
33.3%
T 2
33.3%
S 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1217
61.0%
ASCII 778
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
309
39.7%
0 86
 
11.1%
- 80
 
10.3%
1 45
 
5.8%
( 39
 
5.0%
) 39
 
5.0%
3 33
 
4.2%
2 32
 
4.1%
4 24
 
3.1%
5 22
 
2.8%
Other values (7) 69
 
8.9%
Hangul
ValueCountFrequency (%)
116
 
9.5%
104
 
8.5%
101
 
8.3%
98
 
8.1%
98
 
8.1%
88
 
7.2%
86
 
7.1%
77
 
6.3%
34
 
2.8%
33
 
2.7%
Other values (91) 382
31.4%

건물용도
Categorical

Distinct12
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size804.0 B
업무시설
26 
판매시설
15 
자동차관련시설
11 
의료시설
노유자시설
Other values (7)
19 

Length

Max length10
Median length4
Mean length5.3333333
Min length4

Unique

Unique2 ?
Unique (%)2.4%

Sample

1st row노유자시설
2nd row자동차관련시설
3rd row판매시설
4th row판매시설
5th row업무시설

Common Values

ValueCountFrequency (%)
업무시설 26
31.0%
판매시설 15
17.9%
자동차관련시설 11
13.1%
의료시설 7
 
8.3%
노유자시설 6
 
7.1%
제1종 근린생활시설 6
 
7.1%
제2종 근린생활시설 3
 
3.6%
문화 및 집회시설 3
 
3.6%
운수시설 3
 
3.6%
교육연구시설 2
 
2.4%
Other values (2) 2
 
2.4%

Length

2023-12-11T02:17:42.501256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
업무시설 26
26.3%
판매시설 15
15.2%
자동차관련시설 11
11.1%
근린생활시설 9
 
9.1%
의료시설 7
 
7.1%
노유자시설 6
 
6.1%
제1종 6
 
6.1%
제2종 3
 
3.0%
문화 3
 
3.0%
3
 
3.0%
Other values (5) 10
 
10.1%
Distinct62
Distinct (%)73.8%
Missing0
Missing (%)0.0%
Memory size804.0 B
Minimum2012-06-28 00:00:00
Maximum2019-09-27 00:00:00
2023-12-11T02:17:42.708409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:17:42.941707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2023-12-11T02:17:43.098375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건물명도로명건물용도석면조사일
건물명1.0001.0001.0001.000
도로명1.0001.0000.9940.999
건물용도1.0000.9941.0000.917
석면조사일1.0000.9990.9171.000

Missing values

2023-12-11T02:17:39.469066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:17:39.674285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

건물명도로명건물용도석면조사일
0영주어린이집부산광역시 중구 망양로 396-0 (동아아파트)노유자시설2012-07-04
1한진중공업 R부산광역시 중구 충장대로 6 (중앙동4가)자동차관련시설2015-03-18
2국도타운부산광역시 중구 비프광장로 18 (남포동6가)판매시설2015-01-08
3롯데백화점 광복점부산광역시 중구 중앙대로 2-0 (롯데백화점광복점)판매시설2014-10-02
4한국전기안전공사부산울산지역본부부산광역시 중구 해관로 27-0업무시설2014-06-19
52층 게임랜드부산광역시 중구 구덕로84번길 5-0제2종 근린생활시설2014-06-12
6더락PC카페부산광역시 중구 광복로 38-0 (동아데파트)판매시설2014-05-30
7신창요양병원부산광역시 중구 중앙대로 55-0의료시설2014-05-29
8보수종합시장부산광역시 중구 보수대로 94-0판매시설2014-05-28
9IBK기업은행 부평동지점부산광역시 중구 흑교로 5-0업무시설2014-05-23
건물명도로명건물용도석면조사일
74국제지하도상가부산광역시 중구 중구로 31-0 (국제지하도상가)판매시설2012-11-22
75민주공원부산광역시 중구 민주공원길 19-0문화 및 집회시설2012-11-22
76대청어린이집부산광역시 중구 망양로383번안길 25-0노유자시설2012-09-19
77중구종합사회복지관부산광역시 중구 망양로 309-0노유자시설2012-08-23
78남포지하도상가부산광역시 중구 구덕로 44-0 (남포지하도상가)판매시설2012-06-28
79광복지하도상가부산광역시 중구 중앙대로 17-0 (광복지하도상가)판매시설2012-06-28
80롯데백화점 광복점 실내주차장부산광역시 중구 중앙대로 2-0 (롯데백화점광복점)자동차관련시설2014-10-02
81부산세관 별관부산광역시 중구 대청로155번길 6-0업무시설2013-06-18
82신동아수산물종합시장 주차장부산광역시 중구 자갈치로 42-0자동차관련시설2014-04-24
83아기산새어린이집부산광역시 중구 임소길 24 (영주동)노유자시설2019-09-27