Overview

Dataset statistics

Number of variables6
Number of observations152
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory49.9 B

Variable types

Categorical3
Text2
Numeric1

Dataset

Description부산광역시남구민방위시설현황_20210914
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3081613

Alerts

시설형태 is highly overall correlated with 시설구분High correlation
시설구분 is highly overall correlated with 시설형태High correlation

Reproduction

Analysis started2023-12-10 16:28:22.932776
Analysis finished2023-12-10 16:28:23.815136
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
대피시설
115 
급수시설
37 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대피시설
2nd row대피시설
3rd row대피시설
4th row대피시설
5th row대피시설

Common Values

ValueCountFrequency (%)
대피시설 115
75.7%
급수시설 37
 
24.3%

Length

2023-12-11T01:28:23.924848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:28:24.083890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대피시설 115
75.7%
급수시설 37
 
24.3%

동명
Categorical

Distinct17
Distinct (%)11.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
대연3동
16 
우암동
16 
용호3동
13 
감만1동
12 
용호1동
12 
Other values (12)
83 

Length

Max length4
Median length4
Mean length3.8486842
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대연1동
2nd row대연1동
3rd row대연1동
4th row대연1동
5th row대연1동

Common Values

ValueCountFrequency (%)
대연3동 16
10.5%
우암동 16
10.5%
용호3동 13
 
8.6%
감만1동 12
 
7.9%
용호1동 12
 
7.9%
대연5동 11
 
7.2%
용호2동 9
 
5.9%
문현1동 9
 
5.9%
대연4동 9
 
5.9%
대연1동 9
 
5.9%
Other values (7) 36
23.7%

Length

2023-12-11T01:28:24.256903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대연3동 16
10.5%
우암동 16
10.5%
용호3동 13
 
8.6%
감만1동 12
 
7.9%
용호1동 12
 
7.9%
대연5동 11
 
7.2%
대연1동 9
 
5.9%
대연4동 9
 
5.9%
문현1동 9
 
5.9%
용호2동 9
 
5.9%
Other values (7) 36
23.7%
Distinct148
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T01:28:24.718934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length8.3355263
Min length3

Characters and Unicode

Total characters1267
Distinct characters206
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique144 ?
Unique (%)94.7%

Sample

1st row부산은행대연동지점
2nd row롯데슈퍼대연동점
3rd row하이마트
4th row법화빌딩
5th rowGS25시 편의점
ValueCountFrequency (%)
지하철 6
 
2.9%
엘지메트로시티 6
 
2.9%
4라인 4
 
1.9%
3 4
 
1.9%
자유2차아파트 3
 
1.4%
1 3
 
1.4%
삼성아파트 3
 
1.4%
자유1차아파트 3
 
1.4%
106동 2
 
1.0%
105동 2
 
1.0%
Other values (161) 171
82.6%
2023-12-11T01:28:25.379697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
77
 
6.1%
69
 
5.4%
67
 
5.3%
55
 
4.3%
55
 
4.3%
1 52
 
4.1%
46
 
3.6%
30
 
2.4%
22
 
1.7%
0 20
 
1.6%
Other values (196) 774
61.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1068
84.3%
Decimal Number 130
 
10.3%
Space Separator 55
 
4.3%
Open Punctuation 5
 
0.4%
Close Punctuation 5
 
0.4%
Uppercase Letter 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
7.2%
69
 
6.5%
67
 
6.3%
55
 
5.1%
46
 
4.3%
30
 
2.8%
22
 
2.1%
19
 
1.8%
18
 
1.7%
16
 
1.5%
Other values (180) 649
60.8%
Decimal Number
ValueCountFrequency (%)
1 52
40.0%
0 20
 
15.4%
2 19
 
14.6%
4 14
 
10.8%
3 9
 
6.9%
5 7
 
5.4%
9 3
 
2.3%
6 3
 
2.3%
8 2
 
1.5%
7 1
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
S 2
50.0%
G 1
25.0%
K 1
25.0%
Space Separator
ValueCountFrequency (%)
55
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1068
84.3%
Common 195
 
15.4%
Latin 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
7.2%
69
 
6.5%
67
 
6.3%
55
 
5.1%
46
 
4.3%
30
 
2.8%
22
 
2.1%
19
 
1.8%
18
 
1.7%
16
 
1.5%
Other values (180) 649
60.8%
Common
ValueCountFrequency (%)
55
28.2%
1 52
26.7%
0 20
 
10.3%
2 19
 
9.7%
4 14
 
7.2%
3 9
 
4.6%
5 7
 
3.6%
( 5
 
2.6%
) 5
 
2.6%
9 3
 
1.5%
Other values (3) 6
 
3.1%
Latin
ValueCountFrequency (%)
S 2
50.0%
G 1
25.0%
K 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1068
84.3%
ASCII 199
 
15.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
77
 
7.2%
69
 
6.5%
67
 
6.3%
55
 
5.1%
46
 
4.3%
30
 
2.8%
22
 
2.1%
19
 
1.8%
18
 
1.7%
16
 
1.5%
Other values (180) 649
60.8%
ASCII
ValueCountFrequency (%)
55
27.6%
1 52
26.1%
0 20
 
10.1%
2 19
 
9.5%
4 14
 
7.0%
3 9
 
4.5%
5 7
 
3.5%
( 5
 
2.5%
) 5
 
2.5%
9 3
 
1.5%
Other values (6) 10
 
5.0%
Distinct140
Distinct (%)92.7%
Missing1
Missing (%)0.7%
Memory size1.3 KiB
2023-12-11T01:28:25.767829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length38
Mean length27.119205
Min length6

Characters and Unicode

Total characters4095
Distinct characters186
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)89.4%

Sample

1st row부산광역시 남구 수영로 234 (대연동, 부산은행대연동지점)
2nd row부산광역시 남구 천제등로 11 (대연동, 동성하이타운)
3rd row부산광역시 남구 수영로 184 (대연동, 하이마트)
4th row부산광역시 남구 수영로 190 (대연동)
5th row부산광역시 남구 수영로 244 (대연동, 남천빌딩)
ValueCountFrequency (%)
부산광역시 114
 
15.0%
남구 114
 
15.0%
대연동 38
 
5.0%
용호동 27
 
3.5%
우암동 15
 
2.0%
감만동 15
 
2.0%
문현동 15
 
2.0%
수영로 14
 
1.8%
지하 10
 
1.3%
동명로 10
 
1.3%
Other values (273) 390
51.2%
2023-12-11T01:28:26.477451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
611
 
14.9%
170
 
4.2%
157
 
3.8%
129
 
3.2%
1 126
 
3.1%
122
 
3.0%
120
 
2.9%
120
 
2.9%
120
 
2.9%
( 118
 
2.9%
Other values (176) 2302
56.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2528
61.7%
Space Separator 611
 
14.9%
Decimal Number 590
 
14.4%
Open Punctuation 118
 
2.9%
Close Punctuation 118
 
2.9%
Other Punctuation 105
 
2.6%
Dash Punctuation 25
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
170
 
6.7%
157
 
6.2%
129
 
5.1%
122
 
4.8%
120
 
4.7%
120
 
4.7%
120
 
4.7%
117
 
4.6%
117
 
4.6%
92
 
3.6%
Other values (160) 1264
50.0%
Decimal Number
ValueCountFrequency (%)
1 126
21.4%
2 95
16.1%
3 66
11.2%
5 53
9.0%
9 52
8.8%
4 49
 
8.3%
6 48
 
8.1%
7 38
 
6.4%
0 36
 
6.1%
8 27
 
4.6%
Other Punctuation
ValueCountFrequency (%)
, 104
99.0%
. 1
 
1.0%
Space Separator
ValueCountFrequency (%)
611
100.0%
Open Punctuation
ValueCountFrequency (%)
( 118
100.0%
Close Punctuation
ValueCountFrequency (%)
) 118
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2528
61.7%
Common 1567
38.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
170
 
6.7%
157
 
6.2%
129
 
5.1%
122
 
4.8%
120
 
4.7%
120
 
4.7%
120
 
4.7%
117
 
4.6%
117
 
4.6%
92
 
3.6%
Other values (160) 1264
50.0%
Common
ValueCountFrequency (%)
611
39.0%
1 126
 
8.0%
( 118
 
7.5%
) 118
 
7.5%
, 104
 
6.6%
2 95
 
6.1%
3 66
 
4.2%
5 53
 
3.4%
9 52
 
3.3%
4 49
 
3.1%
Other values (6) 175
 
11.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2528
61.7%
ASCII 1567
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
611
39.0%
1 126
 
8.0%
( 118
 
7.5%
) 118
 
7.5%
, 104
 
6.6%
2 95
 
6.1%
3 66
 
4.2%
5 53
 
3.4%
9 52
 
3.3%
4 49
 
3.1%
Other values (6) 175
 
11.2%
Hangul
ValueCountFrequency (%)
170
 
6.7%
157
 
6.2%
129
 
5.1%
122
 
4.8%
120
 
4.7%
120
 
4.7%
120
 
4.7%
117
 
4.6%
117
 
4.6%
92
 
3.6%
Other values (160) 1264
50.0%

시설형태
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
지하시설
115 
생활용수
23 
음 용 수
14 

Length

Max length5
Median length4
Mean length4.0921053
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지하시설
2nd row지하시설
3rd row지하시설
4th row지하시설
5th row지하시설

Common Values

ValueCountFrequency (%)
지하시설 115
75.7%
생활용수 23
 
15.1%
음 용 수 14
 
9.2%

Length

2023-12-11T01:28:26.703141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:28:26.854406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하시설 115
63.9%
생활용수 23
 
12.8%
14
 
7.8%
14
 
7.8%
14
 
7.8%
Distinct132
Distinct (%)86.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2470.6316
Minimum30
Maximum16188
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-11T01:28:27.046651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile95.85
Q1185.5
median711.5
Q33993.25
95-th percentile8944.4
Maximum16188
Range16158
Interquartile range (IQR)3807.75

Descriptive statistics

Standard deviation3182.7375
Coefficient of variation (CV)1.2882283
Kurtosis1.7825046
Mean2470.6316
Median Absolute Deviation (MAD)611.5
Skewness1.4977418
Sum375536
Variance10129818
MonotonicityNot monotonic
2023-12-11T01:28:27.300219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 8
 
5.3%
110 3
 
2.0%
115 3
 
2.0%
80 2
 
1.3%
353 2
 
1.3%
120 2
 
1.3%
148 2
 
1.3%
130 2
 
1.3%
150 2
 
1.3%
561 2
 
1.3%
Other values (122) 124
81.6%
ValueCountFrequency (%)
30 1
 
0.7%
60 1
 
0.7%
65 1
 
0.7%
70 1
 
0.7%
75 1
 
0.7%
80 2
 
1.3%
92 1
 
0.7%
99 1
 
0.7%
100 8
5.3%
102 1
 
0.7%
ValueCountFrequency (%)
16188 1
0.7%
11101 1
0.7%
10141 1
0.7%
9739 1
0.7%
9475 1
0.7%
9461 1
0.7%
9238 1
0.7%
9094 1
0.7%
8822 1
0.7%
8580 1
0.7%

Interactions

2023-12-11T01:28:23.428153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:28:27.454382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설구분동명시설형태규모(세제곱미터)_용량
시설구분1.0000.0991.0000.577
동명0.0991.0000.0000.406
시설형태1.0000.0001.0000.386
규모(세제곱미터)_용량0.5770.4060.3861.000
2023-12-11T01:28:27.593013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설형태시설구분동명
시설형태1.0000.9970.000
시설구분0.9971.0000.080
동명0.0000.0801.000
2023-12-11T01:28:27.733516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
규모(세제곱미터)_용량시설구분동명시설형태
규모(세제곱미터)_용량1.0000.4270.1750.262
시설구분0.4271.0000.0800.997
동명0.1750.0801.0000.000
시설형태0.2620.9970.0001.000

Missing values

2023-12-11T01:28:23.589104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:28:23.753877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설구분동명시설명소재지시설형태규모(세제곱미터)_용량
0대피시설대연1동부산은행대연동지점부산광역시 남구 수영로 234 (대연동, 부산은행대연동지점)지하시설561
1대피시설대연1동롯데슈퍼대연동점부산광역시 남구 천제등로 11 (대연동, 동성하이타운)지하시설2040
2대피시설대연1동하이마트부산광역시 남구 수영로 184 (대연동, 하이마트)지하시설263
3대피시설대연1동법화빌딩부산광역시 남구 수영로 190 (대연동)지하시설236
4대피시설대연1동GS25시 편의점부산광역시 남구 수영로 244 (대연동, 남천빌딩)지하시설297
5대피시설대연1동항도빌라부산광역시 남구 유엔평화로9번길 42 (대연동, 항도맨션)지하시설676
6대피시설대연1동지하철 대연역부산광역시 남구 수영로 지하 242 (대연동, 대연역)지하시설9238
7대피시설대연3동여성회관부산광역시 남구 수영로 356 (대연동, 여성회관)지하시설413
8대피시설대연3동부경대학교 2호관부산광역시 남구 용소로 45 (대연동, 부경대학교대연캠퍼스)지하시설1076
9대피시설대연3동대우그린2차아파트부산광역시 남구 황령대로319번가길 189 (대연동, 대우그린2차아파트)지하시설3914
시설구분동명시설명소재지시설형태규모(세제곱미터)_용량
142급수시설용당동신선대(아래솔밭)신선대산복로 105음 용 수100
143급수시설감만1동감만탕우암로 64생활용수237
144급수시설감만1동신성여객자동차(주)우암로 58-1생활용수115
145급수시설우암동성지고등학교유엔로 39생활용수100
146급수시설문현1동문현여자고등학교고동골로 86-41생활용수60
147급수시설문현1동광원아파트고동골로97번길 67생활용수187
148급수시설문현1동인각사진남로 234음 용 수110
149급수시설문현1동문현초등학교고동골로 86-13생활용수102
150급수시설문현3동대도라이프타운수영로39번가길 83생활용수100
151급수시설문현4동벽산한성기린아파트지게골로 52-15생활용수75