Overview

Dataset statistics

Number of variables4
Number of observations102
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)1.0%
Total size in memory3.5 KiB
Average record size in memory35.3 B

Variable types

Categorical1
Text1
Numeric2

Dataset

Description부산광역시연제구_안심귀갓길설치현황_20230112
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15111858

Alerts

Dataset has 1 (1.0%) duplicate rowsDuplicates
구분 is highly imbalanced (86.1%)Imbalance

Reproduction

Analysis started2023-12-10 17:16:13.817403
Analysis finished2023-12-10 17:16:15.191061
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size948.0 B
안심반딧불(센서등)
100 
맘편한길(센서등)
 
2

Length

Max length10
Median length10
Mean length9.9803922
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안심반딧불(센서등)
2nd row안심반딧불(센서등)
3rd row안심반딧불(센서등)
4th row안심반딧불(센서등)
5th row안심반딧불(센서등)

Common Values

ValueCountFrequency (%)
안심반딧불(센서등) 100
98.0%
맘편한길(센서등) 2
 
2.0%

Length

2023-12-11T02:16:15.316003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:16:15.539860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안심반딧불(센서등 100
98.0%
맘편한길(센서등 2
 
2.0%

위치
Text

Distinct100
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size948.0 B
2023-12-11T02:16:16.083731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length11.676471
Min length5

Characters and Unicode

Total characters1191
Distinct characters68
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)96.1%

Sample

1st row과정로225번길34 안쪽
2nd row과정로225번길34 입구쪽
3rd row연수로121-4 바깥쪽
4th row연수로121-4 안쪽
5th row고분로20번길79-9
ValueCountFrequency (%)
대리로 6
 
2.9%
월드컵대로 6
 
2.9%
좌측 5
 
2.4%
우측 5
 
2.4%
쌍미천로 4
 
2.0%
해맞이로 4
 
2.0%
금련로 4
 
2.0%
거제대로 3
 
1.5%
31번가길 3
 
1.5%
금련로38번나길28 3
 
1.5%
Other values (119) 162
79.0%
2023-12-11T02:16:17.684487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
 
8.7%
102
 
8.6%
1 102
 
8.6%
2 85
 
7.1%
83
 
7.0%
83
 
7.0%
- 54
 
4.5%
3 52
 
4.4%
5 42
 
3.5%
4 34
 
2.9%
Other values (58) 450
37.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 591
49.6%
Decimal Number 442
37.1%
Space Separator 104
 
8.7%
Dash Punctuation 54
 
4.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
102
17.3%
83
 
14.0%
83
 
14.0%
26
 
4.4%
17
 
2.9%
14
 
2.4%
13
 
2.2%
13
 
2.2%
12
 
2.0%
11
 
1.9%
Other values (46) 217
36.7%
Decimal Number
ValueCountFrequency (%)
1 102
23.1%
2 85
19.2%
3 52
11.8%
5 42
9.5%
4 34
 
7.7%
8 33
 
7.5%
9 30
 
6.8%
7 23
 
5.2%
6 22
 
5.0%
0 19
 
4.3%
Space Separator
ValueCountFrequency (%)
104
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 600
50.4%
Hangul 591
49.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
102
17.3%
83
 
14.0%
83
 
14.0%
26
 
4.4%
17
 
2.9%
14
 
2.4%
13
 
2.2%
13
 
2.2%
12
 
2.0%
11
 
1.9%
Other values (46) 217
36.7%
Common
ValueCountFrequency (%)
104
17.3%
1 102
17.0%
2 85
14.2%
- 54
9.0%
3 52
8.7%
5 42
7.0%
4 34
 
5.7%
8 33
 
5.5%
9 30
 
5.0%
7 23
 
3.8%
Other values (2) 41
 
6.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 600
50.4%
Hangul 591
49.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
104
17.3%
1 102
17.0%
2 85
14.2%
- 54
9.0%
3 52
8.7%
5 42
7.0%
4 34
 
5.7%
8 33
 
5.5%
9 30
 
5.0%
7 23
 
3.8%
Other values (2) 41
 
6.8%
Hangul
ValueCountFrequency (%)
102
17.3%
83
 
14.0%
83
 
14.0%
26
 
4.4%
17
 
2.9%
14
 
2.4%
13
 
2.2%
13
 
2.2%
12
 
2.0%
11
 
1.9%
Other values (46) 217
36.7%

위도
Real number (ℝ)

Distinct84
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.179572
Minimum35.165559
Maximum35.193463
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-11T02:16:18.014552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.165559
5-th percentile35.170681
Q135.175414
median35.17909
Q335.184066
95-th percentile35.189621
Maximum35.193463
Range0.02790371
Interquartile range (IQR)0.0086517

Descriptive statistics

Standard deviation0.0060179783
Coefficient of variation (CV)0.00017106457
Kurtosis-0.58463489
Mean35.179572
Median Absolute Deviation (MAD)0.0039784
Skewness0.0074240051
Sum3588.3163
Variance3.6216063 × 10-5
MonotonicityNot monotonic
2023-12-11T02:16:18.305289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.17473509 3
 
2.9%
35.17112698 3
 
2.9%
35.17122352 3
 
2.9%
35.18406598 3
 
2.9%
35.18644374 2
 
2.0%
35.17860211 2
 
2.0%
35.17852041 2
 
2.0%
35.18542585 2
 
2.0%
35.17541428 2
 
2.0%
35.17908988 2
 
2.0%
Other values (74) 78
76.5%
ValueCountFrequency (%)
35.16555931 1
 
1.0%
35.16602787 1
 
1.0%
35.16946145 1
 
1.0%
35.16946503 1
 
1.0%
35.17057158 1
 
1.0%
35.17067132 1
 
1.0%
35.17086194 1
 
1.0%
35.17096426 1
 
1.0%
35.17112698 3
2.9%
35.17122352 3
2.9%
ValueCountFrequency (%)
35.19346302 1
1.0%
35.19091587 1
1.0%
35.19029265 1
1.0%
35.19023023 1
1.0%
35.18971778 1
1.0%
35.18962795 1
1.0%
35.18949038 1
1.0%
35.18859188 1
1.0%
35.18736501 1
1.0%
35.18730246 1
1.0%

경도
Real number (ℝ)

Distinct84
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.0873
Minimum129.06613
Maximum129.10923
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-11T02:16:18.633162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum129.06613
5-th percentile129.06706
Q1129.08312
median129.08695
Q3129.09497
95-th percentile129.10505
Maximum129.10923
Range0.0431045
Interquartile range (IQR)0.0118518

Descriptive statistics

Standard deviation0.010439389
Coefficient of variation (CV)8.0870765 × 10-5
Kurtosis-0.29026133
Mean129.0873
Median Absolute Deviation (MAD)0.0067742
Skewness-0.3174587
Sum13166.905
Variance0.00010898084
MonotonicityNot monotonic
2023-12-11T02:16:18.923879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
129.0963864 3
 
2.9%
129.0936813 3
 
2.9%
129.094967 3
 
2.9%
129.0801856 3
 
2.9%
129.1013858 2
 
2.0%
129.0661264 2
 
2.0%
129.0661912 2
 
2.0%
129.0953553 2
 
2.0%
129.0840341 2
 
2.0%
129.067033 2
 
2.0%
Other values (74) 78
76.5%
ValueCountFrequency (%)
129.0661264 2
2.0%
129.0661912 2
2.0%
129.067033 2
2.0%
129.0676387 1
1.0%
129.0682228 1
1.0%
129.0691663 1
1.0%
129.0703995 1
1.0%
129.0705852 1
1.0%
129.0706415 1
1.0%
129.0706762 1
1.0%
ValueCountFrequency (%)
129.1092309 1
1.0%
129.1090731 1
1.0%
129.106184 1
1.0%
129.1057458 1
1.0%
129.1052069 2
2.0%
129.1021392 1
1.0%
129.1013858 2
2.0%
129.0985051 1
1.0%
129.0984367 1
1.0%
129.0983746 1
1.0%

Interactions

2023-12-11T02:16:14.511360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:16:14.143737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:16:14.688420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:16:14.313965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:16:19.127863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분위치위도경도
구분1.0000.0000.1480.221
위치0.0001.0001.0001.000
위도0.1481.0001.0000.850
경도0.2211.0000.8501.000
2023-12-11T02:16:19.347913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도구분
위도1.000-0.0830.105
경도-0.0831.0000.310
구분0.1050.3101.000

Missing values

2023-12-11T02:16:14.937671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:16:15.119994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분위치위도경도
0안심반딧불(센서등)과정로225번길34 안쪽35.186444129.101386
1안심반딧불(센서등)과정로225번길34 입구쪽35.186444129.101386
2안심반딧불(센서등)연수로121-4 바깥쪽35.175414129.084034
3안심반딧불(센서등)연수로121-4 안쪽35.175414129.084034
4안심반딧불(센서등)고분로20번길79-935.182093129.085078
5안심반딧불(센서등)월드컵대로92번길 26 오른쪽35.182781129.084747
6안심반딧불(센서등)월드컵대로92번길 26 왼쪽35.182781129.084747
7안심반딧불(센서등)대리로18-835.181212129.085824
8안심반딧불(센서등)대리로12번길2235.180344129.085143
9안심반딧불(센서등)아시아드대로64번길835.190293129.068223
구분위치위도경도
92안심반딧불(센서등)해맞이로 31번가길 9-235.17852129.066191
93안심반딧불(센서등)해맞이로 31번가길9-1235.178602129.066126
94안심반딧불(센서등)쌍미천로 59번길 5735.178082129.086057
95안심반딧불(센서등)대리로 22번길 20-135.180506129.086166
96안심반딧불(센서등)대리로 6번길 5-135.180978129.084767
97안심반딧불(센서등)대리로 6번길 2535.180065129.085095
98안심반딧불(센서등)월드컵대로 187번길 6135.185325129.074416
99안심반딧불(센서등)고분로 182-1235.185426129.095355
100맘편한길(센서등)해맞이로 31번가길 9-235.17852129.066191
101맘편한길(센서등)해맞이로 31번가길 9-1235.178602129.066126

Duplicate rows

Most frequently occurring

구분위치위도경도# duplicates
0안심반딧불(센서등)연수로111번길2435.176259129.0833542