Overview

Dataset statistics

Number of variables5
Number of observations754
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory31.8 KiB
Average record size in memory43.2 B

Variable types

Numeric3
Categorical1
Text1

Dataset

Description부산광역시남구_국공유재산현황_20211126
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3080532

Alerts

면적(제곱미터) is highly overall correlated with 담당부서명High correlation
담당부서명 is highly overall correlated with 면적(제곱미터)High correlation
담당부서명 is highly imbalanced (55.4%)Imbalance
면적(제곱미터) is highly skewed (γ1 = 24.21738645)Skewed
순번 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:48:43.225661
Analysis finished2023-12-10 16:48:45.033441
Duration1.81 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct754
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean377.5
Minimum1
Maximum754
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.8 KiB
2023-12-11T01:48:45.146634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile38.65
Q1189.25
median377.5
Q3565.75
95-th percentile716.35
Maximum754
Range753
Interquartile range (IQR)376.5

Descriptive statistics

Standard deviation217.80534
Coefficient of variation (CV)0.57696779
Kurtosis-1.2
Mean377.5
Median Absolute Deviation (MAD)188.5
Skewness0
Sum284635
Variance47439.167
MonotonicityStrictly increasing
2023-12-11T01:48:45.392647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
508 1
 
0.1%
499 1
 
0.1%
500 1
 
0.1%
501 1
 
0.1%
502 1
 
0.1%
503 1
 
0.1%
504 1
 
0.1%
505 1
 
0.1%
506 1
 
0.1%
Other values (744) 744
98.7%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
754 1
0.1%
753 1
0.1%
752 1
0.1%
751 1
0.1%
750 1
0.1%
749 1
0.1%
748 1
0.1%
747 1
0.1%
746 1
0.1%
745 1
0.1%

담당부서명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
재무담당관
401 
건축과
350 
일자리경제과
 
1
미래성장담당관
 
1
대연3동
 
1

Length

Max length7
Median length5
Mean length4.0742706
Min length3

Unique

Unique3 ?
Unique (%)0.4%

Sample

1st row건축과
2nd row건축과
3rd row건축과
4th row재무담당관
5th row재무담당관

Common Values

ValueCountFrequency (%)
재무담당관 401
53.2%
건축과 350
46.4%
일자리경제과 1
 
0.1%
미래성장담당관 1
 
0.1%
대연3동 1
 
0.1%

Length

2023-12-11T01:48:45.990667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:48:46.204545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재무담당관 401
53.2%
건축과 350
46.4%
일자리경제과 1
 
0.1%
미래성장담당관 1
 
0.1%
대연3동 1
 
0.1%

소재지
Text

UNIQUE 

Distinct754
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
2023-12-11T01:48:46.735285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length28
Mean length19.822281
Min length17

Characters and Unicode

Total characters14946
Distinct characters45
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique754 ?
Unique (%)100.0%

Sample

1st row부산광역시 남구 대연동 219-36
2nd row부산광역시 남구 대연동 219-41
3rd row부산광역시 남구 대연동 219-45
4th row부산광역시 남구 대연동 225-3
5th row부산광역시 남구 대연동 235-1
ValueCountFrequency (%)
부산광역시 754
24.9%
남구 754
24.9%
문현동 494
16.3%
감만동 130
 
4.3%
대연동 67
 
2.2%
우암동 53
 
1.8%
용호동 5
 
0.2%
용당동 5
 
0.2%
잔여지 2
 
0.1%
도로개설 2
 
0.1%
Other values (758) 759
25.1%
2023-12-11T01:48:47.481167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3025
20.2%
756
 
5.1%
756
 
5.1%
756
 
5.1%
754
 
5.0%
754
 
5.0%
754
 
5.0%
754
 
5.0%
754
 
5.0%
- 746
 
5.0%
Other values (35) 5137
34.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7575
50.7%
Decimal Number 3599
24.1%
Space Separator 3025
 
20.2%
Dash Punctuation 746
 
5.0%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
756
10.0%
756
10.0%
756
10.0%
754
10.0%
754
10.0%
754
10.0%
754
10.0%
754
10.0%
494
6.5%
494
6.5%
Other values (22) 549
7.2%
Decimal Number
ValueCountFrequency (%)
1 727
20.2%
3 442
12.3%
2 404
11.2%
5 401
11.1%
6 324
9.0%
8 297
8.3%
4 295
8.2%
9 252
 
7.0%
7 240
 
6.7%
0 217
 
6.0%
Space Separator
ValueCountFrequency (%)
3025
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 746
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7575
50.7%
Common 7371
49.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
756
10.0%
756
10.0%
756
10.0%
754
10.0%
754
10.0%
754
10.0%
754
10.0%
754
10.0%
494
6.5%
494
6.5%
Other values (22) 549
7.2%
Common
ValueCountFrequency (%)
3025
41.0%
- 746
 
10.1%
1 727
 
9.9%
3 442
 
6.0%
2 404
 
5.5%
5 401
 
5.4%
6 324
 
4.4%
8 297
 
4.0%
4 295
 
4.0%
9 252
 
3.4%
Other values (3) 458
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7575
50.7%
ASCII 7371
49.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3025
41.0%
- 746
 
10.1%
1 727
 
9.9%
3 442
 
6.0%
2 404
 
5.5%
5 401
 
5.4%
6 324
 
4.4%
8 297
 
4.0%
4 295
 
4.0%
9 252
 
3.4%
Other values (3) 458
 
6.2%
Hangul
ValueCountFrequency (%)
756
10.0%
756
10.0%
756
10.0%
754
10.0%
754
10.0%
754
10.0%
754
10.0%
754
10.0%
494
6.5%
494
6.5%
Other values (22) 549
7.2%

면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct141
Distinct (%)18.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.118568
Minimum1
Maximum9017
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.8 KiB
2023-12-11T01:48:47.711118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median15
Q340
95-th percentile127.135
Maximum9017
Range9016
Interquartile range (IQR)36

Descriptive statistics

Standard deviation342.49401
Coefficient of variation (CV)6.8336751
Kurtosis627.67851
Mean50.118568
Median Absolute Deviation (MAD)12
Skewness24.217386
Sum37789.4
Variance117302.14
MonotonicityNot monotonic
2023-12-11T01:48:47.903675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.0 75
 
9.9%
3.0 58
 
7.7%
7.0 45
 
6.0%
2.0 36
 
4.8%
10.0 26
 
3.4%
20.0 23
 
3.1%
6.0 22
 
2.9%
4.0 21
 
2.8%
17.0 20
 
2.7%
23.0 17
 
2.3%
Other values (131) 411
54.5%
ValueCountFrequency (%)
1.0 75
9.9%
1.9 1
 
0.1%
2.0 36
4.8%
3.0 58
7.7%
4.0 21
 
2.8%
5.0 13
 
1.7%
5.8 1
 
0.1%
6.0 22
 
2.9%
7.0 45
6.0%
8.0 10
 
1.3%
ValueCountFrequency (%)
9017.0 1
0.1%
2083.0 1
0.1%
832.0 1
0.1%
676.0 1
0.1%
538.0 1
0.1%
530.0 1
0.1%
507.0 1
0.1%
412.0 1
0.1%
380.8 1
0.1%
380.0 1
0.1%

공시지가
Real number (ℝ)

Distinct430
Distinct (%)57.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean848788.33
Minimum123100
Maximum2915000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.8 KiB
2023-12-11T01:48:48.063532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum123100
5-th percentile148500
Q1529400
median796250
Q31114500
95-th percentile1664450
Maximum2915000
Range2791900
Interquartile range (IQR)585100

Descriptive statistics

Standard deviation495475.58
Coefficient of variation (CV)0.58374457
Kurtosis1.0092317
Mean848788.33
Median Absolute Deviation (MAD)296350
Skewness0.75304513
Sum6.399864 × 108
Variance2.4549605 × 1011
MonotonicityNot monotonic
2023-12-11T01:48:48.239790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
148500 54
 
7.2%
162500 18
 
2.4%
254100 9
 
1.2%
553600 9
 
1.2%
341500 8
 
1.1%
1046000 8
 
1.1%
372900 7
 
0.9%
783600 7
 
0.9%
149800 7
 
0.9%
1036000 7
 
0.9%
Other values (420) 620
82.2%
ValueCountFrequency (%)
123100 1
 
0.1%
138600 1
 
0.1%
148500 54
7.2%
149800 7
 
0.9%
151800 1
 
0.1%
162500 18
 
2.4%
190500 2
 
0.3%
193300 1
 
0.1%
196000 1
 
0.1%
200900 3
 
0.4%
ValueCountFrequency (%)
2915000 1
0.1%
2888000 1
0.1%
2733000 1
0.1%
2679000 1
0.1%
2626000 1
0.1%
2568000 1
0.1%
2520000 2
0.3%
2268000 1
0.1%
2220000 1
0.1%
2209000 1
0.1%

Interactions

2023-12-11T01:48:44.339613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:48:43.454518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:48:43.899209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:48:44.506350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:48:43.586411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:48:44.056097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:48:44.646762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:48:43.742522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:48:44.180334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:48:48.341718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번담당부서명면적(제곱미터)공시지가
순번1.0000.6530.0000.696
담당부서명0.6531.0000.7170.357
면적(제곱미터)0.0000.7171.0000.077
공시지가0.6960.3570.0771.000
2023-12-11T01:48:48.438028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번면적(제곱미터)공시지가담당부서명
순번1.0000.121-0.2960.327
면적(제곱미터)0.1211.000-0.1860.705
공시지가-0.296-0.1861.0000.156
담당부서명0.3270.7050.1561.000

Missing values

2023-12-11T01:48:44.836669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:48:44.966471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번담당부서명소재지면적(제곱미터)공시지가
01건축과부산광역시 남구 대연동 219-3621.0577300
12건축과부산광역시 남구 대연동 219-411.0220200
23건축과부산광역시 남구 대연동 219-456.0546600
34재무담당관부산광역시 남구 대연동 225-32.0697500
45재무담당관부산광역시 남구 대연동 235-150.0447100
56재무담당관부산광역시 남구 대연동 245-452.0911800
67재무담당관부산광역시 남구 대연동 245-981.0911800
78재무담당관부산광역시 남구 대연동 245-22218.0342800
89재무담당관부산광역시 남구 대연동 282-428.61517000
910재무담당관부산광역시 남구 대연동 317-7010.0463600
순번담당부서명소재지면적(제곱미터)공시지가
744745재무담당관부산광역시 남구 감만동 199-12182.0783200
745746재무담당관부산광역시 남구 감만동 199-19132.0775500
746747재무담당관부산광역시 남구 감만동 205-111108.01119000
747748재무담당관부산광역시 남구 감만동 217-11106.0802900
748749재무담당관부산광역시 남구 감만동 351-0 동항부녀경로당(감만1동 351번지98.0766900
749750재무담당관부산광역시 남구 감만동 484-0380.0764900
750751건축과부산광역시 남구 감만동 589-513.01202000
751752건축과부산광역시 남구 감만동 589-1414.0605700
752753건축과부산광역시 남구 감만동 590-37.0254100
753754건축과부산광역시 남구 감만동 590-1110.0746900