Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory673.8 KiB
Average record size in memory69.0 B

Variable types

Numeric3
Categorical4

Dataset

Description한국부동산원(구.한국감정원)에서 제공하는 실거래가격지수 통계를 조회 할 수 있는 서비스로 충남의 해당기간, 해당지역의 실거래 가격지수 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2551

Alerts

지역구분 레벨 is highly overall correlated with 지역코드 and 1 other fieldsHigh correlation
지역명 is highly overall correlated with 지역코드 and 1 other fieldsHigh correlation
지역코드 is highly overall correlated with 지역명 and 1 other fieldsHigh correlation
조사일자 is highly overall correlated with 지수High correlation
지수 is highly overall correlated with 조사일자High correlation
지역구분 레벨 is highly imbalanced (58.5%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 23:18:47.548674
Analysis finished2024-01-09 23:18:49.044504
Duration1.5 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9109.189
Minimum1
Maximum18195
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T08:18:49.108810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile940.95
Q14582.75
median9082
Q313601.5
95-th percentile17292.05
Maximum18195
Range18194
Interquartile range (IQR)9018.75

Descriptive statistics

Standard deviation5238.0764
Coefficient of variation (CV)0.57503214
Kurtosis-1.190104
Mean9109.189
Median Absolute Deviation (MAD)4508.5
Skewness-0.0017418546
Sum91091890
Variance27437445
MonotonicityNot monotonic
2024-01-10T08:18:49.235498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15387 1
 
< 0.1%
11317 1
 
< 0.1%
8170 1
 
< 0.1%
5098 1
 
< 0.1%
13930 1
 
< 0.1%
5206 1
 
< 0.1%
662 1
 
< 0.1%
14489 1
 
< 0.1%
3214 1
 
< 0.1%
9911 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
3 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
9 1
< 0.1%
11 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
ValueCountFrequency (%)
18195 1
< 0.1%
18194 1
< 0.1%
18193 1
< 0.1%
18191 1
< 0.1%
18190 1
< 0.1%
18186 1
< 0.1%
18185 1
< 0.1%
18184 1
< 0.1%
18181 1
< 0.1%
18180 1
< 0.1%

지역코드
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
A1000
1394 
A2001
1221 
11000
1202 
A2000
1186 
A5000
 
394
Other values (23)
4603 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA2001
2nd rowA2000
3rd row11000
4th rowA2000
5th rowA1000

Common Values

ValueCountFrequency (%)
A1000 1394
13.9%
A2001 1221
12.2%
11000 1202
12.0%
A2000 1186
11.9%
A5000 394
 
3.9%
28000 389
 
3.9%
A6000 386
 
3.9%
A3000 380
 
3.8%
41000 370
 
3.7%
11A12 173
 
1.7%
Other values (18) 2905
29.0%

Length

2024-01-10T08:18:49.353586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
a1000 1394
13.9%
a2001 1221
12.2%
11000 1202
12.0%
a2000 1186
11.9%
a5000 394
 
3.9%
28000 389
 
3.9%
a6000 386
 
3.9%
a3000 380
 
3.8%
41000 370
 
3.7%
11a12 173
 
1.7%
Other values (18) 2905
29.0%

지역명
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전국
1394 
지방
1221 
서울
1202 
수도권
1186 
5대광역시
 
394
Other values (23)
4603 

Length

Max length5
Median length2
Mean length2.473
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방
2nd row수도권
3rd row서울
4th row수도권
5th row전국

Common Values

ValueCountFrequency (%)
전국 1394
13.9%
지방 1221
12.2%
서울 1202
12.0%
수도권 1186
11.9%
5대광역시 394
 
3.9%
인천 389
 
3.9%
8개도 386
 
3.9%
6대광역시 380
 
3.8%
경기 370
 
3.7%
동북권 173
 
1.7%
Other values (18) 2905
29.0%

Length

2024-01-10T08:18:49.460839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전국 1394
13.9%
지방 1221
12.2%
서울 1202
12.0%
수도권 1186
11.9%
5대광역시 394
 
3.9%
인천 389
 
3.9%
8개도 386
 
3.9%
6대광역시 380
 
3.8%
경기 370
 
3.7%
동북권 173
 
1.7%
Other values (18) 2905
29.0%

조사일자
Real number (ℝ)

HIGH CORRELATION 

Distinct197
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201487.35
Minimum200601
Maximum202205
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T08:18:49.574424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum200601
5-th percentile200701
Q1201108
median201512
Q3201903
95-th percentile202109
Maximum202205
Range1604
Interquartile range (IQR)795

Descriptive statistics

Standard deviation461.44833
Coefficient of variation (CV)0.0022902099
Kurtosis-0.99970677
Mean201487.35
Median Absolute Deviation (MAD)393
Skewness-0.34229644
Sum2.0148735 × 109
Variance212934.56
MonotonicityNot monotonic
2024-01-10T08:18:49.699449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202107 77
 
0.8%
202007 76
 
0.8%
202203 75
 
0.8%
201811 75
 
0.8%
201807 75
 
0.8%
201606 74
 
0.7%
202103 73
 
0.7%
201909 73
 
0.7%
201403 73
 
0.7%
202109 72
 
0.7%
Other values (187) 9257
92.6%
ValueCountFrequency (%)
200601 37
0.4%
200602 40
0.4%
200603 35
0.4%
200604 33
0.3%
200605 38
0.4%
200606 41
0.4%
200607 36
0.4%
200608 39
0.4%
200609 39
0.4%
200610 47
0.5%
ValueCountFrequency (%)
202205 38
0.4%
202204 53
0.5%
202203 75
0.8%
202202 66
0.7%
202201 60
0.6%
202112 63
0.6%
202111 67
0.7%
202110 70
0.7%
202109 72
0.7%
202108 70
0.7%

계약 타입
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
7377 
1
2623 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 7377
73.8%
1 2623
 
26.2%

Length

2024-01-10T08:18:49.812851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:18:49.890345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 7377
73.8%
1 2623
 
26.2%

지수
Real number (ℝ)

HIGH CORRELATION 

Distinct9938
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean94.422698
Minimum36.356775
Maximum190.39182
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T08:18:49.976116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.356775
5-th percentile61.961166
Q182.606866
median94.906972
Q3101.6599
95-th percentile128.20567
Maximum190.39182
Range154.03504
Interquartile range (IQR)19.053031

Descriptive statistics

Standard deviation20.239017
Coefficient of variation (CV)0.21434483
Kurtosis2.5781761
Mean94.422698
Median Absolute Deviation (MAD)9.6955353
Skewness0.86881014
Sum944226.98
Variance409.61783
MonotonicityNot monotonic
2024-01-10T08:18:50.094413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100.0 63
 
0.6%
99.28566885978697 1
 
< 0.1%
93.3499280590343 1
 
< 0.1%
77.59527486292347 1
 
< 0.1%
100.24605013892452 1
 
< 0.1%
78.60823098654578 1
 
< 0.1%
95.75381488962935 1
 
< 0.1%
82.1874532845743 1
 
< 0.1%
48.82312451775309 1
 
< 0.1%
65.55748730466247 1
 
< 0.1%
Other values (9928) 9928
99.3%
ValueCountFrequency (%)
36.35677453293166 1
< 0.1%
36.443093547245184 1
< 0.1%
37.069668892510734 1
< 0.1%
38.0887191127768 1
< 0.1%
39.0643393636669 1
< 0.1%
39.07592720058212 1
< 0.1%
39.07832744070562 1
< 0.1%
39.255725460013416 1
< 0.1%
39.42535578629533 1
< 0.1%
39.53102847465653 1
< 0.1%
ValueCountFrequency (%)
190.3918153090596 1
< 0.1%
187.7713766095173 1
< 0.1%
187.53247950477865 1
< 0.1%
185.4862551024079 1
< 0.1%
185.23918033687613 1
< 0.1%
184.31381175523296 1
< 0.1%
184.1361617869817 1
< 0.1%
182.58671087019235 1
< 0.1%
182.4974114427937 1
< 0.1%
182.47406542982665 1
< 0.1%

지역구분 레벨
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9164 
1
 
836

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9164
91.6%
1 836
 
8.4%

Length

2024-01-10T08:18:50.212519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:18:50.299341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9164
91.6%
1 836
 
8.4%

Interactions

2024-01-10T08:18:48.619777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:18:48.087288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:18:48.350578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:18:48.706479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:18:48.172898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:18:48.439788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:18:48.798166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:18:48.264024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:18:48.530082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T08:18:50.372100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호지역코드지역명조사일자계약 타입지수지역구분 레벨
번호1.0000.2060.2060.0000.0890.0470.124
지역코드0.2061.0001.0000.0000.1890.4641.000
지역명0.2061.0001.0000.0000.1890.4641.000
조사일자0.0000.0000.0001.0000.5500.8200.000
계약 타입0.0890.1890.1890.5501.0000.4130.071
지수0.0470.4640.4640.8200.4131.0000.221
지역구분 레벨0.1241.0001.0000.0000.0710.2211.000
2024-01-10T08:18:50.490993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역구분 레벨지역명지역코드계약 타입
지역구분 레벨1.0000.9990.9990.045
지역명0.9991.0001.0000.150
지역코드0.9991.0001.0000.150
계약 타입0.0450.1500.1501.000
2024-01-10T08:18:50.597150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호조사일자지수지역코드지역명계약 타입지역구분 레벨
번호1.0000.0100.0180.0750.0750.0680.095
조사일자0.0101.0000.8450.0000.0000.4250.000
지수0.0180.8451.0000.1850.1850.3170.170
지역코드0.0750.0000.1851.0001.0000.1500.999
지역명0.0750.0000.1851.0001.0000.1500.999
계약 타입0.0680.4250.3170.1500.1501.0000.045
지역구분 레벨0.0950.0000.1700.9990.9990.0451.000

Missing values

2024-01-10T08:18:48.904193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T08:18:49.001193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호지역코드지역명조사일자계약 타입지수지역구분 레벨
1538615387A2001지방201712199.2856690
41744175A2000수도권201606092.551350
1322132311000서울201305078.0252330
49334934A2000수도권2017100100.2825420
89898990A1000전국201408087.0456690
1407614077A50005대광역시201807099.8671790
20912092A1000전국200702063.7508830
28652866A2000수도권201411178.0451220
8317831844000충남2014101103.2446560
1607116072A2001지방2022030119.2997510
번호지역코드지역명조사일자계약 타입지수지역구분 레벨
122761227736000세종201701093.131350
16301631A2001지방2020071110.0084730
1524915250A2001지방201705199.1864790
1347013471A1000전국201006074.6490190
153791538047000경북2020011102.1387880
1357513576A2000수도권2020040112.2926420
168511685231000울산201904086.2789380
1044910450A1000전국200606054.2782850
178631786411A11도심권201708199.9361811
1560915610A1000전국2018010100.2838760