Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 673.8 KiB |
Average record size in memory | 69.0 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 4 |
Dataset
Description | 한국부동산원(구.한국감정원)에서 제공하는 실거래가격지수 통계를 조회 할 수 있는 서비스로 충남의 해당기간, 해당지역의 실거래 가격지수 정보를 제공합니다. |
---|---|
Author | 충청남도 |
URL | https://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2551 |
지역구분 레벨 is highly overall correlated with 지역코드 and 1 other fields | High correlation |
지역명 is highly overall correlated with 지역코드 and 1 other fields | High correlation |
지역코드 is highly overall correlated with 지역명 and 1 other fields | High correlation |
조사일자 is highly overall correlated with 지수 | High correlation |
지수 is highly overall correlated with 조사일자 | High correlation |
지역구분 레벨 is highly imbalanced (58.5%) | Imbalance |
번호 has unique values | Unique |
Reproduction
Analysis started | 2024-01-09 23:18:47.548674 |
---|---|
Analysis finished | 2024-01-09 23:18:49.044504 |
Duration | 1.5 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9109.189 |
Minimum | 1 |
---|---|
Maximum | 18195 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 940.95 |
Q1 | 4582.75 |
median | 9082 |
Q3 | 13601.5 |
95-th percentile | 17292.05 |
Maximum | 18195 |
Range | 18194 |
Interquartile range (IQR) | 9018.75 |
Descriptive statistics
Standard deviation | 5238.0764 |
---|---|
Coefficient of variation (CV) | 0.57503214 |
Kurtosis | -1.190104 |
Mean | 9109.189 |
Median Absolute Deviation (MAD) | 4508.5 |
Skewness | -0.0017418546 |
Sum | 91091890 |
Variance | 27437445 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15387 | 1 | < 0.1% |
11317 | 1 | < 0.1% |
8170 | 1 | < 0.1% |
5098 | 1 | < 0.1% |
13930 | 1 | < 0.1% |
5206 | 1 | < 0.1% |
662 | 1 | < 0.1% |
14489 | 1 | < 0.1% |
3214 | 1 | < 0.1% |
9911 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1 | 1 | |
3 | 1 | |
6 | 1 | |
7 | 1 | |
9 | 1 | |
11 | 1 | |
13 | 1 | |
14 | 1 | |
15 | 1 | |
16 | 1 |
Value | Count | Frequency (%) |
18195 | 1 | |
18194 | 1 | |
18193 | 1 | |
18191 | 1 | |
18190 | 1 | |
18186 | 1 | |
18185 | 1 | |
18184 | 1 | |
18181 | 1 | |
18180 | 1 |
지역코드
Categorical
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
A1000 | |
---|---|
A2001 | |
11000 | |
A2000 | |
A5000 | 394 |
Other values (23) |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | A2001 |
---|---|
2nd row | A2000 |
3rd row | 11000 |
4th row | A2000 |
5th row | A1000 |
Common Values
Value | Count | Frequency (%) |
A1000 | 1394 | |
A2001 | 1221 | |
11000 | 1202 | |
A2000 | 1186 | |
A5000 | 394 | 3.9% |
28000 | 389 | 3.9% |
A6000 | 386 | 3.9% |
A3000 | 380 | 3.8% |
41000 | 370 | 3.7% |
11A12 | 173 | 1.7% |
Other values (18) | 2905 |
Length
Value | Count | Frequency (%) |
a1000 | 1394 | |
a2001 | 1221 | |
11000 | 1202 | |
a2000 | 1186 | |
a5000 | 394 | 3.9% |
28000 | 389 | 3.9% |
a6000 | 386 | 3.9% |
a3000 | 380 | 3.8% |
41000 | 370 | 3.7% |
11a12 | 173 | 1.7% |
Other values (18) | 2905 |
지역명
Categorical
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
전국 | |
---|---|
지방 | |
서울 | |
수도권 | |
5대광역시 | 394 |
Other values (23) |
Length
Max length | 5 |
---|---|
Median length | 2 |
Mean length | 2.473 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 지방 |
---|---|
2nd row | 수도권 |
3rd row | 서울 |
4th row | 수도권 |
5th row | 전국 |
Common Values
Value | Count | Frequency (%) |
전국 | 1394 | |
지방 | 1221 | |
서울 | 1202 | |
수도권 | 1186 | |
5대광역시 | 394 | 3.9% |
인천 | 389 | 3.9% |
8개도 | 386 | 3.9% |
6대광역시 | 380 | 3.8% |
경기 | 370 | 3.7% |
동북권 | 173 | 1.7% |
Other values (18) | 2905 |
Length
Value | Count | Frequency (%) |
전국 | 1394 | |
지방 | 1221 | |
서울 | 1202 | |
수도권 | 1186 | |
5대광역시 | 394 | 3.9% |
인천 | 389 | 3.9% |
8개도 | 386 | 3.9% |
6대광역시 | 380 | 3.8% |
경기 | 370 | 3.7% |
동북권 | 173 | 1.7% |
Other values (18) | 2905 |
조사일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 197 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201487.35 |
Minimum | 200601 |
---|---|
Maximum | 202205 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 200601 |
---|---|
5-th percentile | 200701 |
Q1 | 201108 |
median | 201512 |
Q3 | 201903 |
95-th percentile | 202109 |
Maximum | 202205 |
Range | 1604 |
Interquartile range (IQR) | 795 |
Descriptive statistics
Standard deviation | 461.44833 |
---|---|
Coefficient of variation (CV) | 0.0022902099 |
Kurtosis | -0.99970677 |
Mean | 201487.35 |
Median Absolute Deviation (MAD) | 393 |
Skewness | -0.34229644 |
Sum | 2.0148735 × 109 |
Variance | 212934.56 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
202107 | 77 | 0.8% |
202007 | 76 | 0.8% |
202203 | 75 | 0.8% |
201811 | 75 | 0.8% |
201807 | 75 | 0.8% |
201606 | 74 | 0.7% |
202103 | 73 | 0.7% |
201909 | 73 | 0.7% |
201403 | 73 | 0.7% |
202109 | 72 | 0.7% |
Other values (187) | 9257 |
Value | Count | Frequency (%) |
200601 | 37 | |
200602 | 40 | |
200603 | 35 | |
200604 | 33 | |
200605 | 38 | |
200606 | 41 | |
200607 | 36 | |
200608 | 39 | |
200609 | 39 | |
200610 | 47 |
Value | Count | Frequency (%) |
202205 | 38 | |
202204 | 53 | |
202203 | 75 | |
202202 | 66 | |
202201 | 60 | |
202112 | 63 | |
202111 | 67 | |
202110 | 70 | |
202109 | 72 | |
202108 | 70 |
계약 타입
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 7377 | |
1 | 2623 | 26.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 7377 | |
1 | 2623 | 26.2% |
지수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9938 |
---|---|
Distinct (%) | 99.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 94.422698 |
Minimum | 36.356775 |
---|---|
Maximum | 190.39182 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 36.356775 |
---|---|
5-th percentile | 61.961166 |
Q1 | 82.606866 |
median | 94.906972 |
Q3 | 101.6599 |
95-th percentile | 128.20567 |
Maximum | 190.39182 |
Range | 154.03504 |
Interquartile range (IQR) | 19.053031 |
Descriptive statistics
Standard deviation | 20.239017 |
---|---|
Coefficient of variation (CV) | 0.21434483 |
Kurtosis | 2.5781761 |
Mean | 94.422698 |
Median Absolute Deviation (MAD) | 9.6955353 |
Skewness | 0.86881014 |
Sum | 944226.98 |
Variance | 409.61783 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100.0 | 63 | 0.6% |
99.28566885978697 | 1 | < 0.1% |
93.3499280590343 | 1 | < 0.1% |
77.59527486292347 | 1 | < 0.1% |
100.24605013892452 | 1 | < 0.1% |
78.60823098654578 | 1 | < 0.1% |
95.75381488962935 | 1 | < 0.1% |
82.1874532845743 | 1 | < 0.1% |
48.82312451775309 | 1 | < 0.1% |
65.55748730466247 | 1 | < 0.1% |
Other values (9928) | 9928 |
Value | Count | Frequency (%) |
36.35677453293166 | 1 | |
36.443093547245184 | 1 | |
37.069668892510734 | 1 | |
38.0887191127768 | 1 | |
39.0643393636669 | 1 | |
39.07592720058212 | 1 | |
39.07832744070562 | 1 | |
39.255725460013416 | 1 | |
39.42535578629533 | 1 | |
39.53102847465653 | 1 |
Value | Count | Frequency (%) |
190.3918153090596 | 1 | |
187.7713766095173 | 1 | |
187.53247950477865 | 1 | |
185.4862551024079 | 1 | |
185.23918033687613 | 1 | |
184.31381175523296 | 1 | |
184.1361617869817 | 1 | |
182.58671087019235 | 1 | |
182.4974114427937 | 1 | |
182.47406542982665 | 1 |
지역구분 레벨
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 836 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9164 | |
1 | 836 | 8.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9164 | |
1 | 836 | 8.4% |
번호 | 지역코드 | 지역명 | 조사일자 | 계약 타입 | 지수 | 지역구분 레벨 | |
---|---|---|---|---|---|---|---|
번호 | 1.000 | 0.206 | 0.206 | 0.000 | 0.089 | 0.047 | 0.124 |
지역코드 | 0.206 | 1.000 | 1.000 | 0.000 | 0.189 | 0.464 | 1.000 |
지역명 | 0.206 | 1.000 | 1.000 | 0.000 | 0.189 | 0.464 | 1.000 |
조사일자 | 0.000 | 0.000 | 0.000 | 1.000 | 0.550 | 0.820 | 0.000 |
계약 타입 | 0.089 | 0.189 | 0.189 | 0.550 | 1.000 | 0.413 | 0.071 |
지수 | 0.047 | 0.464 | 0.464 | 0.820 | 0.413 | 1.000 | 0.221 |
지역구분 레벨 | 0.124 | 1.000 | 1.000 | 0.000 | 0.071 | 0.221 | 1.000 |
지역구분 레벨 | 지역명 | 지역코드 | 계약 타입 | |
---|---|---|---|---|
지역구분 레벨 | 1.000 | 0.999 | 0.999 | 0.045 |
지역명 | 0.999 | 1.000 | 1.000 | 0.150 |
지역코드 | 0.999 | 1.000 | 1.000 | 0.150 |
계약 타입 | 0.045 | 0.150 | 0.150 | 1.000 |
번호 | 조사일자 | 지수 | 지역코드 | 지역명 | 계약 타입 | 지역구분 레벨 | |
---|---|---|---|---|---|---|---|
번호 | 1.000 | 0.010 | 0.018 | 0.075 | 0.075 | 0.068 | 0.095 |
조사일자 | 0.010 | 1.000 | 0.845 | 0.000 | 0.000 | 0.425 | 0.000 |
지수 | 0.018 | 0.845 | 1.000 | 0.185 | 0.185 | 0.317 | 0.170 |
지역코드 | 0.075 | 0.000 | 0.185 | 1.000 | 1.000 | 0.150 | 0.999 |
지역명 | 0.075 | 0.000 | 0.185 | 1.000 | 1.000 | 0.150 | 0.999 |
계약 타입 | 0.068 | 0.425 | 0.317 | 0.150 | 0.150 | 1.000 | 0.045 |
지역구분 레벨 | 0.095 | 0.000 | 0.170 | 0.999 | 0.999 | 0.045 | 1.000 |
번호 | 지역코드 | 지역명 | 조사일자 | 계약 타입 | 지수 | 지역구분 레벨 | |
---|---|---|---|---|---|---|---|
15386 | 15387 | A2001 | 지방 | 201712 | 1 | 99.285669 | 0 |
4174 | 4175 | A2000 | 수도권 | 201606 | 0 | 92.55135 | 0 |
1322 | 1323 | 11000 | 서울 | 201305 | 0 | 78.025233 | 0 |
4933 | 4934 | A2000 | 수도권 | 201710 | 0 | 100.282542 | 0 |
8989 | 8990 | A1000 | 전국 | 201408 | 0 | 87.045669 | 0 |
14076 | 14077 | A5000 | 5대광역시 | 201807 | 0 | 99.867179 | 0 |
2091 | 2092 | A1000 | 전국 | 200702 | 0 | 63.750883 | 0 |
2865 | 2866 | A2000 | 수도권 | 201411 | 1 | 78.045122 | 0 |
8317 | 8318 | 44000 | 충남 | 201410 | 1 | 103.244656 | 0 |
16071 | 16072 | A2001 | 지방 | 202203 | 0 | 119.299751 | 0 |
번호 | 지역코드 | 지역명 | 조사일자 | 계약 타입 | 지수 | 지역구분 레벨 | |
---|---|---|---|---|---|---|---|
12276 | 12277 | 36000 | 세종 | 201701 | 0 | 93.13135 | 0 |
1630 | 1631 | A2001 | 지방 | 202007 | 1 | 110.008473 | 0 |
15249 | 15250 | A2001 | 지방 | 201705 | 1 | 99.186479 | 0 |
13470 | 13471 | A1000 | 전국 | 201006 | 0 | 74.649019 | 0 |
15379 | 15380 | 47000 | 경북 | 202001 | 1 | 102.138788 | 0 |
13575 | 13576 | A2000 | 수도권 | 202004 | 0 | 112.292642 | 0 |
16851 | 16852 | 31000 | 울산 | 201904 | 0 | 86.278938 | 0 |
10449 | 10450 | A1000 | 전국 | 200606 | 0 | 54.278285 | 0 |
17863 | 17864 | 11A11 | 도심권 | 201708 | 1 | 99.936181 | 1 |
15609 | 15610 | A1000 | 전국 | 201801 | 0 | 100.283876 | 0 |