Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 10000 |
Missing cells | 5767 |
Missing cells (%) | 4.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.2 MiB |
Average record size in memory | 124.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 11 |
Categorical | 1 |
Dataset
Description | 관리_허가대장_PK,허가번호_년,허가번호_기관_코드,허가번호_구분_코드,허가번호_일련번호,건축_구분_코드,건축_허가_일,대지_면적,건폐_율,연면적,용적_율,주_용도_코드,외필지_수 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15400/S/1/datasetView.do |
외필지_수 has constant value "" | Constant |
허가번호_년 is highly overall correlated with 건축_허가_일 | High correlation |
허가번호_구분_코드 is highly overall correlated with 건축_구분_코드 and 3 other fields | High correlation |
건축_구분_코드 is highly overall correlated with 허가번호_구분_코드 | High correlation |
건축_허가_일 is highly overall correlated with 허가번호_년 | High correlation |
대지_면적 is highly overall correlated with 연면적 | High correlation |
건폐_율 is highly overall correlated with 허가번호_구분_코드 and 2 other fields | High correlation |
연면적 is highly overall correlated with 허가번호_구분_코드 and 3 other fields | High correlation |
용적_율 is highly overall correlated with 허가번호_구분_코드 and 2 other fields | High correlation |
건축_구분_코드 has 2882 (28.8%) missing values | Missing |
주_용도_코드 has 2885 (28.8%) missing values | Missing |
대지_면적 is highly skewed (γ1 = 37.49358473) | Skewed |
건폐_율 is highly skewed (γ1 = 69.3968301) | Skewed |
연면적 is highly skewed (γ1 = 82.68928223) | Skewed |
용적_율 is highly skewed (γ1 = 95.74149657) | Skewed |
관리_허가대장_PK has unique values | Unique |
대지_면적 has 1506 (15.1%) zeros | Zeros |
건폐_율 has 2967 (29.7%) zeros | Zeros |
연면적 has 1423 (14.2%) zeros | Zeros |
용적_율 has 2961 (29.6%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-18 04:09:23.578802 |
---|---|
Analysis finished | 2024-05-18 04:10:13.166924 |
Duration | 49.59 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_허가대장_PK
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 17.639 |
Min length | 15 |
Characters and Unicode
Total characters | 176390 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11650-100117192 |
---|---|
2nd row | 11260-100040896 |
3rd row | 11170-1000000000000000063299 |
4th row | 11305-100083961 |
5th row | 11500-100106104 |
Value | Count | Frequency (%) |
11650-100117192 | 1 | < 0.1% |
11440-100108539 | 1 | < 0.1% |
11560-100090851 | 1 | < 0.1% |
11740-1000000000000000120334 | 1 | < 0.1% |
11680-100118949 | 1 | < 0.1% |
11545-100084919 | 1 | < 0.1% |
11230-100105732 | 1 | < 0.1% |
11170-100090073 | 1 | < 0.1% |
11440-100111922 | 1 | < 0.1% |
11590-100105401 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 68016 | |
1 | 40348 | |
- | 10000 | 5.7% |
5 | 8128 | 4.6% |
6 | 7969 | 4.5% |
2 | 7839 | 4.4% |
4 | 7421 | 4.2% |
3 | 7119 | 4.0% |
7 | 6877 | 3.9% |
8 | 6436 | 3.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 166390 | |
Dash Punctuation | 10000 | 5.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 68016 | |
1 | 40348 | |
5 | 8128 | 4.9% |
6 | 7969 | 4.8% |
2 | 7839 | 4.7% |
4 | 7421 | 4.5% |
3 | 7119 | 4.3% |
7 | 6877 | 4.1% |
8 | 6436 | 3.9% |
9 | 6237 | 3.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 176390 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 68016 | |
1 | 40348 | |
- | 10000 | 5.7% |
5 | 8128 | 4.6% |
6 | 7969 | 4.5% |
2 | 7839 | 4.4% |
4 | 7421 | 4.2% |
3 | 7119 | 4.0% |
7 | 6877 | 3.9% |
8 | 6436 | 3.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 176390 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 68016 | |
1 | 40348 | |
- | 10000 | 5.7% |
5 | 8128 | 4.6% |
6 | 7969 | 4.5% |
2 | 7839 | 4.4% |
4 | 7421 | 4.2% |
3 | 7119 | 4.0% |
7 | 6877 | 3.9% |
8 | 6436 | 3.6% |
허가번호_년
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2020.684 |
Minimum | 2018 |
---|---|
Maximum | 2024 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2018 |
---|---|
5-th percentile | 2018 |
Q1 | 2019 |
median | 2021 |
Q3 | 2022 |
95-th percentile | 2023 |
Maximum | 2024 |
Range | 6 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 1.5654357 |
---|---|
Coefficient of variation (CV) | 0.00077470586 |
Kurtosis | -0.8344364 |
Mean | 2020.684 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.24507401 |
Sum | 20206840 |
Variance | 2.4505891 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2019 | 2279 | |
2020 | 1998 | |
2021 | 1991 | |
2022 | 1740 | |
2023 | 1056 | |
2018 | 544 | 5.4% |
2024 | 392 | 3.9% |
Value | Count | Frequency (%) |
2018 | 544 | 5.4% |
2019 | 2279 | |
2020 | 1998 | |
2021 | 1991 | |
2022 | 1740 | |
2023 | 1056 | |
2024 | 392 | 3.9% |
Value | Count | Frequency (%) |
2024 | 392 | 3.9% |
2023 | 1056 | |
2022 | 1740 | |
2021 | 1991 | |
2020 | 1998 | |
2019 | 2279 | |
2018 | 544 | 5.4% |
허가번호_기관_코드
Real number (ℝ)
Distinct | 170 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3136827.9 |
Minimum | 3000000 |
---|---|
Maximum | 6114031 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 3000000 |
---|---|
5-th percentile | 3010178 |
Q1 | 3060180 |
median | 3150029 |
Q3 | 3210141 |
95-th percentile | 3230304 |
Maximum | 6114031 |
Range | 3114031 |
Interquartile range (IQR) | 149961 |
Descriptive statistics
Standard deviation | 117395.99 |
---|---|
Coefficient of variation (CV) | 0.037425066 |
Kurtosis | 369.81542 |
Mean | 3136827.9 |
Median Absolute Deviation (MAD) | 69855 |
Skewness | 14.577078 |
Sum | 3.1368279 × 1010 |
Variance | 1.3781818 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3220175 | 958 | 9.6% |
3200245 | 534 | 5.3% |
3150029 | 503 | 5.0% |
3180176 | 452 | 4.5% |
3130160 | 447 | 4.5% |
3210141 | 445 | 4.5% |
3230165 | 402 | 4.0% |
3240079 | 392 | 3.9% |
3060180 | 342 | 3.4% |
3000082 | 335 | 3.4% |
Other values (160) | 5190 |
Value | Count | Frequency (%) |
3000000 | 1 | < 0.1% |
3000082 | 335 | |
3000148 | 30 | 0.3% |
3000220 | 10 | 0.1% |
3000221 | 84 | 0.8% |
3010000 | 2 | < 0.1% |
3010075 | 7 | 0.1% |
3010134 | 2 | < 0.1% |
3010178 | 47 | 0.5% |
3010180 | 174 |
Value | Count | Frequency (%) |
6114031 | 3 | < 0.1% |
6113930 | 6 | 0.1% |
3240295 | 3 | < 0.1% |
3240172 | 40 | 0.4% |
3240159 | 28 | 0.3% |
3240079 | 392 | |
3230304 | 99 | 1.0% |
3230301 | 9 | 0.1% |
3230263 | 1 | < 0.1% |
3230262 | 24 | 0.2% |
허가번호_구분_코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 26 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1275.713 |
Minimum | 1101 |
---|---|
Maximum | 5804 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1101 |
---|---|
5-th percentile | 1101 |
Q1 | 1102 |
median | 1108 |
Q3 | 1208 |
95-th percentile | 1501 |
Maximum | 5804 |
Range | 4703 |
Interquartile range (IQR) | 106 |
Descriptive statistics
Standard deviation | 597.12611 |
---|---|
Coefficient of variation (CV) | 0.46807245 |
Kurtosis | 37.4506 |
Mean | 1275.713 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 6.1373234 |
Sum | 12757130 |
Variance | 356559.59 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1101 | 2441 | |
1108 | 2058 | |
1208 | 1491 | |
1501 | 1218 | |
1207 | 941 | 9.4% |
1107 | 513 | 5.1% |
1202 | 429 | 4.3% |
1102 | 246 | 2.5% |
5200 | 147 | 1.5% |
1210 | 122 | 1.2% |
Other values (16) | 394 | 3.9% |
Value | Count | Frequency (%) |
1101 | 2441 | |
1102 | 246 | 2.5% |
1103 | 11 | 0.1% |
1106 | 8 | 0.1% |
1107 | 513 | 5.1% |
1108 | 2058 | |
1201 | 85 | 0.9% |
1202 | 429 | 4.3% |
1203 | 7 | 0.1% |
1206 | 106 | 1.1% |
Value | Count | Frequency (%) |
5804 | 1 | < 0.1% |
5803 | 2 | < 0.1% |
5802 | 1 | < 0.1% |
5801 | 3 | < 0.1% |
5510 | 1 | < 0.1% |
5200 | 147 | 1.5% |
5100 | 62 | 0.6% |
1502 | 17 | 0.2% |
1501 | 1218 | |
1403 | 1 | < 0.1% |
허가번호_일련번호
Real number (ℝ)
Distinct | 446 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 65.4662 |
Minimum | 1 |
---|---|
Maximum | 635 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 11 |
median | 36 |
Q3 | 89 |
95-th percentile | 229.05 |
Maximum | 635 |
Range | 634 |
Interquartile range (IQR) | 78 |
Descriptive statistics
Standard deviation | 81.073654 |
---|---|
Coefficient of variation (CV) | 1.2384048 |
Kurtosis | 7.4985161 |
Mean | 65.4662 |
Median Absolute Deviation (MAD) | 30 |
Skewness | 2.3639509 |
Sum | 654662 |
Variance | 6572.9374 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 416 | 4.2% |
2 | 318 | 3.2% |
4 | 280 | 2.8% |
3 | 268 | 2.7% |
5 | 245 | 2.5% |
6 | 219 | 2.2% |
7 | 194 | 1.9% |
8 | 164 | 1.6% |
10 | 162 | 1.6% |
11 | 159 | 1.6% |
Other values (436) | 7575 |
Value | Count | Frequency (%) |
1 | 416 | |
2 | 318 | |
3 | 268 | |
4 | 280 | |
5 | 245 | |
6 | 219 | |
7 | 194 | |
8 | 164 | 1.6% |
9 | 154 | 1.5% |
10 | 162 | 1.6% |
Value | Count | Frequency (%) |
635 | 1 | |
627 | 1 | |
626 | 1 | |
623 | 1 | |
620 | 1 | |
604 | 1 | |
600 | 1 | |
598 | 1 | |
595 | 1 | |
590 | 1 |
건축_구분_코드
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 2882 |
Missing (%) | 28.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 427.29699 |
Minimum | 100 |
---|---|
Maximum | 3000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 100 |
Q1 | 100 |
median | 600 |
Q3 | 700 |
95-th percentile | 700 |
Maximum | 3000 |
Range | 2900 |
Interquartile range (IQR) | 600 |
Descriptive statistics
Standard deviation | 304.74819 |
---|---|
Coefficient of variation (CV) | 0.71319995 |
Kurtosis | 7.1191378 |
Mean | 427.29699 |
Median Absolute Deviation (MAD) | 100 |
Skewness | 0.95895845 |
Sum | 3041500 |
Variance | 92871.462 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
700 | 3017 | |
100 | 2583 | |
200 | 724 | 7.2% |
600 | 639 | 6.4% |
800 | 123 | 1.2% |
300 | 19 | 0.2% |
3000 | 13 | 0.1% |
(Missing) | 2882 |
Value | Count | Frequency (%) |
100 | 2583 | |
200 | 724 | 7.2% |
300 | 19 | 0.2% |
600 | 639 | 6.4% |
700 | 3017 | |
800 | 123 | 1.2% |
3000 | 13 | 0.1% |
Value | Count | Frequency (%) |
3000 | 13 | 0.1% |
800 | 123 | 1.2% |
700 | 3017 | |
600 | 639 | 6.4% |
300 | 19 | 0.2% |
200 | 724 | 7.2% |
100 | 2583 |
건축_허가_일
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1445 |
---|---|
Distinct (%) | 14.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20207409 |
Minimum | 19871016 |
---|---|
Maximum | 20240513 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 19871016 |
---|---|
5-th percentile | 20181221 |
Q1 | 20191108 |
median | 20210208 |
Q3 | 20220510 |
95-th percentile | 20231123 |
Maximum | 20240513 |
Range | 369497 |
Interquartile range (IQR) | 29402 |
Descriptive statistics
Standard deviation | 16224.035 |
---|---|
Coefficient of variation (CV) | 0.00080287555 |
Kurtosis | 27.161195 |
Mean | 20207409 |
Median Absolute Deviation (MAD) | 10499 |
Skewness | -1.2213681 |
Sum | 2.0207409 × 1011 |
Variance | 2.632193 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190307 | 24 | 0.2% |
20190305 | 23 | 0.2% |
20200429 | 22 | 0.2% |
20190228 | 19 | 0.2% |
20190313 | 18 | 0.2% |
20200406 | 18 | 0.2% |
20190118 | 18 | 0.2% |
20200114 | 18 | 0.2% |
20190416 | 17 | 0.2% |
20210521 | 17 | 0.2% |
Other values (1435) | 9806 |
Value | Count | Frequency (%) |
19871016 | 1 | < 0.1% |
19920307 | 1 | < 0.1% |
20150727 | 1 | < 0.1% |
20161021 | 1 | < 0.1% |
20170911 | 1 | < 0.1% |
20170913 | 1 | < 0.1% |
20180227 | 1 | < 0.1% |
20180410 | 1 | < 0.1% |
20181008 | 5 | |
20181010 | 11 |
Value | Count | Frequency (%) |
20240513 | 3 | |
20240511 | 1 | < 0.1% |
20240510 | 4 | |
20240509 | 2 | < 0.1% |
20240508 | 6 | |
20240507 | 3 | |
20240503 | 7 | |
20240502 | 3 | |
20240501 | 7 | |
20240430 | 7 |
대지_면적
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 5388 |
---|---|
Distinct (%) | 53.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12589.517 |
Minimum | 0 |
---|---|
Maximum | 11440144 |
Zeros | 1506 |
Zeros (%) | 15.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 122.3 |
median | 263.6 |
Q3 | 795.025 |
95-th percentile | 17323.872 |
Maximum | 11440144 |
Range | 11440144 |
Interquartile range (IQR) | 672.725 |
Descriptive statistics
Standard deviation | 177513.35 |
---|---|
Coefficient of variation (CV) | 14.100092 |
Kurtosis | 1939.998 |
Mean | 12589.517 |
Median Absolute Deviation (MAD) | 211.4 |
Skewness | 37.493585 |
Sum | 1.2589517 × 108 |
Variance | 3.1510989 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 1506 | 15.1% |
18.0 | 28 | 0.3% |
165.0 | 23 | 0.2% |
132.0 | 20 | 0.2% |
162.0 | 19 | 0.2% |
231.0 | 18 | 0.2% |
149.0 | 18 | 0.2% |
152.0 | 17 | 0.2% |
330.0 | 16 | 0.2% |
202.0 | 15 | 0.1% |
Other values (5378) | 8320 |
Value | Count | Frequency (%) |
0.0 | 1506 | |
1.0 | 1 | < 0.1% |
4.1 | 1 | < 0.1% |
9.0 | 3 | < 0.1% |
9.2 | 1 | < 0.1% |
10.75 | 1 | < 0.1% |
12.0 | 1 | < 0.1% |
12.9 | 1 | < 0.1% |
15.0 | 1 | < 0.1% |
17.5 | 1 | < 0.1% |
Value | Count | Frequency (%) |
11440144.0 | 1 | < 0.1% |
4108394.0 | 1 | < 0.1% |
3895659.0 | 6 | |
3890567.0 | 1 | < 0.1% |
3425949.9 | 2 | < 0.1% |
2693724.0 | 1 | < 0.1% |
1612459.0 | 1 | < 0.1% |
1569434.3 | 1 | < 0.1% |
1414246.0 | 2 | < 0.1% |
1230943.3 | 1 | < 0.1% |
건폐_율
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 3646 |
---|---|
Distinct (%) | 36.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 46.100851 |
Minimum | 0 |
---|---|
Maximum | 39316.049 |
Zeros | 2967 |
Zeros (%) | 29.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 49.76685 |
Q3 | 58.97 |
95-th percentile | 59.99 |
Maximum | 39316.049 |
Range | 39316.049 |
Interquartile range (IQR) | 58.97 |
Descriptive statistics
Standard deviation | 559.21714 |
---|---|
Coefficient of variation (CV) | 12.1303 |
Kurtosis | 4865.1616 |
Mean | 46.100851 |
Median Absolute Deviation (MAD) | 9.99315 |
Skewness | 69.39683 |
Sum | 461008.51 |
Variance | 312723.81 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 2967 | |
59.96 | 58 | 0.6% |
59.98 | 56 | 0.6% |
59.94 | 51 | 0.5% |
59.99 | 51 | 0.5% |
59.95 | 49 | 0.5% |
59.85 | 37 | 0.4% |
59.78 | 35 | 0.4% |
59.97 | 34 | 0.3% |
59.91 | 33 | 0.3% |
Other values (3636) | 6629 |
Value | Count | Frequency (%) |
0.0 | 2967 | |
0.0075 | 1 | < 0.1% |
0.0216 | 1 | < 0.1% |
0.0252 | 1 | < 0.1% |
0.0368 | 1 | < 0.1% |
0.0402 | 1 | < 0.1% |
0.0479 | 1 | < 0.1% |
0.0549 | 1 | < 0.1% |
0.0644 | 1 | < 0.1% |
0.0728 | 1 | < 0.1% |
Value | Count | Frequency (%) |
39316.049 | 2 | |
5959.2042 | 1 | |
220.78 | 1 | |
206.2509 | 1 | |
165.56 | 1 | |
143.3978 | 2 | |
138.6295 | 1 | |
128.6237 | 1 | |
111.1 | 1 | |
105.9818 | 1 |
연면적
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 7042 |
---|---|
Distinct (%) | 70.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14949.818 |
Minimum | 0 |
---|---|
Maximum | 49967875 |
Zeros | 1423 |
Zeros (%) | 14.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 54 |
median | 363.375 |
Q3 | 999.7 |
95-th percentile | 22575.057 |
Maximum | 49967875 |
Range | 49967875 |
Interquartile range (IQR) | 945.7 |
Descriptive statistics
Standard deviation | 541641.64 |
---|---|
Coefficient of variation (CV) | 36.230653 |
Kurtosis | 7366.4601 |
Mean | 14949.818 |
Median Absolute Deviation (MAD) | 345.375 |
Skewness | 82.689282 |
Sum | 1.4949818 × 108 |
Variance | 2.9337567 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 1423 | 14.2% |
18.0 | 258 | 2.6% |
36.0 | 122 | 1.2% |
54.0 | 93 | 0.9% |
27.0 | 88 | 0.9% |
72.0 | 48 | 0.5% |
45.0 | 48 | 0.5% |
9.0 | 40 | 0.4% |
108.0 | 36 | 0.4% |
81.0 | 30 | 0.3% |
Other values (7032) | 7814 |
Value | Count | Frequency (%) |
0.0 | 1423 | |
0.95 | 1 | < 0.1% |
1.0 | 2 | < 0.1% |
1.17 | 1 | < 0.1% |
1.2 | 1 | < 0.1% |
1.5 | 1 | < 0.1% |
1.52 | 1 | < 0.1% |
1.55 | 1 | < 0.1% |
1.57 | 1 | < 0.1% |
1.62 | 1 | < 0.1% |
Value | Count | Frequency (%) |
49967875.0 | 1 | |
17884439.0 | 1 | |
9999344.0 | 1 | |
915921.58 | 1 | |
882279.87 | 1 | |
824351.71 | 1 | |
806049.78 | 1 | |
805927.36 | 1 | |
805872.45 | 1 | |
792584.63 | 1 |
용적_율
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 5952 |
---|---|
Distinct (%) | 59.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 767.81169 |
Minimum | 0 |
---|---|
Maximum | 5037330.8 |
Zeros | 2961 |
Zeros (%) | 29.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 149.64 |
Q3 | 207.8513 |
95-th percentile | 583.89719 |
Maximum | 5037330.8 |
Range | 5037330.8 |
Interquartile range (IQR) | 207.8513 |
Descriptive statistics
Standard deviation | 51211.094 |
---|---|
Coefficient of variation (CV) | 66.697466 |
Kurtosis | 9369.8793 |
Mean | 767.81169 |
Median Absolute Deviation (MAD) | 102.375 |
Skewness | 95.741497 |
Sum | 7678116.9 |
Variance | 2.6225761 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 2961 | |
199.98 | 22 | 0.2% |
199.88 | 18 | 0.2% |
199.96 | 14 | 0.1% |
199.91 | 14 | 0.1% |
199.94 | 14 | 0.1% |
199.93 | 14 | 0.1% |
199.9 | 13 | 0.1% |
199.92 | 13 | 0.1% |
199.97 | 12 | 0.1% |
Other values (5942) | 6905 |
Value | Count | Frequency (%) |
0.0 | 2961 | |
0.0075 | 1 | < 0.1% |
0.0252 | 1 | < 0.1% |
0.0479 | 1 | < 0.1% |
0.0489 | 1 | < 0.1% |
0.0644 | 1 | < 0.1% |
0.0728 | 1 | < 0.1% |
0.0783 | 1 | < 0.1% |
0.0945 | 1 | < 0.1% |
0.1122 | 1 | < 0.1% |
Value | Count | Frequency (%) |
5037330.7617 | 1 | |
923113.53 | 1 | |
39345.5539 | 1 | |
1542.5582 | 1 | |
1471.9992 | 1 | |
1372.33 | 1 | |
1335.2621 | 1 | |
1315.99 | 1 | |
1249.69 | 2 | |
1199.75 | 2 |
주_용도_코드
Real number (ℝ)
MISSING
 
Distinct | 29 |
---|---|
Distinct (%) | 0.4% |
Missing | 2885 |
Missing (%) | 28.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4874.0689 |
Minimum | 1000 |
---|---|
Maximum | 31000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 1000 |
Q1 | 2000 |
median | 4000 |
Q3 | 4000 |
95-th percentile | 14000 |
Maximum | 31000 |
Range | 30000 |
Interquartile range (IQR) | 2000 |
Descriptive statistics
Standard deviation | 4675.8456 |
---|---|
Coefficient of variation (CV) | 0.95933104 |
Kurtosis | 2.463997 |
Mean | 4874.0689 |
Median Absolute Deviation (MAD) | 2000 |
Skewness | 1.7431702 |
Sum | 34679000 |
Variance | 21863532 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4000 | 2047 | |
2000 | 1346 | |
1000 | 1231 | |
3000 | 955 | 9.6% |
14000 | 752 | 7.5% |
10000 | 175 | 1.8% |
7000 | 98 | 1.0% |
17000 | 88 | 0.9% |
9000 | 70 | 0.7% |
5000 | 62 | 0.6% |
Other values (19) | 291 | 2.9% |
(Missing) | 2885 |
Value | Count | Frequency (%) |
1000 | 1231 | |
2000 | 1346 | |
3000 | 955 | |
4000 | 2047 | |
5000 | 62 | 0.6% |
6000 | 48 | 0.5% |
7000 | 98 | 1.0% |
8000 | 8 | 0.1% |
9000 | 70 | 0.7% |
10000 | 175 | 1.8% |
Value | Count | Frequency (%) |
31000 | 1 | < 0.1% |
30000 | 2 | < 0.1% |
29000 | 1 | < 0.1% |
28000 | 8 | 0.1% |
27000 | 3 | < 0.1% |
26000 | 3 | < 0.1% |
24000 | 8 | 0.1% |
23000 | 6 | 0.1% |
21000 | 1 | < 0.1% |
20000 | 40 |
외필지_수
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 10000 |
허가번호_년 | 허가번호_기관_코드 | 허가번호_구분_코드 | 허가번호_일련번호 | 건축_구분_코드 | 건축_허가_일 | 대지_면적 | 건폐_율 | 연면적 | 용적_율 | 주_용도_코드 | |
---|---|---|---|---|---|---|---|---|---|---|---|
허가번호_년 | 1.000 | 0.000 | 0.064 | 0.222 | 0.116 | 0.740 | 0.004 | 0.000 | 0.027 | 0.071 | 0.086 |
허가번호_기관_코드 | 0.000 | 1.000 | NaN | 0.000 | 0.048 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
허가번호_구분_코드 | 0.064 | NaN | 1.000 | 0.109 | 0.194 | 0.261 | 0.067 | 0.000 | 0.000 | 0.000 | 0.348 |
허가번호_일련번호 | 0.222 | 0.000 | 0.109 | 1.000 | 0.218 | 0.137 | 0.000 | 0.000 | 0.000 | 0.000 | 0.107 |
건축_구분_코드 | 0.116 | 0.048 | 0.194 | 0.218 | 1.000 | 0.131 | 0.035 | 0.000 | 0.000 | 0.021 | 0.570 |
건축_허가_일 | 0.740 | 0.000 | 0.261 | 0.137 | 0.131 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
대지_면적 | 0.004 | 0.000 | 0.067 | 0.000 | 0.035 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.180 |
건폐_율 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.827 | 0.000 |
연면적 | 0.027 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.053 |
용적_율 | 0.071 | 0.000 | 0.000 | 0.000 | 0.021 | 0.000 | 0.000 | 0.827 | 0.000 | 1.000 | 0.087 |
주_용도_코드 | 0.086 | 0.000 | 0.348 | 0.107 | 0.570 | 0.000 | 0.180 | 0.000 | 0.053 | 0.087 | 1.000 |
허가번호_년 | 허가번호_기관_코드 | 허가번호_구분_코드 | 허가번호_일련번호 | 건축_구분_코드 | 건축_허가_일 | 대지_면적 | 건폐_율 | 연면적 | 용적_율 | 주_용도_코드 | |
---|---|---|---|---|---|---|---|---|---|---|---|
허가번호_년 | 1.000 | 0.039 | -0.140 | -0.233 | 0.120 | 0.981 | 0.261 | 0.134 | 0.201 | 0.147 | 0.072 |
허가번호_기관_코드 | 0.039 | 1.000 | -0.019 | 0.164 | -0.027 | 0.035 | 0.132 | -0.021 | 0.112 | 0.085 | 0.033 |
허가번호_구분_코드 | -0.140 | -0.019 | 1.000 | -0.180 | 0.776 | -0.143 | -0.127 | -0.654 | -0.521 | -0.597 | 0.251 |
허가번호_일련번호 | -0.233 | 0.164 | -0.180 | 1.000 | -0.156 | -0.141 | -0.203 | 0.046 | -0.076 | 0.031 | -0.076 |
건축_구분_코드 | 0.120 | -0.027 | 0.776 | -0.156 | 1.000 | 0.128 | 0.131 | -0.131 | 0.129 | 0.046 | 0.288 |
건축_허가_일 | 0.981 | 0.035 | -0.143 | -0.141 | 0.128 | 1.000 | 0.267 | 0.138 | 0.206 | 0.152 | 0.078 |
대지_면적 | 0.261 | 0.132 | -0.127 | -0.203 | 0.131 | 0.267 | 1.000 | 0.034 | 0.599 | 0.305 | 0.417 |
건폐_율 | 0.134 | -0.021 | -0.654 | 0.046 | -0.131 | 0.138 | 0.034 | 1.000 | 0.552 | 0.736 | -0.133 |
연면적 | 0.201 | 0.112 | -0.521 | -0.076 | 0.129 | 0.206 | 0.599 | 0.552 | 1.000 | 0.847 | 0.453 |
용적_율 | 0.147 | 0.085 | -0.597 | 0.031 | 0.046 | 0.152 | 0.305 | 0.736 | 0.847 | 1.000 | 0.291 |
주_용도_코드 | 0.072 | 0.033 | 0.251 | -0.076 | 0.288 | 0.078 | 0.417 | -0.133 | 0.453 | 0.291 | 1.000 |
관리_허가대장_PK | 허가번호_년 | 허가번호_기관_코드 | 허가번호_구분_코드 | 허가번호_일련번호 | 건축_구분_코드 | 건축_허가_일 | 대지_면적 | 건폐_율 | 연면적 | 용적_율 | 주_용도_코드 | 외필지_수 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
51037 | 11650-100117192 | 2020 | 3210141 | 1102 | 6 | 200 | 20200305 | 346.6 | 59.3191 | 914.83 | 191.8061 | 4000 | 0 |
48674 | 11260-100040896 | 2020 | 3060180 | 1101 | 69 | 100 | 20200417 | 306.8 | 56.22 | 719.64 | 186.33 | 4000 | 0 |
14073 | 11170-1000000000000000063299 | 2022 | 3020171 | 1101 | 110 | 100 | 20221007 | 535.9 | 58.9832 | 2442.174 | 407.2043 | 14000 | 0 |
37422 | 11305-100083961 | 2021 | 3080077 | 1208 | 2 | <NA> | 20210209 | 162.0 | 0.0 | 18.0 | 0.0 | <NA> | 0 |
22630 | 11500-100106104 | 2022 | 3150029 | 1202 | 3 | 200 | 20220216 | 570.2 | 49.93 | 731.29 | 128.25 | 4000 | 0 |
50809 | 11110-100043191 | 2020 | 3000082 | 1206 | 2 | 600 | 20200310 | 257.2 | 37.44 | 96.29 | 37.44 | 4000 | 0 |
24625 | 11110-100056696 | 2021 | 3000082 | 1108 | 212 | 700 | 20211222 | 470.4 | 33.26 | 324.76 | 63.42 | 3000 | 0 |
14947 | 11560-1000000000000000029965 | 2022 | 3180176 | 1207 | 48 | 700 | 20220907 | 103.0 | 50.83 | 157.05 | 101.65 | 1000 | 0 |
27473 | 11680-100164111 | 2021 | 3220175 | 1210 | 53 | 800 | 20211013 | 33696.1 | 49.9251 | 457994.318 | 919.6501 | 2000 | 0 |
47620 | 11680-100138870 | 2020 | 3220175 | 1501 | 95 | <NA> | 20200511 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 0 |
관리_허가대장_PK | 허가번호_년 | 허가번호_기관_코드 | 허가번호_구분_코드 | 허가번호_일련번호 | 건축_구분_코드 | 건축_허가_일 | 대지_면적 | 건폐_율 | 연면적 | 용적_율 | 주_용도_코드 | 외필지_수 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
72690 | 11410-100056114 | 2018 | 3120159 | 1501 | 145 | <NA> | 20181203 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 0 |
37578 | 11440-100124519 | 2021 | 3130160 | 1101 | 14 | 100 | 20210204 | 157.1 | 59.5481 | 313.85 | 199.7772 | 4000 | 0 |
46577 | 11650-100120109 | 2020 | 3210141 | 1501 | 15 | <NA> | 20200605 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 0 |
23629 | 11170-100089193 | 2022 | 3020171 | 1108 | 5 | 700 | 20220114 | 601.0 | 20.86 | 309.425 | 20.86 | 5000 | 0 |
58273 | 11230-100085771 | 2019 | 3050088 | 1208 | 28 | <NA> | 20190927 | 132.0 | 0.0 | 9.66 | 0.0 | <NA> | 0 |
53692 | 11560-100071511 | 2020 | 3180176 | 1101 | 5 | 100 | 20200109 | 165.0 | 57.72 | 467.2 | 283.15 | 4000 | 0 |
1885 | 11260-1000000000000000463829 | 2024 | 3060180 | 1101 | 8 | 100 | 20240221 | 239.7 | 59.77 | 478.35 | 199.56 | 2000 | 0 |
50191 | 11290-100070523 | 2020 | 3070271 | 5200 | 5 | 200 | 20200320 | 3903.0 | 27.26 | 6678.84 | 146.47 | 10000 | 0 |
65153 | 11470-100056348 | 2019 | 3140231 | 1101 | 49 | 100 | 20190424 | 209.16 | 59.02 | 470.07 | 199.97 | 2000 | 0 |
5354 | 11500-1000000000000000375375 | 2023 | 3150080 | 1208 | 6 | <NA> | 20230918 | 5366.9 | 0.0 | 105.6 | 0.0 | <NA> | 0 |