Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 615 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 84 |
Duplicate rows (%) | 13.7% |
Total size in memory | 15.7 KiB |
Average record size in memory | 26.2 B |
Variable types
Categorical | 1 |
---|---|
Text | 1 |
Numeric | 1 |
Dataset
Description | 경기도 경기통계시스템 추출 자료항목리스트 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=4RT7D5AVAE7EU9J6E3LG33513440&infSeq=1 |
조직번호 has constant value "" | Constant |
Dataset has 84 (13.7%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2023-12-10 21:54:09.449996 |
---|---|
Analysis finished | 2023-12-10 21:54:09.723531 |
Duration | 0.27 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
조직번호
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
210 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 210 |
---|---|
2nd row | 210 |
3rd row | 210 |
4th row | 210 |
5th row | 210 |
Common Values
Value | Count | Frequency (%) |
210 | 615 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
210 | 615 |
통계표ID
Text
Distinct | 137 |
---|---|
Distinct (%) | 22.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
Length
Max length | 20 |
---|---|
Median length | 19 |
Mean length | 13.930081 |
Min length | 11 |
Characters and Unicode
Total characters | 8567 |
---|---|
Distinct characters | 28 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 55 ? |
---|---|
Unique (%) | 8.9% |
Sample
1st row | DT_20220025 |
---|---|
2nd row | DT_20220025 |
3rd row | DT_20220025 |
4th row | DT_20220025 |
5th row | DT_20220025 |
Value | Count | Frequency (%) |
dt_21002_j010 | 149 | |
dt_21002_l007 | 41 | 6.7% |
dt_21002_m016 | 17 | 2.8% |
dt_21002_m023 | 15 | 2.4% |
dt_21002_n001 | 12 | 2.0% |
dt_20114_2021026_04 | 12 | 2.0% |
dt_20114_2021026_05 | 12 | 2.0% |
dt_20114_2021026_01 | 12 | 2.0% |
dt_20114_2021026_02 | 12 | 2.0% |
dt_21002_k010 | 11 | 1.8% |
Other values (127) | 322 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2400 | |
2 | 1476 | |
_ | 1214 | |
1 | 1087 | |
D | 643 | 7.5% |
T | 615 | 7.2% |
7 | 217 | 2.5% |
4 | 152 | 1.8% |
J | 150 | 1.8% |
5 | 102 | 1.2% |
Other values (18) | 511 | 6.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 5727 | |
Uppercase Letter | 1626 | 19.0% |
Connector Punctuation | 1214 | 14.2% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
D | 643 | |
T | 615 | |
J | 150 | 9.2% |
L | 41 | 2.5% |
M | 36 | 2.2% |
B | 25 | 1.5% |
I | 22 | 1.4% |
C | 21 | 1.3% |
E | 16 | 1.0% |
K | 15 | 0.9% |
Other values (7) | 42 | 2.6% |
Decimal Number
Value | Count | Frequency (%) |
0 | 2400 | |
2 | 1476 | |
1 | 1087 | |
7 | 217 | 3.8% |
4 | 152 | 2.7% |
5 | 102 | 1.8% |
6 | 88 | 1.5% |
8 | 88 | 1.5% |
3 | 81 | 1.4% |
9 | 36 | 0.6% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1214 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 6941 | |
Latin | 1626 | 19.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
D | 643 | |
T | 615 | |
J | 150 | 9.2% |
L | 41 | 2.5% |
M | 36 | 2.2% |
B | 25 | 1.5% |
I | 22 | 1.4% |
C | 21 | 1.3% |
E | 16 | 1.0% |
K | 15 | 0.9% |
Other values (7) | 42 | 2.6% |
Common
Value | Count | Frequency (%) |
0 | 2400 | |
2 | 1476 | |
_ | 1214 | |
1 | 1087 | |
7 | 217 | 3.1% |
4 | 152 | 2.2% |
5 | 102 | 1.5% |
6 | 88 | 1.3% |
8 | 88 | 1.3% |
3 | 81 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8567 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2400 | |
2 | 1476 | |
_ | 1214 | |
1 | 1087 | |
D | 643 | 7.5% |
T | 615 | 7.2% |
7 | 217 | 2.5% |
4 | 152 | 1.8% |
J | 150 | 1.8% |
5 | 102 | 1.2% |
Other values (18) | 511 | 6.0% |
최종변경일
Real number (ℝ)
Distinct | 35 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20214232 |
Minimum | 20171117 |
---|---|
Maximum | 20230412 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.5 KiB |
Quantile statistics
Minimum | 20171117 |
---|---|
5-th percentile | 20181224 |
Q1 | 20211221 |
median | 20220712 |
Q3 | 20221219 |
95-th percentile | 20221219 |
Maximum | 20230412 |
Range | 59295 |
Interquartile range (IQR) | 9998 |
Descriptive statistics
Standard deviation | 12683.958 |
---|---|
Coefficient of variation (CV) | 0.00062747665 |
Kurtosis | 1.4674371 |
Mean | 20214232 |
Median Absolute Deviation (MAD) | 507 |
Skewness | -1.6218291 |
Sum | 1.2431753 × 1010 |
Variance | 1.608828 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20221219 | 149 | |
20220407 | 79 | |
20220712 | 59 | 9.6% |
20200509 | 48 | 7.8% |
20200508 | 35 | 5.7% |
20221012 | 35 | 5.7% |
20220711 | 31 | 5.0% |
20221210 | 29 | 4.7% |
20181224 | 27 | 4.4% |
20221206 | 17 | 2.8% |
Other values (25) | 106 |
Value | Count | Frequency (%) |
20171117 | 1 | 0.2% |
20180112 | 10 | 1.6% |
20180510 | 2 | 0.3% |
20180626 | 4 | 0.7% |
20180814 | 1 | 0.2% |
20180824 | 1 | 0.2% |
20180827 | 5 | 0.8% |
20181224 | 27 | |
20200421 | 16 | |
20200508 | 35 |
Value | Count | Frequency (%) |
20230412 | 3 | 0.5% |
20230329 | 6 | 1.0% |
20221219 | 149 | |
20221212 | 6 | 1.0% |
20221210 | 29 | 4.7% |
20221209 | 6 | 1.0% |
20221206 | 17 | 2.8% |
20221203 | 1 | 0.2% |
20221102 | 1 | 0.2% |
20221025 | 1 | 0.2% |
조직번호 | 통계표ID | 최종변경일 | |
---|---|---|---|
0 | 210 | DT_20220025 | 20220407 |
1 | 210 | DT_20220025 | 20220407 |
2 | 210 | DT_20220025 | 20220407 |
3 | 210 | DT_20220025 | 20220712 |
4 | 210 | DT_20220025 | 20220407 |
5 | 210 | DT_20114_2021012_06 | 20220711 |
6 | 210 | DT_20114_2021012_06 | 20220711 |
7 | 210 | DT_20114_2021012_06 | 20220711 |
8 | 210 | DT_20114_2021012_06 | 20220712 |
9 | 210 | DT_20114_2021035_03 | 20220712 |
조직번호 | 통계표ID | 최종변경일 | |
---|---|---|---|
605 | 210 | DT_20114_2021026_04 | 20220712 |
606 | 210 | DT_20114_2021026_04 | 20220712 |
607 | 210 | DT_20114_2021026_04 | 20220712 |
608 | 210 | DT_20114_2021026_04 | 20220712 |
609 | 210 | DT_20114_2021026_04 | 20220712 |
610 | 210 | DT_20114_2021035_02 | 20220712 |
611 | 210 | DT_20220014 | 20220407 |
612 | 210 | DT_20220014 | 20220407 |
613 | 210 | DT_20220014 | 20220407 |
614 | 210 | DT_20220014 | 20220407 |
Most frequently occurring
조직번호 | 통계표ID | 최종변경일 | # duplicates | |
---|---|---|---|---|
72 | 210 | DT_21002_J010 | 20221219 | 149 |
75 | 210 | DT_21002_L007 | 20221012 | 35 |
78 | 210 | DT_21002_M016 | 20221206 | 16 |
79 | 210 | DT_21002_M023 | 20221210 | 14 |
9 | 210 | DT_20114_2021026_02 | 20220712 | 12 |
10 | 210 | DT_20114_2021026_04 | 20220712 | 12 |
11 | 210 | DT_20114_2021026_05 | 20220712 | 12 |
80 | 210 | DT_21002_N001 | 20221210 | 12 |
8 | 210 | DT_20114_2021026_01 | 20220711 | 11 |
73 | 210 | DT_21002_K010 | 20211221 | 11 |