Overview

Dataset statistics

Number of variables4
Number of observations5908
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory202.1 KiB
Average record size in memory35.0 B

Variable types

Numeric3
Categorical1

Dataset

Description전입인구(행정동, 인원수) 정보를 월별로 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=95

Alerts

행정동코드 is highly overall correlated with 인원수High correlation
인원수 is highly overall correlated with 행정동코드High correlation
인원수 has 895 (15.1%) zerosZeros

Reproduction

Analysis started2024-01-09 21:16:30.810971
Analysis finished2024-01-09 21:16:31.927393
Duration1.12 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연월
Real number (ℝ)

Distinct166
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201651.63
Minimum201001
Maximum202310
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size52.1 KiB
2024-01-10T06:16:31.992333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201001
5-th percentile201009
Q1201307
median201612
Q3202005
95-th percentile202302
Maximum202310
Range1309
Interquartile range (IQR)698

Descriptive statistics

Standard deviation395.92926
Coefficient of variation (CV)0.001963432
Kurtosis-1.1939268
Mean201651.63
Median Absolute Deviation (MAD)310
Skewness-0.0014356439
Sum1.1913578 × 109
Variance156759.98
MonotonicityNot monotonic
2024-01-10T06:16:32.128201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201612 36
 
0.6%
201802 36
 
0.6%
201708 36
 
0.6%
201709 36
 
0.6%
201710 36
 
0.6%
201711 36
 
0.6%
201712 36
 
0.6%
201801 36
 
0.6%
201803 36
 
0.6%
201706 36
 
0.6%
Other values (156) 5548
93.9%
ValueCountFrequency (%)
201001 34
0.6%
201002 34
0.6%
201003 34
0.6%
201004 34
0.6%
201005 34
0.6%
201006 34
0.6%
201007 32
0.5%
201008 34
0.6%
201009 34
0.6%
201010 34
0.6%
ValueCountFrequency (%)
202310 36
0.6%
202309 36
0.6%
202308 36
0.6%
202307 18
0.3%
202306 36
0.6%
202305 36
0.6%
202304 36
0.6%
202303 36
0.6%
202302 36
0.6%
202301 36
0.6%

행정동코드
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4523914 × 109
Minimum4.413 × 109
Maximum4.483 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size52.1 KiB
2024-01-10T06:16:32.238985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.413 × 109
5-th percentile4.413 × 109
Q14.421 × 109
median4.473 × 109
Q34.479 × 109
95-th percentile4.483 × 109
Maximum4.483 × 109
Range70000000
Interquartile range (IQR)58000000

Descriptive statistics

Standard deviation28901544
Coefficient of variation (CV)0.0064912406
Kurtosis-1.8609957
Mean4.4523914 × 109
Median Absolute Deviation (MAD)9500000
Skewness-0.24361769
Sum2.6304728 × 1013
Variance8.3529924 × 1014
MonotonicityNot monotonic
2024-01-10T06:16:32.340506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
4413000000 331
 
5.6%
4483000000 331
 
5.6%
4418000000 331
 
5.6%
4420000000 331
 
5.6%
4421000000 331
 
5.6%
4423000000 331
 
5.6%
4425000000 331
 
5.6%
4471000000 331
 
5.6%
4473000000 331
 
5.6%
4415000000 331
 
5.6%
Other values (8) 2598
44.0%
ValueCountFrequency (%)
4413000000 331
5.6%
4415000000 331
5.6%
4418000000 331
5.6%
4420000000 331
5.6%
4421000000 331
5.6%
4423000000 331
5.6%
4425000000 331
5.6%
4427000000 283
4.8%
4471000000 331
5.6%
4473000000 331
5.6%
ValueCountFrequency (%)
4483000000 331
5.6%
4482500000 331
5.6%
4481000000 331
5.6%
4480000000 331
5.6%
4479000000 331
5.6%
4477000000 331
5.6%
4476000000 331
5.6%
4475000000 329
5.6%
4473000000 331
5.6%
4471000000 331
5.6%

구분
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size46.3 KiB
전출
2963 
전입
2945 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전입
2nd row전입
3rd row전입
4th row전입
5th row전입

Common Values

ValueCountFrequency (%)
전출 2963
50.2%
전입 2945
49.8%

Length

2024-01-10T06:16:32.745293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:16:32.829951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전출 2963
50.2%
전입 2945
49.8%

인원수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct2280
Distinct (%)38.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1311.4123
Minimum0
Maximum11789
Zeros895
Zeros (%)15.1%
Negative0
Negative (%)0.0%
Memory size52.1 KiB
2024-01-10T06:16:32.929534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1366
median651
Q31377.25
95-th percentile6830.45
Maximum11789
Range11789
Interquartile range (IQR)1011.25

Descriptive statistics

Standard deviation1901.5885
Coefficient of variation (CV)1.450031
Kurtosis8.0826452
Mean1311.4123
Median Absolute Deviation (MAD)445
Skewness2.8275236
Sum7747824
Variance3616038.7
MonotonicityNot monotonic
2024-01-10T06:16:33.047944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 895
 
15.1%
403 14
 
0.2%
386 13
 
0.2%
461 12
 
0.2%
416 12
 
0.2%
448 12
 
0.2%
496 12
 
0.2%
528 11
 
0.2%
358 11
 
0.2%
529 11
 
0.2%
Other values (2270) 4905
83.0%
ValueCountFrequency (%)
0 895
15.1%
120 1
 
< 0.1%
144 1
 
< 0.1%
148 1
 
< 0.1%
157 1
 
< 0.1%
162 1
 
< 0.1%
163 1
 
< 0.1%
164 1
 
< 0.1%
165 2
 
< 0.1%
166 1
 
< 0.1%
ValueCountFrequency (%)
11789 1
< 0.1%
11744 1
< 0.1%
11724 1
< 0.1%
11678 1
< 0.1%
11653 1
< 0.1%
11471 1
< 0.1%
11380 1
< 0.1%
11039 1
< 0.1%
10732 1
< 0.1%
10522 1
< 0.1%

Interactions

2024-01-10T06:16:31.545942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:16:31.043524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:16:31.292046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:16:31.628681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:16:31.127042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:16:31.374143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:16:31.726310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:16:31.215622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:16:31.459993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:16:33.124030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연월행정동코드구분인원수
기준연월1.0000.0000.0000.133
행정동코드0.0001.0000.0000.590
구분0.0000.0001.0000.080
인원수0.1330.5900.0801.000
2024-01-10T06:16:33.210288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연월행정동코드인원수구분
기준연월1.000-0.003-0.0970.000
행정동코드-0.0031.000-0.6630.000
인원수-0.097-0.6631.0000.061
구분0.0000.0000.0611.000

Missing values

2024-01-10T06:16:31.827724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:16:31.896522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연월행정동코드구분인원수
02010014413000000전입8627
12010014415000000전입1308
22010014418000000전입1328
32010014420000000전입4090
42010014421000000전입2021
52010014423000000전입1482
62010014425000000전입1454
72010014471000000전입516
82010014473000000전입930
92010014475000000전입0
기준연월행정동코드구분인원수
58982023104471000000전입310
58992023104473000000전입0
59002023104475000000전입0
59012023104476000000전입323
59022023104477000000전입270
59032023104479000000전입246
59042023104480000000전입964
59052023104481000000전입539
59062023104482500000전입378
59072023104483000000전입0