Overview

Dataset statistics

Number of variables20
Number of observations35
Missing cells193
Missing cells (%)27.6%
Duplicate rows7
Duplicate rows (%)20.0%
Total size in memory5.6 KiB
Average record size in memory164.8 B

Variable types

Unsupported15
Text4
DateTime1

Dataset

Description2022-12-16
Author주민등록인구통계
URLhttps://bigdata.gwangju.go.kr/usr/dataSet/getDataDetailView.rd?dataSetUncd=DS000201926

Alerts

Unnamed: 12 has constant value ""Constant
Dataset has 7 (20.0%) duplicate rowsDuplicates
Unnamed: 0 has 35 (100.0%) missing valuesMissing
인구이동보고서(1호) has 19 (54.3%) missing valuesMissing
Unnamed: 2 has 24 (68.6%) missing valuesMissing
Unnamed: 3 has 23 (65.7%) missing valuesMissing
Unnamed: 4 has 31 (88.6%) missing valuesMissing
Unnamed: 5 has 2 (5.7%) missing valuesMissing
Unnamed: 6 has 2 (5.7%) missing valuesMissing
Unnamed: 7 has 2 (5.7%) missing valuesMissing
Unnamed: 8 has 2 (5.7%) missing valuesMissing
Unnamed: 9 has 2 (5.7%) missing valuesMissing
Unnamed: 10 has 1 (2.9%) missing valuesMissing
Unnamed: 11 has 2 (5.7%) missing valuesMissing
Unnamed: 12 has 34 (97.1%) missing valuesMissing
Unnamed: 13 has 2 (5.7%) missing valuesMissing
Unnamed: 14 has 2 (5.7%) missing valuesMissing
Unnamed: 15 has 2 (5.7%) missing valuesMissing
Unnamed: 16 has 2 (5.7%) missing valuesMissing
Unnamed: 17 has 2 (5.7%) missing valuesMissing
Unnamed: 18 has 2 (5.7%) missing valuesMissing
Unnamed: 19 has 2 (5.7%) missing valuesMissing
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-02-10 10:00:44.262659
Analysis finished2024-02-10 10:00:47.563475
Duration3.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing35
Missing (%)100.0%
Memory size447.0 B
Distinct16
Distinct (%)100.0%
Missing19
Missing (%)54.3%
Memory size412.0 B
2024-02-10T10:00:47.904026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10.5
Mean length7.875
Min length5

Characters and Unicode

Total characters126
Distinct characters43
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)100.0%

Sample

1st row행정기관 :
2nd row작성기준 :
3rd row시, 군, 구(읍면동)
4th row전월말세대수
5th row전월말인구수
ValueCountFrequency (%)
2
 
7.7%
2
 
7.7%
2
 
7.7%
금월말거주불명자수 1
 
3.8%
금월말인구수 1
 
3.8%
금월말세대수 1
 
3.8%
거주불명자수증감 1
 
3.8%
인구수증감 1
 
3.8%
세대수증감 1
 
3.8%
1
 
3.8%
Other values (13) 13
50.0%
2024-02-10T10:00:48.992373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
9.5%
11
 
8.7%
8
 
6.3%
8
 
6.3%
5
 
4.0%
5
 
4.0%
4
 
3.2%
4
 
3.2%
4
 
3.2%
4
 
3.2%
Other values (33) 61
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 104
82.5%
Control 12
 
9.5%
Space Separator 4
 
3.2%
Other Punctuation 4
 
3.2%
Open Punctuation 1
 
0.8%
Close Punctuation 1
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
10.6%
8
 
7.7%
8
 
7.7%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (27) 47
45.2%
Other Punctuation
ValueCountFrequency (%)
, 2
50.0%
: 2
50.0%
Control
ValueCountFrequency (%)
12
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 104
82.5%
Common 22
 
17.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
10.6%
8
 
7.7%
8
 
7.7%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (27) 47
45.2%
Common
ValueCountFrequency (%)
12
54.5%
4
 
18.2%
, 2
 
9.1%
: 2
 
9.1%
( 1
 
4.5%
) 1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 104
82.5%
ASCII 22
 
17.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12
54.5%
4
 
18.2%
, 2
 
9.1%
: 2
 
9.1%
( 1
 
4.5%
) 1
 
4.5%
Hangul
ValueCountFrequency (%)
11
 
10.6%
8
 
7.7%
8
 
7.7%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (27) 47
45.2%

Unnamed: 2
Text

MISSING 

Distinct9
Distinct (%)81.8%
Missing24
Missing (%)68.6%
Memory size412.0 B
2024-02-10T10:00:49.382411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length2.3636364
Min length2

Characters and Unicode

Total characters26
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)63.6%

Sample

1st row전 입
2nd row복귀
3rd row출생
4th row등록
5th row국외
ValueCountFrequency (%)
국외 2
15.4%
기타 2
15.4%
2
15.4%
1
7.7%
복귀 1
7.7%
출생 1
7.7%
등록 1
7.7%
1
7.7%
사망 1
7.7%
말소 1
7.7%
2024-02-10T10:00:50.428801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
15.4%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22
84.6%
Control 4
 
15.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%
Control
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22
84.6%
Common 4
 
15.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22
84.6%
ASCII 4
 
15.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4
100.0%
Hangul
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%

Unnamed: 3
Text

MISSING 

Distinct7
Distinct (%)58.3%
Missing23
Missing (%)65.7%
Memory size412.0 B
2024-02-10T10:00:50.844376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length3.4166667
Min length1

Characters and Unicode

Total characters41
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)16.7%

Sample

1st row광주광역시 동구
2nd row2022.11 현재
3rd row
4th row남자
5th row여자
ValueCountFrequency (%)
2
14.3%
남자 2
14.3%
여자 2
14.3%
시도내 2
14.3%
시도간 2
14.3%
광주광역시 1
7.1%
동구 1
7.1%
2022.11 1
7.1%
현재 1
7.1%
2024-02-10T10:00:51.821712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
12.2%
4
 
9.8%
4
 
9.8%
3
 
7.3%
2 3
 
7.3%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
Other values (10) 12
29.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31
75.6%
Decimal Number 6
 
14.6%
Space Separator 3
 
7.3%
Other Punctuation 1
 
2.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
16.1%
4
12.9%
4
12.9%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
Other values (5) 5
16.1%
Decimal Number
ValueCountFrequency (%)
2 3
50.0%
1 2
33.3%
0 1
 
16.7%
Space Separator
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31
75.6%
Common 10
 
24.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
16.1%
4
12.9%
4
12.9%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
Other values (5) 5
16.1%
Common
ValueCountFrequency (%)
3
30.0%
2 3
30.0%
1 2
20.0%
0 1
 
10.0%
. 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31
75.6%
ASCII 10
 
24.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
16.1%
4
12.9%
4
12.9%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
Other values (5) 5
16.1%
ASCII
ValueCountFrequency (%)
3
30.0%
2 3
30.0%
1 2
20.0%
0 1
 
10.0%
. 1
 
10.0%

Unnamed: 4
Text

MISSING 

Distinct2
Distinct (%)50.0%
Missing31
Missing (%)88.6%
Memory size412.0 B
2024-02-10T10:00:52.267543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters16
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시군구내
2nd row시군구간
3rd row시군구내
4th row시군구간
ValueCountFrequency (%)
시군구내 2
50.0%
시군구간 2
50.0%
2024-02-10T10:00:52.982611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)2.9%
Memory size412.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 12
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing34
Missing (%)97.1%
Memory size412.0 B
Minimum2022-12-01 00:00:00
Maximum2022-12-01 00:00:00
2024-02-10T10:00:53.332582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-02-10T10:00:53.648618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.7%
Memory size412.0 B

Correlations

2024-02-10T10:00:54.121222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인구이동보고서(1호)Unnamed: 2Unnamed: 3Unnamed: 4
인구이동보고서(1호)1.0000.0001.000NaN
Unnamed: 20.0001.000NaNNaN
Unnamed: 31.000NaN1.000NaN
Unnamed: 4NaNNaNNaN1.000

Missing values

2024-02-10T10:00:44.965630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-02-10T10:00:45.847994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-02-10T10:00:46.784675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Unnamed: 0인구이동보고서(1호)Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19
0<NA>행정기관 :<NA>광주광역시 동구<NA>NaNNaNNaNNaNNaN출력일자 :NaN2022.12.01NaNNaNNaNNaNNaNNaNNaN
1<NA>작성기준 :<NA>2022.11 현재<NA>NaNNaNNaNNaNNaNNaNNaN<NA>NaNNaNNaNNaNNaNNaNNaN
2<NA>시, 군, 구(읍면동)<NA><NA><NA>합 계충장동동명동계림1동계림2동산수1동산수2동<NA>지산1동지산2동서남동학동학운동지원1동지원2동
3<NA>전월말세대수<NA><NA><NA>54006392424775822532844164775<NA>2456237522693579522541817179
4<NA>전월말인구수<NA><NA><NA>105798502238301058312660826110209<NA>417644113026753111249896915871
5<NA>전월말거주불명자수<NA><NA><NA>6683411585256522<NA>39950534222107
6<NA>전월말재외국민등록자수<NA><NA><NA>10276131266<NA>36391786
7<NA>증 가 요 인전 입<NA>143998581362579584<NA>5650436781112302
8<NA><NA><NA>남자<NA>7274429701304842<NA>323021325160138
9<NA><NA><NA>여자<NA>7125429661274742<NA>242022353052164
Unnamed: 0인구이동보고서(1호)Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19
25<NA><NA>말소<NA><NA>1000000<NA>0001000
26<NA><NA>국외<NA><NA>0000000<NA>0000000
27<NA><NA>기타<NA><NA>0000000<NA>0000000
28<NA>세대수증감<NA><NA><NA>44-2-32-1561-4-11<NA>-4-5-114-1-367
29<NA>인구수증감<NA><NA><NA>477-60-46131-27-44<NA>-4-10-16-4-211140
30<NA>거주불명자수증감<NA><NA><NA>-5-21-2-100<NA>02-100-1-1
31<NA>금월말세대수<NA><NA><NA>54050392224455807538944124764<NA>2452237022583583522441787246
32<NA>금월말인구수<NA><NA><NA>105845502937701053712791823410165<NA>417244013010752711228897016011
33<NA>금월말거주불명자수<NA><NA><NA>6633211683246522<NA>391149534221106
34<NA>금월말재외국민등록자수<NA><NA><NA>10477131266<NA>36391886

Duplicate rows

Most frequently occurring

인구이동보고서(1호)Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 12# duplicates
0<NA>국외<NA><NA><NA>2
1<NA>기타<NA><NA><NA>2
2<NA><NA>남자<NA><NA>2
3<NA><NA>시도간<NA><NA>2
4<NA><NA>시도내시군구내<NA>2
5<NA><NA>여자<NA><NA>2
6<NA><NA><NA>시군구간<NA>2