Overview

Dataset statistics

Number of variables23
Number of observations36
Missing cells157
Missing cells (%)19.0%
Duplicate rows7
Duplicate rows (%)19.4%
Total size in memory6.6 KiB
Average record size in memory187.7 B

Variable types

Text4
Unsupported19

Dataset

Description2022-02-23
Author주민등록인구통계
URLhttps://bigdata.gwangju.go.kr/usr/dataSet/getDataDetailView.rd?dataSetUncd=DS000201928

Alerts

Dataset has 7 (19.4%) duplicate rowsDuplicates
인구이동보고서(1호) has 20 (55.6%) missing valuesMissing
Unnamed: 1 has 25 (69.4%) missing valuesMissing
Unnamed: 2 has 24 (66.7%) missing valuesMissing
Unnamed: 3 has 32 (88.9%) missing valuesMissing
Unnamed: 4 has 3 (8.3%) missing valuesMissing
Unnamed: 5 has 3 (8.3%) missing valuesMissing
Unnamed: 6 has 3 (8.3%) missing valuesMissing
Unnamed: 7 has 3 (8.3%) missing valuesMissing
Unnamed: 8 has 3 (8.3%) missing valuesMissing
Unnamed: 9 has 2 (5.6%) missing valuesMissing
Unnamed: 10 has 3 (8.3%) missing valuesMissing
Unnamed: 11 has 3 (8.3%) missing valuesMissing
Unnamed: 12 has 3 (8.3%) missing valuesMissing
Unnamed: 13 has 3 (8.3%) missing valuesMissing
Unnamed: 14 has 3 (8.3%) missing valuesMissing
Unnamed: 15 has 3 (8.3%) missing valuesMissing
Unnamed: 16 has 3 (8.3%) missing valuesMissing
Unnamed: 17 has 3 (8.3%) missing valuesMissing
Unnamed: 18 has 3 (8.3%) missing valuesMissing
Unnamed: 19 has 3 (8.3%) missing valuesMissing
Unnamed: 20 has 3 (8.3%) missing valuesMissing
Unnamed: 21 has 3 (8.3%) missing valuesMissing
Unnamed: 22 has 3 (8.3%) missing valuesMissing
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 21 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 22 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-02-10 07:13:04.991545
Analysis finished2024-02-10 07:13:05.449714
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct16
Distinct (%)100.0%
Missing20
Missing (%)55.6%
Memory size420.0 B
2024-02-10T07:13:05.760066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length7.75
Min length5

Characters and Unicode

Total characters124
Distinct characters42
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)100.0%

Sample

1st row행정기관 :
2nd row작성기준 :
3rd row시 군 구(읍면동)
4th row전월말세대수
5th row전월말인구수
ValueCountFrequency (%)
2
 
7.7%
2
 
7.7%
2
 
7.7%
행정기관 1
 
3.8%
금월말거주불명자수 1
 
3.8%
금월말인구수 1
 
3.8%
금월말세대수 1
 
3.8%
거주불명자수증감 1
 
3.8%
인구수증감 1
 
3.8%
세대수증감 1
 
3.8%
Other values (13) 13
50.0%
2024-02-10T07:13:06.884492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
9.7%
11
 
8.9%
8
 
6.5%
8
 
6.5%
5
 
4.0%
5
 
4.0%
4
 
3.2%
4
 
3.2%
4
 
3.2%
4
 
3.2%
Other values (32) 59
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 104
83.9%
Control 12
 
9.7%
Space Separator 4
 
3.2%
Other Punctuation 2
 
1.6%
Close Punctuation 1
 
0.8%
Open Punctuation 1
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
10.6%
8
 
7.7%
8
 
7.7%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (27) 47
45.2%
Control
ValueCountFrequency (%)
12
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
: 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 104
83.9%
Common 20
 
16.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
10.6%
8
 
7.7%
8
 
7.7%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (27) 47
45.2%
Common
ValueCountFrequency (%)
12
60.0%
4
 
20.0%
: 2
 
10.0%
) 1
 
5.0%
( 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 104
83.9%
ASCII 20
 
16.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12
60.0%
4
 
20.0%
: 2
 
10.0%
) 1
 
5.0%
( 1
 
5.0%
Hangul
ValueCountFrequency (%)
11
 
10.6%
8
 
7.7%
8
 
7.7%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (27) 47
45.2%

Unnamed: 1
Text

MISSING 

Distinct9
Distinct (%)81.8%
Missing25
Missing (%)69.4%
Memory size420.0 B
2024-02-10T07:13:07.492820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length2.3636364
Min length2

Characters and Unicode

Total characters26
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)63.6%

Sample

1st row전 입
2nd row복귀
3rd row출생
4th row등록
5th row국외
ValueCountFrequency (%)
국외 2
15.4%
기타 2
15.4%
2
15.4%
1
7.7%
복귀 1
7.7%
출생 1
7.7%
등록 1
7.7%
1
7.7%
사망 1
7.7%
말소 1
7.7%
2024-02-10T07:13:08.971053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
15.4%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22
84.6%
Control 4
 
15.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%
Control
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22
84.6%
Common 4
 
15.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22
84.6%
ASCII 4
 
15.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4
100.0%
Hangul
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%

Unnamed: 2
Text

MISSING 

Distinct7
Distinct (%)58.3%
Missing24
Missing (%)66.7%
Memory size420.0 B
2024-02-10T07:13:09.493366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length3.4166667
Min length1

Characters and Unicode

Total characters41
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)16.7%

Sample

1st row광주광역시 서구
2nd row2022.01 현재
3rd row
4th row남자
5th row여자
ValueCountFrequency (%)
2
14.3%
남자 2
14.3%
여자 2
14.3%
시도내 2
14.3%
시도간 2
14.3%
광주광역시 1
7.1%
서구 1
7.1%
2022.01 1
7.1%
현재 1
7.1%
2024-02-10T07:13:10.391258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
12.2%
4
 
9.8%
4
 
9.8%
3
 
7.3%
2 3
 
7.3%
2
 
4.9%
0 2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
Other values (10) 12
29.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31
75.6%
Decimal Number 6
 
14.6%
Space Separator 3
 
7.3%
Other Punctuation 1
 
2.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
16.1%
4
12.9%
4
12.9%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
Other values (5) 5
16.1%
Decimal Number
ValueCountFrequency (%)
2 3
50.0%
0 2
33.3%
1 1
 
16.7%
Space Separator
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31
75.6%
Common 10
 
24.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
16.1%
4
12.9%
4
12.9%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
Other values (5) 5
16.1%
Common
ValueCountFrequency (%)
3
30.0%
2 3
30.0%
0 2
20.0%
1 1
 
10.0%
. 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31
75.6%
ASCII 10
 
24.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
16.1%
4
12.9%
4
12.9%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
Other values (5) 5
16.1%
ASCII
ValueCountFrequency (%)
3
30.0%
2 3
30.0%
0 2
20.0%
1 1
 
10.0%
. 1
 
10.0%

Unnamed: 3
Text

MISSING 

Distinct2
Distinct (%)50.0%
Missing32
Missing (%)88.9%
Memory size420.0 B
2024-02-10T07:13:10.740348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters16
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시군구내
2nd row시군구간
3rd row시군구내
4th row시군구간
ValueCountFrequency (%)
시군구내 2
50.0%
시군구간 2
50.0%
2024-02-10T07:13:11.583305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.6%
Memory size420.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 21
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 22
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Sample

인구이동보고서(1호)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22
0<NA><NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
1행정기관 :<NA>광주광역시 서구<NA>NaNNaNNaNNaNNaN출력일자 : 2022.02.03NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
2작성기준 :<NA>2022.01 현재<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
3시 군 구(읍면동)<NA><NA><NA>합 계양동양3동농성1동농성2동광천동유덕동치평동상무1동상무2동화정1동화정2동화정3동화정4동서창동금호1동금호2동풍암동동천동
4전월말세대수<NA><NA><NA>13280821212174621829444234489813617121831308384948006462565082698881610554152436392
5전월말인구수<NA><NA><NA>2912313686457811008490080841097530293248022386115542206041006815428579220012285723678116245
6전월말거주불명자수<NA><NA><NA>9512931815681157311612845373245264023849
7전월말재외국민등록자수<NA><NA><NA>139305212510997171210447212
8증 가 요 인전 입<NA>31112528157918610434935635023119311812492153190320144
9<NA><NA>남자<NA>1555101275534854180172157101976465478110016871
인구이동보고서(1호)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22
26<NA>말소<NA><NA>5000000010201000010
27<NA>국외<NA><NA>0000000000000000000
28<NA>기타<NA><NA>2000000000000000011
29세대수증감<NA><NA><NA>123-1501112-16-11281743-127-31712218
30인구수증감<NA><NA><NA>4-23-200-40-738713-26341-262-322113
31거주불명자수증감<NA><NA><NA>401-1-200-28-2-1000203-20
32금월말세대수<NA><NA><NA>13293121062174622929564218488713645122001312684938008463265052699882310566152646400
33금월말인구수<NA><NA><NA>2912353663457611008490080441096830331248092387415516206381006915402579420009285743680216258
34금월말거주불명자수<NA><NA><NA>9552932805481157112412644373245284026829
35금월말재외국민등록자수<NA><NA><NA>140305212510997171111446223

Duplicate rows

Most frequently occurring

인구이동보고서(1호)Unnamed: 1Unnamed: 2Unnamed: 3# duplicates
0<NA>국외<NA><NA>2
1<NA>기타<NA><NA>2
2<NA><NA>남자<NA>2
3<NA><NA>시도간<NA>2
4<NA><NA>시도내시군구내2
5<NA><NA>여자<NA>2
6<NA><NA><NA>시군구간2