Overview

Dataset statistics

Number of variables26
Number of observations36
Missing cells166
Missing cells (%)17.7%
Duplicate rows7
Duplicate rows (%)19.4%
Total size in memory7.4 KiB
Average record size in memory211.7 B

Variable types

Text4
Unsupported22

Dataset

Description2022-02-23
Author주민등록인구통계
URLhttps://bigdata.gwangju.go.kr/usr/dataSet/getDataDetailView.rd?dataSetUncd=DS000201924

Alerts

Dataset has 7 (19.4%) duplicate rowsDuplicates
인구이동보고서(1호) has 20 (55.6%) missing valuesMissing
Unnamed: 1 has 25 (69.4%) missing valuesMissing
Unnamed: 2 has 24 (66.7%) missing valuesMissing
Unnamed: 3 has 32 (88.9%) missing valuesMissing
Unnamed: 4 has 3 (8.3%) missing valuesMissing
Unnamed: 5 has 3 (8.3%) missing valuesMissing
Unnamed: 6 has 3 (8.3%) missing valuesMissing
Unnamed: 7 has 3 (8.3%) missing valuesMissing
Unnamed: 8 has 3 (8.3%) missing valuesMissing
Unnamed: 9 has 2 (5.6%) missing valuesMissing
Unnamed: 10 has 3 (8.3%) missing valuesMissing
Unnamed: 11 has 3 (8.3%) missing valuesMissing
Unnamed: 12 has 3 (8.3%) missing valuesMissing
Unnamed: 13 has 3 (8.3%) missing valuesMissing
Unnamed: 14 has 3 (8.3%) missing valuesMissing
Unnamed: 15 has 3 (8.3%) missing valuesMissing
Unnamed: 16 has 3 (8.3%) missing valuesMissing
Unnamed: 17 has 3 (8.3%) missing valuesMissing
Unnamed: 18 has 3 (8.3%) missing valuesMissing
Unnamed: 19 has 3 (8.3%) missing valuesMissing
Unnamed: 20 has 3 (8.3%) missing valuesMissing
Unnamed: 21 has 3 (8.3%) missing valuesMissing
Unnamed: 22 has 3 (8.3%) missing valuesMissing
Unnamed: 23 has 3 (8.3%) missing valuesMissing
Unnamed: 24 has 3 (8.3%) missing valuesMissing
Unnamed: 25 has 3 (8.3%) missing valuesMissing
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 21 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 22 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 23 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 24 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 25 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-02-10 06:47:22.253895
Analysis finished2024-02-10 06:47:22.801642
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct16
Distinct (%)100.0%
Missing20
Missing (%)55.6%
Memory size420.0 B
2024-02-10T06:47:23.079849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length7.75
Min length5

Characters and Unicode

Total characters124
Distinct characters42
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)100.0%

Sample

1st row행정기관 :
2nd row작성기준 :
3rd row시 군 구(읍면동)
4th row전월말세대수
5th row전월말인구수
ValueCountFrequency (%)
2
 
7.7%
2
 
7.7%
2
 
7.7%
행정기관 1
 
3.8%
금월말거주불명자수 1
 
3.8%
금월말인구수 1
 
3.8%
금월말세대수 1
 
3.8%
거주불명자수증감 1
 
3.8%
인구수증감 1
 
3.8%
세대수증감 1
 
3.8%
Other values (13) 13
50.0%
2024-02-10T06:47:23.945137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
9.7%
11
 
8.9%
8
 
6.5%
8
 
6.5%
5
 
4.0%
5
 
4.0%
4
 
3.2%
4
 
3.2%
4
 
3.2%
4
 
3.2%
Other values (32) 59
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 104
83.9%
Control 12
 
9.7%
Space Separator 4
 
3.2%
Other Punctuation 2
 
1.6%
Close Punctuation 1
 
0.8%
Open Punctuation 1
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
10.6%
8
 
7.7%
8
 
7.7%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (27) 47
45.2%
Control
ValueCountFrequency (%)
12
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
: 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 104
83.9%
Common 20
 
16.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
10.6%
8
 
7.7%
8
 
7.7%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (27) 47
45.2%
Common
ValueCountFrequency (%)
12
60.0%
4
 
20.0%
: 2
 
10.0%
) 1
 
5.0%
( 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 104
83.9%
ASCII 20
 
16.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12
60.0%
4
 
20.0%
: 2
 
10.0%
) 1
 
5.0%
( 1
 
5.0%
Hangul
ValueCountFrequency (%)
11
 
10.6%
8
 
7.7%
8
 
7.7%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (27) 47
45.2%

Unnamed: 1
Text

MISSING 

Distinct9
Distinct (%)81.8%
Missing25
Missing (%)69.4%
Memory size420.0 B
2024-02-10T06:47:24.461139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length2.3636364
Min length2

Characters and Unicode

Total characters26
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)63.6%

Sample

1st row전 입
2nd row복귀
3rd row출생
4th row등록
5th row국외
ValueCountFrequency (%)
국외 2
15.4%
기타 2
15.4%
2
15.4%
1
7.7%
복귀 1
7.7%
출생 1
7.7%
등록 1
7.7%
1
7.7%
사망 1
7.7%
말소 1
7.7%
2024-02-10T06:47:25.349439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
15.4%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22
84.6%
Control 4
 
15.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%
Control
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22
84.6%
Common 4
 
15.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22
84.6%
ASCII 4
 
15.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4
100.0%
Hangul
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%

Unnamed: 2
Text

MISSING 

Distinct7
Distinct (%)58.3%
Missing24
Missing (%)66.7%
Memory size420.0 B
2024-02-10T06:47:25.783784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.5
Min length1

Characters and Unicode

Total characters42
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)16.7%

Sample

1st row광주광역시 광산구
2nd row2022.01 현재
3rd row
4th row남자
5th row여자
ValueCountFrequency (%)
2
14.3%
남자 2
14.3%
여자 2
14.3%
시도내 2
14.3%
시도간 2
14.3%
광주광역시 1
7.1%
광산구 1
7.1%
2022.01 1
7.1%
현재 1
7.1%
2024-02-10T06:47:26.623606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
11.9%
4
 
9.5%
4
 
9.5%
3
 
7.1%
3
 
7.1%
2 3
 
7.1%
2
 
4.8%
0 2
 
4.8%
2
 
4.8%
2
 
4.8%
Other values (10) 12
28.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32
76.2%
Decimal Number 6
 
14.3%
Space Separator 3
 
7.1%
Other Punctuation 1
 
2.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
15.6%
4
12.5%
4
12.5%
3
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
Other values (5) 5
15.6%
Decimal Number
ValueCountFrequency (%)
2 3
50.0%
0 2
33.3%
1 1
 
16.7%
Space Separator
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32
76.2%
Common 10
 
23.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
15.6%
4
12.5%
4
12.5%
3
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
Other values (5) 5
15.6%
Common
ValueCountFrequency (%)
3
30.0%
2 3
30.0%
0 2
20.0%
1 1
 
10.0%
. 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32
76.2%
ASCII 10
 
23.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
15.6%
4
12.5%
4
12.5%
3
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
Other values (5) 5
15.6%
ASCII
ValueCountFrequency (%)
3
30.0%
2 3
30.0%
0 2
20.0%
1 1
 
10.0%
. 1
 
10.0%

Unnamed: 3
Text

MISSING 

Distinct2
Distinct (%)50.0%
Missing32
Missing (%)88.9%
Memory size420.0 B
2024-02-10T06:47:27.062331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters16
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시군구내
2nd row시군구간
3rd row시군구내
4th row시군구간
ValueCountFrequency (%)
시군구내 2
50.0%
시군구간 2
50.0%
2024-02-10T06:47:27.940311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
2
12.5%
2
12.5%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.6%
Memory size420.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 21
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 22
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 23
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 24
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Unnamed: 25
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)8.3%
Memory size420.0 B

Sample

인구이동보고서(1호)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22Unnamed: 23Unnamed: 24Unnamed: 25
0<NA><NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
1행정기관 :<NA>광주광역시 광산구<NA>NaNNaNNaNNaNNaN출력일자 : 2022.02.07NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
2작성기준 :<NA>2022.01 현재<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
3시 군 구(읍면동)<NA><NA><NA>합 계송정1동송정2동도산동신흥동어룡동우산동월곡1동월곡2동비아동첨단1동첨단2동신가동운남동수완동하남동임곡동동곡동평동삼도동본량동신창동
4전월말세대수<NA><NA><NA>16912548793483668220561400914708496266443480104411841069731231328302104951273105825221335122813872
5전월말인구수<NA><NA><NA>4042211086365581527945943303528905108841559474812745943297184573085677438262662139185640642199200934988
6전월말거주불명자수<NA><NA><NA>99979129411677774941272778483280383620138677
7전월말재외국민등록자수<NA><NA><NA>129164336456132469148143009
8증 가 요 인전 입<NA>38741088813846300275891141033094881302207082831914621715348
9<NA><NA>남자<NA>197050476822153124416256150254679536115611939107188
인구이동보고서(1호)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22Unnamed: 23Unnamed: 24Unnamed: 25
26<NA>말소<NA><NA>5000200010110000000000
27<NA>국외<NA><NA>0000000000000000000000
28<NA>기타<NA><NA>0000000000000000000000
29세대수증감<NA><NA><NA>277131015-122-120-27117344-41367473-51-3-414
30인구수증감<NA><NA><NA>9828315-32021-10-34356311-48-21-142-2-51-3-6-8
31거주불명자수증감<NA><NA><NA>-910000-2-101-10-31-3000100-3
32금월말세대수<NA><NA><NA>16940248923493669720551403114696496266173491105141845469691232628369105421276105325231332122413886
33금월말인구수<NA><NA><NA>4043191089165611529445913305528926108741556075162752243308184093083577437263082137185140652196200334980
34금월말거주불명자수<NA><NA><NA>99080129411677754841282678453377383620148674
35금월말재외국민등록자수<NA><NA><NA>128164336456132369148143009

Duplicate rows

Most frequently occurring

인구이동보고서(1호)Unnamed: 1Unnamed: 2Unnamed: 3# duplicates
0<NA>국외<NA><NA>2
1<NA>기타<NA><NA>2
2<NA><NA>남자<NA>2
3<NA><NA>시도간<NA>2
4<NA><NA>시도내시군구내2
5<NA><NA>여자<NA>2
6<NA><NA><NA>시군구간2