Overview

Dataset statistics

Number of variables4
Number of observations42
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory37.1 B

Variable types

Numeric2
Text1
Categorical1

Dataset

Description공무원연금 재직기간별 가입자 현황입니다. (연단위로 구분되어 있으며 40년 이상 및 1년 이하는 일괄로 표시합니다)
URLhttps://www.data.go.kr/data/15095354/fileData.do

Alerts

기준일 has constant value ""Constant
순번 is highly overall correlated with 인원수High correlation
인원수 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique
구분 has unique valuesUnique
인원수 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:33:10.763725
Analysis finished2023-12-12 10:33:11.545201
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.5
Minimum1
Maximum42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2023-12-12T19:33:11.644799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.05
Q111.25
median21.5
Q331.75
95-th percentile39.95
Maximum42
Range41
Interquartile range (IQR)20.5

Descriptive statistics

Standard deviation12.267844
Coefficient of variation (CV)0.5705974
Kurtosis-1.2
Mean21.5
Median Absolute Deviation (MAD)10.5
Skewness0
Sum903
Variance150.5
MonotonicityStrictly increasing
2023-12-12T19:33:11.823280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
1 1
 
2.4%
33 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
32 1
 
2.4%
Other values (32) 32
76.2%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
42 1
2.4%
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%

구분
Text

UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size468.0 B
2023-12-12T19:33:12.087549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.1190476
Min length2

Characters and Unicode

Total characters131
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)100.0%

Sample

1st row1년 미만
2nd row1년 이상
3rd row2년
4th row3년
5th row4년
ValueCountFrequency (%)
이상 3
 
6.4%
1년 2
 
4.3%
33년 2
 
4.3%
29년 1
 
2.1%
39년 1
 
2.1%
23년 1
 
2.1%
24년 1
 
2.1%
25년 1
 
2.1%
26년 1
 
2.1%
27년 1
 
2.1%
Other values (33) 33
70.2%
2023-12-12T19:33:12.607004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
32.1%
3 16
 
12.2%
1 15
 
11.5%
2 14
 
10.7%
5
 
3.8%
4 5
 
3.8%
6 4
 
3.1%
0 4
 
3.1%
8 4
 
3.1%
7 4
 
3.1%
Other values (8) 18
13.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 74
56.5%
Other Letter 52
39.7%
Space Separator 5
 
3.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 16
21.6%
1 15
20.3%
2 14
18.9%
4 5
 
6.8%
6 4
 
5.4%
0 4
 
5.4%
8 4
 
5.4%
7 4
 
5.4%
5 4
 
5.4%
9 4
 
5.4%
Other Letter
ValueCountFrequency (%)
42
80.8%
3
 
5.8%
3
 
5.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
1
 
1.9%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 79
60.3%
Hangul 52
39.7%

Most frequent character per script

Common
ValueCountFrequency (%)
3 16
20.3%
1 15
19.0%
2 14
17.7%
5
 
6.3%
4 5
 
6.3%
6 4
 
5.1%
0 4
 
5.1%
8 4
 
5.1%
7 4
 
5.1%
5 4
 
5.1%
Hangul
ValueCountFrequency (%)
42
80.8%
3
 
5.8%
3
 
5.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 79
60.3%
Hangul 52
39.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
80.8%
3
 
5.8%
3
 
5.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
1
 
1.9%
ASCII
ValueCountFrequency (%)
3 16
20.3%
1 15
19.0%
2 14
17.7%
5
 
6.3%
4 5
 
6.3%
6 4
 
5.1%
0 4
 
5.1%
8 4
 
5.1%
7 4
 
5.1%
5 4
 
5.1%

인원수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30499.857
Minimum1871
Maximum64070
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2023-12-12T19:33:12.792349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1871
5-th percentile3326.2
Q124209.5
median28052
Q337785.5
95-th percentile57381.55
Maximum64070
Range62199
Interquartile range (IQR)13576

Descriptive statistics

Standard deviation15242.84
Coefficient of variation (CV)0.4997676
Kurtosis0.007314927
Mean30499.857
Median Absolute Deviation (MAD)8546.5
Skewness0.16513203
Sum1280994
Variance2.3234418 × 108
MonotonicityNot monotonic
2023-12-12T19:33:12.939163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
50252 1
 
2.4%
26010 1
 
2.4%
17565 1
 
2.4%
26425 1
 
2.4%
24158 1
 
2.4%
27847 1
 
2.4%
26146 1
 
2.4%
24996 1
 
2.4%
31106 1
 
2.4%
27296 1
 
2.4%
Other values (32) 32
76.2%
ValueCountFrequency (%)
1871 1
2.4%
3108 1
2.4%
3194 1
2.4%
5838 1
2.4%
7187 1
2.4%
12151 1
2.4%
17565 1
2.4%
17934 1
2.4%
22488 1
2.4%
22584 1
2.4%
ValueCountFrequency (%)
64070 1
2.4%
61921 1
2.4%
57438 1
2.4%
56309 1
2.4%
50252 1
2.4%
46286 1
2.4%
45459 1
2.4%
44745 1
2.4%
42872 1
2.4%
38615 1
2.4%

기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size468.0 B
2022-12-31
42 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-31
2nd row2022-12-31
3rd row2022-12-31
4th row2022-12-31
5th row2022-12-31

Common Values

ValueCountFrequency (%)
2022-12-31 42
100.0%

Length

2023-12-12T19:33:13.063206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:33:13.167586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-31 42
100.0%

Interactions

2023-12-12T19:33:11.097200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:33:10.882237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:33:11.214907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:33:10.987957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:33:13.238840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분인원수
순번1.0001.0000.902
구분1.0001.0001.000
인원수0.9021.0001.000
2023-12-12T19:33:13.321552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번인원수
순번1.000-0.902
인원수-0.9021.000

Missing values

2023-12-12T19:33:11.374479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:33:11.501366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번구분인원수기준일
011년 미만502522022-12-31
121년 이상640702022-12-31
232년574382022-12-31
343년619212022-12-31
454년563092022-12-31
565년462862022-12-31
676년454592022-12-31
787년447452022-12-31
898년428722022-12-31
9109년386152022-12-31
순번구분인원수기준일
323332년260102022-12-31
333433년18712022-12-31
343533년 초과243642022-12-31
353634년 이상225842022-12-31
363735년179342022-12-31
373836년121512022-12-31
383937년71872022-12-31
394038년58382022-12-31
404139년31082022-12-31
414240년 이상31942022-12-31