Overview

Dataset statistics

Number of variables4
Number of observations186
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.3 KiB
Average record size in memory34.7 B

Variable types

Categorical2
Numeric2

Dataset

Description국외인적자원관리시스템은 등록된 장학생을 대상으로 온라인증명서 발급을 서비스중이며, 해당 서비스에 대한 연도별, 사업별 발급 현황임.
Author교육부 국립국제교육원
URLhttps://www.data.go.kr/data/15069762/fileData.do

Alerts

사업명 is highly overall correlated with 증명서명High correlation
증명서명 is highly overall correlated with 사업명High correlation

Reproduction

Analysis started2023-12-12 07:01:48.798959
Analysis finished2023-12-12 07:01:49.555397
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)11.3%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
정부초청 외국인 장학생 관리
35 
GKS 우수자비유학생 선발관리
23 
GKS 우수교환학생 선발관리
22 
한일공동이공계학부유학생선발파견사업
18 
교원해외파견
16 
Other values (16)
72 

Length

Max length20
Median length16
Mean length13.623656
Min length6

Unique

Unique5 ?
Unique (%)2.7%

Sample

1st rowGKS 우수교환학생 선발관리
2nd rowGKS 우수교환학생 선발관리
3rd rowGKS 우수교환학생 선발관리
4th rowGKS 우수교환학생 선발관리
5th rowGKS 우수교환학생 선발관리

Common Values

ValueCountFrequency (%)
정부초청 외국인 장학생 관리 35
18.8%
GKS 우수자비유학생 선발관리 23
12.4%
GKS 우수교환학생 선발관리 22
11.8%
한일공동이공계학부유학생선발파견사업 18
9.7%
교원해외파견 16
8.6%
외국정부초청 장학생 관리 11
 
5.9%
한일 학술문화 청소년 교류 11
 
5.9%
한일중고생 교류 10
 
5.4%
상대국어 선택 고교생 교류 10
 
5.4%
국비유학생 선발파견 6
 
3.2%
Other values (11) 24
12.9%

Length

2023-12-12T16:01:49.652773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
gks 49
 
9.3%
장학생 46
 
8.7%
관리 46
 
8.7%
선발관리 45
 
8.5%
정부초청 35
 
6.6%
외국인 35
 
6.6%
교류 31
 
5.9%
우수자비유학생 23
 
4.3%
우수교환학생 22
 
4.2%
한일공동이공계학부유학생선발파견사업 18
 
3.4%
Other values (34) 179
33.8%

발급년도
Real number (ℝ)

Distinct14
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.7097
Minimum2009
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T16:01:49.783800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2009
5-th percentile2011
Q12013.25
median2017
Q32020
95-th percentile2022
Maximum2022
Range13
Interquartile range (IQR)6.75

Descriptive statistics

Standard deviation3.7288666
Coefficient of variation (CV)0.0018489853
Kurtosis-1.1562228
Mean2016.7097
Median Absolute Deviation (MAD)3
Skewness-0.22311132
Sum375108
Variance13.904446
MonotonicityNot monotonic
2023-12-12T16:01:49.932770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2019 20
10.8%
2022 20
10.8%
2020 19
10.2%
2013 18
9.7%
2021 16
8.6%
2014 14
7.5%
2017 14
7.5%
2015 13
7.0%
2018 12
6.5%
2016 11
 
5.9%
Other values (4) 29
15.6%
ValueCountFrequency (%)
2009 2
 
1.1%
2010 7
 
3.8%
2011 11
5.9%
2012 9
4.8%
2013 18
9.7%
2014 14
7.5%
2015 13
7.0%
2016 11
5.9%
2017 14
7.5%
2018 12
6.5%
ValueCountFrequency (%)
2022 20
10.8%
2021 16
8.6%
2020 19
10.2%
2019 20
10.8%
2018 12
6.5%
2017 14
7.5%
2016 11
5.9%
2015 13
7.0%
2014 14
7.5%
2013 18
9.7%

증명서명
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)12.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
연수증명서
40 
장학생증명(외국정부초청)
27 
장학생증명서(영문)
22 
장학생증명서(국문)
21 
유학생증명(한국정부초청)
16 
Other values (18)
60 

Length

Max length13
Median length10
Mean length9.0645161
Min length5

Unique

Unique6 ?
Unique (%)3.2%

Sample

1st row장학생증명서(국문)
2nd row장학생증명서(영문)
3rd row장학생증명서(국문)
4th row장학생증명서(영문)
5th row장학생증명서(국문)

Common Values

ValueCountFrequency (%)
연수증명서 40
21.5%
장학생증명(외국정부초청) 27
14.5%
장학생증명서(영문) 22
11.8%
장학생증명서(국문) 21
11.3%
유학생증명(한국정부초청) 16
 
8.6%
국비유학확인서 12
 
6.5%
영문장학생증명 11
 
5.9%
국비유학사실증명서 6
 
3.2%
경력증명서(한글) 4
 
2.2%
경력증명서(영문) 4
 
2.2%
Other values (13) 23
12.4%

Length

2023-12-12T16:01:50.120908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
연수증명서 40
21.5%
장학생증명(외국정부초청 27
14.5%
장학생증명서(영문 22
11.8%
장학생증명서(국문 21
11.3%
유학생증명(한국정부초청 16
 
8.6%
국비유학확인서 12
 
6.5%
영문장학생증명 11
 
5.9%
국비유학사실증명서 6
 
3.2%
경력증명서(한글 4
 
2.2%
경력증명서(영문 4
 
2.2%
Other values (13) 23
12.4%

발급건수
Real number (ℝ)

Distinct93
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean251.7043
Minimum1
Maximum4380
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T16:01:50.298507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median15.5
Q384.75
95-th percentile2341.5
Maximum4380
Range4379
Interquartile range (IQR)82.75

Descriptive statistics

Standard deviation732.81525
Coefficient of variation (CV)2.9114133
Kurtosis14.077171
Mean251.7043
Median Absolute Deviation (MAD)14.5
Skewness3.7647872
Sum46817
Variance537018.19
MonotonicityNot monotonic
2023-12-12T16:01:50.475812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 33
 
17.7%
2 16
 
8.6%
6 9
 
4.8%
4 8
 
4.3%
3 5
 
2.7%
8 4
 
2.2%
11 4
 
2.2%
9 4
 
2.2%
10 3
 
1.6%
18 3
 
1.6%
Other values (83) 97
52.2%
ValueCountFrequency (%)
1 33
17.7%
2 16
8.6%
3 5
 
2.7%
4 8
 
4.3%
5 2
 
1.1%
6 9
 
4.8%
7 3
 
1.6%
8 4
 
2.2%
9 4
 
2.2%
10 3
 
1.6%
ValueCountFrequency (%)
4380 1
0.5%
4256 1
0.5%
2952 1
0.5%
2883 1
0.5%
2879 1
0.5%
2744 1
0.5%
2727 1
0.5%
2632 1
0.5%
2577 1
0.5%
2381 1
0.5%

Interactions

2023-12-12T16:01:49.189001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:01:48.982887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:01:49.287069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:01:49.089114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:01:50.598745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업명발급년도증명서명발급건수
사업명1.0000.3150.9460.000
발급년도0.3151.0000.0000.248
증명서명0.9460.0001.0000.000
발급건수0.0000.2480.0001.000
2023-12-12T16:01:50.703147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업명증명서명
사업명1.0000.622
증명서명0.6221.000
2023-12-12T16:01:50.821456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발급년도발급건수사업명증명서명
발급년도1.0000.0080.1330.000
발급건수0.0081.0000.0000.000
사업명0.1330.0001.0000.622
증명서명0.0000.0000.6221.000

Missing values

2023-12-12T16:01:49.411028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:01:49.514292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업명발급년도증명서명발급건수
0GKS 우수교환학생 선발관리2013장학생증명서(국문)2
1GKS 우수교환학생 선발관리2013장학생증명서(영문)2
2GKS 우수교환학생 선발관리2014장학생증명서(국문)30
3GKS 우수교환학생 선발관리2014장학생증명서(영문)52
4GKS 우수교환학생 선발관리2015장학생증명서(국문)2
5GKS 우수교환학생 선발관리2015장학생증명서(영문)8
6GKS 우수교환학생 선발관리2016장학생증명서(국문)8
7GKS 우수교환학생 선발관리2016장학생증명서(영문)11
8GKS 우수교환학생 선발관리2017장학생증명서(국문)9
9GKS 우수교환학생 선발관리2017장학생증명서(영문)20
사업명발급년도증명서명발급건수
176한일중고생 교류2011연수증명서211
177한일중고생 교류2012연수증명서14
178한일중고생 교류2013연수증명서6
179한일중고생 교류2014연수증명서6
180한일중고생 교류2015연수증명서2
181한일중고생 교류2017연수증명서5
182한일중고생 교류2018연수증명서1
183한일중고생 교류2020연수증명서3
184한중중학생교류연수2013연수증명서6
185한중중학생교류연수2020연수증명서1