Overview

Dataset statistics

Number of variables14
Number of observations1139
Missing cells6146
Missing cells (%)38.5%
Duplicate rows54
Duplicate rows (%)4.7%
Total size in memory132.5 KiB
Average record size in memory119.1 B

Variable types

Categorical6
Text2
Unsupported1
Numeric5

Dataset

Description2020학년도 전문대학 9월학기 순수외국인 모집결과(대학명,학과명,모집인원,지원인원,등록인원 등)
Author한국전문대학교육협의회
URLhttps://www.data.go.kr/data/15068977/fileData.do

Alerts

Dataset has 54 (4.7%) duplicate rowsDuplicates
설립 is highly overall correlated with 지원인원 and 3 other fieldsHigh correlation
대학명 is highly overall correlated with 모집인원 and 2 other fieldsHigh correlation
중분류 is highly overall correlated with 대분류(계열)High correlation
대분류(계열) is highly overall correlated with 중분류High correlation
지역 is highly overall correlated with 대학명High correlation
모집인원 is highly overall correlated with 지원인원 and 2 other fieldsHigh correlation
지원인원 is highly overall correlated with 모집인원 and 3 other fieldsHigh correlation
등록인원 is highly overall correlated with 모집인원 and 3 other fieldsHigh correlation
지원율 is highly overall correlated with 지원인원 and 2 other fieldsHigh correlation
등록율 is highly overall correlated with 지원율 and 1 other fieldsHigh correlation
설립 is highly imbalanced (84.3%)Imbalance
모집단위 has 553 (48.6%) missing valuesMissing
주야 has 1139 (100.0%) missing valuesMissing
모집인원 has 611 (53.6%) missing valuesMissing
지원인원 has 956 (83.9%) missing valuesMissing
등록인원 has 975 (85.6%) missing valuesMissing
지원율 has 937 (82.3%) missing valuesMissing
등록율 has 975 (85.6%) missing valuesMissing
주야 is an unsupported type, check if it needs cleaning or further analysisUnsupported
지원율 has 22 (1.9%) zerosZeros

Reproduction

Analysis started2023-12-12 01:39:40.751132
Analysis finished2023-12-12 01:39:45.891096
Duration5.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
경기
322 
대구
104 
부산
98 
경남
96 
인천
90 
Other values (12)
429 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원
2nd row강원
3rd row강원
4th row강원
5th row강원

Common Values

ValueCountFrequency (%)
경기 322
28.3%
대구 104
 
9.1%
부산 98
 
8.6%
경남 96
 
8.4%
인천 90
 
7.9%
서울 58
 
5.1%
제주 58
 
5.1%
전북 54
 
4.7%
세종 44
 
3.9%
경북 44
 
3.9%
Other values (7) 171
15.0%

Length

2023-12-12T10:39:45.972152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 322
28.3%
대구 104
 
9.1%
부산 98
 
8.6%
경남 96
 
8.4%
인천 90
 
7.9%
서울 58
 
5.1%
제주 58
 
5.1%
전북 54
 
4.7%
경북 44
 
3.9%
세종 44
 
3.9%
Other values (7) 171
15.0%

설립
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
사립
1113 
국립
 
26

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사립
2nd row사립
3rd row사립
4th row사립
5th row사립

Common Values

ValueCountFrequency (%)
사립 1113
97.7%
국립 26
 
2.3%

Length

2023-12-12T10:39:46.205676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:39:46.405291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사립 1113
97.7%
국립 26
 
2.3%

대학명
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
한양여자대학교
 
58
제주한라대학교
 
58
경남정보대학교
 
56
마산대학교
 
54
용인송담대학교
 
52
Other values (41)
861 

Length

Max length9
Median length7
Mean length6.4433714
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한림성심대학교
2nd row한림성심대학교
3rd row한림성심대학교
4th row한림성심대학교
5th row한림성심대학교

Common Values

ValueCountFrequency (%)
한양여자대학교 58
 
5.1%
제주한라대학교 58
 
5.1%
경남정보대학교 56
 
4.9%
마산대학교 54
 
4.7%
용인송담대학교 52
 
4.6%
인하공업전문대학 50
 
4.4%
경복대학교 50
 
4.4%
부천대학교 50
 
4.4%
김포대학교 46
 
4.0%
영남이공대학교 44
 
3.9%
Other values (36) 621
54.5%

Length

2023-12-12T10:39:46.591496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한양여자대학교 58
 
5.1%
제주한라대학교 58
 
5.1%
경남정보대학교 56
 
4.9%
마산대학교 54
 
4.7%
용인송담대학교 52
 
4.6%
인하공업전문대학 50
 
4.4%
경복대학교 50
 
4.4%
부천대학교 50
 
4.4%
김포대학교 46
 
4.0%
영남이공대학교 44
 
3.9%
Other values (36) 621
54.5%

대분류(계열)
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
공학계열
372 
인문사회계열
353 
자연과학계열
234 
예체능계열
180 

Length

Max length6
Median length6
Mean length5.1887621
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공학계열
2nd row공학계열
3rd row자연과학계열
4th row자연과학계열
5th row인문사회계열

Common Values

ValueCountFrequency (%)
공학계열 372
32.7%
인문사회계열 353
31.0%
자연과학계열 234
20.5%
예체능계열 180
15.8%

Length

2023-12-12T10:39:46.799072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:39:46.999650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공학계열 372
32.7%
인문사회계열 353
31.0%
자연과학계열 234
20.5%
예체능계열 180
15.8%

중분류
Categorical

HIGH CORRELATION 

Distinct41
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
사회과학
151 
경영,경제
122 
기계
102 
전기,전자,컴퓨터
82 
보건
70 
Other values (36)
612 

Length

Max length10
Median length9
Mean length4.1158911
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건축
2nd row건축
3rd row생활과학
4th row생활과학
5th row관광,음료,언어

Common Values

ValueCountFrequency (%)
사회과학 151
13.3%
경영,경제 122
 
10.7%
기계 102
 
9.0%
전기,전자,컴퓨터 82
 
7.2%
보건 70
 
6.1%
컴퓨터,통신 62
 
5.4%
간호보건 54
 
4.7%
외식,영양 48
 
4.2%
응용예술 48
 
4.2%
생활과학 42
 
3.7%
Other values (31) 358
31.4%

Length

2023-12-12T10:39:47.200674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
사회과학 151
13.3%
경영,경제 122
 
10.7%
기계 102
 
9.0%
전기,전자,컴퓨터 82
 
7.2%
보건 70
 
6.1%
컴퓨터,통신 62
 
5.4%
간호보건 54
 
4.7%
외식,영양 48
 
4.2%
응용예술 48
 
4.2%
디자인 42
 
3.7%
Other values (31) 358
31.4%
Distinct142
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2023-12-12T10:39:47.553185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length4.1861282
Min length2

Characters and Unicode

Total characters4768
Distinct characters174
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건축
2nd row건축
3rd row조리과학
4th row조리과학
5th row음료,언어
ValueCountFrequency (%)
피부미용 46
 
4.0%
식품조리 36
 
3.2%
관광학 34
 
3.0%
사회복지 34
 
3.0%
자동차 32
 
2.8%
전기 28
 
2.5%
컴퓨터정보응용 26
 
2.3%
경영학 26
 
2.3%
임상보건 24
 
2.1%
디자인 24
 
2.1%
Other values (132) 829
72.8%
2023-12-12T10:39:48.129067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
388
 
8.1%
202
 
4.2%
143
 
3.0%
140
 
2.9%
116
 
2.4%
110
 
2.3%
106
 
2.2%
99
 
2.1%
, 96
 
2.0%
94
 
2.0%
Other values (164) 3274
68.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4636
97.2%
Other Punctuation 110
 
2.3%
Uppercase Letter 18
 
0.4%
Dash Punctuation 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
388
 
8.4%
202
 
4.4%
143
 
3.1%
140
 
3.0%
116
 
2.5%
110
 
2.4%
106
 
2.3%
99
 
2.1%
94
 
2.0%
94
 
2.0%
Other values (156) 3144
67.8%
Other Punctuation
ValueCountFrequency (%)
, 96
87.3%
. 12
 
10.9%
· 2
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
N 6
33.3%
C 6
33.3%
E 6
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4636
97.2%
Common 112
 
2.3%
Latin 20
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
388
 
8.4%
202
 
4.4%
143
 
3.1%
140
 
3.0%
116
 
2.5%
110
 
2.4%
106
 
2.3%
99
 
2.1%
94
 
2.0%
94
 
2.0%
Other values (156) 3144
67.8%
Common
ValueCountFrequency (%)
, 96
85.7%
. 12
 
10.7%
- 2
 
1.8%
· 2
 
1.8%
Latin
ValueCountFrequency (%)
N 6
30.0%
C 6
30.0%
E 6
30.0%
e 2
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4636
97.2%
ASCII 130
 
2.7%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
388
 
8.4%
202
 
4.4%
143
 
3.1%
140
 
3.0%
116
 
2.5%
110
 
2.4%
106
 
2.3%
99
 
2.1%
94
 
2.0%
94
 
2.0%
Other values (156) 3144
67.8%
ASCII
ValueCountFrequency (%)
, 96
73.8%
. 12
 
9.2%
N 6
 
4.6%
C 6
 
4.6%
E 6
 
4.6%
- 2
 
1.5%
e 2
 
1.5%
None
ValueCountFrequency (%)
· 2
100.0%

모집단위
Text

MISSING 

Distinct411
Distinct (%)70.1%
Missing553
Missing (%)48.6%
Memory size9.0 KiB
2023-12-12T10:39:48.451963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length6.3276451
Min length3

Characters and Unicode

Total characters3708
Distinct characters263
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique334 ?
Unique (%)57.0%

Sample

1st row건축과
2nd row관광외식조리과
3rd row글로벌관광과
4th row디지털문화콘텐츠과
5th row보건환경과
ValueCountFrequency (%)
사회복지과 14
 
2.3%
유아교육과 8
 
1.3%
건축과 8
 
1.3%
자동차과 7
 
1.2%
컴퓨터정보과 7
 
1.2%
보건행정과 6
 
1.0%
전기과 6
 
1.0%
경영과 6
 
1.0%
세무회계과 6
 
1.0%
호텔외식조리과 6
 
1.0%
Other values (405) 524
87.6%
2023-12-12T10:39:48.999957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
493
 
13.3%
94
 
2.5%
93
 
2.5%
87
 
2.3%
86
 
2.3%
77
 
2.1%
69
 
1.9%
66
 
1.8%
65
 
1.8%
61
 
1.6%
Other values (253) 2517
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3625
97.8%
Uppercase Letter 21
 
0.6%
Other Punctuation 15
 
0.4%
Lowercase Letter 14
 
0.4%
Space Separator 12
 
0.3%
Open Punctuation 9
 
0.2%
Close Punctuation 9
 
0.2%
Dash Punctuation 2
 
0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
493
 
13.6%
94
 
2.6%
93
 
2.6%
87
 
2.4%
86
 
2.4%
77
 
2.1%
69
 
1.9%
66
 
1.8%
65
 
1.8%
61
 
1.7%
Other values (231) 2434
67.1%
Lowercase Letter
ValueCountFrequency (%)
e 5
35.7%
l 2
 
14.3%
u 1
 
7.1%
r 1
 
7.1%
v 1
 
7.1%
i 1
 
7.1%
a 1
 
7.1%
n 1
 
7.1%
c 1
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
I 7
33.3%
T 7
33.3%
C 3
14.3%
D 2
 
9.5%
S 1
 
4.8%
A 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
· 9
60.0%
& 6
40.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3625
97.8%
Common 48
 
1.3%
Latin 35
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
493
 
13.6%
94
 
2.6%
93
 
2.6%
87
 
2.4%
86
 
2.4%
77
 
2.1%
69
 
1.9%
66
 
1.8%
65
 
1.8%
61
 
1.7%
Other values (231) 2434
67.1%
Latin
ValueCountFrequency (%)
I 7
20.0%
T 7
20.0%
e 5
14.3%
C 3
8.6%
l 2
 
5.7%
D 2
 
5.7%
S 1
 
2.9%
u 1
 
2.9%
r 1
 
2.9%
v 1
 
2.9%
Other values (5) 5
14.3%
Common
ValueCountFrequency (%)
12
25.0%
( 9
18.8%
) 9
18.8%
· 9
18.8%
& 6
12.5%
- 2
 
4.2%
3 1
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3622
97.7%
ASCII 74
 
2.0%
None 9
 
0.2%
Compat Jamo 3
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
493
 
13.6%
94
 
2.6%
93
 
2.6%
87
 
2.4%
86
 
2.4%
77
 
2.1%
69
 
1.9%
66
 
1.8%
65
 
1.8%
61
 
1.7%
Other values (230) 2431
67.1%
ASCII
ValueCountFrequency (%)
12
16.2%
( 9
12.2%
) 9
12.2%
I 7
9.5%
T 7
9.5%
& 6
8.1%
e 5
6.8%
C 3
 
4.1%
l 2
 
2.7%
D 2
 
2.7%
Other values (11) 12
16.2%
None
ValueCountFrequency (%)
· 9
100.0%
Compat Jamo
ValueCountFrequency (%)
3
100.0%

학제
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
<NA>
704 
2
295 
3
130 
4
 
10

Length

Max length4
Median length4
Mean length2.8542581
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row<NA>
3rd row2
4th row<NA>
5th row2

Common Values

ValueCountFrequency (%)
<NA> 704
61.8%
2 295
25.9%
3 130
 
11.4%
4 10
 
0.9%

Length

2023-12-12T10:39:49.200232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:39:49.326351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 704
61.8%
2 295
25.9%
3 130
 
11.4%
4 10
 
0.9%

주야
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1139
Missing (%)100.0%
Memory size10.1 KiB

모집인원
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct32
Distinct (%)6.1%
Missing611
Missing (%)53.6%
Infinite0
Infinite (%)0.0%
Mean10.734848
Minimum1
Maximum360
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2023-12-12T10:39:49.460101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median8
Q310
95-th percentile30
Maximum360
Range359
Interquartile range (IQR)7

Descriptive statistics

Standard deviation21.315862
Coefficient of variation (CV)1.9856696
Kurtosis175.07217
Mean10.734848
Median Absolute Deviation (MAD)3
Skewness11.896019
Sum5668
Variance454.36599
MonotonicityNot monotonic
2023-12-12T10:39:49.644248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
10 134
 
11.8%
5 96
 
8.4%
1 75
 
6.6%
20 53
 
4.7%
2 36
 
3.2%
3 23
 
2.0%
8 18
 
1.6%
4 15
 
1.3%
30 13
 
1.1%
6 12
 
1.1%
Other values (22) 53
 
4.7%
(Missing) 611
53.6%
ValueCountFrequency (%)
1 75
6.6%
2 36
 
3.2%
3 23
 
2.0%
4 15
 
1.3%
5 96
8.4%
6 12
 
1.1%
7 6
 
0.5%
8 18
 
1.6%
9 2
 
0.2%
10 134
11.8%
ValueCountFrequency (%)
360 1
 
0.1%
265 1
 
0.1%
100 1
 
0.1%
60 3
 
0.3%
50 2
 
0.2%
44 1
 
0.1%
42 1
 
0.1%
40 5
 
0.4%
35 2
 
0.2%
30 13
1.1%

지원인원
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct39
Distinct (%)21.3%
Missing956
Missing (%)83.9%
Infinite0
Infinite (%)0.0%
Mean11.15847
Minimum1
Maximum315
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2023-12-12T10:39:49.821640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q39
95-th percentile37.8
Maximum315
Range314
Interquartile range (IQR)8

Descriptive statistics

Standard deviation29.951463
Coefficient of variation (CV)2.6841909
Kurtosis70.719752
Mean11.15847
Median Absolute Deviation (MAD)2
Skewness7.8019964
Sum2042
Variance897.09013
MonotonicityNot monotonic
2023-12-12T10:39:49.986312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
1 52
 
4.6%
3 22
 
1.9%
2 18
 
1.6%
4 12
 
1.1%
5 10
 
0.9%
7 9
 
0.8%
8 6
 
0.5%
10 5
 
0.4%
6 5
 
0.4%
9 4
 
0.4%
Other values (29) 40
 
3.5%
(Missing) 956
83.9%
ValueCountFrequency (%)
1 52
4.6%
2 18
 
1.6%
3 22
1.9%
4 12
 
1.1%
5 10
 
0.9%
6 5
 
0.4%
7 9
 
0.8%
8 6
 
0.5%
9 4
 
0.4%
10 5
 
0.4%
ValueCountFrequency (%)
315 1
0.1%
220 1
0.1%
63 1
0.1%
62 1
0.1%
61 1
0.1%
52 1
0.1%
49 1
0.1%
41 2
0.2%
38 1
0.1%
36 1
0.1%

등록인원
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct31
Distinct (%)18.9%
Missing975
Missing (%)85.6%
Infinite0
Infinite (%)0.0%
Mean10.47561
Minimum1
Maximum311
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2023-12-12T10:39:50.155325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median4
Q39
95-th percentile28
Maximum311
Range310
Interquartile range (IQR)8

Descriptive statistics

Standard deviation30.379492
Coefficient of variation (CV)2.9000213
Kurtosis70.361061
Mean10.47561
Median Absolute Deviation (MAD)3
Skewness7.9090067
Sum1718
Variance922.91351
MonotonicityNot monotonic
2023-12-12T10:39:50.333113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
1 45
 
4.0%
2 19
 
1.7%
3 17
 
1.5%
4 13
 
1.1%
5 11
 
1.0%
7 7
 
0.6%
10 6
 
0.5%
8 5
 
0.4%
6 5
 
0.4%
11 4
 
0.4%
Other values (21) 32
 
2.8%
(Missing) 975
85.6%
ValueCountFrequency (%)
1 45
4.0%
2 19
1.7%
3 17
 
1.5%
4 13
 
1.1%
5 11
 
1.0%
6 5
 
0.4%
7 7
 
0.6%
8 5
 
0.4%
9 4
 
0.4%
10 6
 
0.5%
ValueCountFrequency (%)
311 1
0.1%
212 1
0.1%
58 1
0.1%
57 1
0.1%
54 1
0.1%
50 1
0.1%
44 1
0.1%
36 1
0.1%
28 2
0.2%
27 1
0.1%

지원율
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct23
Distinct (%)11.4%
Missing937
Missing (%)82.3%
Infinite0
Infinite (%)0.0%
Mean0.58712871
Minimum0
Maximum4.6
Zeros22
Zeros (%)1.9%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2023-12-12T10:39:50.492752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.2
median0.4
Q31
95-th percentile1.3
Maximum4.6
Range4.6
Interquartile range (IQR)0.8

Descriptive statistics

Standard deviation0.58686508
Coefficient of variation (CV)0.99955098
Kurtosis13.311274
Mean0.58712871
Median Absolute Deviation (MAD)0.3
Skewness2.7066049
Sum118.6
Variance0.34441062
MonotonicityNot monotonic
2023-12-12T10:39:50.662775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
1.0 34
 
3.0%
0.1 25
 
2.2%
0.3 24
 
2.1%
0.0 22
 
1.9%
0.2 17
 
1.5%
0.4 17
 
1.5%
0.5 14
 
1.2%
0.8 8
 
0.7%
0.6 7
 
0.6%
0.7 6
 
0.5%
Other values (13) 28
 
2.5%
(Missing) 937
82.3%
ValueCountFrequency (%)
0.0 22
1.9%
0.1 25
2.2%
0.2 17
1.5%
0.3 24
2.1%
0.4 17
1.5%
0.5 14
1.2%
0.6 7
 
0.6%
0.7 6
 
0.5%
0.8 8
 
0.7%
0.9 5
 
0.4%
ValueCountFrequency (%)
4.6 1
 
0.1%
3.4 1
 
0.1%
3.0 1
 
0.1%
2.3 1
 
0.1%
1.8 1
 
0.1%
1.7 1
 
0.1%
1.6 1
 
0.1%
1.5 1
 
0.1%
1.4 2
 
0.2%
1.3 5
0.4%

등록율
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct50
Distinct (%)30.5%
Missing975
Missing (%)85.6%
Infinite0
Infinite (%)0.0%
Mean54.495732
Minimum2.4
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2023-12-12T10:39:50.853917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.4
5-th percentile5
Q125
median50
Q3100
95-th percentile100
Maximum100
Range97.6
Interquartile range (IQR)75

Descriptive statistics

Standard deviation34.451503
Coefficient of variation (CV)0.63218718
Kurtosis-1.4854595
Mean54.495732
Median Absolute Deviation (MAD)30
Skewness0.15487242
Sum8937.3
Variance1186.9061
MonotonicityNot monotonic
2023-12-12T10:39:51.021500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100.0 43
 
3.8%
50.0 11
 
1.0%
20.0 10
 
0.9%
33.3 8
 
0.7%
30.0 7
 
0.6%
60.0 7
 
0.6%
40.0 7
 
0.6%
5.0 7
 
0.6%
80.0 6
 
0.5%
25.0 6
 
0.5%
Other values (40) 52
 
4.6%
(Missing) 975
85.6%
ValueCountFrequency (%)
2.4 1
 
0.1%
2.5 1
 
0.1%
3.3 1
 
0.1%
5.0 7
0.6%
5.9 1
 
0.1%
6.7 1
 
0.1%
10.0 5
0.4%
10.5 1
 
0.1%
13.3 2
 
0.2%
14.3 2
 
0.2%
ValueCountFrequency (%)
100.0 43
3.8%
96.7 1
 
0.1%
95.0 1
 
0.1%
93.3 1
 
0.1%
90.0 2
 
0.2%
86.4 1
 
0.1%
83.3 1
 
0.1%
80.0 6
 
0.5%
75.0 1
 
0.1%
73.3 1
 
0.1%

Interactions

2023-12-12T10:39:44.718036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:42.132413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:42.702240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:43.258181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:44.126848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:44.853175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:42.244215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:42.823693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:43.688127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:44.233721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:44.980427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:42.368462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:42.921407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:43.788655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:44.348484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:45.130708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:42.483022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:43.038264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:43.910139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:44.462438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:45.261471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:42.603673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:43.148947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:44.005638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:39:44.564082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:39:51.163483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역설립대학명대분류(계열)중분류학제모집인원지원인원등록인원지원율등록율
지역1.0000.2391.0000.4610.7330.5630.4290.1970.4600.4100.431
설립0.2391.0001.0000.1210.2620.1580.000NaNNaN0.000NaN
대학명1.0001.0001.0000.6120.8740.6190.8780.3520.4810.0000.691
대분류(계열)0.4610.1210.6121.0000.9980.1120.0000.0000.0000.0750.189
중분류0.7330.2620.8740.9981.0000.7200.7790.0000.0000.0000.000
학제0.5630.1580.6190.1120.7201.0000.0000.0000.0000.6100.000
모집인원0.4290.0000.8780.0000.7790.0001.0000.8760.9090.0000.541
지원인원0.197NaN0.3520.0000.0000.0000.8761.0000.9960.4360.511
등록인원0.460NaN0.4810.0000.0000.0000.9090.9961.0000.0000.488
지원율0.4100.0000.0000.0750.0000.6100.0000.4360.0001.0000.709
등록율0.431NaN0.6910.1890.0000.0000.5410.5110.4880.7091.000
2023-12-12T10:39:51.356862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설립대학명중분류대분류(계열)학제지역
설립1.0000.9800.2160.0800.2600.213
대학명0.9801.0000.3200.3480.3450.987
중분류0.2160.3201.0000.9760.4690.282
대분류(계열)0.0800.3480.9761.0000.1050.272
학제0.2600.3450.4690.1051.0000.361
지역0.2130.9870.2820.2720.3611.000
2023-12-12T10:39:51.886995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
모집인원지원인원등록인원지원율등록율지역설립대학명대분류(계열)중분류학제
모집인원1.0000.5780.618-0.211-0.2860.2340.0000.6070.0000.4490.000
지원인원0.5781.0000.9670.5830.4240.1041.0000.1600.0000.0000.000
등록인원0.6180.9671.0000.5360.4760.2211.0000.2260.0000.0000.000
지원율-0.2110.5830.5361.0000.9390.1800.0000.0000.0310.0000.498
등록율-0.2860.4240.4760.9391.0000.1791.0000.2800.1100.0000.000
지역0.2340.1040.2210.1800.1791.0000.2130.9870.2720.2820.361
설립0.0001.0001.0000.0001.0000.2131.0000.9800.0800.2160.260
대학명0.6070.1600.2260.0000.2800.9870.9801.0000.3480.3200.345
대분류(계열)0.0000.0000.0000.0310.1100.2720.0800.3481.0000.9760.105
중분류0.4490.0000.0000.0000.0000.2820.2160.3200.9761.0000.469
학제0.0000.0000.0000.4980.0000.3610.2600.3450.1050.4691.000

Missing values

2023-12-12T10:39:45.402896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:39:45.601334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T10:39:45.766968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

지역설립대학명대분류(계열)중분류소분류모집단위학제주야모집인원지원인원등록인원지원율등록율
0강원사립한림성심대학교공학계열건축건축건축과3<NA>2<NA><NA><NA><NA>
1강원사립한림성심대학교공학계열건축건축<NA><NA><NA><NA><NA><NA><NA><NA>
2강원사립한림성심대학교자연과학계열생활과학조리과학관광외식조리과2<NA>2<NA><NA><NA><NA>
3강원사립한림성심대학교자연과학계열생활과학조리과학<NA><NA><NA><NA><NA><NA><NA><NA>
4강원사립한림성심대학교인문사회계열관광,음료,언어음료,언어글로벌관광과2<NA>2<NA><NA><NA><NA>
5강원사립한림성심대학교인문사회계열관광,음료,언어음료,언어<NA><NA><NA><NA><NA><NA><NA><NA>
6강원사립한림성심대학교공학계열전기,전자,컴퓨터전산학,컴퓨터공학디지털문화콘텐츠과2<NA>2<NA><NA><NA><NA>
7강원사립한림성심대학교공학계열전기,전자,컴퓨터전산학,컴퓨터공학<NA><NA><NA><NA><NA><NA><NA><NA>
8강원사립한림성심대학교자연과학계열보건보건관리보건환경과3<NA>2<NA><NA><NA><NA>
9강원사립한림성심대학교자연과학계열보건보건관리<NA><NA><NA><NA><NA><NA><NA><NA>
지역설립대학명대분류(계열)중분류소분류모집단위학제주야모집인원지원인원등록인원지원율등록율
1129충북사립충청대학교예체능계열음악실용음악분야<NA><NA><NA><NA><NA><NA><NA><NA>
1130충북사립충청대학교자연과학계열간호보건피부미용의료미용과3<NA>5220.440.0
1131충북사립충청대학교자연과학계열간호보건피부미용<NA><NA><NA><NA><NA><NA><NA><NA>
1132충북사립충청대학교공학계열전기,전자,컴퓨터전자공학컴퓨터전자과2<NA>3034241.180.0
1133충북사립충청대학교공학계열전기,전자,컴퓨터전자공학<NA><NA><NA><NA><NA><NA><NA><NA>
1134충북사립충청대학교인문사회계열사회과학항공운항항공관광과2<NA>5531.060.0
1135충북사립충청대학교인문사회계열사회과학항공운항<NA><NA><NA><NA><NA><NA><NA><NA>
1136충북사립충청대학교공학계열기계자동차항공자동차기계학부2<NA>3041281.493.3
1137충북사립충청대학교공학계열기계자동차<NA><NA><NA><NA><NA><NA><NA><NA>
1138충북사립충청대학교인문사회계열사회과학호텔경영호텔ㆍ바리스타전공2<NA>151991.360.0

Duplicate rows

Most frequently occurring

지역설립대학명대분류(계열)중분류소분류모집단위학제모집인원지원인원등록인원지원율등록율# duplicates
34세종사립한국영상대학교예체능계열응용예술영상예술<NA><NA><NA><NA><NA><NA><NA>9
18경남사립마산대학교자연과학계열보건임상보건<NA><NA><NA><NA><NA><NA><NA>7
13경기사립용인송담대학교예체능계열미술디자인<NA><NA><NA><NA><NA><NA><NA>5
33세종사립한국영상대학교예체능계열응용예술방송공연<NA><NA><NA><NA><NA><NA><NA>5
48제주사립제주한라대학교인문사회계열경영,경제경영학<NA><NA><NA><NA><NA><NA><NA>4
19경남사립마산대학교자연과학계열보건재활치료<NA><NA><NA><NA><NA><NA><NA>3
23대구사립영남이공대학교인문사회계열경영,경제관광학<NA><NA><NA><NA><NA><NA><NA>3
31서울사립한양여자대학교예체능계열디자인패션디자인<NA><NA><NA><NA><NA><NA><NA>3
44인천사립인하공업전문대학인문사회계열사회과학항공운항<NA><NA><NA><NA><NA><NA><NA>3
51제주사립제주한라대학교자연과학계열보건임상보건<NA><NA><NA><NA><NA><NA><NA>3