Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory332.0 KiB
Average record size in memory34.0 B

Variable types

Numeric2
Categorical1

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 과목 점수 제도와 관련된 내용을 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15091101/fileData.do

Alerts

비중 has 7431 (74.3%) zerosZeros

Reproduction

Analysis started2023-12-12 13:56:50.027857
Analysis finished2023-12-12 13:56:50.744900
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

과정코드
Real number (ℝ)

Distinct7334
Distinct (%)73.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean63189.702
Minimum3
Maximum413464
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:56:50.821623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile1517.5
Q15505
median9212
Q398104.75
95-th percentile279917.2
Maximum413464
Range413461
Interquartile range (IQR)92599.75

Descriptive statistics

Standard deviation95887.791
Coefficient of variation (CV)1.5174591
Kurtosis2.3303686
Mean63189.702
Median Absolute Deviation (MAD)6062.5
Skewness1.78508
Sum6.3189702 × 108
Variance9.1944684 × 109
MonotonicityNot monotonic
2023-12-12T22:56:50.969453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10379 6
 
0.1%
6412 5
 
0.1%
6689 5
 
0.1%
6093 5
 
0.1%
8548 4
 
< 0.1%
167057 4
 
< 0.1%
334177 4
 
< 0.1%
9874 4
 
< 0.1%
7591 4
 
< 0.1%
4077 4
 
< 0.1%
Other values (7324) 9955
99.6%
ValueCountFrequency (%)
3 3
< 0.1%
31 1
 
< 0.1%
33 1
 
< 0.1%
43 1
 
< 0.1%
122 1
 
< 0.1%
123 1
 
< 0.1%
124 1
 
< 0.1%
126 1
 
< 0.1%
142 2
< 0.1%
147 1
 
< 0.1%
ValueCountFrequency (%)
413464 2
< 0.1%
407104 1
 
< 0.1%
407101 2
< 0.1%
407086 1
 
< 0.1%
407080 1
 
< 0.1%
407071 1
 
< 0.1%
407062 1
 
< 0.1%
407059 1
 
< 0.1%
407053 1
 
< 0.1%
407044 4
< 0.1%
Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
과제
1481 
출석
1456 
시험
1453 
라이브세미나
1448 
퀴즈
1438 
Other values (3)
2724 

Length

Max length6
Median length2
Mean length3.0292
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출석
2nd row출석
3rd row라이브세미나
4th row과제
5th row시험

Common Values

ValueCountFrequency (%)
과제 1481
14.8%
출석 1456
14.6%
시험 1453
14.5%
라이브세미나 1448
14.5%
퀴즈 1438
14.4%
토론 1379
13.8%
게시판 참여 1125
11.2%
기타 220
 
2.2%

Length

2023-12-12T22:56:51.155022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:56:51.310709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
과제 1481
13.3%
출석 1456
13.1%
시험 1453
13.1%
라이브세미나 1448
13.0%
퀴즈 1438
12.9%
토론 1379
12.4%
게시판 1125
10.1%
참여 1125
10.1%
기타 220
 
2.0%

비중
Real number (ℝ)

ZEROS 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.494
Minimum0
Maximum100
Zeros7431
Zeros (%)74.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:56:51.457508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q340
95-th percentile60
Maximum100
Range100
Interquartile range (IQR)40

Descriptive statistics

Standard deviation26.584485
Coefficient of variation (CV)1.8341717
Kurtosis1.7864929
Mean14.494
Median Absolute Deviation (MAD)0
Skewness1.6714037
Sum144940
Variance706.73484
MonotonicityNot monotonic
2023-12-12T22:56:51.627661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0 7431
74.3%
60 999
 
10.0%
40 985
 
9.8%
100 344
 
3.4%
50 189
 
1.9%
30 33
 
0.3%
20 6
 
0.1%
10 5
 
0.1%
80 5
 
0.1%
99 1
 
< 0.1%
Other values (2) 2
 
< 0.1%
ValueCountFrequency (%)
0 7431
74.3%
1 1
 
< 0.1%
10 5
 
0.1%
20 6
 
0.1%
30 33
 
0.3%
40 985
 
9.8%
50 189
 
1.9%
60 999
 
10.0%
80 5
 
0.1%
90 1
 
< 0.1%
ValueCountFrequency (%)
100 344
 
3.4%
99 1
 
< 0.1%
90 1
 
< 0.1%
80 5
 
0.1%
60 999
10.0%
50 189
 
1.9%
40 985
9.8%
30 33
 
0.3%
20 6
 
0.1%
10 5
 
0.1%

Interactions

2023-12-12T22:56:50.403670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:56:50.203930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:56:50.501647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:56:50.291904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:56:51.733433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과정코드카테고리 코드비중
과정코드1.0000.2240.329
카테고리 코드0.2241.0000.728
비중0.3290.7281.000
2023-12-12T22:56:51.826459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과정코드비중카테고리 코드
과정코드1.000-0.0720.108
비중-0.0721.0000.470
카테고리 코드0.1080.4701.000

Missing values

2023-12-12T22:56:50.639856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:56:50.714814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과정코드카테고리 코드비중
5619321803출석60
444338883출석60
5531017445라이브세미나0
172064563과제0
5528417063시험0
84581221603출석100
2512860퀴즈0
283136201시험40
6993797996기타0
280146158과제0
과정코드카테고리 코드비중
74504117031과제0
248005682퀴즈0
262245897게시판 참여0
79665170365시험0
249975710과제0
83166212335라이브세미나0
82531189533퀴즈0
55122출석60
81530189247퀴즈0
73816109513출석100