Overview

Dataset statistics

Number of variables10
Number of observations30
Missing cells30
Missing cells (%)10.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory88.4 B

Variable types

Text1
Categorical6
Boolean2
Unsupported1

Dataset

Description샘플 데이터
Author경기도일자리재단
URLhttps://www.bigdata-region.kr/#/dataset/2345f48e-3d9a-4598-b52b-fc7fb4055fa4

Alerts

학습기수년도 has constant value ""Constant
학점설명 has constant value ""Constant
학습기수차수 has constant value ""Constant
학습과정여부 has constant value ""Constant
사용여부 has constant value ""Constant
학습기수명 is highly overall correlated with 학습기수코드 and 1 other fieldsHigh correlation
데이터기준일자 is highly overall correlated with 학습기수코드 and 1 other fieldsHigh correlation
학습기수코드 is highly overall correlated with 학습기수명 and 1 other fieldsHigh correlation
데이터기준일자 is highly imbalanced (64.7%)Imbalance
연수일정명 has 30 (100.0%) missing valuesMissing
학습계획정보번호 has unique valuesUnique
연수일정명 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 14:11:34.463634
Analysis finished2023-12-10 14:11:35.398030
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:11:35.675600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length14.066667
Min length14

Characters and Unicode

Total characters422
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowBI02_28_28_1019
2nd rowBI02_28_28_1020
3rd rowBI02_28_28_748
4th rowBI02_28_28_764
5th rowBI02_28_28_772
ValueCountFrequency (%)
bi02_28_28_1019 1
 
3.3%
bi02_28_28_1020 1
 
3.3%
bi04_31_30_756 1
 
3.3%
bi03_28_29_885 1
 
3.3%
bi03_28_29_884 1
 
3.3%
bi03_28_29_755 1
 
3.3%
bi02_28_28_993 1
 
3.3%
bi02_28_28_923 1
 
3.3%
bi02_28_28_922 1
 
3.3%
bi02_28_28_921 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:11:36.355653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 90
21.3%
2 88
20.9%
8 77
18.2%
0 39
9.2%
B 30
 
7.1%
I 30
 
7.1%
7 16
 
3.8%
3 13
 
3.1%
9 12
 
2.8%
1 11
 
2.6%
Other values (3) 16
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 272
64.5%
Connector Punctuation 90
 
21.3%
Uppercase Letter 60
 
14.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 88
32.4%
8 77
28.3%
0 39
14.3%
7 16
 
5.9%
3 13
 
4.8%
9 12
 
4.4%
1 11
 
4.0%
4 7
 
2.6%
5 6
 
2.2%
6 3
 
1.1%
Uppercase Letter
ValueCountFrequency (%)
B 30
50.0%
I 30
50.0%
Connector Punctuation
ValueCountFrequency (%)
_ 90
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 362
85.8%
Latin 60
 
14.2%

Most frequent character per script

Common
ValueCountFrequency (%)
_ 90
24.9%
2 88
24.3%
8 77
21.3%
0 39
10.8%
7 16
 
4.4%
3 13
 
3.6%
9 12
 
3.3%
1 11
 
3.0%
4 7
 
1.9%
5 6
 
1.7%
Latin
ValueCountFrequency (%)
B 30
50.0%
I 30
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 422
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 90
21.3%
2 88
20.9%
8 77
18.2%
0 39
9.2%
B 30
 
7.1%
I 30
 
7.1%
7 16
 
3.8%
3 13
 
3.1%
9 12
 
2.8%
1 11
 
2.6%
Other values (3) 16
 
3.8%

학습기수코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2
25 
3
4
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 25
83.3%
3 3
 
10.0%
4 2
 
6.7%

Length

2023-12-10T23:11:36.697546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:11:36.982559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 25
83.3%
3 3
 
10.0%
4 2
 
6.7%

학습기수년도
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2016
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2016
2nd row2016
3rd row2016
4th row2016
5th row2016

Common Values

ValueCountFrequency (%)
2016 30
100.0%

Length

2023-12-10T23:11:37.172791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:11:37.387035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2016 30
100.0%

학습기수명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2014년 여성창업전문교육_1차
25 
2014년 여성IT전문교육-1차
특성화고등학교 취업준비 과정_시범기수
 
2

Length

Max length20
Median length17
Mean length17.2
Min length17

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2014년 여성창업전문교육_1차
2nd row2014년 여성창업전문교육_1차
3rd row2014년 여성창업전문교육_1차
4th row2014년 여성창업전문교육_1차
5th row2014년 여성창업전문교육_1차

Common Values

ValueCountFrequency (%)
2014년 여성창업전문교육_1차 25
83.3%
2014년 여성IT전문교육-1차 3
 
10.0%
특성화고등학교 취업준비 과정_시범기수 2
 
6.7%

Length

2023-12-10T23:11:37.768362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:11:38.062010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2014년 28
45.2%
여성창업전문교육_1차 25
40.3%
여성it전문교육-1차 3
 
4.8%
특성화고등학교 2
 
3.2%
취업준비 2
 
3.2%
과정_시범기수 2
 
3.2%

학점설명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
없음
30 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row없음
2nd row없음
3rd row없음
4th row없음
5th row없음

Common Values

ValueCountFrequency (%)
없음 30
100.0%

Length

2023-12-10T23:11:38.266063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:11:38.426346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 30
100.0%

학습기수차수
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
1
30 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 30
100.0%

Length

2023-12-10T23:11:38.825668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:11:39.056844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 30
100.0%

학습과정여부
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
30 
ValueCountFrequency (%)
False 30
100.0%
2023-12-10T23:11:39.267176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T23:11:39.439682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

연수일정명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing30
Missing (%)100.0%
Memory size402.0 B

데이터기준일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2014-06-03
28 
2014-06-09
 
2

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2014-06-03
2nd row2014-06-03
3rd row2014-06-03
4th row2014-06-03
5th row2014-06-03

Common Values

ValueCountFrequency (%)
2014-06-03 28
93.3%
2014-06-09 2
 
6.7%

Length

2023-12-10T23:11:39.729391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:11:39.968697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2014-06-03 28
93.3%
2014-06-09 2
 
6.7%

Correlations

2023-12-10T23:11:40.231123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학습계획정보번호학습기수코드학습기수명데이터기준일자
학습계획정보번호1.0001.0001.0001.000
학습기수코드1.0001.0001.0001.000
학습기수명1.0001.0001.0001.000
데이터기준일자1.0001.0001.0001.000
2023-12-10T23:11:40.433468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학습기수명데이터기준일자학습기수코드
학습기수명1.0000.9821.000
데이터기준일자0.9821.0000.982
학습기수코드1.0000.9821.000
2023-12-10T23:11:40.666298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학습기수코드학습기수명데이터기준일자
학습기수코드1.0001.0000.982
학습기수명1.0001.0000.982
데이터기준일자0.9820.9821.000

Missing values

2023-12-10T23:11:34.964017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:11:35.294028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

학습계획정보번호학습기수코드학습기수년도학습기수명학점설명학습기수차수학습과정여부사용여부연수일정명데이터기준일자
0BI02_28_28_1019220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
1BI02_28_28_1020220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
2BI02_28_28_748220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
3BI02_28_28_764220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
4BI02_28_28_772220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
5BI02_28_28_773220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
6BI02_28_28_774220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
7BI02_28_28_783220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
8BI02_28_28_784220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
9BI02_28_28_809220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
학습계획정보번호학습기수코드학습기수년도학습기수명학점설명학습기수차수학습과정여부사용여부연수일정명데이터기준일자
20BI02_28_28_883220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
21BI02_28_28_921220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
22BI02_28_28_922220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
23BI02_28_28_923220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
24BI02_28_28_993220162014년 여성창업전문교육_1차없음1NY<NA>2014-06-03
25BI03_28_29_755320162014년 여성IT전문교육-1차없음1NY<NA>2014-06-03
26BI03_28_29_884320162014년 여성IT전문교육-1차없음1NY<NA>2014-06-03
27BI03_28_29_885320162014년 여성IT전문교육-1차없음1NY<NA>2014-06-03
28BI04_31_30_75642016특성화고등학교 취업준비 과정_시범기수없음1NY<NA>2014-06-09
29BI04_31_30_75742016특성화고등학교 취업준비 과정_시범기수없음1NY<NA>2014-06-09