Overview

Dataset statistics

Number of variables6
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory52.9 B

Variable types

Numeric2
Text2
DateTime1
Categorical1

Dataset

Description중앙다문화교육센터에서 다문화교육포털을 통해 제공하고 있는 다문화교육 관련 교육자료에 대한 자료 유형 정보(분류 체계) 입니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15090355/fileData.do

Alerts

유형등록일 has constant value ""Constant
유형분류자 has constant value ""Constant
연번 is highly overall correlated with 자료유형넘버High correlation
자료유형넘버 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
자료유형넘버 has unique valuesUnique
자료유형 has unique valuesUnique
유형분류코드 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:11:51.727169
Analysis finished2023-12-11 23:11:52.529235
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-12T08:11:52.589248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q112
median23
Q334
95-th percentile42.8
Maximum45
Range44
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.133926
Coefficient of variation (CV)0.57104024
Kurtosis-1.2
Mean23
Median Absolute Deviation (MAD)11
Skewness0
Sum1035
Variance172.5
MonotonicityStrictly increasing
2023-12-12T08:11:52.712929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1 1
 
2.2%
35 1
 
2.2%
26 1
 
2.2%
27 1
 
2.2%
28 1
 
2.2%
29 1
 
2.2%
30 1
 
2.2%
31 1
 
2.2%
32 1
 
2.2%
33 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
1 1
2.2%
2 1
2.2%
3 1
2.2%
4 1
2.2%
5 1
2.2%
6 1
2.2%
7 1
2.2%
8 1
2.2%
9 1
2.2%
10 1
2.2%
ValueCountFrequency (%)
45 1
2.2%
44 1
2.2%
43 1
2.2%
42 1
2.2%
41 1
2.2%
40 1
2.2%
39 1
2.2%
38 1
2.2%
37 1
2.2%
36 1
2.2%

자료유형넘버
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-12T08:11:52.880709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q112
median23
Q334
95-th percentile42.8
Maximum45
Range44
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.133926
Coefficient of variation (CV)0.57104024
Kurtosis-1.2
Mean23
Median Absolute Deviation (MAD)11
Skewness0
Sum1035
Variance172.5
MonotonicityStrictly increasing
2023-12-12T08:11:53.001897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1 1
 
2.2%
35 1
 
2.2%
26 1
 
2.2%
27 1
 
2.2%
28 1
 
2.2%
29 1
 
2.2%
30 1
 
2.2%
31 1
 
2.2%
32 1
 
2.2%
33 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
1 1
2.2%
2 1
2.2%
3 1
2.2%
4 1
2.2%
5 1
2.2%
6 1
2.2%
7 1
2.2%
8 1
2.2%
9 1
2.2%
10 1
2.2%
ValueCountFrequency (%)
45 1
2.2%
44 1
2.2%
43 1
2.2%
42 1
2.2%
41 1
2.2%
40 1
2.2%
39 1
2.2%
38 1
2.2%
37 1
2.2%
36 1
2.2%

자료유형
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-12T08:11:53.181696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length4.3111111
Min length2

Characters and Unicode

Total characters194
Distinct characters83
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row도서/간행물류
2nd row문서류
3rd row사진류
4th row시청각류
5th row박물류
ValueCountFrequency (%)
기타 5
 
8.9%
전자파일 4
 
7.1%
사진 3
 
5.4%
시청각 2
 
3.6%
문서 2
 
3.6%
도서/간행물 2
 
3.6%
릴테이프 1
 
1.8%
베타캠 1
 
1.8%
u-matic 1
 
1.8%
베타 1
 
1.8%
Other values (34) 34
60.7%
2023-12-12T08:11:53.468122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
5.7%
10
 
5.2%
7
 
3.6%
/ 7
 
3.6%
6
 
3.1%
6
 
3.1%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
Other values (73) 125
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 157
80.9%
Uppercase Letter 14
 
7.2%
Space Separator 11
 
5.7%
Other Punctuation 7
 
3.6%
Lowercase Letter 2
 
1.0%
Decimal Number 2
 
1.0%
Dash Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
6.4%
7
 
4.5%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
Other values (57) 96
61.1%
Uppercase Letter
ValueCountFrequency (%)
D 3
21.4%
V 2
14.3%
C 2
14.3%
I 1
 
7.1%
A 1
 
7.1%
T 1
 
7.1%
M 1
 
7.1%
H 1
 
7.1%
S 1
 
7.1%
U 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
6 1
50.0%
8 1
50.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 7
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 157
80.9%
Common 21
 
10.8%
Latin 16
 
8.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
6.4%
7
 
4.5%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
Other values (57) 96
61.1%
Latin
ValueCountFrequency (%)
D 3
18.8%
V 2
12.5%
C 2
12.5%
m 2
12.5%
I 1
 
6.2%
A 1
 
6.2%
T 1
 
6.2%
M 1
 
6.2%
H 1
 
6.2%
S 1
 
6.2%
Common
ValueCountFrequency (%)
11
52.4%
/ 7
33.3%
- 1
 
4.8%
6 1
 
4.8%
8 1
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 157
80.9%
ASCII 37
 
19.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11
29.7%
/ 7
18.9%
D 3
 
8.1%
V 2
 
5.4%
C 2
 
5.4%
m 2
 
5.4%
I 1
 
2.7%
A 1
 
2.7%
T 1
 
2.7%
M 1
 
2.7%
Other values (6) 6
16.2%
Hangul
ValueCountFrequency (%)
10
 
6.4%
7
 
4.5%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
Other values (57) 96
61.1%

유형등록일
Date

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
Minimum2015-07-21 00:00:00
Maximum2015-07-21 00:00:00
2023-12-12T08:11:53.593664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:11:53.680872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

유형분류자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
관리자
45 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row관리자
2nd row관리자
3rd row관리자
4th row관리자
5th row관리자

Common Values

ValueCountFrequency (%)
관리자 45
100.0%

Length

2023-12-12T08:11:53.780162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:11:53.865478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
관리자 45
100.0%

유형분류코드
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-12T08:11:54.049157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length3.8666667
Min length2

Characters and Unicode

Total characters174
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st rowF1
2nd rowF2
3rd rowF3
4th rowF4
5th rowF5
ValueCountFrequency (%)
f1 1
 
2.2%
f4-1 1
 
2.2%
f4-3 1
 
2.2%
f4-4 1
 
2.2%
f4-5 1
 
2.2%
f4-6 1
 
2.2%
f4-7 1
 
2.2%
f4-8 1
 
2.2%
f4-9 1
 
2.2%
f4-10 1
 
2.2%
Other values (35) 35
77.8%
2023-12-12T08:11:54.391886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
F 45
25.9%
- 40
23.0%
1 20
11.5%
4 17
 
9.8%
5 17
 
9.8%
2 11
 
6.3%
3 11
 
6.3%
6 3
 
1.7%
7 3
 
1.7%
8 3
 
1.7%
Other values (2) 4
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 89
51.1%
Uppercase Letter 45
25.9%
Dash Punctuation 40
23.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 20
22.5%
4 17
19.1%
5 17
19.1%
2 11
12.4%
3 11
12.4%
6 3
 
3.4%
7 3
 
3.4%
8 3
 
3.4%
9 2
 
2.2%
0 2
 
2.2%
Uppercase Letter
ValueCountFrequency (%)
F 45
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 129
74.1%
Latin 45
 
25.9%

Most frequent character per script

Common
ValueCountFrequency (%)
- 40
31.0%
1 20
15.5%
4 17
13.2%
5 17
13.2%
2 11
 
8.5%
3 11
 
8.5%
6 3
 
2.3%
7 3
 
2.3%
8 3
 
2.3%
9 2
 
1.6%
Latin
ValueCountFrequency (%)
F 45
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 174
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
F 45
25.9%
- 40
23.0%
1 20
11.5%
4 17
 
9.8%
5 17
 
9.8%
2 11
 
6.3%
3 11
 
6.3%
6 3
 
1.7%
7 3
 
1.7%
8 3
 
1.7%
Other values (2) 4
 
2.3%

Interactions

2023-12-12T08:11:52.020138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:11:51.883299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:11:52.086597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:11:51.949351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:11:54.482893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번자료유형넘버자료유형유형분류코드
연번1.0001.0001.0001.000
자료유형넘버1.0001.0001.0001.000
자료유형1.0001.0001.0001.000
유형분류코드1.0001.0001.0001.000
2023-12-12T08:11:54.613015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번자료유형넘버
연번1.0001.000
자료유형넘버1.0001.000

Missing values

2023-12-12T08:11:52.401094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:11:52.495244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번자료유형넘버자료유형유형등록일유형분류자유형분류코드
011도서/간행물류2015-07-21관리자F1
122문서류2015-07-21관리자F2
233사진류2015-07-21관리자F3
344시청각류2015-07-21관리자F4
455박물류2015-07-21관리자F5
566발행도서2015-07-21관리자F1-1
677간행물2015-07-21관리자F1-2
788논문2015-07-21관리자F1-3
899신문2015-07-21관리자F1-4
91010잡지2015-07-21관리자F1-5
연번자료유형넘버자료유형유형등록일유형분류자유형분류코드
353636의복/잡화2015-07-21관리자F5-2
363737사무집기2015-07-21관리자F5-3
373838훈장2015-07-21관리자F5-4
383939공예품2015-07-21관리자F5-5
394040액자2015-07-21관리자F5-6
404141도면2015-07-21관리자F5-7
414242현수막2015-07-21관리자F5-8
424343그림2015-07-21관리자F5-9
434444서예2015-07-21관리자F5-10
444545박물 기타2015-07-21관리자F5-11