Overview

Dataset statistics

Number of variables16
Number of observations30
Missing cells90
Missing cells (%)18.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory139.4 B

Variable types

Numeric2
Text1
Boolean2
Categorical6
Unsupported3
DateTime2

Dataset

Description샘플 데이터
Author경기도일자리재단
URLhttps://www.bigdata-region.kr/#/dataset/2dd7492a-8d4c-4e08-93a7-a28f9e6f7f98

Alerts

학습분수 has constant value ""Constant
차시비용 has constant value ""Constant
정산분배선택명 has constant value ""Constant
사용여부 has constant value ""Constant
하위범주명 is highly overall correlated with 컨텐츠정보번호 and 2 other fieldsHigh correlation
상위범주명 is highly overall correlated with 컨텐츠정보번호 and 2 other fieldsHigh correlation
서비스설정명 is highly overall correlated with 컨텐츠정보번호 and 3 other fieldsHigh correlation
컨텐츠정보번호 is highly overall correlated with 총페이지수 and 3 other fieldsHigh correlation
총페이지수 is highly overall correlated with 컨텐츠정보번호 and 1 other fieldsHigh correlation
맛보기설정여부 is highly imbalanced (78.9%)Imbalance
컨텐츠URL has 30 (100.0%) missing valuesMissing
컨텐츠모바일URL has 30 (100.0%) missing valuesMissing
컨텐츠설명 has 30 (100.0%) missing valuesMissing
컨텐츠정보번호 has unique valuesUnique
컨텐츠URL is an unsupported type, check if it needs cleaning or further analysisUnsupported
컨텐츠모바일URL is an unsupported type, check if it needs cleaning or further analysisUnsupported
컨텐츠설명 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 13:54:35.639610
Analysis finished2023-12-10 13:54:37.534751
Duration1.9 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

컨텐츠정보번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean846.5
Minimum683
Maximum977
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:54:37.622503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum683
5-th percentile684.45
Q1726.25
median882.5
Q3912.25
95-th percentile962.15
Maximum977
Range294
Interquartile range (IQR)186

Descriptive statistics

Standard deviation102.91635
Coefficient of variation (CV)0.12157868
Kurtosis-0.97966141
Mean846.5
Median Absolute Deviation (MAD)50
Skewness-0.77746742
Sum25395
Variance10591.776
MonotonicityNot monotonic
2023-12-10T22:54:37.834023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
683 1
 
3.3%
949 1
 
3.3%
689 1
 
3.3%
688 1
 
3.3%
687 1
 
3.3%
686 1
 
3.3%
685 1
 
3.3%
684 1
 
3.3%
904 1
 
3.3%
893 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
683 1
3.3%
684 1
3.3%
685 1
3.3%
686 1
3.3%
687 1
3.3%
688 1
3.3%
689 1
3.3%
692 1
3.3%
829 1
3.3%
876 1
3.3%
ValueCountFrequency (%)
977 1
3.3%
968 1
3.3%
955 1
3.3%
949 1
3.3%
943 1
3.3%
938 1
3.3%
929 1
3.3%
915 1
3.3%
904 1
3.3%
893 1
3.3%
Distinct18
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T22:54:38.159689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length11.966667
Min length6

Characters and Unicode

Total characters359
Distinct characters110
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)56.7%

Sample

1st row놀이치료의 상담환경에 대한 이해
2nd row오리엔테이션
3rd row강모 팁(Bristles Tip) 기능과 브러시 옵션 익히기
4th row오리엔테이션
5th row타이포그래피 1
ValueCountFrequency (%)
오리엔테이션 13
 
16.5%
미술치료의 3
 
3.8%
과정 3
 
3.8%
놀이치료의 3
 
3.8%
주요기능 3
 
3.8%
프로그램의 3
 
3.8%
1 3
 
3.8%
2 3
 
3.8%
adobe 2
 
2.5%
indesign 2
 
2.5%
Other values (38) 41
51.9%
2023-12-10T22:54:38.693370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
13.6%
24
 
6.7%
14
 
3.9%
13
 
3.6%
13
 
3.6%
13
 
3.6%
13
 
3.6%
10
 
2.8%
9
 
2.5%
8
 
2.2%
Other values (100) 193
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 252
70.2%
Space Separator 49
 
13.6%
Lowercase Letter 37
 
10.3%
Uppercase Letter 11
 
3.1%
Decimal Number 6
 
1.7%
Dash Punctuation 2
 
0.6%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
9.5%
14
 
5.6%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
10
 
4.0%
9
 
3.6%
8
 
3.2%
8
 
3.2%
Other values (71) 127
50.4%
Lowercase Letter
ValueCountFrequency (%)
s 6
16.2%
e 6
16.2%
i 4
10.8%
n 4
10.8%
r 3
8.1%
b 2
 
5.4%
d 2
 
5.4%
o 2
 
5.4%
g 2
 
5.4%
p 1
 
2.7%
Other values (5) 5
13.5%
Uppercase Letter
ValueCountFrequency (%)
A 2
18.2%
I 2
18.2%
D 2
18.2%
T 1
9.1%
B 1
9.1%
Q 1
9.1%
X 1
9.1%
P 1
9.1%
Decimal Number
ValueCountFrequency (%)
1 3
50.0%
2 3
50.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 252
70.2%
Common 59
 
16.4%
Latin 48
 
13.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
9.5%
14
 
5.6%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
10
 
4.0%
9
 
3.6%
8
 
3.2%
8
 
3.2%
Other values (71) 127
50.4%
Latin
ValueCountFrequency (%)
s 6
 
12.5%
e 6
 
12.5%
i 4
 
8.3%
n 4
 
8.3%
r 3
 
6.2%
b 2
 
4.2%
A 2
 
4.2%
d 2
 
4.2%
o 2
 
4.2%
I 2
 
4.2%
Other values (13) 15
31.2%
Common
ValueCountFrequency (%)
49
83.1%
1 3
 
5.1%
2 3
 
5.1%
- 2
 
3.4%
) 1
 
1.7%
( 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 252
70.2%
ASCII 107
29.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49
45.8%
s 6
 
5.6%
e 6
 
5.6%
i 4
 
3.7%
n 4
 
3.7%
r 3
 
2.8%
1 3
 
2.8%
2 3
 
2.8%
b 2
 
1.9%
A 2
 
1.9%
Other values (19) 25
23.4%
Hangul
ValueCountFrequency (%)
24
 
9.5%
14
 
5.6%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
10
 
4.0%
9
 
3.6%
8
 
3.2%
8
 
3.2%
Other values (71) 127
50.4%

맛보기설정여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
29 
True
 
1
ValueCountFrequency (%)
False 29
96.7%
True 1
 
3.3%
2023-12-10T22:54:38.992516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

총페이지수
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)26.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.4333333
Minimum1
Maximum15
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:54:39.219059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median11
Q312
95-th percentile13.55
Maximum15
Range14
Interquartile range (IQR)11

Descriptive statistics

Standard deviation5.5253855
Coefficient of variation (CV)0.74332541
Kurtosis-1.9124752
Mean7.4333333
Median Absolute Deviation (MAD)3.5
Skewness-0.17880923
Sum223
Variance30.529885
MonotonicityNot monotonic
2023-12-10T22:54:39.371442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 11
36.7%
12 9
30.0%
13 3
 
10.0%
5 2
 
6.7%
11 2
 
6.7%
4 1
 
3.3%
14 1
 
3.3%
15 1
 
3.3%
ValueCountFrequency (%)
1 11
36.7%
4 1
 
3.3%
5 2
 
6.7%
11 2
 
6.7%
12 9
30.0%
13 3
 
10.0%
14 1
 
3.3%
15 1
 
3.3%
ValueCountFrequency (%)
15 1
 
3.3%
14 1
 
3.3%
13 3
 
10.0%
12 9
30.0%
11 2
 
6.7%
5 2
 
6.7%
4 1
 
3.3%
1 11
36.7%

학습분수
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
0
30 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 30
100.0%

Length

2023-12-10T22:54:39.549339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:54:39.771233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 30
100.0%

차시비용
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
0
30 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 30
100.0%

Length

2023-12-10T22:54:39.924553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:54:40.091473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 30
100.0%

정산분배선택명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
W
30 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowW
2nd rowW
3rd rowW
4th rowW
5th rowW

Common Values

ValueCountFrequency (%)
W 30
100.0%

Length

2023-12-10T22:54:40.422264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:54:41.233101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
w 30
100.0%

서비스설정명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
W;M
20 
W
10 

Length

Max length3
Median length3
Mean length2.3333333
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowW;M
2nd rowW;M
3rd rowW;M
4th rowW;M
5th rowW;M

Common Values

ValueCountFrequency (%)
W;M 20
66.7%
W 10
33.3%

Length

2023-12-10T22:54:41.398605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:54:41.569160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
w;m 20
66.7%
w 10
33.3%

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T22:54:41.702363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

컨텐츠URL
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing30
Missing (%)100.0%
Memory size402.0 B

컨텐츠모바일URL
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing30
Missing (%)100.0%
Memory size402.0 B

컨텐츠설명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing30
Missing (%)100.0%
Memory size402.0 B

상위범주명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
취업
19 
여성취업
IT
정보화
 
1

Length

Max length4
Median length2
Mean length2.4333333
Min length2

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row취업
2nd row취업
3rd row정보화
4th row취업
5th row취업

Common Values

ValueCountFrequency (%)
취업 19
63.3%
여성취업 6
 
20.0%
IT 4
 
13.3%
정보화 1
 
3.3%

Length

2023-12-10T22:54:41.873476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:54:42.054171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
취업 19
63.3%
여성취업 6
 
20.0%
it 4
 
13.3%
정보화 1
 
3.3%

하위범주명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
디자인
11 
교육; 컨설팅
경영직업군
O/A
취업일반

Length

Max length7
Median length6
Mean length4.4666667
Min length3

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row교육; 컨설팅
2nd row교육; 컨설팅
3rd row멀티미디어
4th row디자인
5th row디자인

Common Values

ValueCountFrequency (%)
디자인 11
36.7%
교육; 컨설팅 8
26.7%
경영직업군 4
 
13.3%
O/A 4
 
13.3%
취업일반 2
 
6.7%
멀티미디어 1
 
3.3%

Length

2023-12-10T22:54:42.299394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:54:42.694266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
디자인 11
28.9%
교육 8
21.1%
컨설팅 8
21.1%
경영직업군 4
 
10.5%
o/a 4
 
10.5%
취업일반 2
 
5.3%
멀티미디어 1
 
2.6%
Distinct13
Distinct (%)43.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2012-02-07 05:02:00
Maximum2014-05-20 04:25:00
2023-12-10T22:54:43.049467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:54:43.355687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
Distinct8
Distinct (%)26.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2012-02-07 00:00:00
Maximum2014-05-20 00:00:00
2023-12-10T22:54:43.603949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:54:43.849389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)

Interactions

2023-12-10T22:54:36.674554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:54:36.386824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:54:36.815045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:54:36.516327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:54:44.033688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
컨텐츠정보번호컨텐츠명맛보기설정여부총페이지수서비스설정명상위범주명하위범주명등록일시데이터기준일자
컨텐츠정보번호1.0000.0000.0000.5851.0001.0000.9980.9860.890
컨텐츠명0.0001.0001.0000.8600.5650.0000.0000.0000.000
맛보기설정여부0.0001.0001.0000.3710.0000.0000.0001.0000.000
총페이지수0.5850.8600.3711.0000.7860.5530.6080.8290.650
서비스설정명1.0000.5650.0000.7861.0001.0001.0001.0001.000
상위범주명1.0000.0000.0000.5531.0001.0001.0000.9000.940
하위범주명0.9980.0000.0000.6081.0001.0001.0000.9490.910
등록일시0.9860.0001.0000.8291.0000.9000.9491.0001.000
데이터기준일자0.8900.0000.0000.6501.0000.9400.9101.0001.000
2023-12-10T22:54:44.276120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
하위범주명상위범주명맛보기설정여부서비스설정명
하위범주명1.0000.9610.0000.926
상위범주명0.9611.0000.0000.964
맛보기설정여부0.0000.0001.0000.000
서비스설정명0.9260.9640.0001.000
2023-12-10T22:54:44.518371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
컨텐츠정보번호총페이지수맛보기설정여부서비스설정명상위범주명하위범주명
컨텐츠정보번호1.000-0.6390.0000.9260.9610.938
총페이지수-0.6391.0000.4230.8670.4670.452
맛보기설정여부0.0000.4231.0000.0000.0000.000
서비스설정명0.9260.8670.0001.0000.9640.926
상위범주명0.9610.4670.0000.9641.0000.961
하위범주명0.9380.4520.0000.9260.9611.000

Missing values

2023-12-10T22:54:37.100477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:54:37.411878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

컨텐츠정보번호컨텐츠명맛보기설정여부총페이지수학습분수차시비용정산분배선택명서비스설정명사용여부컨텐츠URL컨텐츠모바일URL컨텐츠설명상위범주명하위범주명등록일시데이터기준일자
0683놀이치료의 상담환경에 대한 이해N1300WW;MY<NA><NA><NA>취업교육; 컨설팅2012-02-17 07:022012-02-17
1692오리엔테이션N500WW;MY<NA><NA><NA>취업교육; 컨설팅2012-02-07 05:022012-02-07
2829강모 팁(Bristles Tip) 기능과 브러시 옵션 익히기N100WW;MY<NA><NA><NA>정보화멀티미디어2012-02-07 06:022012-02-07
3876오리엔테이션N400WW;MY<NA><NA><NA>취업디자인2012-02-10 05:022012-02-10
4878타이포그래피 1N1200WW;MY<NA><NA><NA>취업디자인2012-02-10 05:022012-02-10
5879타이포그래피 2N1200WW;MY<NA><NA><NA>취업디자인2012-02-10 05:022012-02-10
6880이미지와 컬러N1200WW;MY<NA><NA><NA>취업디자인2012-02-10 05:022012-02-10
7881레이아웃 1N1200WW;MY<NA><NA><NA>취업디자인2012-02-10 05:022012-02-10
8882레이아웃 2N1200WW;MY<NA><NA><NA>취업디자인2012-02-10 05:022012-02-10
9883Adobe InDesign 프로그램의 주요기능 1N1200WW;MY<NA><NA><NA>취업디자인2012-02-10 05:022012-02-10
컨텐츠정보번호컨텐츠명맛보기설정여부총페이지수학습분수차시비용정산분배선택명서비스설정명사용여부컨텐츠URL컨텐츠모바일URL컨텐츠설명상위범주명하위범주명등록일시데이터기준일자
20929오리엔테이션N100WWY<NA><NA><NA>여성취업경영직업군2013-03-20 03:032013-03-20
21955오리엔테이션N100WWY<NA><NA><NA>ITO/A2013-03-20 03:032013-03-20
22893오리엔테이션N100WWY<NA><NA><NA>여성취업취업일반2013-05-20 09:052013-05-20
23904오리엔테이션N100WWY<NA><NA><NA>여성취업취업일반2013-05-20 10:052013-05-20
24684놀이치료의 과정 - 접수상담의 이론과 실제N1100WW;MY<NA><NA><NA>취업교육; 컨설팅2014-05-20 04:102014-05-20
25685놀이치료의 과정 - 상담초기부터 종료까지N1300WW;MY<NA><NA><NA>취업교육; 컨설팅2014-05-20 04:102014-05-20
26686오리엔테이션N500WW;MY<NA><NA><NA>취업교육; 컨설팅2014-05-20 04:182014-05-20
27687미술치료의 개념과 미술치료 과정Y1300WW;MY<NA><NA><NA>취업교육; 컨설팅2014-05-20 04:252014-05-20
28688미술치료의 세 가지 이론적 관점N1500WW;MY<NA><NA><NA>취업교육; 컨설팅2014-05-20 04:192014-05-20
29689미술치료의 장점과 치료목적N1100WW;MY<NA><NA><NA>취업교육; 컨설팅2014-05-20 04:192014-05-20