Overview

Dataset statistics

Number of variables6
Number of observations351
Missing cells22
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.6 KiB
Average record size in memory51.4 B

Variable types

Text2
Categorical2
Numeric2

Dataset

Description문화포털(https://www.culture.go.kr/)내에 제공하는 예술지식백과(상위메뉴: 문화지식) 메뉴에서 제공하는 예술용어 관련 설명자료 입니다.
Author한국문화정보원
URLhttps://www.data.go.kr/data/15067530/fileData.do

Alerts

분류체계 사용여부 has constant value ""Constant
정보분류체계 상위코드 has 22 (6.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 22:51:12.990940
Analysis finished2023-12-12 22:51:13.843559
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct242
Distinct (%)68.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-13T07:51:14.125590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length4.8490028
Min length1

Characters and Unicode

Total characters1702
Distinct characters201
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique190 ?
Unique (%)54.1%

Sample

1st row무용
2nd row연극
3rd row음악
4th row미술
5th row건축
ValueCountFrequency (%)
이후 10
 
2.5%
작품 8
 
2.0%
예술가 8
 
2.0%
예술단체 8
 
2.0%
출판목록 8
 
2.0%
영상 7
 
1.7%
교육/학술 7
 
1.7%
웹사이트 6
 
1.5%
음반 6
 
1.5%
출판 6
 
1.5%
Other values (251) 328
81.6%
2023-12-13T07:51:14.683469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 103
 
6.1%
9 98
 
5.8%
89
 
5.2%
51
 
3.0%
0 51
 
3.0%
40
 
2.4%
38
 
2.2%
34
 
2.0%
33
 
1.9%
33
 
1.9%
Other values (191) 1132
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1259
74.0%
Decimal Number 352
 
20.7%
Space Separator 51
 
3.0%
Math Symbol 25
 
1.5%
Other Punctuation 12
 
0.7%
Dash Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
 
7.1%
40
 
3.2%
38
 
3.0%
34
 
2.7%
33
 
2.6%
33
 
2.6%
32
 
2.5%
31
 
2.5%
26
 
2.1%
26
 
2.1%
Other values (175) 877
69.7%
Decimal Number
ValueCountFrequency (%)
1 103
29.3%
9 98
27.8%
0 51
14.5%
5 18
 
5.1%
4 17
 
4.8%
8 16
 
4.5%
6 14
 
4.0%
2 13
 
3.7%
7 12
 
3.4%
3 10
 
2.8%
Other Punctuation
ValueCountFrequency (%)
/ 9
75.0%
, 2
 
16.7%
· 1
 
8.3%
Space Separator
ValueCountFrequency (%)
51
100.0%
Math Symbol
ValueCountFrequency (%)
~ 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1259
74.0%
Common 443
 
26.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
7.1%
40
 
3.2%
38
 
3.0%
34
 
2.7%
33
 
2.6%
33
 
2.6%
32
 
2.5%
31
 
2.5%
26
 
2.1%
26
 
2.1%
Other values (175) 877
69.7%
Common
ValueCountFrequency (%)
1 103
23.3%
9 98
22.1%
51
11.5%
0 51
11.5%
~ 25
 
5.6%
5 18
 
4.1%
4 17
 
3.8%
8 16
 
3.6%
6 14
 
3.2%
2 13
 
2.9%
Other values (6) 37
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1259
74.0%
ASCII 442
 
26.0%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 103
23.3%
9 98
22.2%
51
11.5%
0 51
11.5%
~ 25
 
5.7%
5 18
 
4.1%
4 17
 
3.8%
8 16
 
3.6%
6 14
 
3.2%
2 13
 
2.9%
Other values (5) 36
 
8.1%
Hangul
ValueCountFrequency (%)
89
 
7.1%
40
 
3.2%
38
 
3.0%
34
 
2.7%
33
 
2.6%
33
 
2.6%
32
 
2.5%
31
 
2.5%
26
 
2.1%
26
 
2.1%
Other values (175) 877
69.7%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct116
Distinct (%)35.3%
Missing22
Missing (%)6.3%
Memory size2.9 KiB
2023-12-13T07:51:15.053538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters1316
Distinct characters22
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)14.0%

Sample

1st rowF000
2nd rowJ000
3rd rowJ000
4th rowB000
5th rowB000
ValueCountFrequency (%)
d001 10
 
3.0%
a001 9
 
2.7%
h051 8
 
2.4%
h031 7
 
2.1%
g011 7
 
2.1%
a021 7
 
2.1%
h041 7
 
2.1%
b002 6
 
1.8%
h061 6
 
1.8%
a000 5
 
1.5%
Other values (106) 257
78.1%
2023-12-13T07:51:15.572685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 523
39.7%
1 230
17.5%
2 90
 
6.8%
A 63
 
4.8%
B 54
 
4.1%
H 52
 
4.0%
5 38
 
2.9%
3 37
 
2.8%
4 37
 
2.8%
D 35
 
2.7%
Other values (12) 157
 
11.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 987
75.0%
Uppercase Letter 329
 
25.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 63
19.1%
B 54
16.4%
H 52
15.8%
D 35
10.6%
G 26
7.9%
C 26
7.9%
E 21
 
6.4%
J 19
 
5.8%
X 14
 
4.3%
F 9
 
2.7%
Other values (2) 10
 
3.0%
Decimal Number
ValueCountFrequency (%)
0 523
53.0%
1 230
23.3%
2 90
 
9.1%
5 38
 
3.9%
3 37
 
3.7%
4 37
 
3.7%
6 17
 
1.7%
9 6
 
0.6%
7 6
 
0.6%
8 3
 
0.3%

Most occurring scripts

ValueCountFrequency (%)
Common 987
75.0%
Latin 329
 
25.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 63
19.1%
B 54
16.4%
H 52
15.8%
D 35
10.6%
G 26
7.9%
C 26
7.9%
E 21
 
6.4%
J 19
 
5.8%
X 14
 
4.3%
F 9
 
2.7%
Other values (2) 10
 
3.0%
Common
ValueCountFrequency (%)
0 523
53.0%
1 230
23.3%
2 90
 
9.1%
5 38
 
3.9%
3 37
 
3.7%
4 37
 
3.7%
6 17
 
1.7%
9 6
 
0.6%
7 6
 
0.6%
8 3
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1316
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 523
39.7%
1 230
17.5%
2 90
 
6.8%
A 63
 
4.8%
B 54
 
4.1%
H 52
 
4.0%
5 38
 
2.9%
3 37
 
2.8%
4 37
 
2.8%
D 35
 
2.7%
Other values (12) 157
 
11.9%
Distinct4
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
4
188 
3
102 
2
47 
1
 
14

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
4 188
53.6%
3 102
29.1%
2 47
 
13.4%
1 14
 
4.0%

Length

2023-12-13T07:51:15.750290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:51:15.887459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 188
53.6%
3 102
29.1%
2 47
 
13.4%
1 14
 
4.0%
Distinct77
Distinct (%)21.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.965812
Minimum1
Maximum82
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-13T07:51:16.036751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q19
median20
Q337
95-th percentile63.5
Maximum82
Range81
Interquartile range (IQR)28

Descriptive statistics

Standard deviation19.787268
Coefficient of variation (CV)0.79257458
Kurtosis-0.16055902
Mean24.965812
Median Absolute Deviation (MAD)14
Skewness0.83012301
Sum8763
Variance391.53597
MonotonicityNot monotonic
2023-12-13T07:51:16.207870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 14
 
4.0%
4 11
 
3.1%
5 11
 
3.1%
6 11
 
3.1%
2 11
 
3.1%
9 11
 
3.1%
8 10
 
2.8%
7 10
 
2.8%
14 9
 
2.6%
15 9
 
2.6%
Other values (67) 244
69.5%
ValueCountFrequency (%)
1 14
4.0%
2 11
3.1%
3 9
2.6%
4 11
3.1%
5 11
3.1%
6 11
3.1%
7 10
2.8%
8 10
2.8%
9 11
3.1%
10 9
2.6%
ValueCountFrequency (%)
82 1
0.3%
81 1
0.3%
80 1
0.3%
79 1
0.3%
78 1
0.3%
73 1
0.3%
72 2
0.6%
71 1
0.3%
69 2
0.6%
68 1
0.3%
Distinct15
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5698006
Minimum1
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-13T07:51:16.374295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q39
95-th percentile21
Maximum22
Range21
Interquartile range (IQR)6

Descriptive statistics

Standard deviation5.2246777
Coefficient of variation (CV)0.79525666
Kurtosis2.245353
Mean6.5698006
Median Absolute Deviation (MAD)3
Skewness1.4816822
Sum2306
Variance27.297257
MonotonicityNot monotonic
2023-12-13T07:51:16.494941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1 64
18.2%
3 55
15.7%
9 53
15.1%
5 37
10.5%
4 28
8.0%
8 27
7.7%
11 26
7.4%
7 22
 
6.3%
22 15
 
4.3%
21 9
 
2.6%
Other values (5) 15
 
4.3%
ValueCountFrequency (%)
1 64
18.2%
3 55
15.7%
4 28
8.0%
5 37
10.5%
6 9
 
2.6%
7 22
 
6.3%
8 27
7.7%
9 53
15.1%
10 3
 
0.9%
11 26
7.4%
ValueCountFrequency (%)
22 15
 
4.3%
21 9
 
2.6%
19 1
 
0.3%
13 1
 
0.3%
12 1
 
0.3%
11 26
7.4%
10 3
 
0.9%
9 53
15.1%
8 27
7.7%
7 22
6.3%

분류체계 사용여부
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
사용
351 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row 사용
2nd row 사용
3rd row 사용
4th row 사용
5th row 사용

Common Values

ValueCountFrequency (%)
사용 351
100.0%

Length

2023-12-13T07:51:16.633762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:51:16.737695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사용 351
100.0%

Interactions

2023-12-13T07:51:13.471543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:51:13.259663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:51:13.568714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:51:13.379804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:51:16.828596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정보 분류체계 깊이정보 분류체계 일련번호정보분류체계 그룹 번호
정보 분류체계 깊이1.0000.3300.564
정보 분류체계 일련번호0.3301.0000.345
정보분류체계 그룹 번호0.5640.3451.000
2023-12-13T07:51:16.925220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정보 분류체계 일련번호정보분류체계 그룹 번호정보 분류체계 깊이
정보 분류체계 일련번호1.000-0.2980.201
정보분류체계 그룹 번호-0.2981.0000.278
정보 분류체계 깊이0.2010.2781.000

Missing values

2023-12-13T07:51:13.675333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:51:13.802311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

정보분류체계 제목정보분류체계 상위코드정보 분류체계 깊이정보 분류체계 일련번호정보분류체계 그룹 번호분류체계 사용여부
0무용<NA>114사용
1연극<NA>111사용
2음악<NA>113사용
3미술<NA>115사용
4건축<NA>117사용
5영상<NA>118사용
6문학<NA>119사용
7문화정책<NA>1110사용
8축제문화공간<NA>1111사용
9관람지원정보<NA>1122사용
정보분류체계 제목정보분류체계 상위코드정보 분류체계 깊이정보 분류체계 일련번호정보분류체계 그룹 번호분류체계 사용여부
341문학관<NA>45511사용
342공연장<NA>41311사용
343영화관<NA>41411사용
344박물관J02144811사용
345전시실J02144911사용
346미술관J02145011사용
347화랑J02145111사용
348문화복지시설<NA>45211사용
349도서관<NA>45311사용
350문화보급전수시설<NA>45411사용