Overview

Dataset statistics

Number of variables6
Number of observations122
Missing cells217
Missing cells (%)29.6%
Duplicate rows1
Duplicate rows (%)0.8%
Total size in memory6.3 KiB
Average record size in memory53.1 B

Variable types

Categorical2
Numeric4

Dataset

Description2014-2019년 문예진흥기금 공모사업 중 문학 분야 "집필공간운영" 지원 사업의 분야별 계량성과(예: 분야, 출간건수, 발표건수 등)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076471/fileData.do

Alerts

Dataset has 1 (0.8%) duplicate rowsDuplicates
출간(건) is highly overall correlated with 출간발표_계(건)High correlation
발표(건) is highly overall correlated with 출간발표_계(건)High correlation
출간발표_계(건) is highly overall correlated with 출간(건) and 1 other fieldsHigh correlation
출간(건) has 101 (82.8%) missing valuesMissing
발표(건) has 101 (82.8%) missing valuesMissing
출간발표_계(건) has 15 (12.3%) missing valuesMissing
출간(건) has 5 (4.1%) zerosZeros
출간발표_계(건) has 17 (13.9%) zerosZeros

Reproduction

Analysis started2023-12-12 11:53:00.601530
Analysis finished2023-12-12 11:53:04.015371
Duration3.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

문학단체명
Categorical

Distinct7
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
*을**집
26 
*지**단
24 
*버**집
24 
*1**학
19 
*날**날
18 
Other values (2)
11 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row*악**원
2nd row*을**집
3rd row*1**학
4th row*지**단
5th row*날**날

Common Values

ValueCountFrequency (%)
*을**집 26
21.3%
*지**단 24
19.7%
*버**집 24
19.7%
*1**학 19
15.6%
*날**날 18
14.8%
*악**원 10
 
8.2%
*산**꽃 1
 
0.8%

Length

2023-12-12T20:53:04.130608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:53:04.320943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
을**집 26
21.3%
지**단 24
19.7%
버**집 24
19.7%
1**학 19
15.6%
날**날 18
14.8%
악**원 10
 
8.2%
산**꽃 1
 
0.8%

사업연도
Real number (ℝ)

Distinct6
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.5164
Minimum2014
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T20:53:04.517254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2015
Q12017
median2018
Q32018
95-th percentile2019
Maximum2019
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.2212553
Coefficient of variation (CV)0.0006053261
Kurtosis0.90355943
Mean2017.5164
Median Absolute Deviation (MAD)1
Skewness-0.91047513
Sum246137
Variance1.4914646
MonotonicityIncreasing
2023-12-12T20:53:04.694360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2017 43
35.2%
2018 36
29.5%
2019 28
23.0%
2015 6
 
4.9%
2016 5
 
4.1%
2014 4
 
3.3%
ValueCountFrequency (%)
2014 4
 
3.3%
2015 6
 
4.9%
2016 5
 
4.1%
2017 43
35.2%
2018 36
29.5%
2019 28
23.0%
ValueCountFrequency (%)
2019 28
23.0%
2018 36
29.5%
2017 43
35.2%
2016 5
 
4.1%
2015 6
 
4.9%
2014 4
 
3.3%

분야
Categorical

Distinct8
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
아동문학
19 
산문
18 
<NA>
15 
시(조)
14 
소설
14 
Other values (3)
42 

Length

Max length4
Median length2
Mean length2.7868852
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
아동문학 19
15.6%
산문 18
14.8%
<NA> 15
12.3%
시(조) 14
11.5%
소설 14
11.5%
평론 14
11.5%
희곡 14
11.5%
기타 14
11.5%

Length

2023-12-12T20:53:04.936352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:53:05.160786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아동문학 19
15.6%
산문 18
14.8%
na 15
12.3%
시(조 14
11.5%
소설 14
11.5%
평론 14
11.5%
희곡 14
11.5%
기타 14
11.5%

출간(건)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct10
Distinct (%)47.6%
Missing101
Missing (%)82.8%
Infinite0
Infinite (%)0.0%
Mean5.5238095
Minimum0
Maximum26
Zeros5
Zeros (%)4.1%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T20:53:05.355864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile19
Maximum26
Range26
Interquartile range (IQR)5

Descriptive statistics

Standard deviation6.6604733
Coefficient of variation (CV)1.2057753
Kurtosis3.9515287
Mean5.5238095
Median Absolute Deviation (MAD)3
Skewness1.9750317
Sum116
Variance44.361905
MonotonicityNot monotonic
2023-12-12T20:53:05.538541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
3 6
 
4.9%
0 5
 
4.1%
7 2
 
1.6%
5 2
 
1.6%
10 1
 
0.8%
13 1
 
0.8%
4 1
 
0.8%
19 1
 
0.8%
26 1
 
0.8%
2 1
 
0.8%
(Missing) 101
82.8%
ValueCountFrequency (%)
0 5
4.1%
2 1
 
0.8%
3 6
4.9%
4 1
 
0.8%
5 2
 
1.6%
7 2
 
1.6%
10 1
 
0.8%
13 1
 
0.8%
19 1
 
0.8%
26 1
 
0.8%
ValueCountFrequency (%)
26 1
 
0.8%
19 1
 
0.8%
13 1
 
0.8%
10 1
 
0.8%
7 2
 
1.6%
5 2
 
1.6%
4 1
 
0.8%
3 6
4.9%
2 1
 
0.8%
0 5
4.1%

발표(건)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct15
Distinct (%)71.4%
Missing101
Missing (%)82.8%
Infinite0
Infinite (%)0.0%
Mean19.619048
Minimum0
Maximum110
Zeros1
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T20:53:05.718198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median9
Q322
95-th percentile77
Maximum110
Range110
Interquartile range (IQR)19

Descriptive statistics

Standard deviation27.625127
Coefficient of variation (CV)1.4080768
Kurtosis5.6257581
Mean19.619048
Median Absolute Deviation (MAD)7
Skewness2.3453169
Sum412
Variance763.14762
MonotonicityNot monotonic
2023-12-12T20:53:05.882066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
3 3
 
2.5%
2 3
 
2.5%
9 2
 
1.6%
17 2
 
1.6%
77 1
 
0.8%
11 1
 
0.8%
1 1
 
0.8%
110 1
 
0.8%
35 1
 
0.8%
13 1
 
0.8%
Other values (5) 5
 
4.1%
(Missing) 101
82.8%
ValueCountFrequency (%)
0 1
 
0.8%
1 1
 
0.8%
2 3
2.5%
3 3
2.5%
8 1
 
0.8%
9 2
1.6%
11 1
 
0.8%
13 1
 
0.8%
17 2
1.6%
22 1
 
0.8%
ValueCountFrequency (%)
110 1
0.8%
77 1
0.8%
43 1
0.8%
35 1
0.8%
25 1
0.8%
22 1
0.8%
17 2
1.6%
13 1
0.8%
11 1
0.8%
9 2
1.6%

출간발표_계(건)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct41
Distinct (%)38.3%
Missing15
Missing (%)12.3%
Infinite0
Infinite (%)0.0%
Mean17.186916
Minimum0
Maximum135
Zeros17
Zeros (%)13.9%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T20:53:06.058508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11.5
median8
Q317.5
95-th percentile72.7
Maximum135
Range135
Interquartile range (IQR)16

Descriptive statistics

Standard deviation26.116622
Coefficient of variation (CV)1.5195642
Kurtosis5.9214706
Mean17.186916
Median Absolute Deviation (MAD)7
Skewness2.3884737
Sum1839
Variance682.07794
MonotonicityNot monotonic
2023-12-12T20:53:06.223924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
0 17
13.9%
1 10
 
8.2%
2 9
 
7.4%
3 8
 
6.6%
9 5
 
4.1%
11 4
 
3.3%
4 4
 
3.3%
8 4
 
3.3%
16 4
 
3.3%
17 3
 
2.5%
Other values (31) 39
32.0%
(Missing) 15
 
12.3%
ValueCountFrequency (%)
0 17
13.9%
1 10
8.2%
2 9
7.4%
3 8
6.6%
4 4
 
3.3%
5 1
 
0.8%
6 2
 
1.6%
7 1
 
0.8%
8 4
 
3.3%
9 5
 
4.1%
ValueCountFrequency (%)
135 1
0.8%
114 1
0.8%
103 1
0.8%
87 1
0.8%
81 1
0.8%
73 1
0.8%
72 2
1.6%
67 1
0.8%
60 1
0.8%
50 1
0.8%

Interactions

2023-12-12T20:53:03.000916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:00.934432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:01.790411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:02.370990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:03.146596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:01.065012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:01.919006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:02.531854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:03.275447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:01.170841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:02.048335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:02.724911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:03.387896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:01.286664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:02.212047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:53:02.879059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:53:06.334868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문학단체명사업연도분야출간(건)발표(건)출간발표_계(건)
문학단체명1.0000.3760.0000.0000.2590.223
사업연도0.3761.0000.0000.3410.2060.000
분야0.0000.0001.0000.5170.3980.379
출간(건)0.0000.3410.5171.0000.2990.577
발표(건)0.2590.2060.3980.2991.0000.980
출간발표_계(건)0.2230.0000.3790.5770.9801.000
2023-12-12T20:53:06.456185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야문학단체명
분야1.0000.000
문학단체명0.0001.000
2023-12-12T20:53:06.550883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도출간(건)발표(건)출간발표_계(건)문학단체명분야
사업연도1.0000.0850.1840.2930.2060.000
출간(건)0.0851.0000.1970.5010.0000.145
발표(건)0.1840.1971.0000.9160.0000.172
출간발표_계(건)0.2930.5010.9161.0000.1100.206
문학단체명0.2060.0000.0000.1101.0000.000
분야0.0000.1450.1720.2060.0001.000

Missing values

2023-12-12T20:53:03.564112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:53:03.741014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T20:53:03.916850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

문학단체명사업연도분야출간(건)발표(건)출간발표_계(건)
0*악**원2014<NA><NA><NA><NA>
1*을**집2014<NA><NA><NA><NA>
2*1**학2014<NA><NA><NA><NA>
3*지**단2014<NA><NA><NA><NA>
4*날**날2015<NA><NA><NA><NA>
5*악**원2015<NA><NA><NA><NA>
6*산**꽃2015<NA><NA><NA><NA>
7*을**집2015<NA><NA><NA><NA>
8*1**학2015<NA><NA><NA><NA>
9*지**단2015<NA><NA><NA><NA>
문학단체명사업연도분야출간(건)발표(건)출간발표_계(건)
112*버**집2019평론<NA><NA>72
113*버**집2019희곡<NA><NA>0
114*버**집2019기타<NA><NA>10
115*악**원2019시(조)<NA><NA>72
116*악**원2019산문<NA><NA>2
117*악**원2019소설<NA><NA>17
118*악**원2019아동문학<NA><NA>1
119*악**원2019평론<NA><NA>0
120*악**원2019희곡<NA><NA>0
121*악**원2019기타<NA><NA>4

Duplicate rows

Most frequently occurring

문학단체명사업연도분야출간(건)발표(건)출간발표_계(건)# duplicates
0*1**학2017산문<NA><NA>02