Overview

Dataset statistics

Number of variables6
Number of observations31
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory53.3 B

Variable types

Categorical4
Numeric1
DateTime1

Dataset

DescriptionR&D전문기관인 한국에너지기술평가원에서 담당하는 에너지기술개발사업 중 사업 성격에 따른 과제 평가 결과 및 과제 성공 여부
Author한국에너지기술평가원
URLhttps://www.data.go.kr/data/15104708/fileData.do

Alerts

데이터기준일 has constant value ""Constant
성공과제구분 is highly overall correlated with 평가결과High correlation
평가결과 is highly overall correlated with 과제수(개) and 1 other fieldsHigh correlation
과제수(개) is highly overall correlated with 평가결과High correlation

Reproduction

Analysis started2023-12-12 08:34:29.897341
Analysis finished2023-12-12 08:34:30.655543
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

성격
Categorical

Distinct2
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size380.0 B
기술개발
16 
기반조성
15 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기술개발
2nd row기술개발
3rd row기술개발
4th row기술개발
5th row기술개발

Common Values

ValueCountFrequency (%)
기술개발 16
51.6%
기반조성 15
48.4%

Length

2023-12-12T17:34:30.761295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:34:30.927148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기술개발 16
51.6%
기반조성 15
48.4%

평가결과
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)22.6%
Missing0
Missing (%)0.0%
Memory size380.0 B
성실수행
10 
보통
혁신성과
불성실수행
우수
Other values (2)

Length

Max length6
Median length5
Mean length3.483871
Min length2

Unique

Unique2 ?
Unique (%)6.5%

Sample

1st row혁신성과
2nd row혁신성과
3rd row우수
4th row보통
5th row보통

Common Values

ValueCountFrequency (%)
성실수행 10
32.3%
보통 8
25.8%
혁신성과 5
16.1%
불성실수행 4
 
12.9%
우수 2
 
6.5%
완료 1
 
3.2%
보통(완료) 1
 
3.2%

Length

2023-12-12T17:34:31.095273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:34:31.290323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성실수행 10
32.3%
보통 8
25.8%
혁신성과 5
16.1%
불성실수행 4
 
12.9%
우수 2
 
6.5%
완료 1
 
3.2%
보통(완료 1
 
3.2%

성공과제구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size380.0 B
성공과제
17 
비성공과제
14 

Length

Max length5
Median length4
Mean length4.4516129
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성공과제
2nd row성공과제
3rd row성공과제
4th row성공과제
5th row성공과제

Common Values

ValueCountFrequency (%)
성공과제 17
54.8%
비성공과제 14
45.2%

Length

2023-12-12T17:34:31.462807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:34:31.937991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성공과제 17
54.8%
비성공과제 14
45.2%

연도
Categorical

Distinct6
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Memory size380.0 B
2017
2018
2021
2020
2019

Length

Max length5
Median length4
Mean length4.0322581
Min length4

Unique

Unique1 ?
Unique (%)3.2%

Sample

1st row2017
2nd row2018
3rd row2021
4th row2017
5th row2018

Common Values

ValueCountFrequency (%)
2017 7
22.6%
2018 7
22.6%
2021 6
19.4%
2020 6
19.4%
2019 4
12.9%
2020년 1
 
3.2%

Length

2023-12-12T17:34:32.067400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:34:32.206990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 7
22.6%
2018 7
22.6%
2021 6
19.4%
2020 6
19.4%
2019 4
12.9%
2020년 1
 
3.2%

과제수(개)
Real number (ℝ)

HIGH CORRELATION 

Distinct22
Distinct (%)71.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.580645
Minimum1
Maximum210
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-12T17:34:32.384677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median9
Q355
95-th percentile171
Maximum210
Range209
Interquartile range (IQR)52

Descriptive statistics

Standard deviation60.283096
Coefficient of variation (CV)1.4157394
Kurtosis1.8675818
Mean42.580645
Median Absolute Deviation (MAD)8
Skewness1.6658109
Sum1320
Variance3634.0516
MonotonicityNot monotonic
2023-12-12T17:34:32.545804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
1 5
16.1%
3 4
 
12.9%
4 3
 
9.7%
2 1
 
3.2%
9 1
 
3.2%
8 1
 
3.2%
6 1
 
3.2%
84 1
 
3.2%
27 1
 
3.2%
49 1
 
3.2%
Other values (12) 12
38.7%
ValueCountFrequency (%)
1 5
16.1%
2 1
 
3.2%
3 4
12.9%
4 3
9.7%
6 1
 
3.2%
8 1
 
3.2%
9 1
 
3.2%
11 1
 
3.2%
21 1
 
3.2%
27 1
 
3.2%
ValueCountFrequency (%)
210 1
3.2%
199 1
3.2%
143 1
3.2%
141 1
3.2%
131 1
3.2%
84 1
3.2%
62 1
3.2%
61 1
3.2%
49 1
3.2%
48 1
3.2%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
Minimum2022-09-01 00:00:00
Maximum2022-09-01 00:00:00
2023-12-12T17:34:32.690039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:34:32.817419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T17:34:30.200189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:34:32.908215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성격평가결과성공과제구분연도과제수(개)
성격1.0000.1420.0000.0000.322
평가결과0.1421.0001.0000.0000.743
성공과제구분0.0001.0001.0000.0000.372
연도0.0000.0000.0001.0000.000
과제수(개)0.3220.7430.3720.0001.000
2023-12-12T17:34:33.032300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성격성공과제구분연도평가결과
성격1.0000.0000.0000.112
성공과제구분0.0001.0000.0000.910
연도0.0000.0001.0000.000
평가결과0.1120.9100.0001.000
2023-12-12T17:34:33.172240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과제수(개)성격평가결과성공과제구분연도
과제수(개)1.0000.2030.5450.2380.000
성격0.2031.0000.1120.0000.000
평가결과0.5450.1121.0000.9100.000
성공과제구분0.2380.0000.9101.0000.000
연도0.0000.0000.0000.0001.000

Missing values

2023-12-12T17:34:30.366657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:34:30.568834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

성격평가결과성공과제구분연도과제수(개)데이터기준일
0기술개발혁신성과성공과제201742022-09-01
1기술개발혁신성과성공과제201812022-09-01
2기술개발우수성공과제2021112022-09-01
3기술개발보통성공과제20171312022-09-01
4기술개발보통성공과제20181412022-09-01
5기술개발보통성공과제20191432022-09-01
6기술개발보통성공과제20201992022-09-01
7기술개발완료성공과제20212102022-09-01
8기술개발성실수행비성공과제2017212022-09-01
9기술개발성실수행비성공과제2018362022-09-01
성격평가결과성공과제구분연도과제수(개)데이터기준일
21기반조성보통성공과제2018612022-09-01
22기반조성보통성공과제2019492022-09-01
23기반조성보통성공과제2020272022-09-01
24기반조성보통(완료)성공과제2021842022-09-01
25기반조성성실수행비성공과제201762022-09-01
26기반조성성실수행비성공과제201882022-09-01
27기반조성성실수행비성공과제201992022-09-01
28기반조성성실수행비성공과제202042022-09-01
29기반조성성실수행비성공과제202122022-09-01
30기반조성성실수행비성공과제2020년32022-09-01