Overview

Dataset statistics

Number of variables5
Number of observations188
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.8 KiB
Average record size in memory42.7 B

Variable types

Numeric2
Categorical3

Dataset

Description본교 학생을 대상으로 취창업센터에서 집계한 문화재수리기술자 자격증 취득 현황입니다. 해당 자료는 구두 조사로 진행되어 일부 데이터가 정확하지 않을 수 있습니다. 컬럼 구성은 "연번", "취득년도", "학과", "학년", "종목"으로 구성되어 있습니다.
URLhttps://www.data.go.kr/data/15105256/fileData.do

Alerts

연번 is highly overall correlated with 취득년도High correlation
취득년도 is highly overall correlated with 연번High correlation
학과 is highly overall correlated with 종목High correlation
종목 is highly overall correlated with 학과High correlation
학년 is highly imbalanced (50.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:56:53.587687
Analysis finished2023-12-12 04:56:54.442636
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct188
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean94.5
Minimum1
Maximum188
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T13:56:54.533455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.35
Q147.75
median94.5
Q3141.25
95-th percentile178.65
Maximum188
Range187
Interquartile range (IQR)93.5

Descriptive statistics

Standard deviation54.415071
Coefficient of variation (CV)0.57582086
Kurtosis-1.2
Mean94.5
Median Absolute Deviation (MAD)47
Skewness0
Sum17766
Variance2961
MonotonicityStrictly increasing
2023-12-12T13:56:54.702545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
131 1
 
0.5%
122 1
 
0.5%
123 1
 
0.5%
124 1
 
0.5%
125 1
 
0.5%
126 1
 
0.5%
127 1
 
0.5%
128 1
 
0.5%
129 1
 
0.5%
Other values (178) 178
94.7%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
188 1
0.5%
187 1
0.5%
186 1
0.5%
185 1
0.5%
184 1
0.5%
183 1
0.5%
182 1
0.5%
181 1
0.5%
180 1
0.5%
179 1
0.5%

취득년도
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2010.6915
Minimum2003
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T13:56:54.836418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2003
5-th percentile2005
Q12007
median2010
Q32014
95-th percentile2018.65
Maximum2023
Range20
Interquartile range (IQR)7

Descriptive statistics

Standard deviation4.4458154
Coefficient of variation (CV)0.0022110878
Kurtosis-0.28297389
Mean2010.6915
Median Absolute Deviation (MAD)3
Skewness0.55595526
Sum378010
Variance19.765275
MonotonicityIncreasing
2023-12-12T13:56:54.962451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
2010 20
10.6%
2008 17
 
9.0%
2012 17
 
9.0%
2009 15
 
8.0%
2011 14
 
7.4%
2006 14
 
7.4%
2007 14
 
7.4%
2005 13
 
6.9%
2014 11
 
5.9%
2018 10
 
5.3%
Other values (9) 43
22.9%
ValueCountFrequency (%)
2003 2
 
1.1%
2004 7
 
3.7%
2005 13
6.9%
2006 14
7.4%
2007 14
7.4%
2008 17
9.0%
2009 15
8.0%
2010 20
10.6%
2011 14
7.4%
2012 17
9.0%
ValueCountFrequency (%)
2023 3
 
1.6%
2020 1
 
0.5%
2019 6
 
3.2%
2018 10
5.3%
2017 6
 
3.2%
2016 5
 
2.7%
2015 6
 
3.2%
2014 11
5.9%
2013 7
3.7%
2012 17
9.0%

학과
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
보존
66 
건축
55 
조경
50 
미공
12 
수리
 
4

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row조경
2nd row조경
3rd row건축
4th row조경
5th row조경

Common Values

ValueCountFrequency (%)
보존 66
35.1%
건축 55
29.3%
조경 50
26.6%
미공 12
 
6.4%
수리 4
 
2.1%
무형 1
 
0.5%

Length

2023-12-12T13:56:55.127498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:56:55.245582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보존 66
35.1%
건축 55
29.3%
조경 50
26.6%
미공 12
 
6.4%
수리 4
 
2.1%
무형 1
 
0.5%

학년
Categorical

IMBALANCE 

Distinct7
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
졸업
130 
4
39 
3
 
6
2
 
4
1
 
4
Other values (2)
 
5

Length

Max length2
Median length2
Mean length1.7180851
Min length1

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row4
2nd row4
3rd row3
4th row졸업
5th row4

Common Values

ValueCountFrequency (%)
졸업 130
69.1%
4 39
 
20.7%
3 6
 
3.2%
2 4
 
2.1%
1 4
 
2.1%
석사 4
 
2.1%
박사 1
 
0.5%

Length

2023-12-12T13:56:55.371460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:56:55.501068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
졸업 130
69.1%
4 39
 
20.7%
3 6
 
3.2%
2 4
 
2.1%
1 4
 
2.1%
석사 4
 
2.1%
박사 1
 
0.5%

종목
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
보존
63 
보수
50 
조경
47 
단청
10 
건축
Other values (2)

Length

Max length4
Median length2
Mean length2.0425532
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row조경
2nd row조경
3rd row보수
4th row조경
5th row조경

Common Values

ValueCountFrequency (%)
보존 63
33.5%
보수 50
26.6%
조경 47
25.0%
단청 10
 
5.3%
건축 9
 
4.8%
식물 5
 
2.7%
보존과학 4
 
2.1%

Length

2023-12-12T13:56:55.657541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:56:55.824379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보존 63
33.5%
보수 50
26.6%
조경 47
25.0%
단청 10
 
5.3%
건축 9
 
4.8%
식물 5
 
2.7%
보존과학 4
 
2.1%

Interactions

2023-12-12T13:56:54.040083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:56:53.831510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:56:54.154297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:56:53.926327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:56:55.950390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번취득년도학과학년종목
연번1.0000.9740.5170.5050.541
취득년도0.9741.0000.6090.6090.526
학과0.5170.6091.0000.5940.839
학년0.5050.6090.5941.0000.629
종목0.5410.5260.8390.6291.000
2023-12-12T13:56:56.073493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학년학과종목
학년1.0000.4050.262
학과0.4051.0000.691
종목0.2620.6911.000
2023-12-12T13:56:56.165798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번취득년도학과학년종목
연번1.0000.9970.3000.2820.308
취득년도0.9971.0000.3770.3680.298
학과0.3000.3771.0000.4050.691
학년0.2820.3680.4051.0000.262
종목0.3080.2980.6910.2621.000

Missing values

2023-12-12T13:56:54.290529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:56:54.401142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번취득년도학과학년종목
012003조경4조경
122003조경4조경
232004건축3보수
342004조경졸업조경
452004조경4조경
562004조경4조경
672004조경4조경
782004조경4조경
892004미공2단청
9102005조경졸업조경
연번취득년도학과학년종목
1781792019수리석사조경
1791802019수리석사조경
1801812019미공석사단청
1811822019보존졸업보존
1821832019보존졸업보존
1831842019보존졸업보존
1841852020보존2보존
1851862023무형4단청
1861872023수리2보존
1871882023건축졸업보수