Overview

Dataset statistics

Number of variables4
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory38.4 B

Variable types

Numeric2
DateTime1
Categorical1

Dataset

Description샘플 데이터
Author경기도일자리재단
URLhttps://www.bigdata-region.kr/#/dataset/104f8960-9f9c-4ba2-99ec-d736553fe2cf

Alerts

청년시리즈신청번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:51:19.430832
Analysis finished2023-12-10 13:51:20.541220
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

청년시리즈신청번호
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65.833333
Minimum42
Maximum91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:51:20.664625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum42
5-th percentile43.45
Q154.25
median63.5
Q377.75
95-th percentile88.1
Maximum91
Range49
Interquartile range (IQR)23.5

Descriptive statistics

Standard deviation15.261137
Coefficient of variation (CV)0.23181474
Kurtosis-1.2765039
Mean65.833333
Median Absolute Deviation (MAD)13
Skewness0.010854192
Sum1975
Variance232.9023
MonotonicityStrictly increasing
2023-12-10T22:51:20.880572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
42 1
 
3.3%
71 1
 
3.3%
91 1
 
3.3%
89 1
 
3.3%
87 1
 
3.3%
86 1
 
3.3%
85 1
 
3.3%
81 1
 
3.3%
79 1
 
3.3%
78 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
42 1
3.3%
43 1
3.3%
44 1
3.3%
45 1
3.3%
47 1
3.3%
50 1
3.3%
51 1
3.3%
54 1
3.3%
55 1
3.3%
56 1
3.3%
ValueCountFrequency (%)
91 1
3.3%
89 1
3.3%
87 1
3.3%
86 1
3.3%
85 1
3.3%
81 1
3.3%
79 1
3.3%
78 1
3.3%
77 1
3.3%
76 1
3.3%
Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2013-01-01 00:00:00
Maximum2017-09-11 00:00:00
2023-12-10T22:51:21.072386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:51:21.275755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)

근무년수
Real number (ℝ)

Distinct6
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.3333333
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:51:21.465531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q34
95-th percentile5
Maximum6
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6045911
Coefficient of variation (CV)0.68768191
Kurtosis-0.6782013
Mean2.3333333
Median Absolute Deviation (MAD)1
Skewness0.86049323
Sum70
Variance2.5747126
MonotonicityNot monotonic
2023-12-10T22:51:21.607824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 14
46.7%
2 6
20.0%
4 5
 
16.7%
5 3
 
10.0%
6 1
 
3.3%
3 1
 
3.3%
ValueCountFrequency (%)
1 14
46.7%
2 6
20.0%
3 1
 
3.3%
4 5
 
16.7%
5 3
 
10.0%
6 1
 
3.3%
ValueCountFrequency (%)
6 1
 
3.3%
5 3
 
10.0%
4 5
 
16.7%
3 1
 
3.3%
2 6
20.0%
1 14
46.7%
Distinct11
Distinct (%)36.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2018-01-22
13 
2018-01-26
2018-02-05
2018-01-23
2018-01-27
Other values (6)

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique4 ?
Unique (%)13.3%

Sample

1st row2018-02-05
2nd row2018-01-26
3rd row2018-01-25
4th row2018-01-23
5th row2018-01-27

Common Values

ValueCountFrequency (%)
2018-01-22 13
43.3%
2018-01-26 3
 
10.0%
2018-02-05 2
 
6.7%
2018-01-23 2
 
6.7%
2018-01-27 2
 
6.7%
2018-01-29 2
 
6.7%
2018-01-31 2
 
6.7%
2018-01-25 1
 
3.3%
2018-02-01 1
 
3.3%
2018-01-30 1
 
3.3%

Length

2023-12-10T22:51:21.774881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2018-01-22 13
43.3%
2018-01-26 3
 
10.0%
2018-02-05 2
 
6.7%
2018-01-23 2
 
6.7%
2018-01-27 2
 
6.7%
2018-01-29 2
 
6.7%
2018-01-31 2
 
6.7%
2018-01-25 1
 
3.3%
2018-02-01 1
 
3.3%
2018-01-30 1
 
3.3%

Interactions

2023-12-10T22:51:19.902061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:51:19.637099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:51:20.057787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:51:19.770411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:51:21.894878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년시리즈신청번호고용보험가입일자근무년수데이터기준일자
청년시리즈신청번호1.0000.9320.3740.404
고용보험가입일자0.9321.0001.0000.854
근무년수0.3741.0001.0000.000
데이터기준일자0.4040.8540.0001.000
2023-12-10T22:51:22.078123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년시리즈신청번호근무년수데이터기준일자
청년시리즈신청번호1.000-0.1710.127
근무년수-0.1711.0000.000
데이터기준일자0.1270.0001.000

Missing values

2023-12-10T22:51:20.229877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:51:20.422758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

청년시리즈신청번호고용보험가입일자근무년수데이터기준일자
0422015-01-0542018-02-05
1432016-12-2022018-01-26
2442013-08-1952018-01-25
3452015-01-0542018-01-23
4472014-12-1842018-01-27
5502017-08-0712018-01-22
6512014-08-2142018-01-29
7542014-08-2642018-01-27
8552017-09-1112018-01-22
9562017-02-0112018-01-26
청년시리즈신청번호고용보험가입일자근무년수데이터기준일자
20762016-07-0122018-01-30
21772017-06-1312018-01-24
22782014-01-0652018-01-22
23792016-02-1522018-01-26
24812017-05-2912018-01-22
25852013-01-0162018-01-22
26862017-08-1612018-01-31
27872015-11-0232018-01-22
28892016-10-1022018-01-22
29912017-02-2012018-01-29