Overview

Dataset statistics

Number of variables4
Number of observations389
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.7 KiB
Average record size in memory33.3 B

Variable types

Numeric1
DateTime1
Categorical1
Text1

Dataset

Description현대한국구술자료관 구술자 정보와 연관된 번호와 해당 구술자 관련 정보가 등록된 일자와 시간, 해당 구술자 관련 정보가 등록된 일자와 시간 등을 제공합니다.
Author한국학중앙연구원
URLhttps://www.data.go.kr/data/15049079/fileData.do

Alerts

번호 has unique valuesUnique
구술정보연결번호 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:22:53.042198
Analysis finished2023-12-13 00:22:53.342752
Duration0.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct389
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean329.98458
Minimum42
Maximum744
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-13T09:22:53.397499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum42
5-th percentile68.4
Q1166
median302
Q3456
95-th percentile700.6
Maximum744
Range702
Interquartile range (IQR)290

Descriptive statistics

Standard deviation189.89726
Coefficient of variation (CV)0.57547314
Kurtosis-0.74793093
Mean329.98458
Median Absolute Deviation (MAD)145
Skewness0.42847516
Sum128364
Variance36060.969
MonotonicityNot monotonic
2023-12-13T09:22:53.500855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
150 1
 
0.3%
493 1
 
0.3%
363 1
 
0.3%
406 1
 
0.3%
401 1
 
0.3%
429 1
 
0.3%
348 1
 
0.3%
368 1
 
0.3%
350 1
 
0.3%
400 1
 
0.3%
Other values (379) 379
97.4%
ValueCountFrequency (%)
42 1
0.3%
43 1
0.3%
44 1
0.3%
46 1
0.3%
48 1
0.3%
49 1
0.3%
50 1
0.3%
51 1
0.3%
52 1
0.3%
53 1
0.3%
ValueCountFrequency (%)
744 1
0.3%
743 1
0.3%
742 1
0.3%
740 1
0.3%
738 1
0.3%
735 1
0.3%
733 1
0.3%
732 1
0.3%
724 1
0.3%
723 1
0.3%
Distinct375
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
Minimum2011-02-14 15:15:00
Maximum2018-10-23 22:36:00
2023-12-13T09:22:53.607098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:22:53.715912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

생산자코드
Categorical

Distinct6
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
AK-C5001
92 
MJ-C2001
80 
HF-C1001
77 
SE-C3001
71 
HD-C4001
68 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st rowAK-C5001
2nd rowAK-C5001
3rd rowAK-C5001
4th rowAK-C5001
5th rowMJ-C2001

Common Values

ValueCountFrequency (%)
AK-C5001 92
23.7%
MJ-C2001 80
20.6%
HF-C1001 77
19.8%
SE-C3001 71
18.3%
HD-C4001 68
17.5%
NO-C9999 1
 
0.3%

Length

2023-12-13T09:22:53.819468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:22:53.901296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ak-c5001 92
23.7%
mj-c2001 80
20.6%
hf-c1001 77
19.8%
se-c3001 71
18.3%
hd-c4001 68
17.5%
no-c9999 1
 
0.3%
Distinct389
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-13T09:22:54.121124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters3501
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique389 ?
Unique (%)100.0%

Sample

1st rowOH-N00093
2nd rowOH-N00092
3rd rowOH-N00091
4th rowOH-N00090
5th rowOH-N00089
ValueCountFrequency (%)
oh-n00093 1
 
0.3%
oh-n00183 1
 
0.3%
oh-n00300 1
 
0.3%
oh-n00343 1
 
0.3%
oh-n00338 1
 
0.3%
oh-n00365 1
 
0.3%
oh-n00286 1
 
0.3%
oh-n00305 1
 
0.3%
oh-n00288 1
 
0.3%
oh-n00337 1
 
0.3%
Other values (379) 379
97.4%
2023-12-13T09:22:54.441917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 945
27.0%
O 389
11.1%
H 389
11.1%
- 389
11.1%
N 389
11.1%
3 160
 
4.6%
1 144
 
4.1%
2 141
 
4.0%
4 130
 
3.7%
5 112
 
3.2%
Other values (4) 313
 
8.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1945
55.6%
Uppercase Letter 1167
33.3%
Dash Punctuation 389
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 945
48.6%
3 160
 
8.2%
1 144
 
7.4%
2 141
 
7.2%
4 130
 
6.7%
5 112
 
5.8%
6 89
 
4.6%
9 78
 
4.0%
7 76
 
3.9%
8 70
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
O 389
33.3%
H 389
33.3%
N 389
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 389
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2334
66.7%
Latin 1167
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 945
40.5%
- 389
16.7%
3 160
 
6.9%
1 144
 
6.2%
2 141
 
6.0%
4 130
 
5.6%
5 112
 
4.8%
6 89
 
3.8%
9 78
 
3.3%
7 76
 
3.3%
Latin
ValueCountFrequency (%)
O 389
33.3%
H 389
33.3%
N 389
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3501
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 945
27.0%
O 389
11.1%
H 389
11.1%
- 389
11.1%
N 389
11.1%
3 160
 
4.6%
1 144
 
4.1%
2 141
 
4.0%
4 130
 
3.7%
5 112
 
3.2%
Other values (4) 313
 
8.9%

Interactions

2023-12-13T09:22:53.137774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T09:22:54.519360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호생산자코드
번호1.0000.561
생산자코드0.5611.000
2023-12-13T09:22:54.581891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호생산자코드
번호1.0000.336
생산자코드0.3361.000

Missing values

2023-12-13T09:22:53.247802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:22:53.316354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호등록일생산자코드구술정보연결번호
01502011-12-15 11:38AK-C5001OH-N00093
11492011-12-15 11:34AK-C5001OH-N00092
21482011-12-15 11:32AK-C5001OH-N00091
31472011-12-15 11:29AK-C5001OH-N00090
41462011-12-10 1:03MJ-C2001OH-N00089
51452011-12-09 23:26MJ-C2001OH-N00088
61442011-12-09 15:44MJ-C2001OH-N00087
71432011-12-09 14:58MJ-C2001OH-N00086
81422011-12-09 14:25MJ-C2001OH-N00085
91412011-12-09 13:28MJ-C2001OH-N00084
번호등록일생산자코드구술정보연결번호
3796982017-10-26 10:43HF-C1001OH-N00582
3807232018-04-22 12:30HF-C1001OH-N00604
3817002017-11-02 9:02HF-C1001OH-N00584
3827022017-11-02 10:44HF-C1001OH-N00586
3837192018-03-13 10:59HF-C1001OH-N00603
3847012017-11-02 9:08HF-C1001OH-N00585
3857182018-01-19 14:15MJ-C2001OH-N00602
3867152018-01-19 14:09MJ-C2001OH-N00599
3875992015-12-21 15:12SE-C3001OH-N00493
3885882015-11-30 12:39HF-C1001OH-N00482