Overview

Dataset statistics

Number of variables3
Number of observations448
Missing cells0
Missing cells (%)0.0%
Duplicate rows15
Duplicate rows (%)3.3%
Total size in memory11.1 KiB
Average record size in memory25.3 B

Variable types

Text1
Numeric1
DateTime1

Dataset

Description전기전자제품및자동차의재활용시스템 내 연간 발신한 알림메시지 정보 제공(알람메시지 제목, 알람대상 확인수, 등록일시 등)
Author환경부
URLhttps://www.data.go.kr/data/15092257/fileData.do

Alerts

Dataset has 15 (3.3%) duplicate rowsDuplicates
알람대상 확인수 has 50 (11.2%) zerosZeros

Reproduction

Analysis started2024-04-06 08:08:54.240978
Analysis finished2024-04-06 08:08:55.146926
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct304
Distinct (%)67.9%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2024-04-06T17:08:55.530174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length42
Mean length28.767857
Min length3

Characters and Unicode

Total characters12888
Distinct characters266
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique232 ?
Unique (%)51.8%

Sample

1st row2019년 폐자동차 EcoAS시스템 관리표 오류관리표 수정안내
2nd row2020년 회수의무이행계획서 제출기한 안내
3rd row판매업자 회수부과금 미납에 따른 납부 안내
4th row공제조합 가입 마감기한 안내
5th row환경성보장제 사전예방규정 법령 개정사항 안내
ValueCountFrequency (%)
안내 241
 
10.6%
제출 149
 
6.6%
폐자동차 103
 
4.6%
재활용결과보고서 88
 
3.9%
출고수입실적서 66
 
2.9%
환경성보장제 47
 
2.1%
수정 46
 
2.0%
전기전자제품 42
 
1.9%
2022년 36
 
1.6%
사전예방규정 35
 
1.5%
Other values (399) 1410
62.3%
2024-04-06T17:08:56.610534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1839
 
14.3%
2 619
 
4.8%
374
 
2.9%
350
 
2.7%
345
 
2.7%
0 327
 
2.5%
316
 
2.5%
310
 
2.4%
296
 
2.3%
282
 
2.2%
Other values (256) 7830
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8696
67.5%
Space Separator 1839
 
14.3%
Decimal Number 1445
 
11.2%
Open Punctuation 213
 
1.7%
Close Punctuation 213
 
1.7%
Other Punctuation 193
 
1.5%
Lowercase Letter 135
 
1.0%
Uppercase Letter 95
 
0.7%
Math Symbol 39
 
0.3%
Dash Punctuation 18
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
374
 
4.3%
350
 
4.0%
345
 
4.0%
316
 
3.6%
310
 
3.6%
296
 
3.4%
282
 
3.2%
281
 
3.2%
235
 
2.7%
234
 
2.7%
Other values (205) 5673
65.2%
Lowercase Letter
ValueCountFrequency (%)
o 34
25.2%
c 34
25.2%
s 12
 
8.9%
n 10
 
7.4%
g 10
 
7.4%
i 10
 
7.4%
w 7
 
5.2%
a 6
 
4.4%
e 5
 
3.7%
t 5
 
3.7%
Other values (2) 2
 
1.5%
Other Punctuation
ValueCountFrequency (%)
. 79
40.9%
& 27
 
14.0%
# 26
 
13.5%
; 26
 
13.5%
· 18
 
9.3%
" 6
 
3.1%
' 4
 
2.1%
/ 3
 
1.6%
2
 
1.0%
: 1
 
0.5%
Decimal Number
ValueCountFrequency (%)
2 619
42.8%
0 327
22.6%
1 232
 
16.1%
3 108
 
7.5%
9 66
 
4.6%
4 43
 
3.0%
5 21
 
1.5%
7 16
 
1.1%
6 9
 
0.6%
8 4
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
E 35
36.8%
A 19
20.0%
S 18
18.9%
R 6
 
6.3%
Q 6
 
6.3%
M 5
 
5.3%
W 3
 
3.2%
T 1
 
1.1%
K 1
 
1.1%
C 1
 
1.1%
Open Punctuation
ValueCountFrequency (%)
( 144
67.6%
[ 69
32.4%
Close Punctuation
ValueCountFrequency (%)
) 144
67.6%
] 69
32.4%
Space Separator
ValueCountFrequency (%)
1839
100.0%
Math Symbol
ValueCountFrequency (%)
~ 39
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8696
67.5%
Common 3962
30.7%
Latin 230
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
374
 
4.3%
350
 
4.0%
345
 
4.0%
316
 
3.6%
310
 
3.6%
296
 
3.4%
282
 
3.2%
281
 
3.2%
235
 
2.7%
234
 
2.7%
Other values (205) 5673
65.2%
Common
ValueCountFrequency (%)
1839
46.4%
2 619
 
15.6%
0 327
 
8.3%
1 232
 
5.9%
( 144
 
3.6%
) 144
 
3.6%
3 108
 
2.7%
. 79
 
2.0%
] 69
 
1.7%
[ 69
 
1.7%
Other values (19) 332
 
8.4%
Latin
ValueCountFrequency (%)
E 35
15.2%
o 34
14.8%
c 34
14.8%
A 19
8.3%
S 18
7.8%
s 12
 
5.2%
n 10
 
4.3%
g 10
 
4.3%
i 10
 
4.3%
w 7
 
3.0%
Other values (12) 41
17.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8695
67.5%
ASCII 4172
32.4%
None 18
 
0.1%
Punctuation 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1839
44.1%
2 619
 
14.8%
0 327
 
7.8%
1 232
 
5.6%
( 144
 
3.5%
) 144
 
3.5%
3 108
 
2.6%
. 79
 
1.9%
] 69
 
1.7%
[ 69
 
1.7%
Other values (39) 542
 
13.0%
Hangul
ValueCountFrequency (%)
374
 
4.3%
350
 
4.0%
345
 
4.0%
316
 
3.6%
310
 
3.6%
296
 
3.4%
282
 
3.2%
281
 
3.2%
235
 
2.7%
234
 
2.7%
Other values (204) 5672
65.2%
None
ValueCountFrequency (%)
· 18
100.0%
Punctuation
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

알람대상 확인수
Real number (ℝ)

ZEROS 

Distinct87
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.904018
Minimum0
Maximum557
Zeros50
Zeros (%)11.2%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2024-04-06T17:08:57.045766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median6
Q330
95-th percentile73.6
Maximum557
Range557
Interquartile range (IQR)29

Descriptive statistics

Standard deviation47.134556
Coefficient of variation (CV)2.0579165
Kurtosis50.889777
Mean22.904018
Median Absolute Deviation (MAD)6
Skewness5.9324882
Sum10261
Variance2221.6664
MonotonicityNot monotonic
2024-04-06T17:08:57.303872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 116
25.9%
0 50
 
11.2%
2 25
 
5.6%
5 14
 
3.1%
7 12
 
2.7%
8 12
 
2.7%
9 11
 
2.5%
3 9
 
2.0%
25 8
 
1.8%
6 8
 
1.8%
Other values (77) 183
40.8%
ValueCountFrequency (%)
0 50
11.2%
1 116
25.9%
2 25
 
5.6%
3 9
 
2.0%
4 7
 
1.6%
5 14
 
3.1%
6 8
 
1.8%
7 12
 
2.7%
8 12
 
2.7%
9 11
 
2.5%
ValueCountFrequency (%)
557 1
0.2%
411 1
0.2%
279 1
0.2%
271 1
0.2%
208 1
0.2%
195 1
0.2%
186 1
0.2%
182 1
0.2%
180 1
0.2%
167 1
0.2%
Distinct265
Distinct (%)59.2%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
Minimum2020-01-02 00:00:00
Maximum2024-01-11 00:00:00
2024-04-06T17:08:57.556417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:08:57.829684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-04-06T17:08:54.614308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-06T17:08:54.853280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:08:55.097083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

알람메시지 제목알람대상 확인수등록일
02019년 폐자동차 EcoAS시스템 관리표 오류관리표 수정안내552020-01-09
12020년 회수의무이행계획서 제출기한 안내22020-02-28
2판매업자 회수부과금 미납에 따른 납부 안내252020-08-27
3공제조합 가입 마감기한 안내342020-12-04
4환경성보장제 사전예방규정 법령 개정사항 안내02021-03-09
5환경성보장제도 사전예방규정 안내12021-04-06
6의무이행21년(출고수입20년) 출고수입실적서 수정 안내12021-09-10
72020년 회수의무이행계획서 제출 안내02020-02-26
8의무이행20년(출고수입19년) 출고수입실적서 관련 유선 요청12020-12-24
9[부울경] 2020년 자원순환분야 Eco-AS 코칭서비스 안내752020-03-04
알람메시지 제목알람대상 확인수등록일
4382023년 3분기 폐자동차 재활용 결과보고서 제출 안내(10/16(월)까지)402023-10-12
4392023년 3분기 폐자동차 재활용결과보고서 제출 안내182023-10-13
4402023년 4분기 폐자동차 재활용 결과보고서 제출 안내(~ 1/12 18시)842024-01-03
4412023년 4분기 폐자동차 재활용결과보고서 제출 안내452024-01-03
442전기전자제품 판매업자 회수의무이행결과보고서 제출 안내182023-04-26
443(판매) 의무이행 2022년도 전자제품 회수의무이행결과보고서 제출 안내12023-05-08
4442023년 수도권동부관할내 폐자동차재활용업체 대상 비대면 교육 안내252023-06-02
445[EcoAS] 전자제품 매입판매실적서(22년분) 수정 요청12023-06-27
446EcoAS 출고수입실적서 제출안내12023-06-30
447환경성보장제 사전예방규정 이행 안내02023-09-14

Duplicate rows

Most frequently occurring

알람메시지 제목알람대상 확인수등록일# duplicates
13프탈레이트 추가 준수 공표 여부 확인요청12021-09-295
14환경성보장제 사전예방규정 법령 개정사항 안내02021-03-095
5[환경성보장제 사전예방규정 법령 개정사항 안내]02021-03-043
02021년 재활용의무이행결과보고서 제출 안내82022-05-022
12022년도 판매업자 회수의무이행결과보고서 미제출 안내02023-05-222
2[EcoAS] 전자제품 매입판매실적서(22년분) 수정 요청12023-06-272
3[전기전자제품 판매업자 법정서류 제출안내]02022-05-022
4[중소기업확인서 발급요청]중소기업 확인서 발급요청 드립니다.12023-08-072
6매입판매실적 제출 안내12021-12-152
7의무이행 2022년 회수의무이행계획서 수정 제출 안내12022-02-242