Overview

Dataset statistics

Number of variables3
Number of observations1020
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.0 KiB
Average record size in memory25.1 B

Variable types

Numeric1
Text2

Dataset

Description2023년 8월 기준 환경통계포털에서 제공하는 Open API 기능 활용을 위한 통계표명과 자체 통계표 코드 목록을 제공
URLhttps://www.data.go.kr/data/15105567/fileData.do

Alerts

연번 has unique valuesUnique
통계표코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:29:04.353755
Analysis finished2023-12-12 03:29:05.041301
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1020
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean510.5
Minimum1
Maximum1020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.1 KiB
2023-12-12T12:29:05.146275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile51.95
Q1255.75
median510.5
Q3765.25
95-th percentile969.05
Maximum1020
Range1019
Interquartile range (IQR)509.5

Descriptive statistics

Standard deviation294.59294
Coefficient of variation (CV)0.57706746
Kurtosis-1.2
Mean510.5
Median Absolute Deviation (MAD)255
Skewness0
Sum520710
Variance86785
MonotonicityStrictly increasing
2023-12-12T12:29:05.333382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
687 1
 
0.1%
674 1
 
0.1%
675 1
 
0.1%
676 1
 
0.1%
677 1
 
0.1%
678 1
 
0.1%
679 1
 
0.1%
680 1
 
0.1%
681 1
 
0.1%
Other values (1010) 1010
99.0%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1020 1
0.1%
1019 1
0.1%
1018 1
0.1%
1017 1
0.1%
1016 1
0.1%
1015 1
0.1%
1014 1
0.1%
1013 1
0.1%
1012 1
0.1%
1011 1
0.1%
Distinct991
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-12T12:29:05.635498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length42
Mean length21.409804
Min length2

Characters and Unicode

Total characters21838
Distinct characters410
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique965 ?
Unique (%)94.6%

Sample

1st row환경보호활동별 매출액(조사업체)(2004~2010)
2nd row환경보호활동별 종사자수(2012)
3rd row환경산업분류별(업종별)종사자수(2012)
4th row환경산업분류별(매체별)종사자수(2012)
5th row환경보호활동별 수출액(2004~2012)
ValueCountFrequency (%)
153
 
3.9%
폐기물 127
 
3.2%
79
 
2.0%
발생량 67
 
1.7%
66
 
1.7%
원단위 58
 
1.5%
현황 55
 
1.4%
처리현황 50
 
1.3%
발생원별 50
 
1.3%
발생 47
 
1.2%
Other values (1167) 3185
80.9%
2023-12-12T12:29:06.208557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2920
 
13.4%
697
 
3.2%
565
 
2.6%
) 540
 
2.5%
( 540
 
2.5%
0 521
 
2.4%
473
 
2.2%
418
 
1.9%
409
 
1.9%
2 381
 
1.7%
Other values (400) 14374
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15366
70.4%
Space Separator 2920
 
13.4%
Decimal Number 1691
 
7.7%
Close Punctuation 540
 
2.5%
Open Punctuation 540
 
2.5%
Other Punctuation 232
 
1.1%
Uppercase Letter 200
 
0.9%
Math Symbol 159
 
0.7%
Connector Punctuation 86
 
0.4%
Dash Punctuation 80
 
0.4%
Other values (4) 24
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
697
 
4.5%
565
 
3.7%
473
 
3.1%
418
 
2.7%
409
 
2.7%
370
 
2.4%
368
 
2.4%
367
 
2.4%
338
 
2.2%
303
 
2.0%
Other values (347) 11058
72.0%
Uppercase Letter
ValueCountFrequency (%)
P 52
26.0%
M 37
18.5%
E 19
 
9.5%
H 13
 
6.5%
C 12
 
6.0%
R 11
 
5.5%
O 11
 
5.5%
B 8
 
4.0%
A 8
 
4.0%
S 7
 
3.5%
Other values (5) 22
11.0%
Decimal Number
ValueCountFrequency (%)
0 521
30.8%
2 381
22.5%
1 285
16.9%
9 201
 
11.9%
6 71
 
4.2%
8 70
 
4.1%
4 62
 
3.7%
5 40
 
2.4%
7 33
 
2.0%
3 27
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 120
51.7%
· 50
21.6%
/ 29
 
12.5%
. 9
 
3.9%
: 9
 
3.9%
& 6
 
2.6%
# 4
 
1.7%
; 4
 
1.7%
% 1
 
0.4%
Lowercase Letter
ValueCountFrequency (%)
p 4
19.0%
t 4
19.0%
x 3
14.3%
n 2
9.5%
z 2
9.5%
o 2
9.5%
c 2
9.5%
e 1
 
4.8%
s 1
 
4.8%
Math Symbol
ValueCountFrequency (%)
~ 158
99.4%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
2920
100.0%
Close Punctuation
ValueCountFrequency (%)
) 540
100.0%
Open Punctuation
ValueCountFrequency (%)
( 540
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 86
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15366
70.4%
Common 6251
28.6%
Latin 221
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
697
 
4.5%
565
 
3.7%
473
 
3.1%
418
 
2.7%
409
 
2.7%
370
 
2.4%
368
 
2.4%
367
 
2.4%
338
 
2.2%
303
 
2.0%
Other values (347) 11058
72.0%
Common
ValueCountFrequency (%)
2920
46.7%
) 540
 
8.6%
( 540
 
8.6%
0 521
 
8.3%
2 381
 
6.1%
1 285
 
4.6%
9 201
 
3.2%
~ 158
 
2.5%
, 120
 
1.9%
_ 86
 
1.4%
Other values (19) 499
 
8.0%
Latin
ValueCountFrequency (%)
P 52
23.5%
M 37
16.7%
E 19
 
8.6%
H 13
 
5.9%
C 12
 
5.4%
R 11
 
5.0%
O 11
 
5.0%
B 8
 
3.6%
A 8
 
3.6%
S 7
 
3.2%
Other values (14) 43
19.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15366
70.4%
ASCII 6418
29.4%
None 51
 
0.2%
Punctuation 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2920
45.5%
) 540
 
8.4%
( 540
 
8.4%
0 521
 
8.1%
2 381
 
5.9%
1 285
 
4.4%
9 201
 
3.1%
~ 158
 
2.5%
, 120
 
1.9%
_ 86
 
1.3%
Other values (38) 666
 
10.4%
Hangul
ValueCountFrequency (%)
697
 
4.5%
565
 
3.7%
473
 
3.1%
418
 
2.7%
409
 
2.7%
370
 
2.4%
368
 
2.4%
367
 
2.4%
338
 
2.2%
303
 
2.0%
Other values (347) 11058
72.0%
None
ValueCountFrequency (%)
· 50
98.0%
1
 
2.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

통계표코드
Text

UNIQUE 

Distinct1020
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-12T12:29:06.539469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length17.285294
Min length11

Characters and Unicode

Total characters17631
Distinct characters25
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1020 ?
Unique (%)100.0%

Sample

1st rowDT_106N_16_0100017
2nd rowDT_106N_16_0200015
3rd rowDT_106N_16_0200014
4th rowDT_106N_16_0200013
5th rowDT_106N_16_0200012
ValueCountFrequency (%)
dt_106n_16_0100017 1
 
0.1%
dt_106t_032528 1
 
0.1%
dt_106t_032408 1
 
0.1%
dt_106t_032313 1
 
0.1%
dt_106t_032354 1
 
0.1%
dt_106t_032359 1
 
0.1%
dt_106t_032370 1
 
0.1%
dt_106t_032373 1
 
0.1%
dt_106t_032661 1
 
0.1%
dt_106t_032785 1
 
0.1%
Other values (1010) 1010
99.0%
2023-12-12T12:29:07.035542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4728
26.8%
_ 2930
16.6%
1 2080
11.8%
6 1288
 
7.3%
T 1147
 
6.5%
D 1020
 
5.8%
N 863
 
4.9%
9 839
 
4.8%
2 811
 
4.6%
3 662
 
3.8%
Other values (15) 1263
 
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11599
65.8%
Uppercase Letter 3102
 
17.6%
Connector Punctuation 2930
 
16.6%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 1147
37.0%
D 1020
32.9%
N 863
27.8%
A 33
 
1.1%
Z 18
 
0.6%
M 6
 
0.2%
L 3
 
0.1%
G 2
 
0.1%
R 2
 
0.1%
B 2
 
0.1%
Other values (4) 6
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 4728
40.8%
1 2080
17.9%
6 1288
 
11.1%
9 839
 
7.2%
2 811
 
7.0%
3 662
 
5.7%
5 367
 
3.2%
4 360
 
3.1%
7 236
 
2.0%
8 228
 
2.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2930
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 14529
82.4%
Latin 3102
 
17.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 1147
37.0%
D 1020
32.9%
N 863
27.8%
A 33
 
1.1%
Z 18
 
0.6%
M 6
 
0.2%
L 3
 
0.1%
G 2
 
0.1%
R 2
 
0.1%
B 2
 
0.1%
Other values (4) 6
 
0.2%
Common
ValueCountFrequency (%)
0 4728
32.5%
_ 2930
20.2%
1 2080
14.3%
6 1288
 
8.9%
9 839
 
5.8%
2 811
 
5.6%
3 662
 
4.6%
5 367
 
2.5%
4 360
 
2.5%
7 236
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17631
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4728
26.8%
_ 2930
16.6%
1 2080
11.8%
6 1288
 
7.3%
T 1147
 
6.5%
D 1020
 
5.8%
N 863
 
4.9%
9 839
 
4.8%
2 811
 
4.6%
3 662
 
3.8%
Other values (15) 1263
 
7.2%

Interactions

2023-12-12T12:29:04.650739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T12:29:04.888243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:29:05.000672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번통계표명통계표코드
01환경보호활동별 매출액(조사업체)(2004~2010)DT_106N_16_0100017
12환경보호활동별 종사자수(2012)DT_106N_16_0200015
23환경산업분류별(업종별)종사자수(2012)DT_106N_16_0200014
34환경산업분류별(매체별)종사자수(2012)DT_106N_16_0200013
45환경보호활동별 수출액(2004~2012)DT_106N_16_0200012
56환경분야별 자격증 소지자수DT_106N_16_0200006
67산업별/종사상지위별 환경부문 종사자수(2004~2008)DT_106N_16_0200005
78환경산업분류별(보호활동) 투자액DT_106N_16_0100061
89환경산업분류별(매체별) 투자액DT_106N_16_0100060
910환경산업분류별(업종별) 투자액DT_106N_16_0100059
연번통계표명통계표코드
10101011요 중 트리클로산 농도(크레아티닌 보정)DT_106N_99_1100082
10111012요 중 트리클로산 농도DT_106N_99_1100081
10121013요 중 프로필파라벤 농도(크레아티닌 보정)DT_106N_99_1100080
10131014요 중 프로필파라벤 농도DT_106N_99_1100079
10141015요 중 에틸파라벤 농도(크레아티닌 보정)DT_106N_99_1100078
10151016요 중 에틸파라벤 농도DT_106N_99_1100077
10161017요 중 메틸파라벤 농도(크레아티닌 보정)DT_106N_99_1100076
10171018요 중 메틸파라벤 농도DT_106N_99_1100075
10181019요 중 비스페놀 S 농도(크레아티닌 보정)DT_106N_99_1100074
10191020요 중 카드뮴 농도DT_106N_99_1100057