Overview

Dataset statistics

Number of variables4
Number of observations1040
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory35.7 KiB
Average record size in memory35.1 B

Variable types

Numeric3
Text1

Dataset

Description공공데이터 제공신청으로 2019.07~2019.12 기간의 대 중국 수출 품목(HS 4단위) 정보를 2020년에 개방했으며, 이후 1년 기간으로 데이터를 제공 : 순번, 품목코드(HS4), 품목명 등
URLhttps://www.data.go.kr/data/15060049/fileData.do

Alerts

순번 is highly overall correlated with 품목코드(HS4)High correlation
품목코드(HS4) is highly overall correlated with 순번High correlation
2022수출액(달러) is highly skewed (γ1 = 28.44106434)Skewed
순번 has unique valuesUnique
품목코드(HS4) has unique valuesUnique
품목명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:08:22.934820
Analysis finished2023-12-12 18:08:24.715325
Duration1.78 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1040
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean520.5
Minimum1
Maximum1040
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.3 KiB
2023-12-13T03:08:24.790743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile52.95
Q1260.75
median520.5
Q3780.25
95-th percentile988.05
Maximum1040
Range1039
Interquartile range (IQR)519.5

Descriptive statistics

Standard deviation300.36644
Coefficient of variation (CV)0.5770729
Kurtosis-1.2
Mean520.5
Median Absolute Deviation (MAD)260
Skewness0
Sum541320
Variance90220
MonotonicityStrictly increasing
2023-12-13T03:08:25.242895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
685 1
 
0.1%
687 1
 
0.1%
688 1
 
0.1%
689 1
 
0.1%
690 1
 
0.1%
691 1
 
0.1%
692 1
 
0.1%
693 1
 
0.1%
694 1
 
0.1%
Other values (1030) 1030
99.0%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1040 1
0.1%
1039 1
0.1%
1038 1
0.1%
1037 1
0.1%
1036 1
0.1%
1035 1
0.1%
1034 1
0.1%
1033 1
0.1%
1032 1
0.1%
1031 1
0.1%

품목코드(HS4)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1040
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5629.0904
Minimum106
Maximum9706
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.3 KiB
2023-12-13T03:08:25.404110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum106
5-th percentile1211.95
Q13204.75
median5956
Q38214.25
95-th percentile9104.05
Maximum9706
Range9600
Interquartile range (IQR)5009.5

Descriptive statistics

Standard deviation2608.2225
Coefficient of variation (CV)0.46334707
Kurtosis-1.2026645
Mean5629.0904
Median Absolute Deviation (MAD)2457.5
Skewness-0.23869979
Sum5854254
Variance6802824.7
MonotonicityStrictly increasing
2023-12-13T03:08:25.584250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
106 1
 
0.1%
7307 1
 
0.1%
7309 1
 
0.1%
7310 1
 
0.1%
7311 1
 
0.1%
7312 1
 
0.1%
7313 1
 
0.1%
7314 1
 
0.1%
7315 1
 
0.1%
7316 1
 
0.1%
Other values (1030) 1030
99.0%
ValueCountFrequency (%)
106 1
0.1%
210 1
0.1%
302 1
0.1%
303 1
0.1%
304 1
0.1%
305 1
0.1%
306 1
0.1%
307 1
0.1%
308 1
0.1%
401 1
0.1%
ValueCountFrequency (%)
9706 1
0.1%
9703 1
0.1%
9702 1
0.1%
9701 1
0.1%
9620 1
0.1%
9619 1
0.1%
9618 1
0.1%
9617 1
0.1%
9616 1
0.1%
9615 1
0.1%

품목명
Text

UNIQUE 

Distinct1040
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
2023-12-13T03:08:25.901398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length347
Median length161
Mean length70.803846
Min length1

Characters and Unicode

Total characters73636
Distinct characters1071
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1040 ?
Unique (%)100.0%

Sample

1st row그 밖의 살아 있는 동물
2nd row육과 식용 설육(屑肉)(염장하거나 염수장한 것ㆍ건조하거나 훈제한 것으로 한정한다), 육이나 설육(屑肉)의 식용 고운 가루ㆍ거친 가루
3rd row신선하거나 냉장한 어류[제0304호의 어류의 필레(fillet)와 그 밖의 어육은 제외한다]
4th row냉동어류[제0304호의 어류의 필레(fillet)와 기타 어육은 제외한다]
5th row어류의 필레(fillet)와 그 밖의 어육(잘게 썰었는지에 상관없으며, 신선한 것ㆍ냉장한 것ㆍ냉동한 것으로 한정한다)
ValueCountFrequency (%)
444
 
3.8%
밖의 380
 
3.3%
것으로 300
 
2.6%
한정한다 276
 
2.4%
만든 210
 
1.8%
이와 206
 
1.8%
유사한 200
 
1.7%
제외한다 180
 
1.5%
상관없다 158
 
1.4%
것인지에 152
 
1.3%
Other values (4798) 9162
78.5%
2023-12-13T03:08:26.389312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10628
 
14.4%
) 2077
 
2.8%
( 2077
 
2.8%
1929
 
2.6%
1822
 
2.5%
1383
 
1.9%
1224
 
1.7%
1210
 
1.6%
1043
 
1.4%
973
 
1.3%
Other values (1061) 49270
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47966
65.1%
Space Separator 10628
 
14.4%
Lowercase Letter 8118
 
11.0%
Close Punctuation 2421
 
3.3%
Open Punctuation 2420
 
3.3%
Decimal Number 1148
 
1.6%
Other Punctuation 855
 
1.2%
Dash Punctuation 67
 
0.1%
Uppercase Letter 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1929
 
4.0%
1822
 
3.8%
1383
 
2.9%
1224
 
2.6%
1210
 
2.5%
1043
 
2.2%
973
 
2.0%
913
 
1.9%
875
 
1.8%
807
 
1.7%
Other values (1008) 35787
74.6%
Lowercase Letter
ValueCountFrequency (%)
e 966
11.9%
a 711
 
8.8%
r 676
 
8.3%
t 635
 
7.8%
i 577
 
7.1%
l 544
 
6.7%
s 527
 
6.5%
o 490
 
6.0%
n 434
 
5.3%
c 416
 
5.1%
Other values (16) 2142
26.4%
Decimal Number
ValueCountFrequency (%)
0 295
25.7%
8 137
11.9%
4 126
11.0%
1 121
10.5%
5 121
10.5%
3 84
 
7.3%
6 84
 
7.3%
2 74
 
6.4%
9 58
 
5.1%
7 48
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
P 4
30.8%
C 3
23.1%
O 2
15.4%
J 1
 
7.7%
F 1
 
7.7%
B 1
 
7.7%
S 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 752
88.0%
: 72
 
8.4%
· 16
 
1.9%
. 15
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 2077
85.8%
] 344
 
14.2%
Open Punctuation
ValueCountFrequency (%)
( 2077
85.8%
[ 343
 
14.2%
Space Separator
ValueCountFrequency (%)
10628
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 67
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47167
64.1%
Common 17539
 
23.8%
Latin 8131
 
11.0%
Han 799
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1929
 
4.1%
1822
 
3.9%
1383
 
2.9%
1224
 
2.6%
1210
 
2.6%
1043
 
2.2%
973
 
2.1%
913
 
1.9%
875
 
1.9%
807
 
1.7%
Other values (775) 34988
74.2%
Han
ValueCountFrequency (%)
28
 
3.5%
27
 
3.4%
26
 
3.3%
20
 
2.5%
20
 
2.5%
20
 
2.5%
13
 
1.6%
12
 
1.5%
12
 
1.5%
12
 
1.5%
Other values (223) 609
76.2%
Latin
ValueCountFrequency (%)
e 966
11.9%
a 711
 
8.7%
r 676
 
8.3%
t 635
 
7.8%
i 577
 
7.1%
l 544
 
6.7%
s 527
 
6.5%
o 490
 
6.0%
n 434
 
5.3%
c 416
 
5.1%
Other values (23) 2155
26.5%
Common
ValueCountFrequency (%)
10628
60.6%
) 2077
 
11.8%
( 2077
 
11.8%
, 752
 
4.3%
] 344
 
2.0%
[ 343
 
2.0%
0 295
 
1.7%
8 137
 
0.8%
4 126
 
0.7%
1 121
 
0.7%
Other values (10) 639
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45238
61.4%
ASCII 25654
34.8%
Compat Jamo 1929
 
2.6%
CJK 785
 
1.1%
None 16
 
< 0.1%
CJK Compat Ideographs 14
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10628
41.4%
) 2077
 
8.1%
( 2077
 
8.1%
e 966
 
3.8%
, 752
 
2.9%
a 711
 
2.8%
r 676
 
2.6%
t 635
 
2.5%
i 577
 
2.2%
l 544
 
2.1%
Other values (42) 6011
23.4%
Compat Jamo
ValueCountFrequency (%)
1929
100.0%
Hangul
ValueCountFrequency (%)
1822
 
4.0%
1383
 
3.1%
1224
 
2.7%
1210
 
2.7%
1043
 
2.3%
973
 
2.2%
913
 
2.0%
875
 
1.9%
807
 
1.8%
779
 
1.7%
Other values (774) 34209
75.6%
CJK
ValueCountFrequency (%)
28
 
3.6%
27
 
3.4%
26
 
3.3%
20
 
2.5%
20
 
2.5%
20
 
2.5%
13
 
1.7%
12
 
1.5%
12
 
1.5%
12
 
1.5%
Other values (214) 595
75.8%
None
ValueCountFrequency (%)
· 16
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
5
35.7%
2
 
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%

2022수출액(달러)
Real number (ℝ)

SKEWED 

Distinct1039
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.4979749 × 108
Minimum1
Maximum4.8812087 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.3 KiB
2023-12-13T03:08:26.530775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2210
Q1223624.75
median2948032
Q331875545
95-th percentile4.1872375 × 108
Maximum4.8812087 × 1010
Range4.8812087 × 1010
Interquartile range (IQR)31651920

Descriptive statistics

Standard deviation1.5787177 × 109
Coefficient of variation (CV)10.539013
Kurtosis871.45913
Mean1.4979749 × 108
Median Absolute Deviation (MAD)2941924
Skewness28.441064
Sum1.5578939 × 1011
Variance2.4923496 × 1018
MonotonicityNot monotonic
2023-12-13T03:08:26.670787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2110 2
 
0.2%
76375 1
 
0.1%
97245726 1
 
0.1%
13194484 1
 
0.1%
114784909 1
 
0.1%
101758609 1
 
0.1%
43758365 1
 
0.1%
1247 1
 
0.1%
1917172 1
 
0.1%
8331047 1
 
0.1%
Other values (1029) 1029
98.9%
ValueCountFrequency (%)
1 1
0.1%
12 1
0.1%
14 1
0.1%
19 1
0.1%
24 1
0.1%
30 1
0.1%
42 1
0.1%
53 1
0.1%
60 1
0.1%
125 1
0.1%
ValueCountFrequency (%)
48812087314 1
0.1%
6291718764 1
0.1%
5876711165 1
0.1%
4520212508 1
0.1%
4188941454 1
0.1%
3873980545 1
0.1%
3828048043 1
0.1%
3451548843 1
0.1%
3069048170 1
0.1%
2695316845 1
0.1%

Interactions

2023-12-13T03:08:24.253356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:08:23.580328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:08:23.923936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:08:24.346454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:08:23.693887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:08:24.059052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:08:24.447461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:08:23.805428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:08:24.153820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:08:26.757029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번품목코드(HS4)2022수출액(달러)
순번1.0000.9880.000
품목코드(HS4)0.9881.0000.000
2022수출액(달러)0.0000.0001.000
2023-12-13T03:08:26.837948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번품목코드(HS4)2022수출액(달러)
순번1.0001.0000.110
품목코드(HS4)1.0001.0000.110
2022수출액(달러)0.1100.1101.000

Missing values

2023-12-13T03:08:24.572182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:08:24.670528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번품목코드(HS4)품목명2022수출액(달러)
01106그 밖의 살아 있는 동물76375
12210육과 식용 설육(屑肉)(염장하거나 염수장한 것ㆍ건조하거나 훈제한 것으로 한정한다), 육이나 설육(屑肉)의 식용 고운 가루ㆍ거친 가루3878
23302신선하거나 냉장한 어류[제0304호의 어류의 필레(fillet)와 그 밖의 어육은 제외한다]4546691
34303냉동어류[제0304호의 어류의 필레(fillet)와 기타 어육은 제외한다]524483534
45304어류의 필레(fillet)와 그 밖의 어육(잘게 썰었는지에 상관없으며, 신선한 것ㆍ냉장한 것ㆍ냉동한 것으로 한정한다)35969524
56305건조한 어류, 염장이나 염수장한 어류, 훈제한 어류(훈제과정 중이나 훈제 전에 조리한 것인지에 상관없다)632450
67306갑각류(껍데기가 붙어 있는 것인지에 상관없으며 살아 있는 것과 신선한 것·냉장한 것·냉동한 것·건조한 것·염장이나 염수장한 것), 훈제한 갑각류(껍데기가 붙어 있는 것인지 또는 훈제 전이나 훈제과정 중에 조리한 것인지에 상관없다), 껍데기가 붙어 있는 상태로 물에 찌거나 삶은 갑각류(냉장한 것·냉동한 것·건조한 것·염장이나 염수장한 것인지에 상관없다)56480666
78307연체동물(껍데기가 붙어 있는지에 상관없으며 살아 있는 것과 신선한 것ㆍ냉장이나 냉동한 것ㆍ건조한 것ㆍ염장이나 염수장한 것), 훈제한 연체동물(껍데기가 붙어 있는 것인지 또는 훈제 전이나 훈제과정 중에 조리한 것인지에 상관없다)68507566
89308수생(水生) 무척추동물(갑각류와 연체동물은 제외하며, 살아 있는 것과 신선한 것·냉장한 것·냉동한 것·건조한 것, 염장이나 염수장한 것), 훈제한 수생(水生) 무척추동물(갑각류와 연체동물은 제외하며, 훈제 전이나 훈제과정 중에 조리한 것인지에 상관없다)20498316
910401밀크와 크림(농축하지 않은 것으로서 설탕이나 그 밖의 감미료를 첨가하지 않은 것으로 한정한다)7414193
순번품목코드(HS4)품목명2022수출액(달러)
103010319615빗ㆍ헤어슬라이드(hair-slide)와 이와 유사한 물품ㆍ머리핀ㆍ컬링핀(curling pin)ㆍ컬링그립(curling grip)ㆍ헤어컬러(hair curler)와 이와 유사한 물품(제8516호에 해당하는 물품은 제외한다)과 이들의 부분품1268975
103110329616향수용 분무기와 이와 유사한 화장용 분무기, 이들의 마운트(mount)와 두부(頭部), 화장용 분첩과 패드3477798
103210339617진공 플라스크와 그 밖의 진공용기(완전한 것으로 한정한다)와 그 부분품(유리로 만든 내부용기는 제외한다)565823
103310349618마네킹 인형과 그 밖의 모델형 인형, 자동인형과 그 밖의 쇼윈도 장식용인 움직이는 전시용품85916
103410359619위생 타월(패드)ㆍ탐폰(tampon), 냅킨(기저귀)ㆍ냅킨라이너(napkin liner)와 이와 유사한 물품(어떤 재질이라도 가능하다)11841987
103510369620일각대ㆍ양각대ㆍ삼각대와 이와 유사한 물품23898
103610379701회화ㆍ데생ㆍ파스텔(손으로 직접 그린 것으로 한정하며, 제4906호의 도안과 손으로 그렸거나 장식한 가공품은 제외한다), 콜라주(collage)ㆍ모자이크와 이와 유사한 장식판4492594
103710389702오리지널 동판화ㆍ목판화ㆍ석판화14720
103810399703오리지널 조각과 조상(彫像)(어떤 재료라도 가능하다)226823
103910409706골동품(제작 후 100년을 초과한 것으로 한정한다)10652