Overview

Dataset statistics

Number of variables9
Number of observations7880
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory585.0 KiB
Average record size in memory76.0 B

Variable types

Numeric4
Text4
Categorical1

Dataset

Description경상북도 7,881개의 전통시장 별 소상공인 사업체 정보(순번, 시장 순번, 시장명, 상호명, 시군명, 주소, 종사자 수, 매출등급) 데이터 셋 (CSV 파일)
Author경상북도
URLhttps://www.data.go.kr/data/15096091/fileData.do

Alerts

시장 순번 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with 시장 순번High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:22:05.920563
Analysis finished2023-12-12 11:22:11.853468
Duration5.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct7880
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101270.64
Minimum7
Maximum204345
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.4 KiB
2023-12-12T20:22:11.965566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile8347.85
Q150744.75
median100821
Q3153456.25
95-th percentile193259.05
Maximum204345
Range204338
Interquartile range (IQR)102711.5

Descriptive statistics

Standard deviation59344.707
Coefficient of variation (CV)0.58600108
Kurtosis-1.2046918
Mean101270.64
Median Absolute Deviation (MAD)51594
Skewness0.00061541435
Sum7.9801268 × 108
Variance3.5217942 × 109
MonotonicityStrictly increasing
2023-12-12T20:22:12.228464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7 1
 
< 0.1%
136147 1
 
< 0.1%
136106 1
 
< 0.1%
136105 1
 
< 0.1%
136093 1
 
< 0.1%
136074 1
 
< 0.1%
136068 1
 
< 0.1%
136059 1
 
< 0.1%
136043 1
 
< 0.1%
136039 1
 
< 0.1%
Other values (7870) 7870
99.9%
ValueCountFrequency (%)
7 1
< 0.1%
8 1
< 0.1%
16 1
< 0.1%
18 1
< 0.1%
33 1
< 0.1%
34 1
< 0.1%
50 1
< 0.1%
69 1
< 0.1%
135 1
< 0.1%
215 1
< 0.1%
ValueCountFrequency (%)
204345 1
< 0.1%
204292 1
< 0.1%
204288 1
< 0.1%
204230 1
< 0.1%
204220 1
< 0.1%
204180 1
< 0.1%
204178 1
< 0.1%
204159 1
< 0.1%
204130 1
< 0.1%
204106 1
< 0.1%

시장 순번
Real number (ℝ)

HIGH CORRELATION 

Distinct149
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84.244289
Minimum1
Maximum199
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.4 KiB
2023-12-12T20:22:12.481073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12
Q135
median91
Q3118
95-th percentile176
Maximum199
Range198
Interquartile range (IQR)83

Descriptive statistics

Standard deviation50.2018
Coefficient of variation (CV)0.59590745
Kurtosis-0.70890229
Mean84.244289
Median Absolute Deviation (MAD)34
Skewness0.16215805
Sum663845
Variance2520.2207
MonotonicityNot monotonic
2023-12-12T20:22:12.810501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23 1011
 
12.8%
95 401
 
5.1%
98 374
 
4.7%
61 302
 
3.8%
85 287
 
3.6%
106 278
 
3.5%
72 274
 
3.5%
129 247
 
3.1%
1 194
 
2.5%
137 172
 
2.2%
Other values (139) 4340
55.1%
ValueCountFrequency (%)
1 194
2.5%
2 80
1.0%
4 6
 
0.1%
5 20
 
0.3%
7 17
 
0.2%
9 1
 
< 0.1%
10 24
 
0.3%
11 5
 
0.1%
12 95
1.2%
13 106
1.3%
ValueCountFrequency (%)
199 25
 
0.3%
197 2
 
< 0.1%
193 118
1.5%
192 12
 
0.2%
191 9
 
0.1%
190 47
 
0.6%
189 4
 
0.1%
188 21
 
0.3%
187 1
 
< 0.1%
184 9
 
0.1%
Distinct146
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size61.7 KiB
2023-12-12T20:22:13.152280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length5.2300761
Min length4

Characters and Unicode

Total characters41213
Distinct characters149
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row풍기선비골인삼시장
2nd row중앙신시장
3rd row영천공설시장
4th row죽도시장
5th row북정로중심상가시장
ValueCountFrequency (%)
죽도시장 1011
 
12.8%
중앙신시장 401
 
5.1%
구미새마을중앙시장 374
 
4.7%
중앙시장 350
 
4.4%
성동시장 302
 
3.8%
안동구시장 287
 
3.6%
구미산업유통단지 278
 
3.5%
영천공설시장 247
 
3.1%
구룡포시장 194
 
2.5%
풍물시장 172
 
2.2%
Other values (136) 4264
54.1%
2023-12-12T20:22:14.165440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7533
18.3%
7430
18.0%
1504
 
3.6%
1367
 
3.3%
1187
 
2.9%
1084
 
2.6%
1050
 
2.5%
828
 
2.0%
812
 
2.0%
682
 
1.7%
Other values (139) 17736
43.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41062
99.6%
Close Punctuation 53
 
0.1%
Open Punctuation 53
 
0.1%
Decimal Number 45
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7533
18.3%
7430
18.1%
1504
 
3.7%
1367
 
3.3%
1187
 
2.9%
1084
 
2.6%
1050
 
2.6%
828
 
2.0%
812
 
2.0%
682
 
1.7%
Other values (135) 17585
42.8%
Decimal Number
ValueCountFrequency (%)
1 31
68.9%
5 14
31.1%
Close Punctuation
ValueCountFrequency (%)
) 53
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41062
99.6%
Common 151
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7533
18.3%
7430
18.1%
1504
 
3.7%
1367
 
3.3%
1187
 
2.9%
1084
 
2.6%
1050
 
2.6%
828
 
2.0%
812
 
2.0%
682
 
1.7%
Other values (135) 17585
42.8%
Common
ValueCountFrequency (%)
) 53
35.1%
( 53
35.1%
1 31
20.5%
5 14
 
9.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41062
99.6%
ASCII 151
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7533
18.3%
7430
18.1%
1504
 
3.7%
1367
 
3.3%
1187
 
2.9%
1084
 
2.6%
1050
 
2.6%
828
 
2.0%
812
 
2.0%
682
 
1.7%
Other values (135) 17585
42.8%
ASCII
ValueCountFrequency (%)
) 53
35.1%
( 53
35.1%
1 31
20.5%
5 14
 
9.3%
Distinct5056
Distinct (%)64.2%
Missing0
Missing (%)0.0%
Memory size61.7 KiB
2023-12-12T20:22:14.822358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length27
Mean length5.3545685
Min length2

Characters and Unicode

Total characters42194
Distinct characters722
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3932 ?
Unique (%)49.9%

Sample

1st row풍기****
2nd row중앙***
3rd row최옥******
4th row수진**
5th row라이****
ValueCountFrequency (%)
시장 147
 
1.9%
중앙 73
 
0.9%
서울 71
 
0.9%
안동 69
 
0.9%
제일 59
 
0.7%
경북 53
 
0.7%
우리 53
 
0.7%
현대 49
 
0.6%
동해 47
 
0.6%
포항 46
 
0.6%
Other values (3388) 7233
91.6%
2023-12-12T20:22:15.804993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 26490
62.8%
411
 
1.0%
402
 
1.0%
370
 
0.9%
316
 
0.7%
302
 
0.7%
262
 
0.6%
258
 
0.6%
239
 
0.6%
238
 
0.6%
Other values (712) 12906
30.6%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 26500
62.8%
Other Letter 15374
36.4%
Uppercase Letter 120
 
0.3%
Decimal Number 65
 
0.2%
Open Punctuation 50
 
0.1%
Close Punctuation 32
 
0.1%
Lowercase Letter 26
 
0.1%
Space Separator 20
 
< 0.1%
Dash Punctuation 6
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
411
 
2.7%
402
 
2.6%
370
 
2.4%
316
 
2.1%
302
 
2.0%
262
 
1.7%
258
 
1.7%
239
 
1.6%
238
 
1.5%
232
 
1.5%
Other values (654) 12344
80.3%
Uppercase Letter
ValueCountFrequency (%)
C 9
 
7.5%
M 9
 
7.5%
O 8
 
6.7%
L 8
 
6.7%
K 8
 
6.7%
A 7
 
5.8%
G 7
 
5.8%
S 7
 
5.8%
D 7
 
5.8%
B 7
 
5.8%
Other values (13) 43
35.8%
Lowercase Letter
ValueCountFrequency (%)
k 3
11.5%
n 3
11.5%
o 3
11.5%
e 3
11.5%
r 2
7.7%
a 2
7.7%
y 2
7.7%
h 2
7.7%
i 1
 
3.8%
s 1
 
3.8%
Other values (4) 4
15.4%
Decimal Number
ValueCountFrequency (%)
8 20
30.8%
1 8
 
12.3%
3 7
 
10.8%
6 7
 
10.8%
7 7
 
10.8%
0 4
 
6.2%
2 4
 
6.2%
4 3
 
4.6%
5 3
 
4.6%
9 2
 
3.1%
Other Punctuation
ValueCountFrequency (%)
* 26490
> 99.9%
. 6
 
< 0.1%
& 2
 
< 0.1%
' 1
 
< 0.1%
! 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 46
92.0%
4
 
8.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 26674
63.2%
Hangul 15371
36.4%
Latin 146
 
0.3%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
411
 
2.7%
402
 
2.6%
370
 
2.4%
316
 
2.1%
302
 
2.0%
262
 
1.7%
258
 
1.7%
239
 
1.6%
238
 
1.5%
232
 
1.5%
Other values (651) 12341
80.3%
Latin
ValueCountFrequency (%)
C 9
 
6.2%
M 9
 
6.2%
O 8
 
5.5%
L 8
 
5.5%
K 8
 
5.5%
A 7
 
4.8%
G 7
 
4.8%
S 7
 
4.8%
D 7
 
4.8%
B 7
 
4.8%
Other values (27) 69
47.3%
Common
ValueCountFrequency (%)
* 26490
99.3%
( 46
 
0.2%
) 32
 
0.1%
20
 
0.1%
8 20
 
0.1%
1 8
 
< 0.1%
3 7
 
< 0.1%
6 7
 
< 0.1%
7 7
 
< 0.1%
. 6
 
< 0.1%
Other values (11) 31
 
0.1%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26816
63.6%
Hangul 15371
36.4%
None 4
 
< 0.1%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 26490
98.8%
( 46
 
0.2%
) 32
 
0.1%
20
 
0.1%
8 20
 
0.1%
C 9
 
< 0.1%
M 9
 
< 0.1%
O 8
 
< 0.1%
1 8
 
< 0.1%
L 8
 
< 0.1%
Other values (47) 166
 
0.6%
Hangul
ValueCountFrequency (%)
411
 
2.7%
402
 
2.6%
370
 
2.4%
316
 
2.1%
302
 
2.0%
262
 
1.7%
258
 
1.7%
239
 
1.6%
238
 
1.5%
232
 
1.5%
Other values (651) 12341
80.3%
None
ValueCountFrequency (%)
4
100.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

시군명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size61.7 KiB
포항시 북구
1423 
구미시
1054 
안동시
995 
포항시 남구
984 
경주시
866 
Other values (16)
2558 

Length

Max length6
Median length3
Mean length3.9163706
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row영주시
2nd row안동시
3rd row영천시
4th row포항시 북구
5th row경주시

Common Values

ValueCountFrequency (%)
포항시 북구 1423
18.1%
구미시 1054
13.4%
안동시 995
12.6%
포항시 남구 984
12.5%
경주시 866
11.0%
영주시 816
10.4%
영천시 343
 
4.4%
김천시 266
 
3.4%
상주시 238
 
3.0%
문경시 152
 
1.9%
Other values (11) 743
9.4%

Length

2023-12-12T20:22:16.054674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
포항시 2407
23.4%
북구 1423
13.8%
구미시 1054
10.2%
안동시 995
9.7%
남구 984
9.6%
경주시 866
 
8.4%
영주시 816
 
7.9%
영천시 343
 
3.3%
김천시 266
 
2.6%
상주시 238
 
2.3%
Other values (12) 895
 
8.7%
Distinct2903
Distinct (%)36.8%
Missing0
Missing (%)0.0%
Memory size61.7 KiB
2023-12-12T20:22:16.554927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length13.773604
Min length9

Characters and Unicode

Total characters108536
Distinct characters158
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1778 ?
Unique (%)22.6%

Sample

1st row영주시 소백로 2156
2nd row안동시 중앙시장4길 26-24
3rd row영천시 시장4길 38
4th row포항시 북구 죽도시장13길 13
5th row경주시 원효로 127-4
ValueCountFrequency (%)
포항시 2407
 
9.2%
북구 1423
 
5.5%
구미시 1054
 
4.0%
안동시 995
 
3.8%
남구 984
 
3.8%
경주시 866
 
3.3%
영주시 816
 
3.1%
영천시 343
 
1.3%
11 329
 
1.3%
302-7 278
 
1.1%
Other values (1292) 16552
63.5%
2023-12-12T20:22:17.370127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18167
 
16.7%
9333
 
8.6%
1 7073
 
6.5%
5113
 
4.7%
4743
 
4.4%
2 4253
 
3.9%
4024
 
3.7%
3 3026
 
2.8%
- 2542
 
2.3%
2533
 
2.3%
Other values (148) 47729
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60927
56.1%
Decimal Number 26900
24.8%
Space Separator 18167
 
16.7%
Dash Punctuation 2542
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9333
 
15.3%
5113
 
8.4%
4743
 
7.8%
4024
 
6.6%
2533
 
4.2%
2407
 
4.0%
2335
 
3.8%
2234
 
3.7%
1809
 
3.0%
1606
 
2.6%
Other values (136) 24790
40.7%
Decimal Number
ValueCountFrequency (%)
1 7073
26.3%
2 4253
15.8%
3 3026
11.2%
4 2303
 
8.6%
6 1987
 
7.4%
5 1860
 
6.9%
7 1729
 
6.4%
9 1711
 
6.4%
0 1548
 
5.8%
8 1410
 
5.2%
Space Separator
ValueCountFrequency (%)
18167
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2542
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60927
56.1%
Common 47609
43.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9333
 
15.3%
5113
 
8.4%
4743
 
7.8%
4024
 
6.6%
2533
 
4.2%
2407
 
4.0%
2335
 
3.8%
2234
 
3.7%
1809
 
3.0%
1606
 
2.6%
Other values (136) 24790
40.7%
Common
ValueCountFrequency (%)
18167
38.2%
1 7073
 
14.9%
2 4253
 
8.9%
3 3026
 
6.4%
- 2542
 
5.3%
4 2303
 
4.8%
6 1987
 
4.2%
5 1860
 
3.9%
7 1729
 
3.6%
9 1711
 
3.6%
Other values (2) 2958
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60927
56.1%
ASCII 47609
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18167
38.2%
1 7073
 
14.9%
2 4253
 
8.9%
3 3026
 
6.4%
- 2542
 
5.3%
4 2303
 
4.8%
6 1987
 
4.2%
5 1860
 
3.9%
7 1729
 
3.6%
9 1711
 
3.6%
Other values (2) 2958
 
6.2%
Hangul
ValueCountFrequency (%)
9333
 
15.3%
5113
 
8.4%
4743
 
7.8%
4024
 
6.6%
2533
 
4.2%
2407
 
4.0%
2335
 
3.8%
2234
 
3.7%
1809
 
3.0%
1606
 
2.6%
Other values (136) 24790
40.7%

종사자 수
Real number (ℝ)

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.617132
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.4 KiB
2023-12-12T20:22:17.578490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile3
Maximum9
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.84018619
Coefficient of variation (CV)0.51955326
Kurtosis7.9722007
Mean1.617132
Median Absolute Deviation (MAD)0
Skewness2.0505139
Sum12743
Variance0.70591283
MonotonicityNot monotonic
2023-12-12T20:22:17.791438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 4282
54.3%
2 2710
34.4%
3 617
 
7.8%
4 221
 
2.8%
5 23
 
0.3%
6 12
 
0.2%
8 8
 
0.1%
7 4
 
0.1%
9 3
 
< 0.1%
ValueCountFrequency (%)
1 4282
54.3%
2 2710
34.4%
3 617
 
7.8%
4 221
 
2.8%
5 23
 
0.3%
6 12
 
0.2%
7 4
 
0.1%
8 8
 
0.1%
9 3
 
< 0.1%
ValueCountFrequency (%)
9 3
 
< 0.1%
8 8
 
0.1%
7 4
 
0.1%
6 12
 
0.2%
5 23
 
0.3%
4 221
 
2.8%
3 617
 
7.8%
2 2710
34.4%
1 4282
54.3%

매출 등급
Real number (ℝ)

Distinct37
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.7574873
Minimum1
Maximum64
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.4 KiB
2023-12-12T20:22:18.040511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile5
Maximum64
Range63
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.6532376
Coefficient of variation (CV)1.5096767
Kurtosis89.97377
Mean1.7574873
Median Absolute Deviation (MAD)0
Skewness7.6121219
Sum13849
Variance7.0396698
MonotonicityNot monotonic
2023-12-12T20:22:18.306648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
1 6241
79.2%
2 674
 
8.6%
3 314
 
4.0%
4 175
 
2.2%
5 105
 
1.3%
6 82
 
1.0%
7 52
 
0.7%
8 44
 
0.6%
9 37
 
0.5%
11 24
 
0.3%
Other values (27) 132
 
1.7%
ValueCountFrequency (%)
1 6241
79.2%
2 674
 
8.6%
3 314
 
4.0%
4 175
 
2.2%
5 105
 
1.3%
6 82
 
1.0%
7 52
 
0.7%
8 44
 
0.6%
9 37
 
0.5%
10 24
 
0.3%
ValueCountFrequency (%)
64 1
 
< 0.1%
40 1
 
< 0.1%
37 1
 
< 0.1%
36 1
 
< 0.1%
35 2
< 0.1%
34 1
 
< 0.1%
32 1
 
< 0.1%
31 2
< 0.1%
29 4
0.1%
28 2
< 0.1%
Distinct444
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size61.7 KiB
2023-12-12T20:22:18.811089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length3.190736
Min length1

Characters and Unicode

Total characters25143
Distinct characters319
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique191 ?
Unique (%)2.4%

Sample

1st row전문 소매
2nd row이불
3rd row점술
4th row수산물
5th row골동품
ValueCountFrequency (%)
채소 545
 
5.9%
수산물 452
 
4.9%
여성의류 448
 
4.9%
과실 427
 
4.6%
육류 317
 
3.5%
건어물 297
 
3.2%
젓갈 296
 
3.2%
한식 260
 
2.8%
미용 255
 
2.8%
의복 244
 
2.7%
Other values (437) 5646
61.5%
2023-12-12T20:22:19.493939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1307
 
5.2%
1192
 
4.7%
1004
 
4.0%
887
 
3.5%
, 848
 
3.4%
829
 
3.3%
771
 
3.1%
767
 
3.1%
619
 
2.5%
593
 
2.4%
Other values (309) 16326
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22832
90.8%
Space Separator 1307
 
5.2%
Other Punctuation 848
 
3.4%
Open Punctuation 78
 
0.3%
Close Punctuation 78
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1192
 
5.2%
1004
 
4.4%
887
 
3.9%
829
 
3.6%
771
 
3.4%
767
 
3.4%
619
 
2.7%
593
 
2.6%
546
 
2.4%
503
 
2.2%
Other values (305) 15121
66.2%
Space Separator
ValueCountFrequency (%)
1307
100.0%
Other Punctuation
ValueCountFrequency (%)
, 848
100.0%
Open Punctuation
ValueCountFrequency (%)
( 78
100.0%
Close Punctuation
ValueCountFrequency (%)
) 78
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22832
90.8%
Common 2311
 
9.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1192
 
5.2%
1004
 
4.4%
887
 
3.9%
829
 
3.6%
771
 
3.4%
767
 
3.4%
619
 
2.7%
593
 
2.6%
546
 
2.4%
503
 
2.2%
Other values (305) 15121
66.2%
Common
ValueCountFrequency (%)
1307
56.6%
, 848
36.7%
( 78
 
3.4%
) 78
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22832
90.8%
ASCII 2311
 
9.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1307
56.6%
, 848
36.7%
( 78
 
3.4%
) 78
 
3.4%
Hangul
ValueCountFrequency (%)
1192
 
5.2%
1004
 
4.4%
887
 
3.9%
829
 
3.6%
771
 
3.4%
767
 
3.4%
619
 
2.7%
593
 
2.6%
546
 
2.4%
503
 
2.2%
Other values (305) 15121
66.2%

Interactions

2023-12-12T20:22:10.704803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:08.101498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:08.978471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:09.809994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:10.921841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:08.316680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:09.190012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:10.026661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:11.101867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:08.535699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:09.401408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:10.233506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:11.298030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:08.765319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:09.603373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:22:10.493542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:22:19.610491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번시장 순번시군명종사자 수매출 등급
순번1.0000.0220.0610.0000.000
시장 순번0.0221.0000.9680.1340.080
시군명0.0610.9681.0000.1970.075
종사자 수0.0000.1340.1971.0000.329
매출 등급0.0000.0800.0750.3291.000
2023-12-12T20:22:19.743809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번시장 순번종사자 수매출 등급시군명
순번1.000-0.0060.002-0.0090.022
시장 순번-0.0061.0000.019-0.0110.826
종사자 수0.0020.0191.0000.3400.077
매출 등급-0.009-0.0110.3401.0000.031
시군명0.0220.8260.0770.0311.000

Missing values

2023-12-12T20:22:11.513342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:22:11.755141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번시장 순번시장명상호명시군명도로명 주소종사자 수매출 등급대표제품
07126풍기선비골인삼시장풍기****영주시영주시 소백로 215622전문 소매
1895중앙신시장중앙***안동시안동시 중앙시장4길 26-2411이불
216129영천공설시장최옥******영천시영천시 시장4길 3811점술
31823죽도시장수진**포항시 북구포항시 북구 죽도시장13길 1321수산물
43376북정로중심상가시장라이****경주시경주시 원효로 127-411골동품
534106구미산업유통단지D.*****구미시구미시 3공단1로 302-712장비
65023죽도시장형제**포항시 북구포항시 북구 죽도시장13길 14-122수산물
76976북정로중심상가시장하나********경주시경주시 동성로 14121여성의류
813523죽도시장동대***포항시 북구포항시 북구 죽도시장길 31316수산물
9215106구미산업유통단지(주)정안****구미시구미시 3공단1로 302-7620토공
순번시장 순번시장명상호명시군명도로명 주소종사자 수매출 등급대표제품
787020410651죽도종합상가(주******포항시 북구포항시 북구 중흥로255번길 1721조명장치
787120413023죽도시장청하**포항시 북구포항시 북구 죽도시장13길 3-234건어물, 젓갈
7872204159138(유)중앙시장대성**문경시문경시 중앙시장1길 6-821주방용품
7873204178199후포시장황지***울진군울진군 울진대게로 2111건강보조식품
7874204180106구미산업유통단지(주)연우***구미시구미시 3공단1로 302-732제어장비
7875204220129영천공설시장시민***영천시영천시 시장3길 511조미료
7876204230107선산봉황시장제*구미시구미시 단계동길 2421의복
787720428883중앙시장한국******김천시김천시 중앙시장3길 511학원
7878204292139중앙시장윤*문경시문경시 중앙로 101-311의복
787920434523죽도시장분식**포항시 북구포항시 북구 죽도시장11길 321국수