Overview

Dataset statistics

Number of variables8
Number of observations536
Missing cells656
Missing cells (%)15.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory34.7 KiB
Average record size in memory66.2 B

Variable types

Numeric1
Text6
Unsupported1

Dataset

Description경상남도 사천시에 위치한 제조업체에 관한 데이터입니다.(회사명, 전화번호, 팩스번호, 주소, 업종명, 생산품)
Author경상남도 사천시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15034992

Alerts

전화번호 has 60 (11.2%) missing valuesMissing
팩스번호 has 52 (9.7%) missing valuesMissing
업종명 has 7 (1.3%) missing valuesMissing
Unnamed: 7 has 536 (100.0%) missing valuesMissing
연번 has unique valuesUnique
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 00:09:46.380917
Analysis finished2023-12-11 00:09:47.370410
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct536
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean268.5
Minimum1
Maximum536
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.8 KiB
2023-12-11T09:09:47.465623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile27.75
Q1134.75
median268.5
Q3402.25
95-th percentile509.25
Maximum536
Range535
Interquartile range (IQR)267.5

Descriptive statistics

Standard deviation154.87414
Coefficient of variation (CV)0.57681245
Kurtosis-1.2
Mean268.5
Median Absolute Deviation (MAD)134
Skewness0
Sum143916
Variance23986
MonotonicityStrictly increasing
2023-12-11T09:09:47.608083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
354 1
 
0.2%
368 1
 
0.2%
367 1
 
0.2%
366 1
 
0.2%
365 1
 
0.2%
364 1
 
0.2%
363 1
 
0.2%
362 1
 
0.2%
361 1
 
0.2%
Other values (526) 526
98.1%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
536 1
0.2%
535 1
0.2%
534 1
0.2%
533 1
0.2%
532 1
0.2%
531 1
0.2%
530 1
0.2%
529 1
0.2%
528 1
0.2%
527 1
0.2%
Distinct510
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-11T09:09:47.917356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length6.9291045
Min length2

Characters and Unicode

Total characters3714
Distinct characters338
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique486 ?
Unique (%)90.7%

Sample

1st row(유)한국금속
2nd row(주 율곡 사천공장
3rd row(주) 세화 정밀
4th row(주)경남
5th row(주)경신금속
ValueCountFrequency (%)
주식회사 35
 
5.7%
사천공장 5
 
0.8%
농업회사법인 4
 
0.7%
2공장 4
 
0.7%
동신금속(주 3
 
0.5%
코리아 3
 
0.5%
유한책임회사 3
 
0.5%
주)에어로코텍 3
 
0.5%
한국항공우주산업(주 3
 
0.5%
두원중공업(주 3
 
0.5%
Other values (517) 547
89.2%
2023-12-11T09:09:48.355316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
297
 
8.0%
) 257
 
6.9%
( 256
 
6.9%
121
 
3.3%
116
 
3.1%
88
 
2.4%
85
 
2.3%
77
 
2.1%
71
 
1.9%
70
 
1.9%
Other values (328) 2276
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3063
82.5%
Close Punctuation 258
 
6.9%
Open Punctuation 257
 
6.9%
Space Separator 77
 
2.1%
Uppercase Letter 41
 
1.1%
Other Punctuation 9
 
0.2%
Decimal Number 8
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
297
 
9.7%
121
 
4.0%
116
 
3.8%
88
 
2.9%
85
 
2.8%
71
 
2.3%
70
 
2.3%
60
 
2.0%
55
 
1.8%
53
 
1.7%
Other values (299) 2047
66.8%
Uppercase Letter
ValueCountFrequency (%)
G 7
17.1%
C 4
9.8%
T 3
 
7.3%
E 3
 
7.3%
M 3
 
7.3%
S 3
 
7.3%
H 3
 
7.3%
N 3
 
7.3%
F 2
 
4.9%
I 2
 
4.9%
Other values (8) 8
19.5%
Decimal Number
ValueCountFrequency (%)
2 5
62.5%
1 2
 
25.0%
3 1
 
12.5%
Close Punctuation
ValueCountFrequency (%)
) 257
99.6%
] 1
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 256
99.6%
[ 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 7
77.8%
& 2
 
22.2%
Space Separator
ValueCountFrequency (%)
77
100.0%
Lowercase Letter
ValueCountFrequency (%)
i 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3063
82.5%
Common 609
 
16.4%
Latin 42
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
297
 
9.7%
121
 
4.0%
116
 
3.8%
88
 
2.9%
85
 
2.8%
71
 
2.3%
70
 
2.3%
60
 
2.0%
55
 
1.8%
53
 
1.7%
Other values (299) 2047
66.8%
Latin
ValueCountFrequency (%)
G 7
16.7%
C 4
9.5%
T 3
 
7.1%
E 3
 
7.1%
M 3
 
7.1%
S 3
 
7.1%
H 3
 
7.1%
N 3
 
7.1%
F 2
 
4.8%
I 2
 
4.8%
Other values (9) 9
21.4%
Common
ValueCountFrequency (%)
) 257
42.2%
( 256
42.0%
77
 
12.6%
. 7
 
1.1%
2 5
 
0.8%
& 2
 
0.3%
1 2
 
0.3%
[ 1
 
0.2%
] 1
 
0.2%
3 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3063
82.5%
ASCII 651
 
17.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
297
 
9.7%
121
 
4.0%
116
 
3.8%
88
 
2.9%
85
 
2.8%
71
 
2.3%
70
 
2.3%
60
 
2.0%
55
 
1.8%
53
 
1.7%
Other values (299) 2047
66.8%
ASCII
ValueCountFrequency (%)
) 257
39.5%
( 256
39.3%
77
 
11.8%
G 7
 
1.1%
. 7
 
1.1%
2 5
 
0.8%
C 4
 
0.6%
T 3
 
0.5%
E 3
 
0.5%
M 3
 
0.5%
Other values (19) 29
 
4.5%

전화번호
Text

MISSING 

Distinct430
Distinct (%)90.3%
Missing60
Missing (%)11.2%
Memory size4.3 KiB
2023-12-11T09:09:48.569050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.031513
Min length11

Characters and Unicode

Total characters5727
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique388 ?
Unique (%)81.5%

Sample

1st row055-853-9494
2nd row055-275-2911
3rd row055-852-1322
4th row055-854-7800
5th row055-852-7223
ValueCountFrequency (%)
055-854-7657 3
 
0.6%
055-852-9695 3
 
0.6%
055-852-1322 3
 
0.6%
055-833-2859 3
 
0.6%
055-852-6102 2
 
0.4%
055-851-6178 2
 
0.4%
055-852-4055 2
 
0.4%
055-850-6220 2
 
0.4%
061-727-8058 2
 
0.4%
055-852-6596 2
 
0.4%
Other values (420) 452
95.0%
2023-12-11T09:09:48.889904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1513
26.4%
- 952
16.6%
0 820
14.3%
8 622
10.9%
3 414
 
7.2%
2 311
 
5.4%
4 291
 
5.1%
7 238
 
4.2%
9 200
 
3.5%
1 199
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4775
83.4%
Dash Punctuation 952
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1513
31.7%
0 820
17.2%
8 622
13.0%
3 414
 
8.7%
2 311
 
6.5%
4 291
 
6.1%
7 238
 
5.0%
9 200
 
4.2%
1 199
 
4.2%
6 167
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 952
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5727
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 1513
26.4%
- 952
16.6%
0 820
14.3%
8 622
10.9%
3 414
 
7.2%
2 311
 
5.4%
4 291
 
5.1%
7 238
 
4.2%
9 200
 
3.5%
1 199
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5727
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1513
26.4%
- 952
16.6%
0 820
14.3%
8 622
10.9%
3 414
 
7.2%
2 311
 
5.4%
4 291
 
5.1%
7 238
 
4.2%
9 200
 
3.5%
1 199
 
3.5%

팩스번호
Text

MISSING 

Distinct427
Distinct (%)88.2%
Missing52
Missing (%)9.7%
Memory size4.3 KiB
2023-12-11T09:09:49.102982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.892562
Min length2

Characters and Unicode

Total characters5756
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique382 ?
Unique (%)78.9%

Sample

1st row055-852-7355
2nd row055-275-2921
3rd row055-853-0978
4th row055-854-8300
5th row055-853-2782
ValueCountFrequency (%)
055-834-8356 5
 
1.0%
055 5
 
1.0%
055-854-1357 3
 
0.6%
055-854-7565 3
 
0.6%
055-762-4230 3
 
0.6%
055-851-2011 3
 
0.6%
055-853-0978 3
 
0.6%
055-851-1004 3
 
0.6%
055-852-6103 2
 
0.4%
055-852-6438 2
 
0.4%
Other values (417) 452
93.4%
2023-12-11T09:09:49.433353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1550
26.9%
- 948
16.5%
0 771
13.4%
8 641
11.1%
3 432
 
7.5%
4 309
 
5.4%
2 291
 
5.1%
9 222
 
3.9%
7 213
 
3.7%
1 197
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4808
83.5%
Dash Punctuation 948
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1550
32.2%
0 771
16.0%
8 641
13.3%
3 432
 
9.0%
4 309
 
6.4%
2 291
 
6.1%
9 222
 
4.6%
7 213
 
4.4%
1 197
 
4.1%
6 182
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 948
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5756
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 1550
26.9%
- 948
16.5%
0 771
13.4%
8 641
11.1%
3 432
 
7.5%
4 309
 
5.4%
2 291
 
5.1%
9 222
 
3.9%
7 213
 
3.7%
1 197
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5756
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1550
26.9%
- 948
16.5%
0 771
13.4%
8 641
11.1%
3 432
 
7.5%
4 309
 
5.4%
2 291
 
5.1%
9 222
 
3.9%
7 213
 
3.7%
1 197
 
3.4%

주소
Text

Distinct510
Distinct (%)95.3%
Missing1
Missing (%)0.2%
Memory size4.3 KiB
2023-12-11T09:09:49.683140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length22.11215
Min length16

Characters and Unicode

Total characters11830
Distinct characters171
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique486 ?
Unique (%)90.8%

Sample

1st row경상남도 사천시 사남면 공단1로 23-10
2nd row경상남도 사천시 용현면 종포산단1길 33
3rd row경상남도 사천시 사천읍 두량공단로 27
4th row경상남도 사천시 축동면 구호리 산 13-1
5th row경상남도 사천시 축동면 두량로 102
ValueCountFrequency (%)
경상남도 535
20.0%
사천시 535
20.0%
사남면 114
 
4.3%
축동면 111
 
4.2%
사천읍 68
 
2.5%
공단1로 37
 
1.4%
송포동 31
 
1.2%
서삼로 30
 
1.1%
향촌동 29
 
1.1%
용현면 27
 
1.0%
Other values (594) 1157
43.3%
2023-12-11T09:09:50.040479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2257
19.1%
739
 
6.2%
657
 
5.6%
631
 
5.3%
546
 
4.6%
541
 
4.6%
538
 
4.5%
536
 
4.5%
1 384
 
3.2%
319
 
2.7%
Other values (161) 4682
39.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7241
61.2%
Space Separator 2257
 
19.1%
Decimal Number 1834
 
15.5%
Dash Punctuation 220
 
1.9%
Close Punctuation 129
 
1.1%
Open Punctuation 129
 
1.1%
Other Punctuation 18
 
0.2%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
739
 
10.2%
657
 
9.1%
631
 
8.7%
546
 
7.5%
541
 
7.5%
538
 
7.4%
536
 
7.4%
319
 
4.4%
303
 
4.2%
244
 
3.4%
Other values (144) 2187
30.2%
Decimal Number
ValueCountFrequency (%)
1 384
20.9%
2 261
14.2%
3 217
11.8%
4 195
10.6%
5 167
9.1%
6 152
 
8.3%
7 135
 
7.4%
0 123
 
6.7%
9 105
 
5.7%
8 95
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
L 1
50.0%
Space Separator
ValueCountFrequency (%)
2257
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 220
100.0%
Close Punctuation
ValueCountFrequency (%)
) 129
100.0%
Open Punctuation
ValueCountFrequency (%)
( 129
100.0%
Other Punctuation
ValueCountFrequency (%)
, 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7241
61.2%
Common 4587
38.8%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
739
 
10.2%
657
 
9.1%
631
 
8.7%
546
 
7.5%
541
 
7.5%
538
 
7.4%
536
 
7.4%
319
 
4.4%
303
 
4.2%
244
 
3.4%
Other values (144) 2187
30.2%
Common
ValueCountFrequency (%)
2257
49.2%
1 384
 
8.4%
2 261
 
5.7%
- 220
 
4.8%
3 217
 
4.7%
4 195
 
4.3%
5 167
 
3.6%
6 152
 
3.3%
7 135
 
2.9%
) 129
 
2.8%
Other values (5) 470
 
10.2%
Latin
ValueCountFrequency (%)
B 1
50.0%
L 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7241
61.2%
ASCII 4589
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2257
49.2%
1 384
 
8.4%
2 261
 
5.7%
- 220
 
4.8%
3 217
 
4.7%
4 195
 
4.2%
5 167
 
3.6%
6 152
 
3.3%
7 135
 
2.9%
) 129
 
2.8%
Other values (7) 472
 
10.3%
Hangul
ValueCountFrequency (%)
739
 
10.2%
657
 
9.1%
631
 
8.7%
546
 
7.5%
541
 
7.5%
538
 
7.4%
536
 
7.4%
319
 
4.4%
303
 
4.2%
244
 
3.4%
Other values (144) 2187
30.2%

업종명
Text

MISSING 

Distinct207
Distinct (%)39.1%
Missing7
Missing (%)1.3%
Memory size4.3 KiB
2023-12-11T09:09:50.299108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length27
Mean length17.153119
Min length6

Characters and Unicode

Total characters9074
Distinct characters223
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)24.4%

Sample

1st row자동차용 신품 동력전달장치 제조업 외 2 종
2nd row항공기용 부품 제조업 외 1 종
3rd row건설 및 채광용 기계장비 제조업
4th row레미콘 제조업 외 2 종
5th row타이어 및 튜브 제조업
ValueCountFrequency (%)
제조업 422
 
14.3%
256
 
8.7%
246
 
8.4%
208
 
7.1%
수산동물 101
 
3.4%
1 99
 
3.4%
기타 93
 
3.2%
부품 86
 
2.9%
항공기용 56
 
1.9%
가공 44
 
1.5%
Other values (269) 1330
45.2%
2023-12-11T09:09:50.679117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2412
26.6%
601
 
6.6%
562
 
6.2%
504
 
5.6%
264
 
2.9%
259
 
2.9%
256
 
2.8%
248
 
2.7%
210
 
2.3%
188
 
2.1%
Other values (213) 3570
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6398
70.5%
Space Separator 2412
 
26.6%
Decimal Number 216
 
2.4%
Other Punctuation 44
 
0.5%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
601
 
9.4%
562
 
8.8%
504
 
7.9%
264
 
4.1%
259
 
4.0%
256
 
4.0%
248
 
3.9%
210
 
3.3%
188
 
2.9%
186
 
2.9%
Other values (201) 3120
48.8%
Decimal Number
ValueCountFrequency (%)
1 105
48.6%
2 43
19.9%
3 30
 
13.9%
4 18
 
8.3%
5 8
 
3.7%
7 6
 
2.8%
6 5
 
2.3%
8 1
 
0.5%
Space Separator
ValueCountFrequency (%)
2412
100.0%
Other Punctuation
ValueCountFrequency (%)
, 44
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6398
70.5%
Common 2676
29.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
601
 
9.4%
562
 
8.8%
504
 
7.9%
264
 
4.1%
259
 
4.0%
256
 
4.0%
248
 
3.9%
210
 
3.3%
188
 
2.9%
186
 
2.9%
Other values (201) 3120
48.8%
Common
ValueCountFrequency (%)
2412
90.1%
1 105
 
3.9%
, 44
 
1.6%
2 43
 
1.6%
3 30
 
1.1%
4 18
 
0.7%
5 8
 
0.3%
7 6
 
0.2%
6 5
 
0.2%
( 2
 
0.1%
Other values (2) 3
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6397
70.5%
ASCII 2676
29.5%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2412
90.1%
1 105
 
3.9%
, 44
 
1.6%
2 43
 
1.6%
3 30
 
1.1%
4 18
 
0.7%
5 8
 
0.3%
7 6
 
0.2%
6 5
 
0.2%
( 2
 
0.1%
Other values (2) 3
 
0.1%
Hangul
ValueCountFrequency (%)
601
 
9.4%
562
 
8.8%
504
 
7.9%
264
 
4.1%
259
 
4.0%
256
 
4.0%
248
 
3.9%
210
 
3.3%
188
 
2.9%
186
 
2.9%
Other values (200) 3119
48.8%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct467
Distinct (%)87.1%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-11T09:09:50.916613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length30
Mean length9.0876866
Min length1

Characters and Unicode

Total characters4871
Distinct characters438
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique439 ?
Unique (%)81.9%

Sample

1st row자동차부품(기어,샤프트)
2nd row항공기부품
3rd row링크
4th row레미콘
5th row타이어부품
ValueCountFrequency (%)
부품 35
 
3.7%
30
 
3.2%
항공기부품 19
 
2.0%
자동차 15
 
1.6%
14
 
1.5%
오징어 12
 
1.3%
항공기 12
 
1.3%
농기계부품 10
 
1.1%
자동차부품 9
 
1.0%
쥐포 8
 
0.9%
Other values (626) 776
82.6%
2023-12-11T09:09:51.296961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
407
 
8.4%
, 262
 
5.4%
219
 
4.5%
196
 
4.0%
167
 
3.4%
113
 
2.3%
89
 
1.8%
88
 
1.8%
75
 
1.5%
74
 
1.5%
Other values (428) 3181
65.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3909
80.3%
Space Separator 407
 
8.4%
Other Punctuation 271
 
5.6%
Uppercase Letter 114
 
2.3%
Lowercase Letter 57
 
1.2%
Open Punctuation 46
 
0.9%
Close Punctuation 45
 
0.9%
Decimal Number 20
 
0.4%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
219
 
5.6%
196
 
5.0%
167
 
4.3%
113
 
2.9%
89
 
2.3%
88
 
2.3%
75
 
1.9%
74
 
1.9%
69
 
1.8%
64
 
1.6%
Other values (373) 2755
70.5%
Uppercase Letter
ValueCountFrequency (%)
S 11
 
9.6%
C 9
 
7.9%
T 9
 
7.9%
A 9
 
7.9%
P 9
 
7.9%
E 8
 
7.0%
R 7
 
6.1%
H 6
 
5.3%
B 5
 
4.4%
G 5
 
4.4%
Other values (12) 36
31.6%
Lowercase Letter
ValueCountFrequency (%)
e 9
15.8%
t 8
14.0%
i 6
10.5%
r 6
10.5%
o 4
7.0%
u 4
7.0%
g 4
7.0%
n 4
7.0%
v 3
 
5.3%
f 2
 
3.5%
Other values (6) 7
12.3%
Decimal Number
ValueCountFrequency (%)
7 7
35.0%
2 3
15.0%
5 3
15.0%
3 2
 
10.0%
0 2
 
10.0%
4 1
 
5.0%
1 1
 
5.0%
8 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 262
96.7%
. 5
 
1.8%
/ 2
 
0.7%
· 1
 
0.4%
: 1
 
0.4%
Space Separator
ValueCountFrequency (%)
407
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3909
80.3%
Common 791
 
16.2%
Latin 171
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
219
 
5.6%
196
 
5.0%
167
 
4.3%
113
 
2.9%
89
 
2.3%
88
 
2.3%
75
 
1.9%
74
 
1.9%
69
 
1.8%
64
 
1.6%
Other values (373) 2755
70.5%
Latin
ValueCountFrequency (%)
S 11
 
6.4%
e 9
 
5.3%
C 9
 
5.3%
T 9
 
5.3%
A 9
 
5.3%
P 9
 
5.3%
t 8
 
4.7%
E 8
 
4.7%
R 7
 
4.1%
i 6
 
3.5%
Other values (28) 86
50.3%
Common
ValueCountFrequency (%)
407
51.5%
, 262
33.1%
( 46
 
5.8%
) 45
 
5.7%
7 7
 
0.9%
. 5
 
0.6%
2 3
 
0.4%
5 3
 
0.4%
3 2
 
0.3%
0 2
 
0.3%
Other values (7) 9
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3909
80.3%
ASCII 961
 
19.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
407
42.4%
, 262
27.3%
( 46
 
4.8%
) 45
 
4.7%
S 11
 
1.1%
e 9
 
0.9%
C 9
 
0.9%
T 9
 
0.9%
A 9
 
0.9%
P 9
 
0.9%
Other values (44) 145
 
15.1%
Hangul
ValueCountFrequency (%)
219
 
5.6%
196
 
5.0%
167
 
4.3%
113
 
2.9%
89
 
2.3%
88
 
2.3%
75
 
1.9%
74
 
1.9%
69
 
1.8%
64
 
1.6%
Other values (373) 2755
70.5%
None
ValueCountFrequency (%)
· 1
100.0%

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing536
Missing (%)100.0%
Memory size4.8 KiB

Interactions

2023-12-11T09:09:46.966415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T09:09:47.074466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:09:47.205224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T09:09:47.311862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번회사명전화번호팩스번호주소업종명생산품Unnamed: 7
01(유)한국금속055-853-9494055-852-7355경상남도 사천시 사남면 공단1로 23-10자동차용 신품 동력전달장치 제조업 외 2 종자동차부품(기어,샤프트)<NA>
12(주 율곡 사천공장055-275-2911055-275-2921경상남도 사천시 용현면 종포산단1길 33항공기용 부품 제조업 외 1 종항공기부품<NA>
23(주) 세화 정밀055-852-1322055-853-0978경상남도 사천시 사천읍 두량공단로 27건설 및 채광용 기계장비 제조업링크<NA>
34(주)경남055-854-7800055-854-8300경상남도 사천시 축동면 구호리 산 13-1레미콘 제조업 외 2 종레미콘<NA>
45(주)경신금속055-852-7223055-853-2782경상남도 사천시 축동면 두량로 102타이어 및 튜브 제조업타이어부품<NA>
56(주)고센산업055-855-1383055-855-1385경상남도 사천시 곤양면 흥신로 51-10기타 목재가구 제조업가정용가구,사무기기<NA>
67(주)금민산업055-855-8611055-855-8612경상남도 사천시 사천읍 장전리 897-1강관 제조업 외 3 종산업용강관, 금속구조물<NA>
78(주)금선수산055-832-3216055-832-6829경상남도 사천시 유람선길 42-40 (대방동)수산동물 냉동품 제조업어패류가공<NA>
89(주)금화중공업055-835-7008055-835-9008경상남도 사천시 대방길 68 (대방동)선박 구성 부분품 제조업선박구성부분품<NA>
910(주)길보사료산업055-834-3030055-834-3033경상남도 사천시 거북등길 48-11 (송포동)배합 사료 제조업 외 1 종어분,여유 사료<NA>
연번회사명전화번호팩스번호주소업종명생산품Unnamed: 7
526527인터내셔널돔하우스(주)055-853-9696055-853-9697경상남도 사천시 사남면 유천리 892-1기타 건축용 플라스틱 조립제품 제조업건축용 발포스티렌(EPS)<NA>
527528장안항공산업055-852-4055055-852-4015경상남도 사천시 사남면 방지로 96항공기용 부품 제조업항공기부품(B737,B747기체부품<NA>
528529제이에스테크(주)055-853-5081055-853-5086경상남도 사천시 사남면 외국기업로 158비금속광물 분쇄물 생산업 외 2 종분체수탁가공업<NA>
529530청우중공업(주)055-584-0404055-584-0550경상남도 사천시 사남면 공단2로 220육상 금속 골조 구조재 제조업강구조물제조업<NA>
530531켄코아에어로스페이스(주)055-855-4130055-8544140경상남도 사천시 사남면 외국기업로 152-44항공기용 부품 제조업항공기용 부품<NA>
531532하이즈항공(주)055-853-8800055-853-8801경상남도 사천시 사남면 방지리 675항공기용 부품 제조업항공기 부품,기계부품가공<NA>
532533한국경남태양유전(주)055-851-5515055-851-5500경상남도 사천시 사남면 외국기업로 82그 외 기타 전자부품 제조업 외 1 종적층콘덴샤<NA>
533534한국앰코스페셜티카톤즈 유한회사055-851-0100055-851-0117경상남도 사천시 사남면 외국기업로 152-52기타 인쇄업 외 1 종담배포장케이스, 소비제품 포장지<NA>
534535한국항공우주산업(주)055-851-1000055-851-1004경상남도 사천시 사남면 공단1로 78유인항공기, 항공우주선 및 보조장치 제조업 외 3종항공기,우주선 및 부품<NA>
535536한국항공우주산업(주)055-851-6178055-851-1004경상남도 사천시 사남면 유천리 890유인항공기, 항공우주선 및 보조장치 제조업 외 3종항공기,우주선 및 부품<NA>