Overview

Dataset statistics

Number of variables8
Number of observations7982
Missing cells1044
Missing cells (%)1.6%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory514.6 KiB
Average record size in memory66.0 B

Variable types

Text5
Categorical1
Numeric2

Dataset

Description김해시 제조업체 현황에 대한 데이터로 회사명,대표자명,전화번호,읍면동,주소,위도,경도,주력생산품 항목을 제공합니다
URLhttps://www.data.go.kr/data/15033447/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
위도 is highly overall correlated with 읍면동High correlation
읍면동 is highly overall correlated with 위도High correlation
전화번호 has 1044 (13.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 00:48:24.160007
Analysis finished2023-12-12 00:48:26.661992
Duration2.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct7470
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Memory size62.5 KiB
2023-12-12T09:48:26.949613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length35
Mean length6.1619895
Min length2

Characters and Unicode

Total characters49185
Distinct characters697
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7149 ?
Unique (%)89.6%

Sample

1st rowTKG태광㈜
2nd row(주)넥센
3rd row(주)대흥알앤티
4th row하이에어코리아㈜
5th row(주)유니크
ValueCountFrequency (%)
주식회사 466
 
5.3%
23
 
0.3%
2공장 13
 
0.1%
김해공장 13
 
0.1%
tech 12
 
0.1%
김해지점 11
 
0.1%
테크 8
 
0.1%
대진산업 8
 
0.1%
영진산업 8
 
0.1%
대성산업 8
 
0.1%
Other values (7556) 8204
93.5%
2023-12-12T09:48:27.546745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2809
 
5.7%
) 1981
 
4.0%
( 1970
 
4.0%
1384
 
2.8%
1357
 
2.8%
1242
 
2.5%
1173
 
2.4%
1145
 
2.3%
1108
 
2.3%
987
 
2.0%
Other values (687) 34029
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41342
84.1%
Close Punctuation 2036
 
4.1%
Uppercase Letter 2032
 
4.1%
Open Punctuation 2029
 
4.1%
Space Separator 856
 
1.7%
Lowercase Letter 327
 
0.7%
Other Symbol 276
 
0.6%
Other Punctuation 157
 
0.3%
Decimal Number 120
 
0.2%
Dash Punctuation 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2809
 
6.8%
1384
 
3.3%
1357
 
3.3%
1242
 
3.0%
1173
 
2.8%
1145
 
2.8%
1108
 
2.7%
987
 
2.4%
968
 
2.3%
938
 
2.3%
Other values (616) 28231
68.3%
Uppercase Letter
ValueCountFrequency (%)
E 215
 
10.6%
S 197
 
9.7%
C 171
 
8.4%
N 171
 
8.4%
T 158
 
7.8%
G 132
 
6.5%
M 107
 
5.3%
H 103
 
5.1%
J 86
 
4.2%
D 79
 
3.9%
Other values (16) 613
30.2%
Lowercase Letter
ValueCountFrequency (%)
e 41
12.5%
t 34
10.4%
o 34
10.4%
d 24
 
7.3%
c 24
 
7.3%
n 20
 
6.1%
l 20
 
6.1%
r 18
 
5.5%
s 17
 
5.2%
a 17
 
5.2%
Other values (12) 78
23.9%
Decimal Number
ValueCountFrequency (%)
2 69
57.5%
1 21
 
17.5%
3 13
 
10.8%
0 6
 
5.0%
5 5
 
4.2%
4 3
 
2.5%
6 1
 
0.8%
9 1
 
0.8%
1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 98
62.4%
& 37
 
23.6%
, 13
 
8.3%
5
 
3.2%
/ 3
 
1.9%
1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 1981
97.3%
55
 
2.7%
Open Punctuation
ValueCountFrequency (%)
( 1970
97.1%
59
 
2.9%
Space Separator
ValueCountFrequency (%)
855
99.9%
  1
 
0.1%
Other Symbol
ValueCountFrequency (%)
276
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41615
84.6%
Common 5208
 
10.6%
Latin 2359
 
4.8%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2809
 
6.7%
1384
 
3.3%
1357
 
3.3%
1242
 
3.0%
1173
 
2.8%
1145
 
2.8%
1108
 
2.7%
987
 
2.4%
968
 
2.3%
938
 
2.3%
Other values (614) 28504
68.5%
Latin
ValueCountFrequency (%)
E 215
 
9.1%
S 197
 
8.4%
C 171
 
7.2%
N 171
 
7.2%
T 158
 
6.7%
G 132
 
5.6%
M 107
 
4.5%
H 103
 
4.4%
J 86
 
3.6%
D 79
 
3.3%
Other values (38) 940
39.8%
Common
ValueCountFrequency (%)
) 1981
38.0%
( 1970
37.8%
855
16.4%
. 98
 
1.9%
2 69
 
1.3%
59
 
1.1%
55
 
1.1%
& 37
 
0.7%
1 21
 
0.4%
3 13
 
0.2%
Other values (12) 50
 
1.0%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41339
84.0%
ASCII 7445
 
15.1%
None 398
 
0.8%
CJK 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2809
 
6.8%
1384
 
3.3%
1357
 
3.3%
1242
 
3.0%
1173
 
2.8%
1145
 
2.8%
1108
 
2.7%
987
 
2.4%
968
 
2.3%
938
 
2.3%
Other values (613) 28228
68.3%
ASCII
ValueCountFrequency (%)
) 1981
26.6%
( 1970
26.5%
855
11.5%
E 215
 
2.9%
S 197
 
2.6%
C 171
 
2.3%
N 171
 
2.3%
T 158
 
2.1%
G 132
 
1.8%
M 107
 
1.4%
Other values (54) 1488
20.0%
None
ValueCountFrequency (%)
276
69.3%
59
 
14.8%
55
 
13.8%
5
 
1.3%
1
 
0.3%
1
 
0.3%
  1
 
0.3%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct6664
Distinct (%)83.5%
Missing0
Missing (%)0.0%
Memory size62.5 KiB
2023-12-12T09:48:28.012694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length3
Mean length3.0538712
Min length2

Characters and Unicode

Total characters24376
Distinct characters338
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5748 ?
Unique (%)72.0%

Sample

1st row박주환
2nd row강호찬
3rd row송영수
4th row김근배
5th row안영구 외 1명
ValueCountFrequency (%)
김성수 12
 
0.1%
김경호 8
 
0.1%
김태형 8
 
0.1%
김종철 8
 
0.1%
김미정 7
 
0.1%
이재식 7
 
0.1%
김영호 7
 
0.1%
김영철 7
 
0.1%
김정숙 7
 
0.1%
김태우 7
 
0.1%
Other values (6672) 7943
99.0%
2023-12-12T09:48:28.788365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1768
 
7.3%
1167
 
4.8%
868
 
3.6%
700
 
2.9%
699
 
2.9%
535
 
2.2%
442
 
1.8%
432
 
1.8%
427
 
1.8%
418
 
1.7%
Other values (328) 16920
69.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24133
99.0%
Uppercase Letter 86
 
0.4%
Space Separator 66
 
0.3%
Decimal Number 65
 
0.3%
Other Punctuation 24
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1768
 
7.3%
1167
 
4.8%
868
 
3.6%
700
 
2.9%
699
 
2.9%
535
 
2.2%
442
 
1.8%
432
 
1.8%
427
 
1.8%
418
 
1.7%
Other values (300) 16677
69.1%
Uppercase Letter
ValueCountFrequency (%)
I 13
15.1%
A 9
10.5%
H 7
 
8.1%
N 7
 
8.1%
T 5
 
5.8%
L 5
 
5.8%
R 5
 
5.8%
U 5
 
5.8%
E 5
 
5.8%
O 4
 
4.7%
Other values (11) 21
24.4%
Decimal Number
ValueCountFrequency (%)
1 63
96.9%
2 2
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 21
87.5%
. 3
 
12.5%
Space Separator
ValueCountFrequency (%)
66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24132
99.0%
Common 157
 
0.6%
Latin 86
 
0.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1768
 
7.3%
1167
 
4.8%
868
 
3.6%
700
 
2.9%
699
 
2.9%
535
 
2.2%
442
 
1.8%
432
 
1.8%
427
 
1.8%
418
 
1.7%
Other values (299) 16676
69.1%
Latin
ValueCountFrequency (%)
I 13
15.1%
A 9
10.5%
H 7
 
8.1%
N 7
 
8.1%
T 5
 
5.8%
L 5
 
5.8%
R 5
 
5.8%
U 5
 
5.8%
E 5
 
5.8%
O 4
 
4.7%
Other values (11) 21
24.4%
Common
ValueCountFrequency (%)
66
42.0%
1 63
40.1%
, 21
 
13.4%
. 3
 
1.9%
2 2
 
1.3%
( 1
 
0.6%
) 1
 
0.6%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24132
99.0%
ASCII 243
 
1.0%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1768
 
7.3%
1167
 
4.8%
868
 
3.6%
700
 
2.9%
699
 
2.9%
535
 
2.2%
442
 
1.8%
432
 
1.8%
427
 
1.8%
418
 
1.7%
Other values (299) 16676
69.1%
ASCII
ValueCountFrequency (%)
66
27.2%
1 63
25.9%
, 21
 
8.6%
I 13
 
5.3%
A 9
 
3.7%
H 7
 
2.9%
N 7
 
2.9%
T 5
 
2.1%
L 5
 
2.1%
R 5
 
2.1%
Other values (18) 42
17.3%
CJK
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct6182
Distinct (%)89.1%
Missing1044
Missing (%)13.1%
Memory size62.5 KiB
2023-12-12T09:48:29.121093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.018305
Min length8

Characters and Unicode

Total characters83383
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5613 ?
Unique (%)80.9%

Sample

1st row055-333-7151
2nd row055-723-8526
3rd row055-345-6391
4th row055-345-5000
5th row055-340-2000
ValueCountFrequency (%)
055-310-6874 13
 
0.2%
055-322-8301 13
 
0.2%
055-329-4691 11
 
0.2%
055-345-2278 10
 
0.1%
055-724-0795 9
 
0.1%
055-326-4335 8
 
0.1%
055-000-0000 7
 
0.1%
070-4035-1402 7
 
0.1%
055-329-0490 7
 
0.1%
055-312-7013 6
 
0.1%
Other values (6172) 6847
98.7%
2023-12-12T09:48:29.602392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 17410
20.9%
- 13871
16.6%
3 11864
14.2%
0 10945
13.1%
2 5953
 
7.1%
4 5431
 
6.5%
1 4520
 
5.4%
7 3642
 
4.4%
6 3616
 
4.3%
8 3307
 
4.0%
Other values (2) 2824
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 69504
83.4%
Dash Punctuation 13871
 
16.6%
Connector Punctuation 8
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 17410
25.0%
3 11864
17.1%
0 10945
15.7%
2 5953
 
8.6%
4 5431
 
7.8%
1 4520
 
6.5%
7 3642
 
5.2%
6 3616
 
5.2%
8 3307
 
4.8%
9 2816
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 13871
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 83383
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 17410
20.9%
- 13871
16.6%
3 11864
14.2%
0 10945
13.1%
2 5953
 
7.1%
4 5431
 
6.5%
1 4520
 
5.4%
7 3642
 
4.4%
6 3616
 
4.3%
8 3307
 
4.0%
Other values (2) 2824
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 83383
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 17410
20.9%
- 13871
16.6%
3 11864
14.2%
0 10945
13.1%
2 5953
 
7.1%
4 5431
 
6.5%
1 4520
 
5.4%
7 3642
 
4.4%
6 3616
 
4.3%
8 3307
 
4.0%
Other values (2) 2824
 
3.4%

읍면동
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size62.5 KiB
한림면
1575 
주촌면
1338 
진례면
1209 
진영읍
865 
상동면
824 
Other values (12)
2171 

Length

Max length5
Median length3
Mean length3.0513656
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row삼안동
2nd row삼안동
3rd row진례면
4th row진례면
5th row진영읍

Common Values

ValueCountFrequency (%)
한림면 1575
19.7%
주촌면 1338
16.8%
진례면 1209
15.1%
진영읍 865
10.8%
상동면 824
10.3%
생림면 560
 
7.0%
장유동 387
 
4.8%
삼안동 274
 
3.4%
불암동 233
 
2.9%
활천동 223
 
2.8%
Other values (7) 494
 
6.2%

Length

2023-12-12T09:48:29.810399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한림면 1575
19.7%
주촌면 1338
16.8%
진례면 1209
15.1%
진영읍 865
10.8%
상동면 824
10.3%
생림면 560
 
7.0%
장유동 387
 
4.8%
삼안동 274
 
3.4%
불암동 233
 
2.9%
활천동 223
 
2.8%
Other values (7) 494
 
6.2%

주소
Text

Distinct6638
Distinct (%)83.2%
Missing0
Missing (%)0.0%
Memory size62.5 KiB
2023-12-12T09:48:30.151782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length43
Mean length24.379604
Min length14

Characters and Unicode

Total characters194598
Distinct characters266
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5676 ?
Unique (%)71.1%

Sample

1st row경상남도 김해시 김해대로2635번길 26
2nd row경상남도 김해시 김해대로 2595
3rd row경상남도 김해시 진례면 서부로436번길 70-25
4th row경상남도 김해시 진례면 고모로324번길 204 B동
5th row경상남도 김해시 진영읍 서부로179번길 90
ValueCountFrequency (%)
경상남도 7982
20.3%
김해시 7982
20.3%
한림면 1562
 
4.0%
주촌면 1336
 
3.4%
진례면 1208
 
3.1%
진영읍 864
 
2.2%
상동면 821
 
2.1%
생림면 556
 
1.4%
고모로 227
 
0.6%
서부로1499번길 216
 
0.5%
Other values (4161) 16598
42.2%
2023-12-12T09:48:30.752584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32189
 
16.5%
9195
 
4.7%
9167
 
4.7%
9161
 
4.7%
1 8995
 
4.6%
8034
 
4.1%
7998
 
4.1%
7991
 
4.1%
7990
 
4.1%
7811
 
4.0%
Other values (256) 86067
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 114655
58.9%
Decimal Number 43229
 
22.2%
Space Separator 32189
 
16.5%
Dash Punctuation 4053
 
2.1%
Uppercase Letter 405
 
0.2%
Close Punctuation 24
 
< 0.1%
Open Punctuation 24
 
< 0.1%
Other Punctuation 12
 
< 0.1%
Lowercase Letter 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9195
 
8.0%
9167
 
8.0%
9161
 
8.0%
8034
 
7.0%
7998
 
7.0%
7991
 
7.0%
7990
 
7.0%
7811
 
6.8%
5610
 
4.9%
5210
 
4.5%
Other values (220) 36488
31.8%
Uppercase Letter
ValueCountFrequency (%)
B 179
44.2%
A 175
43.2%
C 22
 
5.4%
D 8
 
2.0%
T 3
 
0.7%
F 3
 
0.7%
S 3
 
0.7%
G 3
 
0.7%
K 2
 
0.5%
E 2
 
0.5%
Other values (4) 5
 
1.2%
Decimal Number
ValueCountFrequency (%)
1 8995
20.8%
2 5655
13.1%
3 4943
11.4%
4 4118
9.5%
5 3863
8.9%
6 3713
8.6%
9 3425
 
7.9%
7 3240
 
7.5%
0 2871
 
6.6%
8 2406
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 7
58.3%
. 3
25.0%
& 1
 
8.3%
/ 1
 
8.3%
Lowercase Letter
ValueCountFrequency (%)
c 3
42.9%
b 2
28.6%
e 1
 
14.3%
a 1
 
14.3%
Space Separator
ValueCountFrequency (%)
32189
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4053
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 114654
58.9%
Common 79531
40.9%
Latin 412
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9195
 
8.0%
9167
 
8.0%
9161
 
8.0%
8034
 
7.0%
7998
 
7.0%
7991
 
7.0%
7990
 
7.0%
7811
 
6.8%
5610
 
4.9%
5210
 
4.5%
Other values (219) 36487
31.8%
Common
ValueCountFrequency (%)
32189
40.5%
1 8995
 
11.3%
2 5655
 
7.1%
3 4943
 
6.2%
4 4118
 
5.2%
- 4053
 
5.1%
5 3863
 
4.9%
6 3713
 
4.7%
9 3425
 
4.3%
7 3240
 
4.1%
Other values (8) 5337
 
6.7%
Latin
ValueCountFrequency (%)
B 179
43.4%
A 175
42.5%
C 22
 
5.3%
D 8
 
1.9%
T 3
 
0.7%
F 3
 
0.7%
S 3
 
0.7%
c 3
 
0.7%
G 3
 
0.7%
K 2
 
0.5%
Other values (8) 11
 
2.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 114654
58.9%
ASCII 79943
41.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32189
40.3%
1 8995
 
11.3%
2 5655
 
7.1%
3 4943
 
6.2%
4 4118
 
5.2%
- 4053
 
5.1%
5 3863
 
4.8%
6 3713
 
4.6%
9 3425
 
4.3%
7 3240
 
4.1%
Other values (26) 5749
 
7.2%
Hangul
ValueCountFrequency (%)
9195
 
8.0%
9167
 
8.0%
9161
 
8.0%
8034
 
7.0%
7998
 
7.0%
7991
 
7.0%
7990
 
7.0%
7811
 
6.8%
5610
 
4.9%
5210
 
4.5%
Other values (219) 36487
31.8%
CJK
ValueCountFrequency (%)
1
100.0%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct6121
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.82856
Minimum128.70585
Maximum129.00154
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size70.3 KiB
2023-12-12T09:48:30.993838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.70585
5-th percentile128.74522
Q1128.77814
median128.8182
Q3128.87101
95-th percentile128.94424
Maximum129.00154
Range0.2956912
Interquartile range (IQR)0.09287705

Descriptive statistics

Standard deviation0.062016952
Coefficient of variation (CV)0.00048139133
Kurtosis-0.47079583
Mean128.82856
Median Absolute Deviation (MAD)0.0415256
Skewness0.53056932
Sum1028309.6
Variance0.0038461024
MonotonicityNot monotonic
2023-12-12T09:48:31.196933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.7843183 20
 
0.3%
128.8322771 17
 
0.2%
128.7701665 13
 
0.2%
128.7699566 13
 
0.2%
128.8293233 13
 
0.2%
128.8247774 12
 
0.2%
128.8213803 12
 
0.2%
128.8457896 12
 
0.2%
128.8361626 11
 
0.1%
128.8327311 11
 
0.1%
Other values (6111) 7848
98.3%
ValueCountFrequency (%)
128.705852 1
 
< 0.1%
128.7068123 1
 
< 0.1%
128.7075517 3
< 0.1%
128.7079088 2
< 0.1%
128.7079925 1
 
< 0.1%
128.7083436 1
 
< 0.1%
128.7089332 2
< 0.1%
128.7089459 1
 
< 0.1%
128.7089919 1
 
< 0.1%
128.7091531 1
 
< 0.1%
ValueCountFrequency (%)
129.0015432 1
< 0.1%
129.000567 2
< 0.1%
129.0002889 1
< 0.1%
128.9986386 1
< 0.1%
128.9983403 1
< 0.1%
128.9981726 1
< 0.1%
128.9980248 1
< 0.1%
128.9975027 1
< 0.1%
128.9973313 2
< 0.1%
128.9973171 1
< 0.1%

경도
Real number (ℝ)

Distinct6126
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.269805
Minimum35.166573
Maximum35.386106
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size70.3 KiB
2023-12-12T09:48:31.464156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.166573
5-th percentile35.213761
Q135.2346
median35.27359
Q335.300558
95-th percentile35.325582
Maximum35.386106
Range0.21953341
Interquartile range (IQR)0.06595825

Descriptive statistics

Standard deviation0.03823448
Coefficient of variation (CV)0.001084057
Kurtosis-0.8982292
Mean35.269805
Median Absolute Deviation (MAD)0.032561565
Skewness-0.026665824
Sum281523.58
Variance0.0014618755
MonotonicityNot monotonic
2023-12-12T09:48:31.656152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.24536206 20
 
0.3%
35.21497041 17
 
0.2%
35.28357474 13
 
0.2%
35.27556897 13
 
0.2%
35.21719916 13
 
0.2%
35.22060152 12
 
0.2%
35.22140888 12
 
0.2%
35.33659192 12
 
0.2%
35.21532785 11
 
0.1%
35.21362754 11
 
0.1%
Other values (6116) 7848
98.3%
ValueCountFrequency (%)
35.16657304 1
< 0.1%
35.16718538 1
< 0.1%
35.16853281 1
< 0.1%
35.16856681 1
< 0.1%
35.16887231 1
< 0.1%
35.16888464 2
< 0.1%
35.16986874 1
< 0.1%
35.17017556 1
< 0.1%
35.17102199 1
< 0.1%
35.17122743 1
< 0.1%
ValueCountFrequency (%)
35.38610645 1
< 0.1%
35.37806983 1
< 0.1%
35.37655929 1
< 0.1%
35.3756863 1
< 0.1%
35.37565901 1
< 0.1%
35.37551556 1
< 0.1%
35.37546979 1
< 0.1%
35.37524191 1
< 0.1%
35.37512373 1
< 0.1%
35.37490077 1
< 0.1%
Distinct6719
Distinct (%)84.2%
Missing0
Missing (%)0.0%
Memory size62.5 KiB
2023-12-12T09:48:32.185278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length35
Mean length9.0403408
Min length1

Characters and Unicode

Total characters72160
Distinct characters782
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6118 ?
Unique (%)76.6%

Sample

1st row신발
2nd row타이어및 튜브,골프공
3rd row자동차용방진고무,비경화고무
4th row선박용,공조기
5th row유압솔레노이드밸브(동력전달장치)
ValueCountFrequency (%)
제조 237
 
1.9%
자동차 230
 
1.8%
부품 158
 
1.3%
플라스틱 151
 
1.2%
자동차부품 143
 
1.1%
141
 
1.1%
가공 141
 
1.1%
금속 108
 
0.9%
절삭가공 99
 
0.8%
제작 92
 
0.7%
Other values (6887) 11032
88.0%
2023-12-12T09:48:32.811764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4628
 
6.4%
2342
 
3.2%
2243
 
3.1%
1857
 
2.6%
1767
 
2.4%
1723
 
2.4%
) 1548
 
2.1%
( 1546
 
2.1%
1522
 
2.1%
, 1377
 
1.9%
Other values (772) 51607
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60458
83.8%
Space Separator 4628
 
6.4%
Other Punctuation 1561
 
2.2%
Close Punctuation 1548
 
2.1%
Open Punctuation 1546
 
2.1%
Uppercase Letter 1257
 
1.7%
Lowercase Letter 1123
 
1.6%
Decimal Number 23
 
< 0.1%
Dash Punctuation 11
 
< 0.1%
Modifier Symbol 2
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2342
 
3.9%
2243
 
3.7%
1857
 
3.1%
1767
 
2.9%
1723
 
2.8%
1522
 
2.5%
1374
 
2.3%
1287
 
2.1%
1251
 
2.1%
1237
 
2.0%
Other values (702) 43855
72.5%
Uppercase Letter
ValueCountFrequency (%)
C 145
 
11.5%
P 117
 
9.3%
E 105
 
8.4%
A 89
 
7.1%
N 83
 
6.6%
L 82
 
6.5%
T 72
 
5.7%
R 68
 
5.4%
S 62
 
4.9%
D 54
 
4.3%
Other values (16) 380
30.2%
Lowercase Letter
ValueCountFrequency (%)
c 127
11.3%
e 107
 
9.5%
p 104
 
9.3%
a 86
 
7.7%
n 81
 
7.2%
t 68
 
6.1%
s 67
 
6.0%
r 67
 
6.0%
l 56
 
5.0%
i 55
 
4.9%
Other values (15) 305
27.2%
Other Punctuation
ValueCountFrequency (%)
, 1377
88.2%
. 161
 
10.3%
/ 17
 
1.1%
& 3
 
0.2%
: 2
 
0.1%
' 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 9
39.1%
2 7
30.4%
3 3
 
13.0%
4 3
 
13.0%
9 1
 
4.3%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
4628
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1548
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1546
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60455
83.8%
Common 9322
 
12.9%
Latin 2380
 
3.3%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2342
 
3.9%
2243
 
3.7%
1857
 
3.1%
1767
 
2.9%
1723
 
2.9%
1522
 
2.5%
1374
 
2.3%
1287
 
2.1%
1251
 
2.1%
1237
 
2.0%
Other values (701) 43852
72.5%
Latin
ValueCountFrequency (%)
C 145
 
6.1%
c 127
 
5.3%
P 117
 
4.9%
e 107
 
4.5%
E 105
 
4.4%
p 104
 
4.4%
A 89
 
3.7%
a 86
 
3.6%
N 83
 
3.5%
L 82
 
3.4%
Other values (41) 1335
56.1%
Common
ValueCountFrequency (%)
4628
49.6%
) 1548
 
16.6%
( 1546
 
16.6%
, 1377
 
14.8%
. 161
 
1.7%
/ 17
 
0.2%
- 11
 
0.1%
1 9
 
0.1%
2 7
 
0.1%
3 3
 
< 0.1%
Other values (9) 15
 
0.2%
Han
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60454
83.8%
ASCII 11701
 
16.2%
CJK 3
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4628
39.6%
) 1548
 
13.2%
( 1546
 
13.2%
, 1377
 
11.8%
. 161
 
1.4%
C 145
 
1.2%
c 127
 
1.1%
P 117
 
1.0%
e 107
 
0.9%
E 105
 
0.9%
Other values (59) 1840
 
15.7%
Hangul
ValueCountFrequency (%)
2342
 
3.9%
2243
 
3.7%
1857
 
3.1%
1767
 
2.9%
1723
 
2.9%
1522
 
2.5%
1374
 
2.3%
1287
 
2.1%
1251
 
2.1%
1237
 
2.0%
Other values (700) 43851
72.5%
CJK
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T09:48:26.128299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:48:25.911159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:48:26.229118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:48:26.025402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:48:32.927303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
읍면동위도경도
읍면동1.0000.9200.804
위도0.9201.0000.675
경도0.8040.6751.000
2023-12-12T09:48:33.053933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도읍면동
위도1.000-0.0640.694
경도-0.0641.0000.476
읍면동0.6940.4761.000

Missing values

2023-12-12T09:48:26.409905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:48:26.583787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회사명대표자명전화번호읍면동주소위도경도주력생산품
0TKG태광㈜박주환055-333-7151삼안동경상남도 김해시 김해대로2635번길 26128.9153235.232074신발
1(주)넥센강호찬055-723-8526삼안동경상남도 김해시 김해대로 2595128.91028635.230922타이어및 튜브,골프공
2(주)대흥알앤티송영수055-345-6391진례면경상남도 김해시 진례면 서부로436번길 70-25128.7441835.268214자동차용방진고무,비경화고무
3하이에어코리아㈜김근배055-345-5000진례면경상남도 김해시 진례면 고모로324번길 204 B동128.78431835.245362선박용,공조기
4(주)유니크안영구 외 1명055-340-2000진영읍경상남도 김해시 진영읍 서부로179번길 90128.77501735.288545유압솔레노이드밸브(동력전달장치)
5(주)동성티씨에스곽경구055-345-6861진례면경상남도 김해시 진례면 고모로134번길 81128.77754635.236969자동차,건설기계,농기계 내외장재
6디케이락주식회사노은식055-338-0114주촌면경상남도 김해시 주촌면 골든루트로129번길 7128.8272535.216402피팅,밸브
7이코리아산업주식회사정태영055-314-2082주촌면경상남도 김해시 주촌면 골든루트로 103-11128.8286135.214058세탁기 플라스틱 부품
8㈜HC컴퍼니박소연055-900-2016주촌면경상남도 김해시 주촌면 골든루트로 104-2128.83067535.21509프라이팬, 냄비
9두리화학㈜최백규055-340-5516생림면경상남도 김해시 생림면 생림대로 826-90128.85759835.334012NC2저소음배관숄루션,EDR 지하횡주관
회사명대표자명전화번호읍면동주소위도경도주력생산품
7972코메스김영신055-346-4977진례면경상남도 김해시 진례면 테크노밸리로 193-38128.77223835.281929산업기계 자동화 설비
7973에스엠.이.엔지최상민<NA>주촌면경상남도 김해시 주촌면 서부로1638번길 65-16128.83103335.231898자동차플라스틱도어 사출금형
7974신아하이테크박병학070-8610-4745진영읍경상남도 김해시 진영읍 서부로396번길 20-35128.75270935.269369금형부품 절삭가공
7975명성엔지니어링하대일055-342-0877진례면경상남도 김해시 진례면 서부로411번길 39128.75478735.265985동주물주조업
7976태산전기정영섭055-346-7872칠산서부동경상남도 김해시 칠산로279번길 12-10128.84725435.198668자동제어판넬
7977동남산업박태준<NA>한림면경상남도 김해시 한림면 김해대로1434번길 10-1128.82818535.276889플라스틱압출성형
7978티엠에스이봉재<NA>주촌면경상남도 김해시 주촌면 서부로1499번길 54128.8145735.232443농기계부품(트렉트플라스틱지붕)
7979한국큐앤테크정현철055-903-0880한림면경상남도 김해시 한림면 김해대로1099번길 124-28128.80210735.300925조선기자재(덴퍼)
7980씨티씨엔지니어링(CTC engineering)변종성055-346-0242한림면경상남도 김해시 한림면 한림로46번길 79128.81669735.295346철도차량부품
7981아민도예천향순055-324-0350상동면경상남도 김해시 상동면 동북로1135번길 32-14128.91695735.34989가정용 도자기

Duplicate rows

Most frequently occurring

회사명대표자명전화번호읍면동주소위도경도주력생산품# duplicates
0강양HTS안승일055-343-2317한림면경상남도 김해시 한림면 가산로 52-16128.75973735.322769기어펌프제조2