Overview

Dataset statistics

Number of variables9
Number of observations85
Missing cells93
Missing cells (%)12.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.3 KiB
Average record size in memory75.6 B

Variable types

Numeric1
Text7
Unsupported1

Dataset

Description하동근 공장등록현황에 대한 데이터로 하동군 회사명, 대표자명, 전화번호, 팩스번호, 공장대표주소, 업종명, 생산품 등의 항목을 제공합니다.
Author경상남도 하동군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15036755

Alerts

전화번호 has 8 (9.4%) missing valuesMissing
Unnamed: 8 has 85 (100.0%) missing valuesMissing
순번 has unique valuesUnique
회사명 has unique valuesUnique
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 23:22:00.134685
Analysis finished2023-12-10 23:22:01.245796
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct85
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66.729412
Minimum1
Maximum138
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size897.0 B
2023-12-11T08:22:01.298277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.4
Q130
median65
Q3103
95-th percentile132.4
Maximum138
Range137
Interquartile range (IQR)73

Descriptive statistics

Standard deviation41.300694
Coefficient of variation (CV)0.61892789
Kurtosis-1.2718503
Mean66.729412
Median Absolute Deviation (MAD)36
Skewness0.099081537
Sum5672
Variance1705.7473
MonotonicityStrictly increasing
2023-12-11T08:22:01.404319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
86 1
 
1.2%
101 1
 
1.2%
100 1
 
1.2%
98 1
 
1.2%
94 1
 
1.2%
93 1
 
1.2%
92 1
 
1.2%
91 1
 
1.2%
90 1
 
1.2%
Other values (75) 75
88.2%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
9 1
1.2%
10 1
1.2%
11 1
1.2%
12 1
1.2%
13 1
1.2%
ValueCountFrequency (%)
138 1
1.2%
136 1
1.2%
135 1
1.2%
134 1
1.2%
133 1
1.2%
130 1
1.2%
127 1
1.2%
126 1
1.2%
124 1
1.2%
122 1
1.2%

회사명
Text

UNIQUE 

Distinct85
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-11T08:22:01.589133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length14
Mean length8.2352941
Min length3

Characters and Unicode

Total characters700
Distinct characters159
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)100.0%

Sample

1st row(재)하동녹차연구소 가공2공장
2nd row(재)하동녹차연구소 가공3공장
3rd row(주)금륜
4th row(주)금오레미콘
5th row(주)금오레미콘 2공장
ValueCountFrequency (%)
주식회사 8
 
7.3%
농업회사법인 7
 
6.4%
재)하동녹차연구소 2
 
1.8%
주)금오레미콘 2
 
1.8%
화개농협제다 2
 
1.8%
나노라인 1
 
0.9%
일송제다 1
 
0.9%
유진제재소 1
 
0.9%
옥종양조장 1
 
0.9%
옥종농협공동퇴비제조장 1
 
0.9%
Other values (84) 84
76.4%
2023-12-11T08:22:01.893660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
6.0%
) 32
 
4.6%
( 32
 
4.6%
25
 
3.6%
23
 
3.3%
22
 
3.1%
19
 
2.7%
18
 
2.6%
16
 
2.3%
15
 
2.1%
Other values (149) 456
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 604
86.3%
Close Punctuation 32
 
4.6%
Open Punctuation 32
 
4.6%
Space Separator 25
 
3.6%
Decimal Number 5
 
0.7%
Uppercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
7.0%
23
 
3.8%
22
 
3.6%
19
 
3.1%
18
 
3.0%
16
 
2.6%
15
 
2.5%
15
 
2.5%
14
 
2.3%
12
 
2.0%
Other values (142) 408
67.5%
Decimal Number
ValueCountFrequency (%)
2 4
80.0%
3 1
 
20.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
M 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 604
86.3%
Common 94
 
13.4%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
7.0%
23
 
3.8%
22
 
3.6%
19
 
3.1%
18
 
3.0%
16
 
2.6%
15
 
2.5%
15
 
2.5%
14
 
2.3%
12
 
2.0%
Other values (142) 408
67.5%
Common
ValueCountFrequency (%)
) 32
34.0%
( 32
34.0%
25
26.6%
2 4
 
4.3%
3 1
 
1.1%
Latin
ValueCountFrequency (%)
S 1
50.0%
M 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 604
86.3%
ASCII 96
 
13.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
7.0%
23
 
3.8%
22
 
3.6%
19
 
3.1%
18
 
3.0%
16
 
2.6%
15
 
2.5%
15
 
2.5%
14
 
2.3%
12
 
2.0%
Other values (142) 408
67.5%
ASCII
ValueCountFrequency (%)
) 32
33.3%
( 32
33.3%
25
26.0%
2 4
 
4.2%
S 1
 
1.0%
M 1
 
1.0%
3 1
 
1.0%
Distinct80
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-11T08:22:02.121171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.0117647
Min length2

Characters and Unicode

Total characters256
Distinct characters92
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)89.4%

Sample

1st row황인후
2nd row황인후
3rd row김옥례
4th row조효봉
5th row조효봉
ValueCountFrequency (%)
김복환 3
 
3.4%
황인후 2
 
2.3%
조효봉 2
 
2.3%
한춘식 2
 
2.3%
우재학 1
 
1.1%
오천호 1
 
1.1%
김종균 1
 
1.1%
김종수 1
 
1.1%
김영철 1
 
1.1%
김인기 1
 
1.1%
Other values (72) 72
82.8%
2023-12-11T08:22:02.439037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
10.9%
15
 
5.9%
13
 
5.1%
8
 
3.1%
8
 
3.1%
6
 
2.3%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (82) 161
62.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 254
99.2%
Space Separator 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
11.0%
15
 
5.9%
13
 
5.1%
8
 
3.1%
8
 
3.1%
6
 
2.4%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (81) 159
62.6%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 254
99.2%
Common 2
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
11.0%
15
 
5.9%
13
 
5.1%
8
 
3.1%
8
 
3.1%
6
 
2.4%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (81) 159
62.6%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 254
99.2%
ASCII 2
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
 
11.0%
15
 
5.9%
13
 
5.1%
8
 
3.1%
8
 
3.1%
6
 
2.4%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (81) 159
62.6%
ASCII
ValueCountFrequency (%)
2
100.0%

전화번호
Text

MISSING 

Distinct72
Distinct (%)93.5%
Missing8
Missing (%)9.4%
Memory size812.0 B
2023-12-11T08:22:02.654752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters924
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)87.0%

Sample

1st row_055_8831206
2nd row_055_8840693
3rd row_055_8838001
4th row_055_8838001
5th row_055_8835641
ValueCountFrequency (%)
055_8838001 2
 
2.6%
2
 
2.6%
055_8823017 2
 
2.6%
055_8826771 2
 
2.6%
055_8838341 2
 
2.6%
055_8828167 1
 
1.3%
055883_9859 1
 
1.3%
055_8835249 1
 
1.3%
055_8842625 1
 
1.3%
055_8840785 1
 
1.3%
Other values (62) 62
80.5%
2023-12-11T08:22:03.032481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 179
19.4%
5 170
18.4%
8 166
18.0%
0 125
13.5%
3 58
 
6.3%
2 52
 
5.6%
4 48
 
5.2%
1 43
 
4.7%
7 38
 
4.1%
6 28
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 745
80.6%
Connector Punctuation 179
 
19.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 170
22.8%
8 166
22.3%
0 125
16.8%
3 58
 
7.8%
2 52
 
7.0%
4 48
 
6.4%
1 43
 
5.8%
7 38
 
5.1%
6 28
 
3.8%
9 17
 
2.3%
Connector Punctuation
ValueCountFrequency (%)
_ 179
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 924
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
_ 179
19.4%
5 170
18.4%
8 166
18.0%
0 125
13.5%
3 58
 
6.3%
2 52
 
5.6%
4 48
 
5.2%
1 43
 
4.7%
7 38
 
4.1%
6 28
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 924
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 179
19.4%
5 170
18.4%
8 166
18.0%
0 125
13.5%
3 58
 
6.3%
2 52
 
5.6%
4 48
 
5.2%
1 43
 
4.7%
7 38
 
4.1%
6 28
 
3.0%
Distinct47
Distinct (%)55.3%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-11T08:22:03.202990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1020
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)51.8%

Sample

1st row_055_8841599
2nd row_01086650460
3rd row_055_8840680
4th row_055_8838006
5th row____________
ValueCountFrequency (%)
37
43.5%
055 2
 
2.4%
055_8823016 2
 
2.4%
055_8838804 1
 
1.2%
055_8838023 1
 
1.2%
055_8832463 1
 
1.2%
055_8841599 1
 
1.2%
055_8847463 1
 
1.2%
055_8839858 1
 
1.2%
055_8839493 1
 
1.2%
Other values (37) 37
43.5%
2023-12-11T08:22:03.497733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 552
54.1%
8 111
 
10.9%
5 104
 
10.2%
0 73
 
7.2%
3 38
 
3.7%
4 30
 
2.9%
1 28
 
2.7%
6 26
 
2.5%
2 24
 
2.4%
7 22
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Connector Punctuation 552
54.1%
Decimal Number 468
45.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 111
23.7%
5 104
22.2%
0 73
15.6%
3 38
 
8.1%
4 30
 
6.4%
1 28
 
6.0%
6 26
 
5.6%
2 24
 
5.1%
7 22
 
4.7%
9 12
 
2.6%
Connector Punctuation
ValueCountFrequency (%)
_ 552
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1020
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
_ 552
54.1%
8 111
 
10.9%
5 104
 
10.2%
0 73
 
7.2%
3 38
 
3.7%
4 30
 
2.9%
1 28
 
2.7%
6 26
 
2.5%
2 24
 
2.4%
7 22
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1020
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 552
54.1%
8 111
 
10.9%
5 104
 
10.2%
0 73
 
7.2%
3 38
 
3.7%
4 30
 
2.9%
1 28
 
2.7%
6 26
 
2.5%
2 24
 
2.4%
7 22
 
2.2%
Distinct84
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-11T08:22:03.789133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length31
Mean length24.117647
Min length19

Characters and Unicode

Total characters2050
Distinct characters139
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)97.6%

Sample

1st row경상남도 하동군 화개면 영당2길 26
2nd row경상남도 하동군 하동읍 섬진강대로 2492
3rd row경상남도 하동군 금성면 경제산업로 507-80
4th row경상남도 하동군 진교면 달구지길 95-47 (진교면)
5th row경상남도 하동군 진교면 달구지길 95-47
ValueCountFrequency (%)
경상남도 85
18.7%
하동군 85
18.7%
화개면 16
 
3.5%
진교면 16
 
3.5%
하동읍 10
 
2.2%
악양면 10
 
2.2%
섬진강대로 8
 
1.8%
적량면 7
 
1.5%
금남면 6
 
1.3%
옥종면 5
 
1.1%
Other values (156) 206
45.4%
2023-12-11T08:22:04.234865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
369
18.0%
103
 
5.0%
97
 
4.7%
95
 
4.6%
89
 
4.3%
88
 
4.3%
86
 
4.2%
85
 
4.1%
77
 
3.8%
2 45
 
2.2%
Other values (129) 916
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1281
62.5%
Space Separator 369
 
18.0%
Decimal Number 300
 
14.6%
Dash Punctuation 41
 
2.0%
Close Punctuation 29
 
1.4%
Open Punctuation 29
 
1.4%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
103
 
8.0%
97
 
7.6%
95
 
7.4%
89
 
6.9%
88
 
6.9%
86
 
6.7%
85
 
6.6%
77
 
6.0%
45
 
3.5%
32
 
2.5%
Other values (114) 484
37.8%
Decimal Number
ValueCountFrequency (%)
2 45
15.0%
1 43
14.3%
3 40
13.3%
6 36
12.0%
4 32
10.7%
5 28
9.3%
9 22
7.3%
0 19
6.3%
8 19
6.3%
7 16
 
5.3%
Space Separator
ValueCountFrequency (%)
369
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1282
62.5%
Common 768
37.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
103
 
8.0%
97
 
7.6%
95
 
7.4%
89
 
6.9%
88
 
6.9%
86
 
6.7%
85
 
6.6%
77
 
6.0%
45
 
3.5%
32
 
2.5%
Other values (115) 485
37.8%
Common
ValueCountFrequency (%)
369
48.0%
2 45
 
5.9%
1 43
 
5.6%
- 41
 
5.3%
3 40
 
5.2%
6 36
 
4.7%
4 32
 
4.2%
) 29
 
3.8%
( 29
 
3.8%
5 28
 
3.6%
Other values (4) 76
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1281
62.5%
ASCII 768
37.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
369
48.0%
2 45
 
5.9%
1 43
 
5.6%
- 41
 
5.3%
3 40
 
5.2%
6 36
 
4.7%
4 32
 
4.2%
) 29
 
3.8%
( 29
 
3.8%
5 28
 
3.6%
Other values (4) 76
 
9.9%
Hangul
ValueCountFrequency (%)
103
 
8.0%
97
 
7.6%
95
 
7.4%
89
 
6.9%
88
 
6.9%
86
 
6.7%
85
 
6.6%
77
 
6.0%
45
 
3.5%
32
 
2.5%
Other values (114) 484
37.8%
None
ValueCountFrequency (%)
1
100.0%
Distinct46
Distinct (%)54.1%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-11T08:22:04.467811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length27
Mean length12.341176
Min length6

Characters and Unicode

Total characters1049
Distinct characters131
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)37.6%

Sample

1st row차류 가공업
2nd row차류 가공업
3rd row비금속류 해체 및 선별업 외 1 종
4th row레미콘 제조업
5th row아스팔트 콘크리트 및 혼합제품 제조업
ValueCountFrequency (%)
제조업 47
 
13.7%
24
 
7.0%
23
 
6.7%
차류 21
 
6.1%
가공업 21
 
6.1%
19
 
5.5%
1 14
 
4.1%
기타 12
 
3.5%
저장 7
 
2.0%
가공 7
 
2.0%
Other values (89) 149
43.3%
2023-12-11T08:22:04.797662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
259
24.7%
92
 
8.8%
58
 
5.5%
50
 
4.8%
32
 
3.1%
30
 
2.9%
29
 
2.8%
24
 
2.3%
24
 
2.3%
22
 
2.1%
Other values (121) 429
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 768
73.2%
Space Separator 259
 
24.7%
Decimal Number 19
 
1.8%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
92
 
12.0%
58
 
7.6%
50
 
6.5%
32
 
4.2%
30
 
3.9%
29
 
3.8%
24
 
3.1%
24
 
3.1%
22
 
2.9%
21
 
2.7%
Other values (113) 386
50.3%
Decimal Number
ValueCountFrequency (%)
1 14
73.7%
2 3
 
15.8%
3 1
 
5.3%
9 1
 
5.3%
Space Separator
ValueCountFrequency (%)
259
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 768
73.2%
Common 281
 
26.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
92
 
12.0%
58
 
7.6%
50
 
6.5%
32
 
4.2%
30
 
3.9%
29
 
3.8%
24
 
3.1%
24
 
3.1%
22
 
2.9%
21
 
2.7%
Other values (113) 386
50.3%
Common
ValueCountFrequency (%)
259
92.2%
1 14
 
5.0%
2 3
 
1.1%
) 1
 
0.4%
( 1
 
0.4%
3 1
 
0.4%
9 1
 
0.4%
. 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 766
73.0%
ASCII 281
 
26.8%
Compat Jamo 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
259
92.2%
1 14
 
5.0%
2 3
 
1.1%
) 1
 
0.4%
( 1
 
0.4%
3 1
 
0.4%
9 1
 
0.4%
. 1
 
0.4%
Hangul
ValueCountFrequency (%)
92
 
12.0%
58
 
7.6%
50
 
6.5%
32
 
4.2%
30
 
3.9%
29
 
3.8%
24
 
3.1%
24
 
3.1%
22
 
2.9%
21
 
2.7%
Other values (112) 384
50.1%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Distinct63
Distinct (%)74.1%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-11T08:22:05.022816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length10
Mean length4.4705882
Min length2

Characters and Unicode

Total characters380
Distinct characters148
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)64.7%

Sample

1st row녹차
2nd row녹차
3rd row레미콘혼합제
4th row레미콘
5th row아스콘
ValueCountFrequency (%)
녹차 15
 
14.6%
막걸리 4
 
3.9%
과일선별기 2
 
1.9%
제다 2
 
1.9%
선박구성부분품 2
 
1.9%
선박구성 2
 
1.9%
재첩국 2
 
1.9%
재첩 2
 
1.9%
레미콘 2
 
1.9%
가공품 2
 
1.9%
Other values (68) 68
66.0%
2023-12-11T08:22:05.358240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
5.0%
18
 
4.7%
18
 
4.7%
10
 
2.6%
9
 
2.4%
8
 
2.1%
8
 
2.1%
7
 
1.8%
7
 
1.8%
7
 
1.8%
Other values (138) 269
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 345
90.8%
Space Separator 18
 
4.7%
Lowercase Letter 6
 
1.6%
Other Punctuation 4
 
1.1%
Uppercase Letter 3
 
0.8%
Open Punctuation 2
 
0.5%
Close Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
5.5%
18
 
5.2%
10
 
2.9%
9
 
2.6%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
6
 
1.7%
Other values (125) 246
71.3%
Lowercase Letter
ValueCountFrequency (%)
f 1
16.7%
i 1
16.7%
l 1
16.7%
t 1
16.7%
e 1
16.7%
r 1
16.7%
Uppercase Letter
ValueCountFrequency (%)
F 1
33.3%
C 1
33.3%
R 1
33.3%
Space Separator
ValueCountFrequency (%)
18
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 345
90.8%
Common 26
 
6.8%
Latin 9
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
5.5%
18
 
5.2%
10
 
2.9%
9
 
2.6%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
6
 
1.7%
Other values (125) 246
71.3%
Latin
ValueCountFrequency (%)
F 1
11.1%
f 1
11.1%
i 1
11.1%
l 1
11.1%
t 1
11.1%
e 1
11.1%
r 1
11.1%
C 1
11.1%
R 1
11.1%
Common
ValueCountFrequency (%)
18
69.2%
. 4
 
15.4%
( 2
 
7.7%
) 2
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 345
90.8%
ASCII 35
 
9.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
5.5%
18
 
5.2%
10
 
2.9%
9
 
2.6%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
6
 
1.7%
Other values (125) 246
71.3%
ASCII
ValueCountFrequency (%)
18
51.4%
. 4
 
11.4%
( 2
 
5.7%
) 2
 
5.7%
F 1
 
2.9%
f 1
 
2.9%
i 1
 
2.9%
l 1
 
2.9%
t 1
 
2.9%
e 1
 
2.9%
Other values (3) 3
 
8.6%

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing85
Missing (%)100.0%
Memory size897.0 B

Interactions

2023-12-11T08:22:00.995265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:22:05.446787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번회사명대표자명전화번호팩스번호공장대표주소업종명생산품
순번1.0001.0000.9870.9750.7310.9160.5940.250
회사명1.0001.0001.0001.0001.0001.0001.0001.000
대표자명0.9871.0001.0000.9990.8550.9960.9930.993
전화번호0.9751.0000.9991.0000.9941.0000.9960.992
팩스번호0.7311.0000.8550.9941.0000.9940.0000.739
공장대표주소0.9161.0000.9961.0000.9941.0001.0000.995
업종명0.5941.0000.9930.9960.0001.0001.0000.998
생산품0.2501.0000.9930.9920.7390.9950.9981.000

Missing values

2023-12-11T08:22:01.094211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:22:01.204759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번회사명대표자명전화번호팩스번호공장대표주소업종명생산품Unnamed: 8
01(재)하동녹차연구소 가공2공장황인후_055_8831206_055_8841599경상남도 하동군 화개면 영당2길 26차류 가공업녹차<NA>
12(재)하동녹차연구소 가공3공장황인후<NA>_01086650460경상남도 하동군 하동읍 섬진강대로 2492차류 가공업녹차<NA>
25(주)금륜김옥례_055_8840693_055_8840680경상남도 하동군 금성면 경제산업로 507-80비금속류 해체 및 선별업 외 1 종레미콘혼합제<NA>
36(주)금오레미콘조효봉_055_8838001_055_8838006경상남도 하동군 진교면 달구지길 95-47 (진교면)레미콘 제조업레미콘<NA>
47(주)금오레미콘 2공장조효봉_055_8838001____________경상남도 하동군 진교면 달구지길 95-47아스팔트 콘크리트 및 혼합제품 제조업아스콘<NA>
59(주)대덕화학이기형_055_8835641_055_8835643경상남도 하동군 고전면 농공단지길 35 (대덕화학)고무패킹류 제조업 외 1 종산업용 고무<NA>
610(주)두양중공업김덕진_055_8825123____________경상남도 하동군 진교면 진양로 258-23산업용 난방보일러 및 방열기 제조업보일러<NA>
711(주)드론고하상균<NA>____________경상남도 하동군 진교면 민다리길 62무인 항공기 및 무인 비행장치 제조업 외 2 종드론<NA>
812(주)디에스아이신 치 덕________________________경상남도 하동군 금남면 해안로 211-134기타 석유정제물 재처리업정제연료유<NA>
913(주)비오투하동김우곤_055_8820660_055_8821294경상남도 하동군 고전면 늘봉길 306배합 사료 제조업 외 1 종단미사료<NA>
순번회사명대표자명전화번호팩스번호공장대표주소업종명생산품Unnamed: 8
75122하동섬진강재첩수산이명환_055_8840094____________경상남도 하동군 적량면 대티길 44-14수산동물 냉동품 제조업재첩 가공품<NA>
76124하동율림영농조합법인최경태<NA>____________경상남도 하동군 하동읍 매화골먹점길 9기타 과실ㆍ채소 가공 및 저장 처리업맛밤<NA>
77126하동전통식품김경연<NA>____________경상남도 하동군 하동읍 섬진강대로 2396수산식물 가공 및 저장 처리업김부각<NA>
78127하동합동양조장양희천_055_8842714____________경상남도 하동군 하동읍 중앙1길 17주정 제조업막걸리<NA>
79130해들가김정희_07077958320____________경상남도 하동군 금남면 경충로 378-65수산식물 가공 및 저장 처리업해초 김<NA>
80133화개농협제다한춘식_055_8838341_055_8832463경상남도 하동군 화개면 화개로 26-1차류 가공업녹차<NA>
81134화개농협제다 2한춘식_055_8838341____________경상남도 하동군 화개면 섬진강대로 3742차류 가공업녹차<NA>
82135화개제다홍소술_055_8832233_055882_8941경상남도 하동군 화개면 섬진강대로 4028 (총 2 필지)차류 가공업녹차<NA>
83136화개합동탁주김진희_055883_2456____________경상남도 하동군 화개면 탑리 678번지탁주 및 약주 제조업막걸리<NA>
84138화창목재강명훈_055_8823306_055_8823066경상남도 하동군 진교면 진교리 67번지일반 제재업각재<NA>