Overview

Dataset statistics

Number of variables17
Number of observations21
Missing cells70
Missing cells (%)19.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory149.3 B

Variable types

Text7
Categorical2
Numeric4
DateTime1
Unsupported3

Dataset

Description경상남도 하동군에 위치한 건축 관련 제조업을 하는 공장 업체정보입니다. 업체명, 유형, 사업자등록번호, 업종, 사업자 업태업종, 주력사업, 전화번호, 도로명 주소, 팩스, 설립일자, 종업원수, 대표자 이름, 대표자 이메일, 위도, 경도, 회사대표 이메일 정보를 제공합니다.
Author경상남도 하동군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15126777

Alerts

업종 has constant value ""Constant
종업원수 is highly overall correlated with 유형High correlation
유형 is highly overall correlated with 종업원수High correlation
전화번호 has 1 (4.8%) missing valuesMissing
팩스 has 6 (28.6%) missing valuesMissing
Unnamed: 14 has 21 (100.0%) missing valuesMissing
Unnamed: 15 has 21 (100.0%) missing valuesMissing
Unnamed: 16 has 21 (100.0%) missing valuesMissing
업체명 has unique valuesUnique
도로명 주소 has unique valuesUnique
설립일자 has unique valuesUnique
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-13 00:12:11.377744
Analysis finished2024-03-13 00:12:13.519933
Duration2.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2024-03-13T09:12:13.606731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length7.8571429
Min length4

Characters and Unicode

Total characters165
Distinct characters69
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row(주)남광하동공장
2nd row(주)남양산업
3rd row(주)대덕화학
4th row(주)예미담황토
5th row(주)참솔산업
ValueCountFrequency (%)
주식회사 7
24.1%
케이엠산업 2
 
6.9%
주)남광하동공장 1
 
3.4%
유진제재소 1
 
3.4%
코코세라믹(주 1
 
3.4%
2공장 1
 
3.4%
이노테크원 1
 
3.4%
경보 1
 
3.4%
가온 1
 
3.4%
인포피알 1
 
3.4%
Other values (12) 12
41.4%
2024-03-13T09:12:13.866154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
10.9%
( 11
 
6.7%
) 11
 
6.7%
9
 
5.5%
8
 
4.8%
8
 
4.8%
8
 
4.8%
7
 
4.2%
6
 
3.6%
4
 
2.4%
Other values (59) 75
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 134
81.2%
Open Punctuation 11
 
6.7%
Close Punctuation 11
 
6.7%
Space Separator 8
 
4.8%
Decimal Number 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
13.4%
9
 
6.7%
8
 
6.0%
8
 
6.0%
7
 
5.2%
6
 
4.5%
4
 
3.0%
3
 
2.2%
3
 
2.2%
2
 
1.5%
Other values (55) 66
49.3%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 134
81.2%
Common 31
 
18.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
13.4%
9
 
6.7%
8
 
6.0%
8
 
6.0%
7
 
5.2%
6
 
4.5%
4
 
3.0%
3
 
2.2%
3
 
2.2%
2
 
1.5%
Other values (55) 66
49.3%
Common
ValueCountFrequency (%)
( 11
35.5%
) 11
35.5%
8
25.8%
2 1
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 134
81.2%
ASCII 31
 
18.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
18
 
13.4%
9
 
6.7%
8
 
6.0%
8
 
6.0%
7
 
5.2%
6
 
4.5%
4
 
3.0%
3
 
2.2%
3
 
2.2%
2
 
1.5%
Other values (55) 66
49.3%
ASCII
ValueCountFrequency (%)
( 11
35.5%
) 11
35.5%
8
25.8%
2 1
 
3.2%

유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size300.0 B
법인
18 
개인

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row법인
2nd row법인
3rd row법인
4th row법인
5th row법인

Common Values

ValueCountFrequency (%)
법인 18
85.7%
개인 3
 
14.3%

Length

2024-03-13T09:12:13.982261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T09:12:14.081900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 18
85.7%
개인 3
 
14.3%

사업자등록번호
Real number (ℝ)

Distinct20
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.0615273 × 109
Minimum2.8885012 × 109
Maximum8.1487025 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2024-03-13T09:12:14.180867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.8885012 × 109
5-th percentile4.0981911 × 109
Q16.1381227 × 109
median6.1381699 × 109
Q36.1481025 × 109
95-th percentile8.1487025 × 109
Maximum8.1487025 × 109
Range5.2602013 × 109
Interquartile range (IQR)9979843

Descriptive statistics

Standard deviation1.1376562 × 109
Coefficient of variation (CV)0.18768474
Kurtosis2.9245401
Mean6.0615273 × 109
Median Absolute Deviation (MAD)2007612
Skewness-0.89043481
Sum1.2729207 × 1011
Variance1.2942615 × 1018
MonotonicityNot monotonic
2024-03-13T09:12:14.299433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
8148702505 2
 
9.5%
6148501329 1
 
4.8%
6140153055 1
 
4.8%
6132276972 1
 
4.8%
6138153471 1
 
4.8%
6948600376 1
 
4.8%
6138131915 1
 
4.8%
6138172648 1
 
4.8%
6518100035 1
 
4.8%
2888501246 1
 
4.8%
Other values (10) 10
47.6%
ValueCountFrequency (%)
2888501246 1
4.8%
4098191070 1
4.8%
4588701780 1
4.8%
6132276972 1
4.8%
6138119769 1
4.8%
6138122696 1
4.8%
6138131915 1
4.8%
6138152813 1
4.8%
6138153471 1
4.8%
6138168028 1
4.8%
ValueCountFrequency (%)
8148702505 2
9.5%
6948600376 1
4.8%
6518100035 1
4.8%
6148501329 1
4.8%
6148102539 1
4.8%
6140177547 1
4.8%
6140153055 1
4.8%
6138172648 1
4.8%
6138171300 1
4.8%
6138169935 1
4.8%

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size300.0 B
제조업
21 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제조업
2nd row제조업
3rd row제조업
4th row제조업
5th row제조업

Common Values

ValueCountFrequency (%)
제조업 21
100.0%

Length

2024-03-13T09:12:14.404538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T09:12:14.491102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조업 21
100.0%
Distinct19
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Memory size300.0 B
2024-03-13T09:12:14.695911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length107
Median length43
Mean length36.52381
Min length6

Characters and Unicode

Total characters767
Distinct characters122
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)81.0%

Sample

1st row코크스 및 관련제품 제조업+석탄화학계 화합물 및 기타 기초 유기화학 물질 제조업+금속류 해체 및 선별업+금속류 원료 재생업
2nd row그 외 기타 콘크리트 제품 및 유사 제품 제조업+그 외 기타 분류 안된 비금속 광물제품 제조업
3rd row고무 패킹류 제조업+ 산업용 그 외 비경화 고무제품 제조업+기어 및 동력전달장치 제조업
4th row점토 벽돌, 블록 및 유사 비내화 요업제품 제조업+포장용 플라스틱 성형용기 제조업
5th row비내화 모르타르 제조업
ValueCountFrequency (%)
26
 
13.0%
제조업 23
 
11.5%
금속 6
 
3.0%
유사 6
 
3.0%
비내화 5
 
2.5%
콘크리트 5
 
2.5%
레미콘 4
 
2.0%
요업제품 4
 
2.0%
기타 4
 
2.0%
제품 4
 
2.0%
Other values (84) 113
56.5%
2024-03-13T09:12:15.064416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
179
23.3%
61
 
8.0%
50
 
6.5%
49
 
6.4%
26
 
3.4%
+ 22
 
2.9%
19
 
2.5%
12
 
1.6%
11
 
1.4%
11
 
1.4%
Other values (112) 327
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 556
72.5%
Space Separator 179
 
23.3%
Math Symbol 22
 
2.9%
Other Punctuation 10
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
11.0%
50
 
9.0%
49
 
8.8%
26
 
4.7%
19
 
3.4%
12
 
2.2%
11
 
2.0%
11
 
2.0%
11
 
2.0%
10
 
1.8%
Other values (109) 296
53.2%
Space Separator
ValueCountFrequency (%)
179
100.0%
Math Symbol
ValueCountFrequency (%)
+ 22
100.0%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 556
72.5%
Common 211
 
27.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
11.0%
50
 
9.0%
49
 
8.8%
26
 
4.7%
19
 
3.4%
12
 
2.2%
11
 
2.0%
11
 
2.0%
11
 
2.0%
10
 
1.8%
Other values (109) 296
53.2%
Common
ValueCountFrequency (%)
179
84.8%
+ 22
 
10.4%
, 10
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 556
72.5%
ASCII 211
 
27.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
179
84.8%
+ 22
 
10.4%
, 10
 
4.7%
Hangul
ValueCountFrequency (%)
61
 
11.0%
50
 
9.0%
49
 
8.8%
26
 
4.7%
19
 
3.4%
12
 
2.2%
11
 
2.0%
11
 
2.0%
11
 
2.0%
10
 
1.8%
Other values (109) 296
53.2%
Distinct14
Distinct (%)66.7%
Missing0
Missing (%)0.0%
Memory size300.0 B
2024-03-13T09:12:15.212662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length21
Mean length16
Min length6

Characters and Unicode

Total characters336
Distinct characters69
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)42.9%

Sample

1st row코크스 및 연탄 제조업
2nd row그 외 기타 콘크리트 제품 및 유사 제품 제조업
3rd row고무 패킹류 제조업
4th row점토 벽돌, 블록 및 유사 비내화 요업제품 제조업
5th row비내화 모르타르 제조업
ValueCountFrequency (%)
제조업 20
20.0%
10
 
10.0%
레미콘 4
 
4.0%
금속 4
 
4.0%
유사 4
 
4.0%
비내화 4
 
4.0%
제품 3
 
3.0%
콘크리트 3
 
3.0%
구조용 3
 
3.0%
요업제품 3
 
3.0%
Other values (30) 42
42.0%
2024-03-13T09:12:15.463210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
23.5%
32
 
9.5%
27
 
8.0%
25
 
7.4%
10
 
3.0%
10
 
3.0%
8
 
2.4%
5
 
1.5%
5
 
1.5%
5
 
1.5%
Other values (59) 130
38.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 253
75.3%
Space Separator 79
 
23.5%
Other Punctuation 4
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
12.6%
27
 
10.7%
25
 
9.9%
10
 
4.0%
10
 
4.0%
8
 
3.2%
5
 
2.0%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (57) 121
47.8%
Space Separator
ValueCountFrequency (%)
79
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 253
75.3%
Common 83
 
24.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
12.6%
27
 
10.7%
25
 
9.9%
10
 
4.0%
10
 
4.0%
8
 
3.2%
5
 
2.0%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (57) 121
47.8%
Common
ValueCountFrequency (%)
79
95.2%
, 4
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 253
75.3%
ASCII 83
 
24.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
79
95.2%
, 4
 
4.8%
Hangul
ValueCountFrequency (%)
32
 
12.6%
27
 
10.7%
25
 
9.9%
10
 
4.0%
10
 
4.0%
8
 
3.2%
5
 
2.0%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (57) 121
47.8%

전화번호
Text

MISSING 

Distinct19
Distinct (%)95.0%
Missing1
Missing (%)4.8%
Memory size300.0 B
2024-03-13T09:12:15.623798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters240
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)90.0%

Sample

1st row055-884-0671
2nd row055-883-7111
3rd row055-883-5641
4th row055-883-5338
5th row055-884-0674
ValueCountFrequency (%)
055-883-8001 2
 
10.0%
055-884-7250 1
 
5.0%
055-883-7111 1
 
5.0%
055-883-6206 1
 
5.0%
055-884-8282 1
 
5.0%
055-884-1930 1
 
5.0%
055-884-0805 1
 
5.0%
055-883-9490 1
 
5.0%
055-884-6560 1
 
5.0%
055-883-1521 1
 
5.0%
Other values (9) 9
45.0%
2024-03-13T09:12:15.896416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 48
20.0%
8 48
20.0%
- 40
16.7%
0 34
14.2%
3 16
 
6.7%
1 15
 
6.2%
2 11
 
4.6%
4 10
 
4.2%
6 8
 
3.3%
9 6
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 200
83.3%
Dash Punctuation 40
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 48
24.0%
8 48
24.0%
0 34
17.0%
3 16
 
8.0%
1 15
 
7.5%
2 11
 
5.5%
4 10
 
5.0%
6 8
 
4.0%
9 6
 
3.0%
7 4
 
2.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 240
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 48
20.0%
8 48
20.0%
- 40
16.7%
0 34
14.2%
3 16
 
6.7%
1 15
 
6.2%
2 11
 
4.6%
4 10
 
4.2%
6 8
 
3.3%
9 6
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 240
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 48
20.0%
8 48
20.0%
- 40
16.7%
0 34
14.2%
3 16
 
6.7%
1 15
 
6.2%
2 11
 
4.6%
4 10
 
4.2%
6 8
 
3.3%
9 6
 
2.5%

도로명 주소
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2024-03-13T09:12:16.071008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length30
Mean length26.142857
Min length20

Characters and Unicode

Total characters549
Distinct characters74
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row경상남도 하동군 적량면 한옥정길 36-22 ((주)남광)
2nd row경상남도 하동군 양보면 진양로 356-95 외 1필지
3rd row경상남도 하동군 고전면 농공단지길 35 (대덕화학)
4th row경상남도 하동군 옥종면 병천리 957-13번지 외 2필지
5th row경상남도 하동군 적량면 한옥정길 36-22
ValueCountFrequency (%)
경상남도 21
17.2%
하동군 21
17.2%
6
 
4.9%
고전면 6
 
4.9%
진교면 5
 
4.1%
3필지 4
 
3.3%
옥종면 3
 
2.5%
적량면 3
 
2.5%
한옥정길 3
 
2.5%
진양로 2
 
1.6%
Other values (41) 48
39.3%
2024-03-13T09:12:16.352186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
18.4%
25
 
4.6%
24
 
4.4%
23
 
4.2%
22
 
4.0%
21
 
3.8%
21
 
3.8%
21
 
3.8%
21
 
3.8%
3 15
 
2.7%
Other values (64) 255
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 337
61.4%
Space Separator 101
 
18.4%
Decimal Number 84
 
15.3%
Dash Punctuation 13
 
2.4%
Close Punctuation 7
 
1.3%
Open Punctuation 7
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
7.4%
24
 
7.1%
23
 
6.8%
22
 
6.5%
21
 
6.2%
21
 
6.2%
21
 
6.2%
21
 
6.2%
13
 
3.9%
10
 
3.0%
Other values (50) 136
40.4%
Decimal Number
ValueCountFrequency (%)
3 15
17.9%
2 11
13.1%
5 10
11.9%
1 9
10.7%
7 8
9.5%
4 8
9.5%
6 8
9.5%
0 5
 
6.0%
8 5
 
6.0%
9 5
 
6.0%
Space Separator
ValueCountFrequency (%)
101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 337
61.4%
Common 212
38.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
7.4%
24
 
7.1%
23
 
6.8%
22
 
6.5%
21
 
6.2%
21
 
6.2%
21
 
6.2%
21
 
6.2%
13
 
3.9%
10
 
3.0%
Other values (50) 136
40.4%
Common
ValueCountFrequency (%)
101
47.6%
3 15
 
7.1%
- 13
 
6.1%
2 11
 
5.2%
5 10
 
4.7%
1 9
 
4.2%
7 8
 
3.8%
4 8
 
3.8%
6 8
 
3.8%
) 7
 
3.3%
Other values (4) 22
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 337
61.4%
ASCII 212
38.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
101
47.6%
3 15
 
7.1%
- 13
 
6.1%
2 11
 
5.2%
5 10
 
4.7%
1 9
 
4.2%
7 8
 
3.8%
4 8
 
3.8%
6 8
 
3.8%
) 7
 
3.3%
Other values (4) 22
 
10.4%
Hangul
ValueCountFrequency (%)
25
 
7.4%
24
 
7.1%
23
 
6.8%
22
 
6.5%
21
 
6.2%
21
 
6.2%
21
 
6.2%
21
 
6.2%
13
 
3.9%
10
 
3.0%
Other values (50) 136
40.4%

팩스
Text

MISSING 

Distinct15
Distinct (%)100.0%
Missing6
Missing (%)28.6%
Memory size300.0 B
2024-03-13T09:12:16.513688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters180
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)100.0%

Sample

1st row055-884-0673
2nd row055-883-7112
3rd row055-883-5643
4th row055-883-5332
5th row055-883-0673
ValueCountFrequency (%)
055-884-0673 1
 
6.7%
055-883-7112 1
 
6.7%
055-883-5643 1
 
6.7%
055-883-5332 1
 
6.7%
055-883-0673 1
 
6.7%
055-882-7789 1
 
6.7%
055-883-5325 1
 
6.7%
055-883-1523 1
 
6.7%
055-884-7251 1
 
6.7%
055-884-6561 1
 
6.7%
Other values (5) 5
33.3%
2024-03-13T09:12:16.750638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 37
20.6%
8 33
18.3%
- 30
16.7%
0 21
11.7%
3 20
11.1%
4 7
 
3.9%
6 7
 
3.9%
7 7
 
3.9%
1 7
 
3.9%
2 7
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 150
83.3%
Dash Punctuation 30
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 37
24.7%
8 33
22.0%
0 21
14.0%
3 20
13.3%
4 7
 
4.7%
6 7
 
4.7%
7 7
 
4.7%
1 7
 
4.7%
2 7
 
4.7%
9 4
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 180
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 37
20.6%
8 33
18.3%
- 30
16.7%
0 21
11.7%
3 20
11.1%
4 7
 
3.9%
6 7
 
3.9%
7 7
 
3.9%
1 7
 
3.9%
2 7
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 180
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 37
20.6%
8 33
18.3%
- 30
16.7%
0 21
11.7%
3 20
11.1%
4 7
 
3.9%
6 7
 
3.9%
7 7
 
3.9%
1 7
 
3.9%
2 7
 
3.9%

설립일자
Date

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
Minimum1991-06-10 00:00:00
Maximum2020-05-06 00:00:00
2024-03-13T09:12:16.847141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:16.938770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)

종업원수
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.666667
Minimum1
Maximum40
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2024-03-13T09:12:17.053092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16
median9
Q313
95-th percentile23
Maximum40
Range39
Interquartile range (IQR)7

Descriptive statistics

Standard deviation8.6909915
Coefficient of variation (CV)0.81478045
Kurtosis5.8512814
Mean10.666667
Median Absolute Deviation (MAD)4
Skewness2.1147306
Sum224
Variance75.533333
MonotonicityNot monotonic
2024-03-13T09:12:17.172256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
6 3
14.3%
5 2
 
9.5%
10 2
 
9.5%
9 2
 
9.5%
15 2
 
9.5%
8 1
 
4.8%
1 1
 
4.8%
12 1
 
4.8%
13 1
 
4.8%
23 1
 
4.8%
Other values (5) 5
23.8%
ValueCountFrequency (%)
1 1
 
4.8%
2 1
 
4.8%
3 1
 
4.8%
5 2
9.5%
6 3
14.3%
7 1
 
4.8%
8 1
 
4.8%
9 2
9.5%
10 2
9.5%
12 1
 
4.8%
ValueCountFrequency (%)
40 1
4.8%
23 1
4.8%
19 1
4.8%
15 2
9.5%
13 1
4.8%
12 1
4.8%
10 2
9.5%
9 2
9.5%
8 1
4.8%
7 1
4.8%
Distinct20
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size300.0 B
2024-03-13T09:12:17.312543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.1428571
Min length2

Characters and Unicode

Total characters66
Distinct characters33
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)90.5%

Sample

1st row김*완
2nd row강*원
3rd row이*형
4th row강*희
5th row김*
ValueCountFrequency (%)
김*만 2
 
9.5%
김*완 1
 
4.8%
강*원 1
 
4.8%
이*길 1
 
4.8%
박*순 1
 
4.8%
정*화+강*훈 1
 
4.8%
박*주 1
 
4.8%
염*원 1
 
4.8%
홍*렬 1
 
4.8%
김*기 1
 
4.8%
Other values (10) 10
47.6%
2024-03-13T09:12:17.528002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 22
33.3%
5
 
7.6%
4
 
6.1%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
1
 
1.5%
Other values (23) 23
34.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43
65.2%
Other Punctuation 22
33.3%
Math Symbol 1
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
11.6%
4
 
9.3%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
1
 
2.3%
1
 
2.3%
1
 
2.3%
Other values (21) 21
48.8%
Other Punctuation
ValueCountFrequency (%)
* 22
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43
65.2%
Common 23
34.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
11.6%
4
 
9.3%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
1
 
2.3%
1
 
2.3%
1
 
2.3%
Other values (21) 21
48.8%
Common
ValueCountFrequency (%)
* 22
95.7%
+ 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43
65.2%
ASCII 23
34.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 22
95.7%
+ 1
 
4.3%
Hangul
ValueCountFrequency (%)
5
 
11.6%
4
 
9.3%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
1
 
2.3%
1
 
2.3%
1
 
2.3%
Other values (21) 21
48.8%

위도
Real number (ℝ)

Distinct19
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.048028
Minimum34.956581
Maximum35.183322
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2024-03-13T09:12:17.643744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34.956581
5-th percentile34.987052
Q135.005196
median35.034909
Q335.07642
95-th percentile35.171637
Maximum35.183322
Range0.22674099
Interquartile range (IQR)0.0712239

Descriptive statistics

Standard deviation0.060092237
Coefficient of variation (CV)0.0017145683
Kurtosis0.52930021
Mean35.048028
Median Absolute Deviation (MAD)0.03243054
Skewness1.0089196
Sum736.00858
Variance0.003611077
MonotonicityNot monotonic
2024-03-13T09:12:17.752219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
35.07642038 2
 
9.5%
35.00247896 2
 
9.5%
35.03525011 1
 
4.8%
34.98705151 1
 
4.8%
35.17163722 1
 
4.8%
35.0349095 1
 
4.8%
35.00519648 1
 
4.8%
35.04129426 1
 
4.8%
35.04693082 1
 
4.8%
35.14767171 1
 
4.8%
Other values (9) 9
42.9%
ValueCountFrequency (%)
34.95658101 1
4.8%
34.98705151 1
4.8%
34.9940342 1
4.8%
35.00247896 2
9.5%
35.00519648 1
4.8%
35.01248093 1
4.8%
35.01302294 1
4.8%
35.022788 1
4.8%
35.03431575 1
4.8%
35.0349095 1
4.8%
ValueCountFrequency (%)
35.183322 1
4.8%
35.17163722 1
4.8%
35.14767171 1
4.8%
35.088941 1
4.8%
35.07642038 2
9.5%
35.07535755 1
4.8%
35.04693082 1
4.8%
35.04129426 1
4.8%
35.03525011 1
4.8%
35.0349095 1
4.8%

경도
Real number (ℝ)

Distinct19
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.82642
Minimum127.72791
Maximum127.90139
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2024-03-13T09:12:17.854631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.72791
5-th percentile127.76977
Q1127.78814
median127.81226
Q3127.87941
95-th percentile127.89372
Maximum127.90139
Range0.173483
Interquartile range (IQR)0.0912642

Descriptive statistics

Standard deviation0.052807856
Coefficient of variation (CV)0.00041312162
Kurtosis-1.3150019
Mean127.82642
Median Absolute Deviation (MAD)0.0413665
Skewness0.021576231
Sum2684.3547
Variance0.0027886697
MonotonicityNot monotonic
2024-03-13T09:12:17.949609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
127.7708958 2
 
9.5%
127.8937162 2
 
9.5%
127.874839 1
 
4.8%
127.8260387 1
 
4.8%
127.8824741 1
 
4.8%
127.7911946 1
 
4.8%
127.8111576 1
 
4.8%
127.7890812 1
 
4.8%
127.7881412 1
 
4.8%
127.8794054 1
 
4.8%
Other values (9) 9
42.9%
ValueCountFrequency (%)
127.727905 1
4.8%
127.7697696 1
4.8%
127.7708958 2
9.5%
127.7755106 1
4.8%
127.7881412 1
4.8%
127.7890812 1
4.8%
127.7911946 1
4.8%
127.8111576 1
4.8%
127.8122046 1
4.8%
127.8122623 1
4.8%
ValueCountFrequency (%)
127.901388 1
4.8%
127.8937162 2
9.5%
127.887526 1
4.8%
127.8824741 1
4.8%
127.8794054 1
4.8%
127.8774759 1
4.8%
127.874839 1
4.8%
127.8260387 1
4.8%
127.8191348 1
4.8%
127.8122623 1
4.8%

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)100.0%
Memory size321.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)100.0%
Memory size321.0 B

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)100.0%
Memory size321.0 B

Interactions

2024-03-13T09:12:12.892530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:11.809783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.097870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.577443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.977416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:11.873042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.169724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.665996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:13.056487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:11.950432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.430609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.755895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:13.120762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.020365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.494736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T09:12:12.818371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T09:12:18.053655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명유형사업자등록번호사업자 업태업종주력사업전화번호도로명 주소팩스설립일자종업원수대표자 이름위도경도
업체명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
유형1.0001.0000.0001.0000.8521.0001.0001.0001.0000.7371.0000.0400.000
사업자등록번호1.0000.0001.0000.8960.8241.0001.0001.0001.0000.0001.0000.0000.099
사업자 업태업종1.0001.0000.8961.0001.0000.9661.0001.0001.0000.7630.9690.0000.763
주력사업1.0000.8520.8241.0001.0000.9491.0001.0001.0000.1720.9460.8520.538
전화번호1.0001.0001.0000.9660.9491.0001.0001.0001.0001.0001.0001.0001.000
도로명 주소1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
팩스1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
설립일자1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
종업원수1.0000.7370.0000.7630.1721.0001.0001.0001.0001.0001.0000.0000.000
대표자 이름1.0001.0001.0000.9690.9461.0001.0001.0001.0001.0001.0001.0001.000
위도1.0000.0400.0000.0000.8521.0001.0001.0001.0000.0001.0001.0000.927
경도1.0000.0000.0990.7630.5381.0001.0001.0001.0000.0001.0000.9271.000
2024-03-13T09:12:18.373916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자등록번호종업원수위도경도유형
사업자등록번호1.0000.019-0.1610.1170.000
종업원수0.0191.000-0.1280.2150.681
위도-0.161-0.1281.000-0.1720.000
경도0.1170.215-0.1721.0000.000
유형0.0000.6810.0000.0001.000

Missing values

2024-03-13T09:12:13.229484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T09:12:13.387925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-13T09:12:13.482275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명유형사업자등록번호업종사업자 업태업종주력사업전화번호도로명 주소팩스설립일자종업원수대표자 이름위도경도Unnamed: 14Unnamed: 15Unnamed: 16
0(주)남광하동공장법인6148501329제조업코크스 및 관련제품 제조업+석탄화학계 화합물 및 기타 기초 유기화학 물질 제조업+금속류 해체 및 선별업+금속류 원료 재생업코크스 및 연탄 제조업055-884-0671경상남도 하동군 적량면 한옥정길 36-22 ((주)남광)055-884-06731999-11-025김*완35.07642127.770896<NA><NA><NA>
1(주)남양산업법인4588701780제조업그 외 기타 콘크리트 제품 및 유사 제품 제조업+그 외 기타 분류 안된 비금속 광물제품 제조업그 외 기타 콘크리트 제품 및 유사 제품 제조업055-883-7111경상남도 하동군 양보면 진양로 356-95 외 1필지055-883-71122020-05-066강*원35.03525127.874839<NA><NA><NA>
2(주)대덕화학법인6148102539제조업고무 패킹류 제조업+ 산업용 그 외 비경화 고무제품 제조업+기어 및 동력전달장치 제조업고무 패킹류 제조업055-883-5641경상남도 하동군 고전면 농공단지길 35 (대덕화학)055-883-56431994-07-056이*형35.013023127.812205<NA><NA><NA>
3(주)예미담황토법인6138171300제조업점토 벽돌, 블록 및 유사 비내화 요업제품 제조업+포장용 플라스틱 성형용기 제조업점토 벽돌, 블록 및 유사 비내화 요업제품 제조업055-883-5338경상남도 하동군 옥종면 병천리 957-13번지 외 2필지055-883-53322018-07-068강*희35.183322127.887526<NA><NA><NA>
4(주)참솔산업법인4098191070제조업비내화 모르타르 제조업비내화 모르타르 제조업055-884-0674경상남도 하동군 적량면 한옥정길 36-22055-883-06732017-01-251김*35.07642127.770896<NA><NA><NA>
5(주)창원법인6138168028제조업레미콘 제조업레미콘 제조업055-882-1122경상남도 하동군 금성면 산업로 480 (주)창원 외 3필지055-882-77892006-02-2112양*일34.956581127.775511<NA><NA><NA>
6(주)태봉철강법인6138169935제조업육상 금속 골조 구조재 제조업+수상 금속 골조 구조재 제조업육상 금속 골조 구조재 제조업055-883-9902경상남도 하동군 하동읍 화심리 703-1번지 외 3필지<NA>2015-01-295임*범35.088941127.727905<NA><NA><NA>
7(주)토지법인6138152813제조업플라스터 제품 제조업 콘크리트 관 및 기타 구조용 콘크리트제품 제조업+ 부직포 및 펠트 제조업+표면처리 및 적층 직물 제조업+콘크리트 타일, 기와, 벽돌 및 블록 제조업플라스터 제품 제조업 콘크리트 관 및 기타 구조용 콘크리트제품 제조업055-883-1956경상남도 하동군 진교면 진양로 280-54<NA>2009-12-2910차*정35.034316127.877476<NA><NA><NA>
8대하산업(주)법인6138119769제조업레미콘 제조업+아스팔트 콘크리트 및 혼합제품 제조업레미콘 제조업055-883-5321경상남도 하동군 금남면 섬진강대로 863-10 외 3필지055-883-53252000-06-2113강*환34.994034127.819135<NA><NA><NA>
9부성산업 주식회사법인6138122696제조업일반 제재업+표면 가공목재 및 특정 목적용 제재목 제조업+목재 보존, 방부처리, 도장 및 유사 처리업+강화 및 재생 목재 제조업+기타 플라스틱 제품 제조업+구조용 금속 판제품 및 공작물 제조업일반 제재업055-883-1521경상남도 하동군 적량면 한옥정길 36-11055-883-15232017-09-199최*우35.075358127.76977<NA><NA><NA>
업체명유형사업자등록번호업종사업자 업태업종주력사업전화번호도로명 주소팩스설립일자종업원수대표자 이름위도경도Unnamed: 14Unnamed: 15Unnamed: 16
11유진제재소개인6140153055제조업일반 제재업일반 제재업055-884-6560경상남도 하동군 진교면 진교리 124-24번지055-884-65611998-12-232김*기35.022788127.901388<NA><NA><NA>
12이누스주식회사(하동지점)법인2888501246제조업타일 및 유사 비내화 요업제품 제조업타일 및 유사 비내화 요업제품 제조업055-883-9490경상남도 하동군 옥종면 옥단로 533055-883-94931998-07-1515홍*렬35.147672127.879405<NA><NA><NA>
13인포피알 주식회사법인6518100035제조업구조용 금속 판제품 및 공작물 제조업+금속 문, 창, 셔터 및 관련제품 제조업구조용 금속 판제품 및 공작물 제조업055-884-0805경상남도 하동군 고전면 사막1길 26<NA>2019-05-246염*원35.046931127.788141<NA><NA><NA>
14주식회사 가온법인6138172648제조업구조용 금속 판제품 및 공작물 제조업+산업용 송풍기 및 배기장치 제조업구조용 금속 판제품 및 공작물 제조업055-884-1930경상남도 하동군 고전면 공설운동장로 475-7 외 3필지055-884-19312017-08-027박*주35.041294127.789081<NA><NA><NA>
15주식회사 경보법인6138131915제조업레미콘 제조업+아스팔트 콘크리트 및 혼합제품 제조업레미콘 제조업055-884-8282경상남도 하동군 고전면 하동읍성로 97-26055-884-03231996-11-0819정*화+강*훈35.005196127.811158<NA><NA><NA>
16주식회사 이노테크원법인6948600376제조업육상 금속 골조 구조재 제조업육상 금속 골조 구조재 제조업<NA>경상남도 하동군 고전면 공설운동장로 567<NA>2018-08-0115박*순35.034909127.791195<NA><NA><NA>
17케이엠산업 주식회사법인8148702505제조업레미콘 제조업레미콘 제조업055-883-8001경상남도 하동군 진교면 달구지길 95-47 (진교면)055-883-80061991-06-1010김*만35.002479127.893716<NA><NA><NA>
18케이엠산업 주식회사 2공장법인8148702505제조업아스팔트 콘크리트 및 혼합제품 제조업아스팔트 콘크리트 및 혼합제품 제조업055-883-8001경상남도 하동군 진교면 달구지길 95-47<NA>2000-07-289김*만35.002479127.893716<NA><NA><NA>
19코코세라믹(주)법인6138153471제조업점토 벽돌, 블록 및 유사 비내화 요업제품 제조업+ 타일 및 유사 비내화 요업제품 제조업점토 벽돌, 블록 및 유사 비내화 요업제품 제조업055-883-6206경상남도 하동군 옥종면 옥단로 820 ((주)풍성세라믹)055-883-70682003-05-2140이*길35.171637127.882474<NA><NA><NA>
20홀츠바우개인6132276972제조업놀이터용 장비 제조업+간판 및 광고물 제조업놀이터용 장비 제조업055-882-8138경상남도 하동군 금남면 경제산업로 48<NA>2019-01-113문*상34.987052127.826039<NA><NA><NA>