Overview

Dataset statistics

Number of variables5
Number of observations132
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 KiB
Average record size in memory42.0 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경상남도 남해군에 현재 등록된 숙박업소현황입니다. 숙박업소의 업종명, 업소명, 업소소재지(도로명주소), 전화번호를 포함한 정보입니다.
Author경상남도 남해군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15065507

Alerts

연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:25:46.974536
Analysis finished2023-12-11 00:25:47.514339
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct132
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66.5
Minimum1
Maximum132
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-11T09:25:47.608644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.55
Q133.75
median66.5
Q399.25
95-th percentile125.45
Maximum132
Range131
Interquartile range (IQR)65.5

Descriptive statistics

Standard deviation38.249183
Coefficient of variation (CV)0.57517568
Kurtosis-1.2
Mean66.5
Median Absolute Deviation (MAD)33
Skewness0
Sum8778
Variance1463
MonotonicityStrictly increasing
2023-12-11T09:25:47.808573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
85 1
 
0.8%
99 1
 
0.8%
98 1
 
0.8%
97 1
 
0.8%
96 1
 
0.8%
95 1
 
0.8%
94 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
Other values (122) 122
92.4%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
132 1
0.8%
131 1
0.8%
130 1
0.8%
129 1
0.8%
128 1
0.8%
127 1
0.8%
126 1
0.8%
125 1
0.8%
124 1
0.8%
123 1
0.8%

업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
숙박업(생활)
67 
숙박업(일반)
65 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(생활) 67
50.8%
숙박업(일반) 65
49.2%

Length

2023-12-11T09:25:47.970917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:25:48.084987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(생활 67
50.8%
숙박업(일반 65
49.2%

업소명
Text

UNIQUE 

Distinct132
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-11T09:25:48.351862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length5.7954545
Min length2

Characters and Unicode

Total characters765
Distinct characters211
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)100.0%

Sample

1st row재두장여관
2nd row해주여관
3rd row한려산장
4th rowK모텔
5th row영남장여관
ValueCountFrequency (%)
펜션 2
 
1.3%
모텔 2
 
1.3%
벨비앙펜션 2
 
1.3%
eg미조힐링리조트 2
 
1.3%
9 2
 
1.3%
큰솔펜션 1
 
0.7%
보물섬캠핑장 1
 
0.7%
남해베네치아리조트 1
 
0.7%
남해갯내음펜션 1
 
0.7%
시아도 1
 
0.7%
Other values (137) 137
90.1%
2023-12-11T09:25:48.858285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
6.4%
44
 
5.8%
40
 
5.2%
40
 
5.2%
24
 
3.1%
23
 
3.0%
20
 
2.6%
19
 
2.5%
16
 
2.1%
14
 
1.8%
Other values (201) 476
62.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 703
91.9%
Space Separator 20
 
2.6%
Lowercase Letter 10
 
1.3%
Uppercase Letter 10
 
1.3%
Decimal Number 9
 
1.2%
Open Punctuation 4
 
0.5%
Close Punctuation 4
 
0.5%
Dash Punctuation 3
 
0.4%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
7.0%
44
 
6.3%
40
 
5.7%
40
 
5.7%
24
 
3.4%
23
 
3.3%
19
 
2.7%
16
 
2.3%
14
 
2.0%
14
 
2.0%
Other values (177) 420
59.7%
Lowercase Letter
ValueCountFrequency (%)
e 3
30.0%
t 2
20.0%
r 1
 
10.0%
l 1
 
10.0%
s 1
 
10.0%
a 1
 
10.0%
o 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
G 2
20.0%
C 2
20.0%
E 2
20.0%
R 1
10.0%
K 1
10.0%
J 1
10.0%
F 1
10.0%
Decimal Number
ValueCountFrequency (%)
2 3
33.3%
9 3
33.3%
1 2
22.2%
5 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
· 1
50.0%
. 1
50.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 703
91.9%
Common 42
 
5.5%
Latin 20
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
7.0%
44
 
6.3%
40
 
5.7%
40
 
5.7%
24
 
3.4%
23
 
3.3%
19
 
2.7%
16
 
2.3%
14
 
2.0%
14
 
2.0%
Other values (177) 420
59.7%
Latin
ValueCountFrequency (%)
e 3
15.0%
t 2
10.0%
G 2
10.0%
C 2
10.0%
E 2
10.0%
r 1
 
5.0%
R 1
 
5.0%
l 1
 
5.0%
K 1
 
5.0%
s 1
 
5.0%
Other values (4) 4
20.0%
Common
ValueCountFrequency (%)
20
47.6%
( 4
 
9.5%
) 4
 
9.5%
- 3
 
7.1%
2 3
 
7.1%
9 3
 
7.1%
1 2
 
4.8%
5 1
 
2.4%
· 1
 
2.4%
. 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 703
91.9%
ASCII 61
 
8.0%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
49
 
7.0%
44
 
6.3%
40
 
5.7%
40
 
5.7%
24
 
3.4%
23
 
3.3%
19
 
2.7%
16
 
2.3%
14
 
2.0%
14
 
2.0%
Other values (177) 420
59.7%
ASCII
ValueCountFrequency (%)
20
32.8%
( 4
 
6.6%
) 4
 
6.6%
- 3
 
4.9%
2 3
 
4.9%
e 3
 
4.9%
9 3
 
4.9%
t 2
 
3.3%
G 2
 
3.3%
C 2
 
3.3%
Other values (13) 15
24.6%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct128
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-11T09:25:49.271082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length40
Mean length24.742424
Min length18

Characters and Unicode

Total characters3266
Distinct characters83
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique124 ?
Unique (%)93.9%

Sample

1st row경상남도 남해군 상주면 남해대로 918-6
2nd row경상남도 남해군 상주면 상주로 17-6
3rd row경상남도 남해군 상주면 남해대로 591-56
4th row경상남도 남해군 남해읍 화전로 52-9
5th row경상남도 남해군 남해읍 화전로38번길 28
ValueCountFrequency (%)
경상남도 132
18.8%
남해군 132
18.8%
미조면 24
 
3.4%
창선면 22
 
3.1%
삼동면 20
 
2.8%
남해읍 17
 
2.4%
남해대로 15
 
2.1%
상주면 14
 
2.0%
동부대로 14
 
2.0%
남면 12
 
1.7%
Other values (187) 301
42.8%
2023-12-11T09:25:49.764280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
581
17.8%
334
 
10.2%
172
 
5.3%
151
 
4.6%
1 136
 
4.2%
132
 
4.0%
132
 
4.0%
132
 
4.0%
129
 
3.9%
122
 
3.7%
Other values (73) 1245
38.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1911
58.5%
Decimal Number 669
 
20.5%
Space Separator 581
 
17.8%
Dash Punctuation 73
 
2.2%
Open Punctuation 15
 
0.5%
Close Punctuation 15
 
0.5%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
334
17.5%
172
 
9.0%
151
 
7.9%
132
 
6.9%
132
 
6.9%
132
 
6.9%
129
 
6.8%
122
 
6.4%
78
 
4.1%
56
 
2.9%
Other values (58) 473
24.8%
Decimal Number
ValueCountFrequency (%)
1 136
20.3%
2 119
17.8%
3 88
13.2%
5 63
9.4%
4 52
 
7.8%
9 48
 
7.2%
8 44
 
6.6%
7 43
 
6.4%
6 40
 
6.0%
0 36
 
5.4%
Space Separator
ValueCountFrequency (%)
581
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1911
58.5%
Common 1355
41.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
334
17.5%
172
 
9.0%
151
 
7.9%
132
 
6.9%
132
 
6.9%
132
 
6.9%
129
 
6.8%
122
 
6.4%
78
 
4.1%
56
 
2.9%
Other values (58) 473
24.8%
Common
ValueCountFrequency (%)
581
42.9%
1 136
 
10.0%
2 119
 
8.8%
3 88
 
6.5%
- 73
 
5.4%
5 63
 
4.6%
4 52
 
3.8%
9 48
 
3.5%
8 44
 
3.2%
7 43
 
3.2%
Other values (5) 108
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1911
58.5%
ASCII 1355
41.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
581
42.9%
1 136
 
10.0%
2 119
 
8.8%
3 88
 
6.5%
- 73
 
5.4%
5 63
 
4.6%
4 52
 
3.8%
9 48
 
3.5%
8 44
 
3.2%
7 43
 
3.2%
Other values (5) 108
 
8.0%
Hangul
ValueCountFrequency (%)
334
17.5%
172
 
9.0%
151
 
7.9%
132
 
6.9%
132
 
6.9%
132
 
6.9%
129
 
6.8%
122
 
6.4%
78
 
4.1%
56
 
2.9%
Other values (58) 473
24.8%
Distinct107
Distinct (%)81.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-11T09:25:50.065698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.984848
Min length9

Characters and Unicode

Total characters1582
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)78.0%

Sample

1st row055-862-6022
2nd row055-862-6042
3rd row000-000-0000
4th row055-864-2981
5th row055-864-2478
ValueCountFrequency (%)
000-000-0000 23
 
17.4%
055-867-6543 2
 
1.5%
055-863-0807 2
 
1.5%
055-863-0020 2
 
1.5%
055-863-5035 1
 
0.8%
055-867-2288 1
 
0.8%
055-863-5005 1
 
0.8%
055-867-6966 1
 
0.8%
055-867-7792 1
 
0.8%
055-867-2575 1
 
0.8%
Other values (97) 97
73.5%
2023-12-11T09:25:50.496052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 407
25.7%
5 265
16.8%
- 263
16.6%
8 165
10.4%
6 137
 
8.7%
7 98
 
6.2%
3 63
 
4.0%
2 59
 
3.7%
4 48
 
3.0%
1 44
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1319
83.4%
Dash Punctuation 263
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 407
30.9%
5 265
20.1%
8 165
12.5%
6 137
 
10.4%
7 98
 
7.4%
3 63
 
4.8%
2 59
 
4.5%
4 48
 
3.6%
1 44
 
3.3%
9 33
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 263
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1582
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 407
25.7%
5 265
16.8%
- 263
16.6%
8 165
10.4%
6 137
 
8.7%
7 98
 
6.2%
3 63
 
4.0%
2 59
 
3.7%
4 48
 
3.0%
1 44
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1582
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 407
25.7%
5 265
16.8%
- 263
16.6%
8 165
10.4%
6 137
 
8.7%
7 98
 
6.2%
3 63
 
4.0%
2 59
 
3.7%
4 48
 
3.0%
1 44
 
2.8%

Interactions

2023-12-11T09:25:47.218322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:25:50.593566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0001.000
업종명1.0001.000
2023-12-11T09:25:50.689507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.954
업종명0.9541.000

Missing values

2023-12-11T09:25:47.363705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:25:47.474965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명업소소재지전화번호
01숙박업(일반)재두장여관경상남도 남해군 상주면 남해대로 918-6055-862-6022
12숙박업(일반)해주여관경상남도 남해군 상주면 상주로 17-6055-862-6042
23숙박업(일반)한려산장경상남도 남해군 상주면 남해대로 591-56000-000-0000
34숙박업(일반)K모텔경상남도 남해군 남해읍 화전로 52-9055-864-2981
45숙박업(일반)영남장여관경상남도 남해군 남해읍 화전로38번길 28055-864-2478
56숙박업(일반)미송여관경상남도 남해군 미조면 미조로 232-4055-867-6078
67숙박업(일반)진주장여관경상남도 남해군 남해읍 화전로 52055-864-2232
78숙박업(일반)금화여관경상남도 남해군 미조면 미조로 248055-867-7001
89숙박업(일반)J모텔경상남도 남해군 남해읍 화전로38번길 25055-862-1501
910숙박업(일반)남해장여관경상남도 남해군 남해읍 화전로96번길 6-3055-864-2273
연번업종명업소명업소소재지전화번호
122123숙박업(생활)원일펜션경상남도 남해군 상주면 남해대로697번길 38 (2층)000-000-0000
123124숙박업(생활)까미노펜션(스위트)경상남도 남해군 남면 남면로1103번길 33-17 2동000-000-0000
124125숙박업(생활)까미노펜션경상남도 남해군 남면 남면로1103번길 33-25 1동 2동 3동 4동000-000-0000
125126숙박업(생활)양화황토펜션경상남도 남해군 삼동면 양화금로 329-35 (가동 나동 다동 라동 마동)000-000-0000
126127숙박업(생활)벨비앙펜션경상남도 남해군 남면 빛담촌길 22 (1동)000-000-0000
127128숙박업(생활)벨비앙펜션 별관경상남도 남해군 남면 빛담촌길 24 (1동 2동 3동)000-000-0000
128129숙박업(생활)캐슬(Castle)529경상남도 남해군 이동면 성남로 274 1동(1층) 2동 3동 4동000-000-0000
129130숙박업(생활)남해예가경상남도 남해군 미조면 미송로 448 1 2동000-000-0000
130131숙박업(생활)남해스포츠파크호텔경상남도 남해군 서면 스포츠파크길 73 (3층~8층)000-000-0000
131132숙박업(생활)레트로 9 (Retro 9)경상남도 남해군 서면 남서대로 1974-41000-000-0000