Overview

Dataset statistics

Number of variables5
Number of observations108
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 KiB
Average record size in memory42.2 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description대구 지역 유원시설업 현황정보에 관한 공공데이터로 업종분류(종합, 일반, 기타), 업체명, 소재지 등의 정보를 제공합니다.
Author대구광역시
URLhttps://www.data.go.kr/data/15054188/fileData.do

Alerts

번호 is highly overall correlated with 구군High correlation
구군 is highly overall correlated with 번호High correlation
업종중분류 is highly imbalanced (52.3%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2024-04-29 22:33:25.897231
Analysis finished2024-04-29 22:33:27.841377
Duration1.94 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.5
Minimum1
Maximum108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-04-30T07:33:27.919047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.35
Q127.75
median54.5
Q381.25
95-th percentile102.65
Maximum108
Range107
Interquartile range (IQR)53.5

Descriptive statistics

Standard deviation31.32092
Coefficient of variation (CV)0.57469577
Kurtosis-1.2
Mean54.5
Median Absolute Deviation (MAD)27
Skewness0
Sum5886
Variance981
MonotonicityStrictly increasing
2024-04-30T07:33:28.049424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
70 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
74 1
 
0.9%
Other values (98) 98
90.7%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%
99 1
0.9%

구군
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size996.0 B
동구
23 
달서구
21 
달성군
21 
북구
18 
중구
11 
Other values (4)
14 

Length

Max length3
Median length2
Mean length2.4907407
Min length2

Unique

Unique2 ?
Unique (%)1.9%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
동구 23
21.3%
달서구 21
19.4%
달성군 21
19.4%
북구 18
16.7%
중구 11
10.2%
수성구 10
9.3%
서구 2
 
1.9%
남구 1
 
0.9%
군위군 1
 
0.9%

Length

2024-04-30T07:33:28.167038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:33:28.277812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동구 23
21.3%
달서구 21
19.4%
달성군 21
19.4%
북구 18
16.7%
중구 11
10.2%
수성구 10
9.3%
서구 2
 
1.9%
남구 1
 
0.9%
군위군 1
 
0.9%

업종중분류
Categorical

IMBALANCE 

Distinct3
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size996.0 B
기타유원시설업
89 
일반유원시설업
17 
종합유원시설업
 
2

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반유원시설업
2nd row일반유원시설업
3rd row일반유원시설업
4th row일반유원시설업
5th row기타유원시설업

Common Values

ValueCountFrequency (%)
기타유원시설업 89
82.4%
일반유원시설업 17
 
15.7%
종합유원시설업 2
 
1.9%

Length

2024-04-30T07:33:28.398277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:33:28.480881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타유원시설업 89
82.4%
일반유원시설업 17
 
15.7%
종합유원시설업 2
 
1.9%
Distinct106
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size996.0 B
2024-04-30T07:33:28.720378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15
Mean length8.1666667
Min length2

Characters and Unicode

Total characters882
Distinct characters238
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)96.3%

Sample

1st row㈜스파크랜드
2nd rowIn디팡
3rd row대구노리존
4th row크레이지팡팡
5th row와이에스피 테크(팡팡)
ValueCountFrequency (%)
대구점 4
 
2.5%
주식회사 3
 
1.9%
스카이점핑랜드 3
 
1.9%
리틀비틀 2
 
1.3%
대구 2
 
1.3%
키즈카페 2
 
1.3%
맥스라이더 2
 
1.3%
노리파크 2
 
1.3%
대구수성점 2
 
1.3%
대구기상과학관 2
 
1.3%
Other values (135) 135
84.9%
2024-04-30T07:33:29.115541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
5.8%
40
 
4.5%
35
 
4.0%
29
 
3.3%
24
 
2.7%
22
 
2.5%
20
 
2.3%
19
 
2.2%
19
 
2.2%
19
 
2.2%
Other values (228) 604
68.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 756
85.7%
Space Separator 51
 
5.8%
Lowercase Letter 18
 
2.0%
Uppercase Letter 18
 
2.0%
Open Punctuation 15
 
1.7%
Close Punctuation 15
 
1.7%
Other Symbol 5
 
0.6%
Decimal Number 4
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
5.3%
35
 
4.6%
29
 
3.8%
24
 
3.2%
22
 
2.9%
20
 
2.6%
19
 
2.5%
19
 
2.5%
19
 
2.5%
18
 
2.4%
Other values (195) 511
67.6%
Lowercase Letter
ValueCountFrequency (%)
n 4
22.2%
t 2
11.1%
u 2
11.1%
e 1
 
5.6%
a 1
 
5.6%
d 1
 
5.6%
l 1
 
5.6%
h 1
 
5.6%
f 1
 
5.6%
c 1
 
5.6%
Other values (3) 3
16.7%
Uppercase Letter
ValueCountFrequency (%)
O 2
11.1%
A 2
11.1%
D 2
11.1%
R 2
11.1%
I 2
11.1%
S 1
 
5.6%
M 1
 
5.6%
F 1
 
5.6%
J 1
 
5.6%
Z 1
 
5.6%
Other values (3) 3
16.7%
Decimal Number
ValueCountFrequency (%)
4 2
50.0%
2 1
25.0%
5 1
25.0%
Space Separator
ValueCountFrequency (%)
51
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 761
86.3%
Common 85
 
9.6%
Latin 36
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
5.3%
35
 
4.6%
29
 
3.8%
24
 
3.2%
22
 
2.9%
20
 
2.6%
19
 
2.5%
19
 
2.5%
19
 
2.5%
18
 
2.4%
Other values (196) 516
67.8%
Latin
ValueCountFrequency (%)
n 4
 
11.1%
O 2
 
5.6%
A 2
 
5.6%
D 2
 
5.6%
R 2
 
5.6%
I 2
 
5.6%
t 2
 
5.6%
u 2
 
5.6%
e 1
 
2.8%
a 1
 
2.8%
Other values (16) 16
44.4%
Common
ValueCountFrequency (%)
51
60.0%
( 15
 
17.6%
) 15
 
17.6%
4 2
 
2.4%
2 1
 
1.2%
5 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 756
85.7%
ASCII 121
 
13.7%
None 5
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
51
42.1%
( 15
 
12.4%
) 15
 
12.4%
n 4
 
3.3%
O 2
 
1.7%
A 2
 
1.7%
D 2
 
1.7%
4 2
 
1.7%
R 2
 
1.7%
I 2
 
1.7%
Other values (22) 24
19.8%
Hangul
ValueCountFrequency (%)
40
 
5.3%
35
 
4.6%
29
 
3.8%
24
 
3.2%
22
 
2.9%
20
 
2.6%
19
 
2.5%
19
 
2.5%
19
 
2.5%
18
 
2.4%
Other values (195) 511
67.6%
None
ValueCountFrequency (%)
5
100.0%
Distinct104
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size996.0 B
2024-04-30T07:33:29.427081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length34
Mean length28.87037
Min length17

Characters and Unicode

Total characters3118
Distinct characters192
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)93.5%

Sample

1st row대구광역시 중구 동성로6길 61, 7층 (공평동)
2nd row대구광역시 중구 동성로3길 20 (삼덕동1가)
3rd row대구광역시 중구 동성로 36-2 (동성로2가)
4th row대구광역시 중구 진골목길 32 (남일동)
5th row대구광역시 중구 달구벌대로 450길 18 (대봉동)
ValueCountFrequency (%)
대구광역시 108
 
17.3%
동구 23
 
3.7%
달성군 21
 
3.4%
달서구 21
 
3.4%
북구 18
 
2.9%
중구 11
 
1.8%
수성구 10
 
1.6%
4층 9
 
1.4%
2층 7
 
1.1%
현풍읍 7
 
1.1%
Other values (267) 390
62.4%
2024-04-30T07:33:29.936616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
518
 
16.6%
216
 
6.9%
145
 
4.7%
128
 
4.1%
114
 
3.7%
113
 
3.6%
108
 
3.5%
105
 
3.4%
1 103
 
3.3%
, 98
 
3.1%
Other values (182) 1470
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1851
59.4%
Space Separator 518
 
16.6%
Decimal Number 493
 
15.8%
Other Punctuation 98
 
3.1%
Close Punctuation 67
 
2.1%
Open Punctuation 67
 
2.1%
Dash Punctuation 16
 
0.5%
Uppercase Letter 6
 
0.2%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
216
 
11.7%
145
 
7.8%
128
 
6.9%
114
 
6.2%
113
 
6.1%
108
 
5.8%
105
 
5.7%
65
 
3.5%
53
 
2.9%
49
 
2.6%
Other values (160) 755
40.8%
Decimal Number
ValueCountFrequency (%)
1 103
20.9%
2 72
14.6%
0 67
13.6%
3 51
10.3%
4 50
10.1%
6 38
 
7.7%
8 33
 
6.7%
9 30
 
6.1%
5 28
 
5.7%
7 21
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
S 1
16.7%
D 1
16.7%
B 1
16.7%
X 1
16.7%
J 1
16.7%
Y 1
16.7%
Space Separator
ValueCountFrequency (%)
518
100.0%
Other Punctuation
ValueCountFrequency (%)
, 98
100.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%
Open Punctuation
ValueCountFrequency (%)
( 67
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1851
59.4%
Common 1261
40.4%
Latin 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
216
 
11.7%
145
 
7.8%
128
 
6.9%
114
 
6.2%
113
 
6.1%
108
 
5.8%
105
 
5.7%
65
 
3.5%
53
 
2.9%
49
 
2.6%
Other values (160) 755
40.8%
Common
ValueCountFrequency (%)
518
41.1%
1 103
 
8.2%
, 98
 
7.8%
2 72
 
5.7%
0 67
 
5.3%
) 67
 
5.3%
( 67
 
5.3%
3 51
 
4.0%
4 50
 
4.0%
6 38
 
3.0%
Other values (6) 130
 
10.3%
Latin
ValueCountFrequency (%)
S 1
16.7%
D 1
16.7%
B 1
16.7%
X 1
16.7%
J 1
16.7%
Y 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1851
59.4%
ASCII 1267
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
518
40.9%
1 103
 
8.1%
, 98
 
7.7%
2 72
 
5.7%
0 67
 
5.3%
) 67
 
5.3%
( 67
 
5.3%
3 51
 
4.0%
4 50
 
3.9%
6 38
 
3.0%
Other values (12) 136
 
10.7%
Hangul
ValueCountFrequency (%)
216
 
11.7%
145
 
7.8%
128
 
6.9%
114
 
6.2%
113
 
6.1%
108
 
5.8%
105
 
5.7%
65
 
3.5%
53
 
2.9%
49
 
2.6%
Other values (160) 755
40.8%

Interactions

2024-04-30T07:33:27.482774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:33:30.043044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구군업종중분류
번호1.0000.9300.295
구군0.9301.0000.237
업종중분류0.2950.2371.000
2024-04-30T07:33:30.148859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종중분류구군
업종중분류1.0000.100
구군0.1001.000
2024-04-30T07:33:30.234784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구군업종중분류
번호1.0000.7680.176
구군0.7681.0000.100
업종중분류0.1760.1001.000

Missing values

2024-04-30T07:33:27.680594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:33:27.780254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호구군업종중분류업체명소재지
01중구일반유원시설업㈜스파크랜드대구광역시 중구 동성로6길 61, 7층 (공평동)
12중구일반유원시설업In디팡대구광역시 중구 동성로3길 20 (삼덕동1가)
23중구일반유원시설업대구노리존대구광역시 중구 동성로 36-2 (동성로2가)
34중구일반유원시설업크레이지팡팡대구광역시 중구 진골목길 32 (남일동)
45중구기타유원시설업와이에스피 테크(팡팡)대구광역시 중구 달구벌대로 450길 18 (대봉동)
56중구기타유원시설업통콩팡플레이키즈대구광역시 중구 남산로 48, 서진빌딩 4층 (남산동)
67중구기타유원시설업유한회사 스파크대구광역시 중구 동성로6길 61, 5층 (공평동)
78중구기타유원시설업주식회사 짱대구광역시 중구 동성로6길 61, 태왕스파크 701호 (공평동)
89중구기타유원시설업짱오락실대구광역시 중구 동성로6길 36, 1층 (공평동)
910중구기타유원시설업짱오락실(대구점)대구광역시 중구 동성로 35-1 (동성로2가)
번호구군업종중분류업체명소재지
9899달성군기타유원시설업아이방방대구광역시 달성군 유가읍 테크노상업로 98, 901호
99100달성군기타유원시설업키즈몬대구광역시 달성군 현풍읍 테크노상업로 48, 9층
100101달성군기타유원시설업잭슨나인스 대구점대구광역시 달성군 유가읍 테크노상업로 120, 311호
101102달성군기타유원시설업퍼펙트플레이카페대구광역시 달성군 현풍읍 테크노대로 40, 401호
102103달성군기타유원시설업주니어카페대구광역시 달성군 현풍읍 테크노대로 40, 402호
103104달성군기타유원시설업아틀란티스카페대구광역시 달성군 현풍읍 테크노대로 40, 303호
104105달성군기타유원시설업미니특공대카페대구광역시 달성군 현풍읍 테크노대로 40, 404호
105106달성군기타유원시설업나인로드 피제리아(현풍테크노점)대구광역시 달성군 유가읍 테크노순환로 12길 33, 201~2호
106107달성군기타유원시설업네버랜드대구광역시 달성군 유가읍 테크노순환로 2층
107108군위군일반유원시설업해피데이환타지월드대구광역시 군위군 의흥면 일연테마로 100