Overview

Dataset statistics

Number of variables5
Number of observations224
Missing cells107
Missing cells (%)9.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.1 KiB
Average record size in memory41.6 B

Variable types

Numeric1
Text3
DateTime1

Dataset

Description충청북도 영동군 관내에 소재해 있는 담배소매인 지정 현황으로 연번, 업소명, 주소, 전화번호, 지정일자 자료를 제공하고 있습니다.
Author충청북도 영동군
URLhttps://www.data.go.kr/data/15021224/fileData.do

Alerts

전화번호 has 107 (47.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:40:46.488642
Analysis finished2023-12-12 08:40:47.147515
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct224
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.5
Minimum1
Maximum224
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T17:40:47.274553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.15
Q156.75
median112.5
Q3168.25
95-th percentile212.85
Maximum224
Range223
Interquartile range (IQR)111.5

Descriptive statistics

Standard deviation64.807407
Coefficient of variation (CV)0.57606584
Kurtosis-1.2
Mean112.5
Median Absolute Deviation (MAD)56
Skewness0
Sum25200
Variance4200
MonotonicityStrictly increasing
2023-12-12T17:40:47.474169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
114 1
 
0.4%
144 1
 
0.4%
145 1
 
0.4%
146 1
 
0.4%
147 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
Other values (214) 214
95.5%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%
220 1
0.4%
219 1
0.4%
218 1
0.4%
217 1
0.4%
216 1
0.4%
215 1
0.4%
Distinct220
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T17:40:47.823023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length17
Mean length6.2901786
Min length2

Characters and Unicode

Total characters1409
Distinct characters266
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique216 ?
Unique (%)96.4%

Sample

1st row원룸슈퍼
2nd row아이비스토어
3rd row상촌마트
4th row지에스25 영동부용점
5th row세븐일레븐 영동산업고점
ValueCountFrequency (%)
씨유 9
 
3.3%
세븐일레븐 6
 
2.2%
영동농협 5
 
1.8%
지에스25 3
 
1.1%
신흥상회 2
 
0.7%
종합행정학교 2
 
0.7%
동양상회 2
 
0.7%
주식회사 2
 
0.7%
영동부용점 2
 
0.7%
영동계산점 2
 
0.7%
Other values (234) 240
87.3%
2023-12-12T17:40:48.348252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
3.8%
52
 
3.7%
51
 
3.6%
46
 
3.3%
46
 
3.3%
41
 
2.9%
38
 
2.7%
32
 
2.3%
31
 
2.2%
30
 
2.1%
Other values (256) 988
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1280
90.8%
Space Separator 52
 
3.7%
Decimal Number 32
 
2.3%
Close Punctuation 12
 
0.9%
Open Punctuation 11
 
0.8%
Uppercase Letter 10
 
0.7%
Lowercase Letter 8
 
0.6%
Dash Punctuation 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
4.2%
51
 
4.0%
46
 
3.6%
46
 
3.6%
41
 
3.2%
38
 
3.0%
32
 
2.5%
31
 
2.4%
30
 
2.3%
27
 
2.1%
Other values (232) 884
69.1%
Lowercase Letter
ValueCountFrequency (%)
f 1
12.5%
l 1
12.5%
o 1
12.5%
t 1
12.5%
r 1
12.5%
a 1
12.5%
m 1
12.5%
e 1
12.5%
Decimal Number
ValueCountFrequency (%)
2 14
43.8%
5 8
25.0%
8 3
 
9.4%
4 3
 
9.4%
1 3
 
9.4%
3 1
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
G 4
40.0%
S 2
20.0%
L 1
 
10.0%
X 1
 
10.0%
U 1
 
10.0%
C 1
 
10.0%
Space Separator
ValueCountFrequency (%)
52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1280
90.8%
Common 111
 
7.9%
Latin 18
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
4.2%
51
 
4.0%
46
 
3.6%
46
 
3.6%
41
 
3.2%
38
 
3.0%
32
 
2.5%
31
 
2.4%
30
 
2.3%
27
 
2.1%
Other values (232) 884
69.1%
Latin
ValueCountFrequency (%)
G 4
22.2%
S 2
11.1%
L 1
 
5.6%
f 1
 
5.6%
l 1
 
5.6%
o 1
 
5.6%
X 1
 
5.6%
U 1
 
5.6%
C 1
 
5.6%
t 1
 
5.6%
Other values (4) 4
22.2%
Common
ValueCountFrequency (%)
52
46.8%
2 14
 
12.6%
) 12
 
10.8%
( 11
 
9.9%
5 8
 
7.2%
- 4
 
3.6%
8 3
 
2.7%
4 3
 
2.7%
1 3
 
2.7%
3 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1280
90.8%
ASCII 129
 
9.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
4.2%
51
 
4.0%
46
 
3.6%
46
 
3.6%
41
 
3.2%
38
 
3.0%
32
 
2.5%
31
 
2.4%
30
 
2.3%
27
 
2.1%
Other values (232) 884
69.1%
ASCII
ValueCountFrequency (%)
52
40.3%
2 14
 
10.9%
) 12
 
9.3%
( 11
 
8.5%
5 8
 
6.2%
- 4
 
3.1%
G 4
 
3.1%
8 3
 
2.3%
4 3
 
2.3%
1 3
 
2.3%
Other values (14) 15
 
11.6%

주소
Text

Distinct223
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T17:40:48.872395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length35.5
Mean length23.607143
Min length18

Characters and Unicode

Total characters5288
Distinct characters169
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique222 ?
Unique (%)99.1%

Sample

1st row충청북도 영동군 영동읍 어미실길 20-2
2nd row충청북도 영동군 영동읍 구교로 20
3rd row충청북도 영동군 상촌면 민주지산로 2995
4th row충청북도 영동군 영동읍 학산영동로 1197
5th row충청북도 영동군 영동읍 학산영동로 1211. 상회
ValueCountFrequency (%)
충청북도 224
18.9%
영동군 224
18.9%
영동읍 91
 
7.7%
황간면 24
 
2.0%
심천면 19
 
1.6%
용산면 18
 
1.5%
상촌면 15
 
1.3%
계산리 14
 
1.2%
양강면 13
 
1.1%
학산면 13
 
1.1%
Other values (360) 533
44.9%
2023-12-12T17:40:49.555669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1111
21.0%
356
 
6.7%
345
 
6.5%
225
 
4.3%
224
 
4.2%
224
 
4.2%
224
 
4.2%
224
 
4.2%
1 164
 
3.1%
133
 
2.5%
Other values (159) 2058
38.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3321
62.8%
Space Separator 1111
 
21.0%
Decimal Number 742
 
14.0%
Dash Punctuation 69
 
1.3%
Other Punctuation 23
 
0.4%
Open Punctuation 10
 
0.2%
Close Punctuation 10
 
0.2%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
356
 
10.7%
345
 
10.4%
225
 
6.8%
224
 
6.7%
224
 
6.7%
224
 
6.7%
224
 
6.7%
133
 
4.0%
116
 
3.5%
115
 
3.5%
Other values (143) 1135
34.2%
Decimal Number
ValueCountFrequency (%)
1 164
22.1%
2 94
12.7%
3 77
10.4%
4 72
9.7%
5 65
 
8.8%
9 61
 
8.2%
0 57
 
7.7%
6 52
 
7.0%
8 52
 
7.0%
7 48
 
6.5%
Space Separator
ValueCountFrequency (%)
1111
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 69
100.0%
Other Punctuation
ValueCountFrequency (%)
. 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3321
62.8%
Common 1967
37.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
356
 
10.7%
345
 
10.4%
225
 
6.8%
224
 
6.7%
224
 
6.7%
224
 
6.7%
224
 
6.7%
133
 
4.0%
116
 
3.5%
115
 
3.5%
Other values (143) 1135
34.2%
Common
ValueCountFrequency (%)
1111
56.5%
1 164
 
8.3%
2 94
 
4.8%
3 77
 
3.9%
4 72
 
3.7%
- 69
 
3.5%
5 65
 
3.3%
9 61
 
3.1%
0 57
 
2.9%
6 52
 
2.6%
Other values (6) 145
 
7.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3321
62.8%
ASCII 1967
37.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1111
56.5%
1 164
 
8.3%
2 94
 
4.8%
3 77
 
3.9%
4 72
 
3.7%
- 69
 
3.5%
5 65
 
3.3%
9 61
 
3.1%
0 57
 
2.9%
6 52
 
2.6%
Other values (6) 145
 
7.4%
Hangul
ValueCountFrequency (%)
356
 
10.7%
345
 
10.4%
225
 
6.8%
224
 
6.7%
224
 
6.7%
224
 
6.7%
224
 
6.7%
133
 
4.0%
116
 
3.5%
115
 
3.5%
Other values (143) 1135
34.2%

전화번호
Text

MISSING 

Distinct111
Distinct (%)94.9%
Missing107
Missing (%)47.8%
Memory size1.9 KiB
2023-12-12T17:40:49.861940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1404
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)93.2%

Sample

1st row043-744-7338
2nd row043-742-3737
3rd row043-744-2558
4th row043-745-2250
5th row043-744-9508
ValueCountFrequency (%)
042-878-7972 6
 
5.1%
043-744-7338 2
 
1.7%
043-742-4727 1
 
0.9%
043-744-0388 1
 
0.9%
043-744-0007 1
 
0.9%
043-744-0766 1
 
0.9%
043-743-2649 1
 
0.9%
043-744-1688 1
 
0.9%
043-743-3358 1
 
0.9%
043-743-6400 1
 
0.9%
Other values (101) 101
86.3%
2023-12-12T17:40:50.336014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 283
20.2%
- 234
16.7%
0 194
13.8%
3 188
13.4%
7 169
12.0%
2 87
 
6.2%
5 65
 
4.6%
8 57
 
4.1%
9 51
 
3.6%
1 39
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1170
83.3%
Dash Punctuation 234
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 283
24.2%
0 194
16.6%
3 188
16.1%
7 169
14.4%
2 87
 
7.4%
5 65
 
5.6%
8 57
 
4.9%
9 51
 
4.4%
1 39
 
3.3%
6 37
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 234
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1404
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 283
20.2%
- 234
16.7%
0 194
13.8%
3 188
13.4%
7 169
12.0%
2 87
 
6.2%
5 65
 
4.6%
8 57
 
4.1%
9 51
 
3.6%
1 39
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1404
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 283
20.2%
- 234
16.7%
0 194
13.8%
3 188
13.4%
7 169
12.0%
2 87
 
6.2%
5 65
 
4.6%
8 57
 
4.1%
9 51
 
3.6%
1 39
 
2.8%
Distinct202
Distinct (%)90.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
Minimum1975-07-01 00:00:00
Maximum2021-07-15 00:00:00
2023-12-12T17:40:50.490655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:40:50.632614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T17:40:46.792831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T17:40:46.938740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:40:47.100049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명주소전화번호지정일자
01원룸슈퍼충청북도 영동군 영동읍 어미실길 20-2043-744-73382021-07-15
12아이비스토어충청북도 영동군 영동읍 구교로 20043-742-37372021-07-07
23상촌마트충청북도 영동군 상촌면 민주지산로 2995<NA>2021-07-05
34지에스25 영동부용점충청북도 영동군 영동읍 학산영동로 1197043-744-25582021-06-21
45세븐일레븐 영동산업고점충청북도 영동군 영동읍 학산영동로 1211. 상회<NA>2021-03-25
56지에스25 영동행복점충청북도 영동군 용산면 용산로 350<NA>2021-02-19
67영동매점충청북도 영동군 용산면 영동산단로 25. 1층<NA>2021-02-15
78천태산 쉬어가는집충청북도 영동군 양산면 천태산진입길 130<NA>2020-11-26
89임산약방충청북도 영동군 상촌면 민주지산로 3006<NA>2020-08-19
910코코마트충청북도 영동군 영동읍 영동시장3길 8. 1층043-745-22502020-07-13
연번업소명주소전화번호지정일자
214215호암슈퍼충청북도 영동군 영동읍 매천리 394-6호<NA>1992-07-21
21521625시마트충청북도 영동군 양산면 송호리 278-19호<NA>1988-07-18
216217시장상회충청북도 영동군 영동읍 계산리 571-7호<NA>1986-05-21
217218앞치휴게소충청북도 영동군 학산면 봉소리 1130-22호<NA>1985-05-17
218219용화상회충청북도 영동군 용화면 민주지산로 97-12<NA>1981-11-27
219220황간상회충청북도 영동군 추풍령면 추풍령리 336-11호<NA>1981-05-19
220221한남슈퍼충청북도 영동군 매곡면 노천리 64호<NA>1980-07-01
221222영흥상회충청북도 영동군 영동읍 계산리 83번지 17호<NA>1980-07-01
222223신흥상회충청북도 영동군 심천면 고당리 458호<NA>1975-10-19
223224고물상충청북도 영동군 상촌면 임산리 349호<NA>1975-07-01