Overview

Dataset statistics

Number of variables4
Number of observations1974
Missing cells11
Missing cells (%)0.1%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory63.7 KiB
Average record size in memory33.1 B

Variable types

Categorical1
Text2
Numeric1

Dataset

Description자치구명,법정동명,업태명,업소수
Author중구
URLhttps://data.seoul.go.kr/dataList/OA-10221/S/1/datasetView.do

Alerts

자치구명 has constant value ""Constant
Dataset has 1 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-05-11 16:19:54.228477
Analysis finished2024-05-11 16:19:57.384240
Duration3.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
중구
1974 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
중구 1974
100.0%

Length

2024-05-12T01:19:57.590266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-12T01:19:57.898713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중구 1974
100.0%
Distinct74
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
2024-05-12T01:19:58.759690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.1458967
Min length2

Characters and Unicode

Total characters8184
Distinct characters66
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row무교동
2nd row무교동
3rd row무교동
4th row무교동
5th row무교동
ValueCountFrequency (%)
신당동 63
 
3.2%
황학동 47
 
2.4%
을지로6가 45
 
2.3%
중림동 42
 
2.1%
서소문동 41
 
2.1%
흥인동 41
 
2.1%
태평로2가 40
 
2.0%
장충동2가 39
 
2.0%
필동2가 39
 
2.0%
남대문로5가 39
 
2.0%
Other values (64) 1538
77.9%
2024-05-12T01:20:00.150956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1355
16.6%
1205
 
14.7%
619
 
7.6%
2 408
 
5.0%
1 393
 
4.8%
245
 
3.0%
235
 
2.9%
235
 
2.9%
215
 
2.6%
211
 
2.6%
Other values (56) 3063
37.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6979
85.3%
Decimal Number 1205
 
14.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1355
19.4%
1205
17.3%
619
 
8.9%
245
 
3.5%
235
 
3.4%
235
 
3.4%
215
 
3.1%
211
 
3.0%
158
 
2.3%
129
 
1.8%
Other values (49) 2372
34.0%
Decimal Number
ValueCountFrequency (%)
2 408
33.9%
1 393
32.6%
3 145
 
12.0%
4 95
 
7.9%
5 93
 
7.7%
6 45
 
3.7%
7 26
 
2.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6979
85.3%
Common 1205
 
14.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1355
19.4%
1205
17.3%
619
 
8.9%
245
 
3.5%
235
 
3.4%
235
 
3.4%
215
 
3.1%
211
 
3.0%
158
 
2.3%
129
 
1.8%
Other values (49) 2372
34.0%
Common
ValueCountFrequency (%)
2 408
33.9%
1 393
32.6%
3 145
 
12.0%
4 95
 
7.9%
5 93
 
7.7%
6 45
 
3.7%
7 26
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6979
85.3%
ASCII 1205
 
14.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1355
19.4%
1205
17.3%
619
 
8.9%
245
 
3.5%
235
 
3.4%
235
 
3.4%
215
 
3.1%
211
 
3.0%
158
 
2.3%
129
 
1.8%
Other values (49) 2372
34.0%
ASCII
ValueCountFrequency (%)
2 408
33.9%
1 393
32.6%
3 145
 
12.0%
4 95
 
7.9%
5 93
 
7.7%
6 45
 
3.7%
7 26
 
2.2%
Distinct75
Distinct (%)3.8%
Missing11
Missing (%)0.6%
Memory size15.5 KiB
2024-05-12T01:20:01.040625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length5.339786
Min length2

Characters and Unicode

Total characters10482
Distinct characters157
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row한식
2nd row중국식
3rd row경양식
4th row일식
5th row분식
ValueCountFrequency (%)
기타 157
 
7.4%
한식 73
 
3.5%
커피숍 72
 
3.4%
식품등 65
 
3.1%
수입판매업 65
 
3.1%
경양식 65
 
3.1%
분식 64
 
3.0%
영업장판매 64
 
3.0%
편의점 64
 
3.0%
일식 61
 
2.9%
Other values (66) 1362
64.5%
2024-05-12T01:20:02.352050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
790
 
7.5%
657
 
6.3%
492
 
4.7%
485
 
4.6%
323
 
3.1%
286
 
2.7%
265
 
2.5%
240
 
2.3%
212
 
2.0%
206
 
2.0%
Other values (147) 6526
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9821
93.7%
Open Punctuation 180
 
1.7%
Close Punctuation 180
 
1.7%
Other Punctuation 152
 
1.5%
Space Separator 149
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
790
 
8.0%
657
 
6.7%
492
 
5.0%
485
 
4.9%
323
 
3.3%
286
 
2.9%
265
 
2.7%
240
 
2.4%
212
 
2.2%
206
 
2.1%
Other values (141) 5865
59.7%
Other Punctuation
ValueCountFrequency (%)
/ 86
56.6%
, 50
32.9%
. 16
 
10.5%
Open Punctuation
ValueCountFrequency (%)
( 180
100.0%
Close Punctuation
ValueCountFrequency (%)
) 180
100.0%
Space Separator
ValueCountFrequency (%)
149
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9821
93.7%
Common 661
 
6.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
790
 
8.0%
657
 
6.7%
492
 
5.0%
485
 
4.9%
323
 
3.3%
286
 
2.9%
265
 
2.7%
240
 
2.4%
212
 
2.2%
206
 
2.1%
Other values (141) 5865
59.7%
Common
ValueCountFrequency (%)
( 180
27.2%
) 180
27.2%
149
22.5%
/ 86
13.0%
, 50
 
7.6%
. 16
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9821
93.7%
ASCII 661
 
6.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
790
 
8.0%
657
 
6.7%
492
 
5.0%
485
 
4.9%
323
 
3.3%
286
 
2.9%
265
 
2.7%
240
 
2.4%
212
 
2.2%
206
 
2.1%
Other values (141) 5865
59.7%
ASCII
ValueCountFrequency (%)
( 180
27.2%
) 180
27.2%
149
22.5%
/ 86
13.0%
, 50
 
7.6%
. 16
 
2.4%

업소수
Real number (ℝ)

Distinct70
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.9761905
Minimum1
Maximum423
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.5 KiB
2024-05-12T01:20:02.773937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q35
95-th percentile22
Maximum423
Range422
Interquartile range (IQR)4

Descriptive statistics

Standard deviation15.046673
Coefficient of variation (CV)2.51777
Kurtosis319.42351
Mean5.9761905
Median Absolute Deviation (MAD)1
Skewness13.808774
Sum11797
Variance226.40237
MonotonicityNot monotonic
2024-05-12T01:20:03.204938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 719
36.4%
2 347
17.6%
3 199
 
10.1%
4 128
 
6.5%
5 101
 
5.1%
6 84
 
4.3%
7 51
 
2.6%
8 43
 
2.2%
9 36
 
1.8%
10 31
 
1.6%
Other values (60) 235
 
11.9%
ValueCountFrequency (%)
1 719
36.4%
2 347
17.6%
3 199
 
10.1%
4 128
 
6.5%
5 101
 
5.1%
6 84
 
4.3%
7 51
 
2.6%
8 43
 
2.2%
9 36
 
1.8%
10 31
 
1.6%
ValueCountFrequency (%)
423 1
0.1%
180 1
0.1%
144 1
0.1%
136 1
0.1%
103 1
0.1%
101 1
0.1%
92 1
0.1%
91 1
0.1%
84 1
0.1%
83 2
0.1%

Interactions

2024-05-12T01:19:56.636918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-12T01:20:03.461511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동명업태명업소수
법정동명1.0000.0000.000
업태명0.0001.0000.178
업소수0.0000.1781.000

Missing values

2024-05-12T01:19:56.969997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-12T01:19:57.257137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치구명법정동명업태명업소수
0중구무교동한식26
1중구무교동중국식1
2중구무교동경양식10
3중구무교동일식5
4중구무교동분식5
5중구무교동호프/통닭1
6중구무교동통닭(치킨)1
7중구무교동까페1
8중구무교동식육(숯불구이)1
9중구무교동외국음식전문점(인도,태국등)1
자치구명법정동명업태명업소수
1964중구만리동2가기타 휴게음식점1
1965중구만리동2가학교2
1966중구만리동2가산업체1
1967중구만리동2가어린이집1
1968중구만리동2가즉석판매제조가공업5
1969중구만리동2가식품등 수입판매업3
1970중구만리동2가위탁급식영업1
1971중구만리동2가제과점영업2
1972중구만리동2가영업장판매1
1973중구만리동2가전자상거래(통신판매업)3

Duplicate rows

Most frequently occurring

자치구명법정동명업태명업소수# duplicates
0중구남대문로2가패스트푸드12