Overview

Dataset statistics

Number of variables4
Number of observations286
Missing cells5
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.3 KiB
Average record size in memory33.5 B

Variable types

Categorical2
Text1
Numeric1

Dataset

Description자치구명,법정동명,업태명,업소수
Author노원구
URLhttps://data.seoul.go.kr/dataList/OA-10991/S/1/datasetView.do

Alerts

자치구명 has constant value ""Constant
업태명 has 5 (1.7%) missing valuesMissing

Reproduction

Analysis started2024-05-18 06:17:30.877391
Analysis finished2024-05-18 06:17:32.262652
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
노원구
286 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노원구
2nd row노원구
3rd row노원구
4th row노원구
5th row노원구

Common Values

ValueCountFrequency (%)
노원구 286
100.0%

Length

2024-05-18T15:17:32.492735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T15:17:32.800923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노원구 286
100.0%

법정동명
Categorical

Distinct7
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
상계동
68 
공릉동
58 
월계동
56 
중계동
53 
하계동
49 
Other values (2)
 
2

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique2 ?
Unique (%)0.7%

Sample

1st row월계동
2nd row월계동
3rd row월계동
4th row월계동
5th row월계동

Common Values

ValueCountFrequency (%)
상계동 68
23.8%
공릉동 58
20.3%
월계동 56
19.6%
중계동 53
18.5%
하계동 49
17.1%
신흥동 1
 
0.3%
대월면 1
 
0.3%

Length

2024-05-18T15:17:33.125126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T15:17:33.476941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상계동 68
23.8%
공릉동 58
20.3%
월계동 56
19.6%
중계동 53
18.5%
하계동 49
17.1%
신흥동 1
 
0.3%
대월면 1
 
0.3%

업태명
Text

MISSING 

Distinct71
Distinct (%)25.3%
Missing5
Missing (%)1.7%
Memory size2.4 KiB
2024-05-18T15:17:33.940941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length5.6192171
Min length2

Characters and Unicode

Total characters1579
Distinct characters155
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)4.3%

Sample

1st row한식
2nd row중국식
3rd row경양식
4th row일식
5th row분식
ValueCountFrequency (%)
기타 18
 
5.9%
패스트푸드 10
 
3.3%
식품제조가공업 9
 
3.0%
집단급식소 7
 
2.3%
일반조리판매 6
 
2.0%
제과점영업 6
 
2.0%
식품등 5
 
1.6%
학교 5
 
1.6%
병원 5
 
1.6%
사회복지시설 5
 
1.6%
Other values (61) 228
75.0%
2024-05-18T15:17:34.876399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
 
6.6%
88
 
5.6%
69
 
4.4%
65
 
4.1%
47
 
3.0%
45
 
2.8%
33
 
2.1%
) 31
 
2.0%
( 31
 
2.0%
31
 
2.0%
Other values (145) 1035
65.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1474
93.4%
Close Punctuation 31
 
2.0%
Open Punctuation 31
 
2.0%
Space Separator 23
 
1.5%
Other Punctuation 20
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
104
 
7.1%
88
 
6.0%
69
 
4.7%
65
 
4.4%
47
 
3.2%
45
 
3.1%
33
 
2.2%
31
 
2.1%
26
 
1.8%
23
 
1.6%
Other values (140) 943
64.0%
Other Punctuation
ValueCountFrequency (%)
/ 15
75.0%
, 5
 
25.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1474
93.4%
Common 105
 
6.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
104
 
7.1%
88
 
6.0%
69
 
4.7%
65
 
4.4%
47
 
3.2%
45
 
3.1%
33
 
2.2%
31
 
2.1%
26
 
1.8%
23
 
1.6%
Other values (140) 943
64.0%
Common
ValueCountFrequency (%)
) 31
29.5%
( 31
29.5%
23
21.9%
/ 15
14.3%
, 5
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1474
93.4%
ASCII 105
 
6.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
104
 
7.1%
88
 
6.0%
69
 
4.7%
65
 
4.4%
47
 
3.2%
45
 
3.1%
33
 
2.2%
31
 
2.1%
26
 
1.8%
23
 
1.6%
Other values (140) 943
64.0%
ASCII
ValueCountFrequency (%)
) 31
29.5%
( 31
29.5%
23
21.9%
/ 15
14.3%
, 5
 
4.8%

업소수
Real number (ℝ)

Distinct82
Distinct (%)28.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.076923
Minimum1
Maximum736
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-05-18T15:17:35.361044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median7
Q327
95-th percentile125
Maximum736
Range735
Interquartile range (IQR)24

Descriptive statistics

Standard deviation64.022537
Coefficient of variation (CV)2.2018333
Kurtosis55.830609
Mean29.076923
Median Absolute Deviation (MAD)6
Skewness6.2052385
Sum8316
Variance4098.8853
MonotonicityNot monotonic
2024-05-18T15:17:36.086133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 45
 
15.7%
2 24
 
8.4%
3 23
 
8.0%
4 16
 
5.6%
6 15
 
5.2%
5 14
 
4.9%
11 8
 
2.8%
9 8
 
2.8%
18 7
 
2.4%
7 7
 
2.4%
Other values (72) 119
41.6%
ValueCountFrequency (%)
1 45
15.7%
2 24
8.4%
3 23
8.0%
4 16
 
5.6%
5 14
 
4.9%
6 15
 
5.2%
7 7
 
2.4%
8 6
 
2.1%
9 8
 
2.8%
10 4
 
1.4%
ValueCountFrequency (%)
736 1
0.3%
336 1
0.3%
321 1
0.3%
272 1
0.3%
228 1
0.3%
220 1
0.3%
199 1
0.3%
196 1
0.3%
185 1
0.3%
170 1
0.3%

Interactions

2024-05-18T15:17:31.311296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-18T15:17:36.470150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동명업태명업소수
법정동명1.0000.0000.000
업태명0.0001.0000.000
업소수0.0000.0001.000
2024-05-18T15:17:36.901466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소수법정동명
업소수1.0000.000
법정동명0.0001.000

Missing values

2024-05-18T15:17:31.824707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T15:17:32.149814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치구명법정동명업태명업소수
0노원구월계동한식170
1노원구월계동중국식20
2노원구월계동경양식20
3노원구월계동일식22
4노원구월계동분식40
5노원구월계동뷔페식5
6노원구월계동정종/대포집/소주방1
7노원구월계동패스트푸드7
8노원구월계동호프/통닭62
9노원구월계동통닭(치킨)17
자치구명법정동명업태명업소수
276노원구중계동건강기능식품수입업4
277노원구중계동영업장판매82
278노원구중계동방문판매27
279노원구중계동전자상거래(통신판매업)107
280노원구중계동<NA>9
281노원구중계동다단계판매7
282노원구중계동기타 건강기능식품일반판매업1
283노원구중계동건강기능식품유통전문판매업1
284노원구신흥동일반조리판매1
285노원구대월면제과점영업1