Overview

Dataset statistics

Number of variables5
Number of observations43
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory45.1 B

Variable types

Numeric1
Text2
Categorical1
DateTime1

Dataset

Description어린이 등하교 안심보행을 위한 영등포구 횡단보도옐로우카펫 설치현황입니다. - 학교명, 설치주소, 설치연도, 데이터기준일자
URLhttps://www.data.go.kr/data/15034325/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 설치연도High correlation
설치연도 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
설치주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:53:03.487513
Analysis finished2023-12-12 12:53:03.951754
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22
Minimum1
Maximum43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-12T21:53:04.328275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.1
Q111.5
median22
Q332.5
95-th percentile40.9
Maximum43
Range42
Interquartile range (IQR)21

Descriptive statistics

Standard deviation12.556539
Coefficient of variation (CV)0.57075176
Kurtosis-1.2
Mean22
Median Absolute Deviation (MAD)11
Skewness0
Sum946
Variance157.66667
MonotonicityStrictly increasing
2023-12-12T21:53:04.460351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
1 1
 
2.3%
2 1
 
2.3%
25 1
 
2.3%
26 1
 
2.3%
27 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
Other values (33) 33
76.7%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
6 1
2.3%
7 1
2.3%
8 1
2.3%
9 1
2.3%
10 1
2.3%
ValueCountFrequency (%)
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%
37 1
2.3%
36 1
2.3%
35 1
2.3%
34 1
2.3%
Distinct23
Distinct (%)53.5%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T21:53:04.657977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length4.2325581
Min length4

Characters and Unicode

Total characters182
Distinct characters34
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)20.9%

Sample

1st row영림초교
2nd row대방초교
3rd row대동초교
4th row문래초교
5th row영동초교
ValueCountFrequency (%)
영동초교 4
 
9.3%
선유초교 4
 
9.3%
영림초교 3
 
7.0%
성미유치원 3
 
7.0%
영원초교 2
 
4.7%
대방초교 2
 
4.7%
영등포초교 2
 
4.7%
신대림초교 2
 
4.7%
신영초교 2
 
4.7%
당산초교 2
 
4.7%
Other values (13) 17
39.5%
2023-12-12T21:53:05.013886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
21.4%
39
21.4%
17
 
9.3%
9
 
4.9%
7
 
3.8%
7
 
3.8%
6
 
3.3%
6
 
3.3%
5
 
2.7%
4
 
2.2%
Other values (24) 43
23.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 182
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
21.4%
39
21.4%
17
 
9.3%
9
 
4.9%
7
 
3.8%
7
 
3.8%
6
 
3.3%
6
 
3.3%
5
 
2.7%
4
 
2.2%
Other values (24) 43
23.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 182
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
21.4%
39
21.4%
17
 
9.3%
9
 
4.9%
7
 
3.8%
7
 
3.8%
6
 
3.3%
6
 
3.3%
5
 
2.7%
4
 
2.2%
Other values (24) 43
23.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 182
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
39
21.4%
39
21.4%
17
 
9.3%
9
 
4.9%
7
 
3.8%
7
 
3.8%
6
 
3.3%
6
 
3.3%
5
 
2.7%
4
 
2.2%
Other values (24) 43
23.6%

설치주소
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T21:53:05.290654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length29
Mean length20.418605
Min length13

Characters and Unicode

Total characters878
Distinct characters120
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row영등포구 대림동 1005-3
2nd row영등포구 여의대방로35가길 19 GATE2 앞
3rd row영등포구 대림로21길 6 대림로방면 횡단보도
4th row영등포구 문래로 104 정문앞
5th row영등포구 영중로 122 마리아요양병원
ValueCountFrequency (%)
영등포구 43
21.2%
25
 
12.3%
정문 10
 
4.9%
횡단보도 5
 
2.5%
14 5
 
2.5%
신길로 5
 
2.5%
후문 4
 
2.0%
선유로43가길 3
 
1.5%
10 2
 
1.0%
대방천로 2
 
1.0%
Other values (88) 99
48.8%
2023-12-12T21:53:05.746006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
160
18.2%
52
 
5.9%
45
 
5.1%
44
 
5.0%
44
 
5.0%
42
 
4.8%
1 32
 
3.6%
27
 
3.1%
3 26
 
3.0%
26
 
3.0%
Other values (110) 380
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 554
63.1%
Space Separator 160
 
18.2%
Decimal Number 149
 
17.0%
Uppercase Letter 9
 
1.0%
Dash Punctuation 3
 
0.3%
Open Punctuation 1
 
0.1%
Math Symbol 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
9.4%
45
 
8.1%
44
 
7.9%
44
 
7.9%
42
 
7.6%
27
 
4.9%
26
 
4.7%
18
 
3.2%
17
 
3.1%
12
 
2.2%
Other values (87) 227
41.0%
Decimal Number
ValueCountFrequency (%)
1 32
21.5%
3 26
17.4%
2 20
13.4%
9 14
9.4%
0 13
8.7%
5 12
 
8.1%
4 12
 
8.1%
7 8
 
5.4%
6 7
 
4.7%
8 5
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
G 2
22.2%
E 1
11.1%
L 1
11.1%
P 1
11.1%
T 1
11.1%
A 1
11.1%
S 1
11.1%
K 1
11.1%
Space Separator
ValueCountFrequency (%)
160
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 554
63.1%
Common 315
35.9%
Latin 9
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
9.4%
45
 
8.1%
44
 
7.9%
44
 
7.9%
42
 
7.6%
27
 
4.9%
26
 
4.7%
18
 
3.2%
17
 
3.1%
12
 
2.2%
Other values (87) 227
41.0%
Common
ValueCountFrequency (%)
160
50.8%
1 32
 
10.2%
3 26
 
8.3%
2 20
 
6.3%
9 14
 
4.4%
0 13
 
4.1%
5 12
 
3.8%
4 12
 
3.8%
7 8
 
2.5%
6 7
 
2.2%
Other values (5) 11
 
3.5%
Latin
ValueCountFrequency (%)
G 2
22.2%
E 1
11.1%
L 1
11.1%
P 1
11.1%
T 1
11.1%
A 1
11.1%
S 1
11.1%
K 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 554
63.1%
ASCII 324
36.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
160
49.4%
1 32
 
9.9%
3 26
 
8.0%
2 20
 
6.2%
9 14
 
4.3%
0 13
 
4.0%
5 12
 
3.7%
4 12
 
3.7%
7 8
 
2.5%
6 7
 
2.2%
Other values (13) 20
 
6.2%
Hangul
ValueCountFrequency (%)
52
 
9.4%
45
 
8.1%
44
 
7.9%
44
 
7.9%
42
 
7.6%
27
 
4.9%
26
 
4.7%
18
 
3.2%
17
 
3.1%
12
 
2.2%
Other values (87) 227
41.0%

설치연도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
2022
22 
2023
11 
2021
2020
 
2

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2022 22
51.2%
2023 11
25.6%
2021 8
 
18.6%
2020 2
 
4.7%

Length

2023-12-12T21:53:05.893664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:53:05.993945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 22
51.2%
2023 11
25.6%
2021 8
 
18.6%
2020 2
 
4.7%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
Minimum2023-05-03 00:00:00
Maximum2023-05-03 00:00:00
2023-12-12T21:53:06.083852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:53:06.167235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T21:53:03.681396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:53:06.228951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번학교명설치주소설치연도
연번1.0000.9351.0000.886
학교명0.9351.0001.0000.843
설치주소1.0001.0001.0001.000
설치연도0.8860.8431.0001.000
2023-12-12T21:53:06.309415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설치연도
연번1.0000.694
설치연도0.6941.000

Missing values

2023-12-12T21:53:03.802310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:53:03.913326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번학교명설치주소설치연도데이터기준일자
01영림초교영등포구 대림동 1005-320202023-05-03
12대방초교영등포구 여의대방로35가길 19 GATE2 앞20202023-05-03
23대동초교영등포구 대림로21길 6 대림로방면 횡단보도20212023-05-03
34문래초교영등포구 문래로 104 정문앞20212023-05-03
45영동초교영등포구 영중로 122 마리아요양병원20212023-05-03
56영동초교영등포구 국회대로 657 영등포평생학습관 앞 횡단보도20212023-05-03
67영동초교영등포구 버드나루로 103 브라보 앞20212023-05-03
78성미유치원영등포구 신길로 107-120212023-05-03
89성미유치원영등포구 신길로 93 정문20212023-05-03
910성미유치원영등포구 신길로 93 차량출입구20212023-05-03
연번학교명설치주소설치연도데이터기준일자
3334신대림초교영등포구 신길로 29 아델포레아파트 109동 앞20232023-05-03
3435신영초교영등포구 대림로39길 15 신영초 후문 앞20232023-05-03
3536신영초교영등포구 도신로4길 27 맞은편20232023-05-03
3637여의도초교영등포구 여의대방로 439 원효대교남단교차부 하부20232023-05-03
3738영등포초교영등포구 도림로 379 도림고가차도 하부 LPG판매소 앞20232023-05-03
3839영등포초교영등포구 도영로 55 앞20232023-05-03
3940영림초교영등포구 시흥대로173길 14 정문 앞20232023-05-03
4041영림초교영등포구 시흥대로173길 14 후문 앞20232023-05-03
4142영중초교영등포구 양산로 185 정문 앞20232023-05-03
4243영중초교영등포구 양산로 185 쪽문(동북측)20232023-05-03