Overview

Dataset statistics

Number of variables4
Number of observations147
Missing cells3
Missing cells (%)0.5%
Duplicate rows1
Duplicate rows (%)0.7%
Total size in memory4.7 KiB
Average record size in memory32.9 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시영도구_평생학습추천사이트_20230620
Author부산광역시 영도구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15045452

Alerts

Dataset has 1 (0.7%) duplicate rowsDuplicates
사이트 안내 has 3 (2.0%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:08:40.465156
Analysis finished2023-12-10 16:08:41.044716
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct20
Distinct (%)13.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
어린이 학습
16 
한국전통문화
15 
체육
14 
인터넷 박물관
11 
음악
11 
Other values (15)
80 

Length

Max length7
Median length2
Mean length3.7346939
Min length2

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row어린이 학습
2nd row어린이 학습
3rd row어린이 학습
4th row어린이 학습
5th row어린이 학습

Common Values

ValueCountFrequency (%)
어린이 학습 16
10.9%
한국전통문화 15
10.2%
체육 14
 
9.5%
인터넷 박물관 11
 
7.5%
음악 11
 
7.5%
식물 10
 
6.8%
영어 9
 
6.1%
환경 9
 
6.1%
도덕 8
 
5.4%
과학 8
 
5.4%
Other values (10) 36
24.5%

Length

2023-12-11T01:08:41.119749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어린이 16
 
9.2%
학습 16
 
9.2%
한국전통문화 15
 
8.6%
체육 14
 
8.0%
인터넷 11
 
6.3%
박물관 11
 
6.3%
음악 11
 
6.3%
식물 10
 
5.7%
영어 9
 
5.2%
환경 9
 
5.2%
Other values (12) 52
29.9%
Distinct142
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T01:08:41.414847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length15
Mean length7.585034
Min length2

Characters and Unicode

Total characters1115
Distinct characters278
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique137 ?
Unique (%)93.2%

Sample

1st row에듀넷
2nd row푸르넷
3rd row서울시교육과학연구원
4th row부산시교육과학연구원
5th row전라북도교육정보과학원
ValueCountFrequency (%)
사이버 3
 
1.4%
어린이 3
 
1.4%
한국의 3
 
1.4%
한국과학문화재단 2
 
0.9%
사이언스올 2
 
0.9%
연구소 2
 
0.9%
소리 2
 
0.9%
한자 2
 
0.9%
2
 
0.9%
2
 
0.9%
Other values (189) 195
89.4%
2023-12-11T01:08:41.880543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
6.4%
42
 
3.8%
34
 
3.0%
30
 
2.7%
27
 
2.4%
26
 
2.3%
26
 
2.3%
25
 
2.2%
20
 
1.8%
17
 
1.5%
Other values (268) 797
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 957
85.8%
Space Separator 71
 
6.4%
Lowercase Letter 56
 
5.0%
Uppercase Letter 21
 
1.9%
Other Punctuation 4
 
0.4%
Dash Punctuation 3
 
0.3%
Math Symbol 1
 
0.1%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
4.4%
34
 
3.6%
30
 
3.1%
27
 
2.8%
26
 
2.7%
26
 
2.7%
25
 
2.6%
20
 
2.1%
17
 
1.8%
17
 
1.8%
Other values (233) 693
72.4%
Lowercase Letter
ValueCountFrequency (%)
l 7
12.5%
i 7
12.5%
a 6
10.7%
e 6
10.7%
d 4
 
7.1%
n 4
 
7.1%
r 3
 
5.4%
s 3
 
5.4%
g 3
 
5.4%
t 2
 
3.6%
Other values (8) 11
19.6%
Uppercase Letter
ValueCountFrequency (%)
O 4
19.0%
K 4
19.0%
S 3
14.3%
C 3
14.3%
M 2
9.5%
W 2
9.5%
F 1
 
4.8%
G 1
 
4.8%
L 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 2
50.0%
? 1
25.0%
. 1
25.0%
Space Separator
ValueCountFrequency (%)
71
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Math Symbol
ValueCountFrequency (%)
| 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 957
85.8%
Common 81
 
7.3%
Latin 77
 
6.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
4.4%
34
 
3.6%
30
 
3.1%
27
 
2.8%
26
 
2.7%
26
 
2.7%
25
 
2.6%
20
 
2.1%
17
 
1.8%
17
 
1.8%
Other values (233) 693
72.4%
Latin
ValueCountFrequency (%)
l 7
 
9.1%
i 7
 
9.1%
a 6
 
7.8%
e 6
 
7.8%
d 4
 
5.2%
n 4
 
5.2%
O 4
 
5.2%
K 4
 
5.2%
r 3
 
3.9%
s 3
 
3.9%
Other values (17) 29
37.7%
Common
ValueCountFrequency (%)
71
87.7%
- 3
 
3.7%
, 2
 
2.5%
? 1
 
1.2%
. 1
 
1.2%
| 1
 
1.2%
( 1
 
1.2%
) 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 957
85.8%
ASCII 158
 
14.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
71
44.9%
l 7
 
4.4%
i 7
 
4.4%
a 6
 
3.8%
e 6
 
3.8%
d 4
 
2.5%
n 4
 
2.5%
O 4
 
2.5%
K 4
 
2.5%
r 3
 
1.9%
Other values (25) 42
26.6%
Hangul
ValueCountFrequency (%)
42
 
4.4%
34
 
3.6%
30
 
3.1%
27
 
2.8%
26
 
2.7%
26
 
2.7%
25
 
2.6%
20
 
2.1%
17
 
1.8%
17
 
1.8%
Other values (233) 693
72.4%

사이트 안내
Text

MISSING 

Distinct143
Distinct (%)99.3%
Missing3
Missing (%)2.0%
Memory size1.3 KiB
2023-12-11T01:08:42.300267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length162
Median length75
Mean length49.951389
Min length10

Characters and Unicode

Total characters7193
Distinct characters465
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique142 ?
Unique (%)98.6%

Sample

1st row교육정보화동향,교육종합상담,에듀넷칼럼니스트,교육과정정보서비스, 교육정보화연수등 제공
2nd row아동 교과목 학습, 공부방
3rd row자료실, 탐구학습관, 전자게시판, 추천사이트등 제공
4th row교육연구 개발 활동과 교단지원 센타로서 DB검색, 자료실, 도서실 제공
5th row교육정보과학원, 교육자료, 행정자료, 전북문화광장, 사이버장학등 제공
ValueCountFrequency (%)
제공 68
 
3.8%
소개 38
 
2.1%
정보 27
 
1.5%
23
 
1.3%
사이트 20
 
1.1%
과학 19
 
1.1%
관련 19
 
1.1%
17
 
1.0%
안내 15
 
0.8%
대한 14
 
0.8%
Other values (1006) 1518
85.4%
2023-12-11T01:08:42.892000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1634
 
22.7%
, 460
 
6.4%
113
 
1.6%
98
 
1.4%
95
 
1.3%
94
 
1.3%
89
 
1.2%
86
 
1.2%
83
 
1.2%
79
 
1.1%
Other values (455) 4362
60.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5000
69.5%
Space Separator 1634
 
22.7%
Other Punctuation 495
 
6.9%
Decimal Number 21
 
0.3%
Uppercase Letter 19
 
0.3%
Lowercase Letter 13
 
0.2%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
113
 
2.3%
98
 
2.0%
95
 
1.9%
94
 
1.9%
89
 
1.8%
86
 
1.7%
83
 
1.7%
79
 
1.6%
71
 
1.4%
71
 
1.4%
Other values (419) 4121
82.4%
Lowercase Letter
ValueCountFrequency (%)
o 3
23.1%
g 1
 
7.7%
l 1
 
7.7%
i 1
 
7.7%
n 1
 
7.7%
k 1
 
7.7%
r 1
 
7.7%
y 1
 
7.7%
t 1
 
7.7%
h 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
S 3
15.8%
B 3
15.8%
D 3
15.8%
E 3
15.8%
Q 2
10.5%
A 2
10.5%
K 1
 
5.3%
G 1
 
5.3%
F 1
 
5.3%
Decimal Number
ValueCountFrequency (%)
0 7
33.3%
2 5
23.8%
3 3
14.3%
1 2
 
9.5%
8 2
 
9.5%
4 1
 
4.8%
7 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 460
92.9%
. 24
 
4.8%
/ 8
 
1.6%
& 2
 
0.4%
· 1
 
0.2%
Space Separator
ValueCountFrequency (%)
1634
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5000
69.5%
Common 2161
30.0%
Latin 32
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
113
 
2.3%
98
 
2.0%
95
 
1.9%
94
 
1.9%
89
 
1.8%
86
 
1.7%
83
 
1.7%
79
 
1.6%
71
 
1.4%
71
 
1.4%
Other values (419) 4121
82.4%
Latin
ValueCountFrequency (%)
S 3
 
9.4%
B 3
 
9.4%
D 3
 
9.4%
o 3
 
9.4%
E 3
 
9.4%
Q 2
 
6.2%
A 2
 
6.2%
g 1
 
3.1%
l 1
 
3.1%
i 1
 
3.1%
Other values (10) 10
31.2%
Common
ValueCountFrequency (%)
1634
75.6%
, 460
 
21.3%
. 24
 
1.1%
/ 8
 
0.4%
0 7
 
0.3%
2 5
 
0.2%
( 4
 
0.2%
) 4
 
0.2%
3 3
 
0.1%
- 3
 
0.1%
Other values (6) 9
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5000
69.5%
ASCII 2192
30.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1634
74.5%
, 460
 
21.0%
. 24
 
1.1%
/ 8
 
0.4%
0 7
 
0.3%
2 5
 
0.2%
( 4
 
0.2%
) 4
 
0.2%
S 3
 
0.1%
B 3
 
0.1%
Other values (25) 40
 
1.8%
Hangul
ValueCountFrequency (%)
113
 
2.3%
98
 
2.0%
95
 
1.9%
94
 
1.9%
89
 
1.8%
86
 
1.7%
83
 
1.7%
79
 
1.6%
71
 
1.4%
71
 
1.4%
Other values (419) 4121
82.4%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct141
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T01:08:43.127115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length44
Mean length25.489796
Min length12

Characters and Unicode

Total characters3747
Distinct characters44
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)91.8%

Sample

1st rowhttp://www.edunet4u.net/main/html/index.html
2nd rowhttp://www.kumsungedu.com
3rd rowhttp://www.sesri.re.kr
4th rowhttp://www.pise.re.kr
5th rowhttp://www.cein.or.kr
ValueCountFrequency (%)
http://www.ecoguide.or.kr 2
 
1.4%
http://www.nfm.go.kr 2
 
1.4%
http://www.kfem.or.kr 2
 
1.4%
http://www.koreanfolk.co.kr 2
 
1.4%
http://www.scienceall.com 2
 
1.4%
http://www.moca.go.kr 2
 
1.4%
http://www.hansory.or.kr 1
 
0.7%
http://www.worldvillage.com/kidz/index.html 1
 
0.7%
http://www.kmusic.org 1
 
0.7%
http://www.dongchosori.co.kr 1
 
0.7%
Other values (131) 131
89.1%
2023-12-11T01:08:43.555561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 402
 
10.7%
w 397
 
10.6%
t 359
 
9.6%
/ 331
 
8.8%
r 226
 
6.0%
o 220
 
5.9%
p 193
 
5.2%
h 186
 
5.0%
k 168
 
4.5%
e 161
 
4.3%
Other values (34) 1104
29.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2837
75.7%
Other Punctuation 883
 
23.6%
Decimal Number 12
 
0.3%
Uppercase Letter 8
 
0.2%
Dash Punctuation 5
 
0.1%
Math Symbol 2
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 397
14.0%
t 359
12.7%
r 226
 
8.0%
o 220
 
7.8%
p 193
 
6.8%
h 186
 
6.6%
k 168
 
5.9%
e 161
 
5.7%
c 116
 
4.1%
n 107
 
3.8%
Other values (16) 704
24.8%
Other Punctuation
ValueCountFrequency (%)
. 402
45.5%
/ 331
37.5%
: 146
 
16.5%
% 3
 
0.3%
? 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
7 5
41.7%
5 2
 
16.7%
2 2
 
16.7%
4 2
 
16.7%
9 1
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
E 4
50.0%
M 1
 
12.5%
A 1
 
12.5%
W 1
 
12.5%
K 1
 
12.5%
Math Symbol
ValueCountFrequency (%)
= 1
50.0%
~ 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2845
75.9%
Common 902
 
24.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 397
14.0%
t 359
12.6%
r 226
 
7.9%
o 220
 
7.7%
p 193
 
6.8%
h 186
 
6.5%
k 168
 
5.9%
e 161
 
5.7%
c 116
 
4.1%
n 107
 
3.8%
Other values (21) 712
25.0%
Common
ValueCountFrequency (%)
. 402
44.6%
/ 331
36.7%
: 146
 
16.2%
- 5
 
0.6%
7 5
 
0.6%
% 3
 
0.3%
5 2
 
0.2%
2 2
 
0.2%
4 2
 
0.2%
= 1
 
0.1%
Other values (3) 3
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3747
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 402
 
10.7%
w 397
 
10.6%
t 359
 
9.6%
/ 331
 
8.8%
r 226
 
6.0%
o 220
 
5.9%
p 193
 
5.2%
h 186
 
5.0%
k 168
 
4.5%
e 161
 
4.3%
Other values (34) 1104
29.5%

Missing values

2023-12-11T01:08:40.897543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:08:41.003622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분사이트명사이트 안내홈페이지
0어린이 학습에듀넷교육정보화동향,교육종합상담,에듀넷칼럼니스트,교육과정정보서비스, 교육정보화연수등 제공http://www.edunet4u.net/main/html/index.html
1어린이 학습푸르넷아동 교과목 학습, 공부방http://www.kumsungedu.com
2어린이 학습서울시교육과학연구원자료실, 탐구학습관, 전자게시판, 추천사이트등 제공http://www.sesri.re.kr
3어린이 학습부산시교육과학연구원교육연구 개발 활동과 교단지원 센타로서 DB검색, 자료실, 도서실 제공http://www.pise.re.kr
4어린이 학습전라북도교육정보과학원교육정보과학원, 교육자료, 행정자료, 전북문화광장, 사이버장학등 제공http://www.cein.or.kr
5어린이 학습경기도교육정보과학원교육자료실, 일반자료실등 교육에 관한 다양한 정보와 자료 제공http://www.kerinet.re.kr
6어린이 학습충청북도교육과학연구원과학전시관, 열린마당, 사이트맵, 추천사이트 제공http://www.cbesr.or.kr
7어린이 학습경상북도교육과학연구원교육 자료실, 현장 지원 Q/A, 인터넷 공부방, 정보 교환실, 현장 지원 자료실 제공http://www.kbise.or.kr
8어린이 학습경상남도교육과학연구원교육연구부, 과학교육부, 정보지원부, 교육공학부, 수행평가, 인터넷 방송 제공http://muhak.gnise.re.kr
9어린이 학습제주도교육과학연구원제주의 야생란, 제주바다물고기, 태양계여행, 과학탐구자료, 학습자료 제공http://www.cisec.or.kr
구분사이트명사이트 안내홈페이지
137한국전통문화한국문화재보호재단문화재를 발굴, 조사, 보존하고 있으며 박물관을 운영하고 있는 곳으로 관련 소장품과 자료 수록http://www.fpcp.or.kr
138한국전통문화고구려 연구회잊혀져 가는 고구려와 발해를 알고 배우기 위해 만들어진 곳. 문헌, 사진자료, 역사답사자료가 풍부http://www.koguryo.org
139한국전통문화한국의 도자기한국의 전통도자기를 소개하고, 전통작가의 작품 및 활동을 인터넷을 통해 전 세계에 홍보하는 사이트. 한국의 도자기, 대표작가, 한국전통공예, 한국문화를 찾아서 등의 메뉴로 구성http://www.koreafolkart.com
140한자맛있는 한자한자공부 사이트, 생활 실용한자, 게임, 한자능력 검정시험, 일본어, 중국어 제공http://www.yamhanja.com
141한자어린이 한자 공부 사이트어린이 한자 학습 사이트, 부수해설, 교훈, 학년별 학습자료, 동영상 등 수록http://www.primary75.pe.kr
142한자박병구의 열린 한문 교실<NA>http://www.openhanmoon.pe.kr
143한자한국사이버서당<NA>http://www.kr-seodang.com
144평생학습경기도 온라인 평생학습 서비스지식 캠퍼트 GSEEKhttps://www.gseek.kr/main/intro
145평생학습| K-MOOC한국형 온라인 공개강좌 K-MOOC 안내.<NA>www.kmooc.kr
146자격취득비전큐민간 자격증 취득가능, 무료 강좌http://v-q.co.kr

Duplicate rows

Most frequently occurring

구분사이트명사이트 안내홈페이지# duplicates
0별자리/우주한국과학문화재단, 사이언스올별자리, 사이버 낚시, 만화로 보는 과학의 역사, 백두대간 탐험, 과학 배낭여행, 퀴즈대회http://www.scienceall.com2