Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 360 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 38 |
Duplicate rows (%) | 10.6% |
Total size in memory | 8.6 KiB |
Average record size in memory | 24.4 B |
Variable types
Text | 3 |
---|
Dataset
Description | 한국토지주택공사가 개발 조성한 전국 여러 지역에서 출토되어 토지주택박물관이 현재 소장중인 주요 유물 데이터를 제공합니다. |
---|---|
Author | 한국토지주택공사 |
URL | https://www.data.go.kr/data/15088290/fileData.do |
Dataset has 38 (10.6%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2023-12-12 13:17:04.016781 |
---|---|
Analysis finished | 2023-12-12 13:17:04.440945 |
Duration | 0.42 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
국문 유물명
Text
Distinct | 178 |
---|---|
Distinct (%) | 49.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
Value | Count | Frequency (%) |
토지매매문서 | 75 | 19.5% |
호적 | 34 | 8.9% |
토지매매문기 | 11 | 2.9% |
분재기 | 11 | 2.9% |
소송문기 | 8 | 2.1% |
임명장 | 5 | 1.3% |
소지 | 4 | 1.0% |
저울추 | 4 | 1.0% |
수키와 | 4 | 1.0% |
소송문서 | 3 | 0.8% |
Other values (181) | 225 |
Most occurring characters
Value | Count | Frequency (%) |
363 | ||
매 | 182 | 9.3% |
문 | 127 | 6.5% |
지 | 107 | 5.4% |
서 | 105 | 5.3% |
토 | 104 | 5.3% |
기 | 67 | 3.4% |
호 | 52 | 2.6% |
적 | 40 | 2.0% |
와 | 21 | 1.1% |
Other values (216) | 799 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1574 | |
Space Separator | 363 | 18.5% |
Decimal Number | 18 | 0.9% |
Open Punctuation | 6 | 0.3% |
Close Punctuation | 6 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
매 | 182 | 11.6% |
문 | 127 | 8.1% |
지 | 107 | 6.8% |
서 | 105 | 6.7% |
토 | 104 | 6.6% |
기 | 67 | 4.3% |
호 | 52 | 3.3% |
적 | 40 | 2.5% |
와 | 21 | 1.3% |
소 | 18 | 1.1% |
Other values (206) | 751 |
Decimal Number
Value | Count | Frequency (%) |
1 | 4 | |
5 | 4 | |
8 | 3 | |
7 | 3 | |
9 | 2 | |
3 | 1 | 5.6% |
6 | 1 | 5.6% |
Space Separator
Value | Count | Frequency (%) |
363 |
Open Punctuation
Value | Count | Frequency (%) |
( | 6 |
Close Punctuation
Value | Count | Frequency (%) |
) | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1568 | |
Common | 393 | 20.0% |
Han | 6 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
매 | 182 | 11.6% |
문 | 127 | 8.1% |
지 | 107 | 6.8% |
서 | 105 | 6.7% |
토 | 104 | 6.6% |
기 | 67 | 4.3% |
호 | 52 | 3.3% |
적 | 40 | 2.6% |
와 | 21 | 1.3% |
소 | 18 | 1.1% |
Other values (202) | 745 |
Common
Value | Count | Frequency (%) |
363 | ||
( | 6 | 1.5% |
) | 6 | 1.5% |
1 | 4 | 1.0% |
5 | 4 | 1.0% |
8 | 3 | 0.8% |
7 | 3 | 0.8% |
9 | 2 | 0.5% |
3 | 1 | 0.3% |
6 | 1 | 0.3% |
Han
Value | Count | Frequency (%) |
帖 | 2 | |
差 | 2 | |
下 | 1 | |
定 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1568 | |
ASCII | 393 | 20.0% |
CJK | 6 | 0.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
363 | ||
( | 6 | 1.5% |
) | 6 | 1.5% |
1 | 4 | 1.0% |
5 | 4 | 1.0% |
8 | 3 | 0.8% |
7 | 3 | 0.8% |
9 | 2 | 0.5% |
3 | 1 | 0.3% |
6 | 1 | 0.3% |
Hangul
Value | Count | Frequency (%) |
매 | 182 | 11.6% |
문 | 127 | 8.1% |
지 | 107 | 6.8% |
서 | 105 | 6.7% |
토 | 104 | 6.6% |
기 | 67 | 4.3% |
호 | 52 | 3.3% |
적 | 40 | 2.6% |
와 | 21 | 1.3% |
소 | 18 | 1.1% |
Other values (202) | 745 |
CJK
Value | Count | Frequency (%) |
帖 | 2 | |
差 | 2 | |
下 | 1 | |
定 | 1 |
한문 유물명
Text
Distinct | 150 |
---|---|
Distinct (%) | 41.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
Value | Count | Frequency (%) |
明文 | 85 | |
없음 | 25 | 6.9% |
準戶口 | 20 | 5.6% |
戶籍單子 | 14 | 3.9% |
所志 | 12 | 3.3% |
和會文記 | 7 | 1.9% |
瓦 | 6 | 1.7% |
議送 | 5 | 1.4% |
牌旨 | 5 | 1.4% |
戶口單子 | 5 | 1.4% |
Other values (140) | 176 |
Most occurring characters
Value | Count | Frequency (%) |
文 | 114 | 10.1% |
明 | 85 | 7.5% |
戶 | 43 | 3.8% |
口 | 29 | 2.6% |
없 | 25 | 2.2% |
음 | 25 | 2.2% |
瓦 | 22 | 2.0% |
準 | 21 | 1.9% |
子 | 21 | 1.9% |
記 | 19 | 1.7% |
Other values (307) | 724 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1103 | |
Decimal Number | 13 | 1.2% |
Open Punctuation | 5 | 0.4% |
Close Punctuation | 5 | 0.4% |
Other Punctuation | 1 | 0.1% |
Space Separator | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
文 | 114 | 10.3% |
明 | 85 | 7.7% |
戶 | 43 | 3.9% |
口 | 29 | 2.6% |
없 | 25 | 2.3% |
음 | 25 | 2.3% |
瓦 | 22 | 2.0% |
準 | 21 | 1.9% |
子 | 21 | 1.9% |
記 | 19 | 1.7% |
Other values (296) | 699 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3 | |
7 | 3 | |
5 | 2 | |
8 | 2 | |
6 | 1 | 7.7% |
9 | 1 | 7.7% |
3 | 1 | 7.7% |
Open Punctuation
Value | Count | Frequency (%) |
( | 5 |
Close Punctuation
Value | Count | Frequency (%) |
) | 5 |
Other Punctuation
Value | Count | Frequency (%) |
, | 1 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 1051 | |
Hangul | 52 | 4.6% |
Common | 25 | 2.2% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
文 | 114 | 10.8% |
明 | 85 | 8.1% |
戶 | 43 | 4.1% |
口 | 29 | 2.8% |
瓦 | 22 | 2.1% |
準 | 21 | 2.0% |
子 | 21 | 2.0% |
記 | 19 | 1.8% |
單 | 19 | 1.8% |
籍 | 18 | 1.7% |
Other values (292) | 660 |
Common
Value | Count | Frequency (%) |
( | 5 | |
) | 5 | |
1 | 3 | |
7 | 3 | |
5 | 2 | 8.0% |
8 | 2 | 8.0% |
, | 1 | 4.0% |
1 | 4.0% | |
6 | 1 | 4.0% |
9 | 1 | 4.0% |
Hangul
Value | Count | Frequency (%) |
없 | 25 | |
음 | 25 | |
의 | 1 | 1.9% |
과 | 1 | 1.9% |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 1018 | |
Hangul | 52 | 4.6% |
CJK Compat Ideographs | 33 | 2.9% |
ASCII | 25 | 2.2% |
Most frequent character per block
CJK
Value | Count | Frequency (%) |
文 | 114 | 11.2% |
明 | 85 | 8.3% |
戶 | 43 | 4.2% |
口 | 29 | 2.8% |
瓦 | 22 | 2.2% |
準 | 21 | 2.1% |
子 | 21 | 2.1% |
記 | 19 | 1.9% |
單 | 19 | 1.9% |
籍 | 18 | 1.8% |
Other values (280) | 627 |
Hangul
Value | Count | Frequency (%) |
없 | 25 | |
음 | 25 | |
의 | 1 | 1.9% |
과 | 1 | 1.9% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
金 | 9 | |
李 | 4 | |
量 | 3 | 9.1% |
蓮 | 3 | 9.1% |
立 | 3 | 9.1% |
兩 | 3 | 9.1% |
龍 | 2 | 6.1% |
狀 | 2 | 6.1% |
林 | 1 | 3.0% |
柳 | 1 | 3.0% |
Other values (2) | 2 | 6.1% |
ASCII
Value | Count | Frequency (%) |
( | 5 | |
) | 5 | |
1 | 3 | |
7 | 3 | |
5 | 2 | 8.0% |
8 | 2 | 8.0% |
, | 1 | 4.0% |
1 | 4.0% | |
6 | 1 | 4.0% |
9 | 1 | 4.0% |
영문 유물명
Text
Distinct | 122 |
---|---|
Distinct (%) | 33.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
Length
Max length | 66 |
---|---|
Median length | 2 |
Mean length | 11.022222 |
Min length | 2 |
Characters and Unicode
Total characters | 3968 |
---|---|
Distinct characters | 57 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 102 ? |
---|---|
Unique (%) | 28.3% |
Sample
1st row | Diverse Plain Pottery Vessels |
---|---|
2nd row | Gilt bronze ornamental shoes |
3rd row | Nine-storied bronze pagoda |
4th row | Certificate of Redemption |
5th row | Land Bond |
Value | Count | Frequency (%) |
없음 | 214 | |
of | 31 | 4.2% |
tile | 22 | 3.0% |
with | 16 | 2.2% |
eaves | 15 | 2.0% |
land | 9 | 1.2% |
family | 9 | 1.2% |
register | 8 | 1.1% |
pattern | 8 | 1.1% |
jar | 8 | 1.1% |
Other values (232) | 402 |
Most occurring characters
Value | Count | Frequency (%) |
382 | 9.6% | |
e | 349 | 8.8% |
o | 268 | 6.8% |
t | 231 | 5.8% |
a | 225 | 5.7% |
i | 223 | 5.6% |
n | 218 | 5.5% |
없 | 214 | 5.4% |
음 | 214 | 5.4% |
r | 188 | 4.7% |
Other values (47) | 1456 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 2656 | |
Uppercase Letter | 462 | 11.6% |
Other Letter | 428 | 10.8% |
Space Separator | 382 | 9.6% |
Dash Punctuation | 19 | 0.5% |
Other Punctuation | 17 | 0.4% |
Open Punctuation | 2 | 0.1% |
Close Punctuation | 2 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 349 | |
o | 268 | |
t | 231 | 8.7% |
a | 225 | 8.5% |
i | 223 | 8.4% |
n | 218 | 8.2% |
r | 188 | 7.1% |
l | 123 | 4.6% |
s | 112 | 4.2% |
d | 94 | 3.5% |
Other values (15) | 625 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 52 | |
B | 46 | 10.0% |
C | 40 | 8.7% |
P | 35 | 7.6% |
T | 35 | 7.6% |
L | 31 | 6.7% |
F | 27 | 5.8% |
R | 26 | 5.6% |
M | 23 | 5.0% |
E | 22 | 4.8% |
Other values (13) | 125 |
Other Punctuation
Value | Count | Frequency (%) |
' | 15 | |
, | 1 | 5.9% |
. | 1 | 5.9% |
Other Letter
Value | Count | Frequency (%) |
없 | 214 | |
음 | 214 |
Space Separator
Value | Count | Frequency (%) |
382 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 19 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 3118 | |
Hangul | 428 | 10.8% |
Common | 422 | 10.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 349 | 11.2% |
o | 268 | 8.6% |
t | 231 | 7.4% |
a | 225 | 7.2% |
i | 223 | 7.2% |
n | 218 | 7.0% |
r | 188 | 6.0% |
l | 123 | 3.9% |
s | 112 | 3.6% |
d | 94 | 3.0% |
Other values (38) | 1087 |
Common
Value | Count | Frequency (%) |
382 | ||
- | 19 | 4.5% |
' | 15 | 3.6% |
( | 2 | 0.5% |
) | 2 | 0.5% |
, | 1 | 0.2% |
. | 1 | 0.2% |
Hangul
Value | Count | Frequency (%) |
없 | 214 | |
음 | 214 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3540 | |
Hangul | 428 | 10.8% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
382 | 10.8% | |
e | 349 | 9.9% |
o | 268 | 7.6% |
t | 231 | 6.5% |
a | 225 | 6.4% |
i | 223 | 6.3% |
n | 218 | 6.2% |
r | 188 | 5.3% |
l | 123 | 3.5% |
s | 112 | 3.2% |
Other values (45) | 1221 |
Hangul
Value | Count | Frequency (%) |
없 | 214 | |
음 | 214 |
국문 유물명 | 한문 유물명 | 영문 유물명 | |
---|---|---|---|
0 | 여러가지 무문토기 | 各種無文土器 | Diverse Plain Pottery Vessels |
1 | 금동신발 | 金銅飾履 | Gilt bronze ornamental shoes |
2 | 청동9층탑 | 靑銅九層塔 | Nine-storied bronze pagoda |
3 | 상환증서 | 相換證書 | Certificate of Redemption |
4 | 지가증권 | 地價證券 | Land Bond |
5 | 토지측량도 | 土地測量圖 | Land Survey Map |
6 | 측량학교 졸업증서 | 測量學校卒業證書 | Certificate of the Completion of a Survey School |
7 | 측량기사임명장 | 測量技士任命狀 | Appointment Letter of Surveying Engineer |
8 | 목판채색지도 | 木板彩色地圖 | Block-Printed Colored Map |
9 | 소송문서 | 訴訟文書 | Records of a Lawsuit |
국문 유물명 | 한문 유물명 | 영문 유물명 | |
---|---|---|---|
350 | 긁개 | 없음 | 없음 |
351 | 철촉 | 없음 | 없음 |
352 | 긁개 | 없음 | 없음 |
353 | 홍날 | 없음 | 없음 |
354 | 긁개 | 없음 | 없음 |
355 | 찍개 | 없음 | 없음 |
356 | 마제석촉 | 없음 | 없음 |
357 | 찍개 | 없음 | 없음 |
358 | 마제석창 | 없음 | 없음 |
359 | 빗살무늬토기 | 없음 | 없음 |
Most frequently occurring
국문 유물명 | 한문 유물명 | 영문 유물명 | # duplicates | |
---|---|---|---|---|
31 | 토지매매문서 | 明文 | 없음 | 69 |
37 | 호적 | 準戶口 | 없음 | 16 |
36 | 호적 | 戶籍單子 | 없음 | 14 |
29 | 토지매매문기 | 明文 | 없음 | 9 |
10 | 분재기 | 和會文記 | 없음 | 5 |
14 | 소송문기 | 議送 | 없음 | 4 |
16 | 소지 | 所志 | 없음 | 4 |
1 | 긁개 | 없음 | 없음 | 3 |
13 | 소송문기 | 所志 | 없음 | 3 |
17 | 수키와 | 瓦 | Convex Roof Tile | 3 |