Overview

Dataset statistics

Number of variables11
Number of observations43
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory93.1 B

Variable types

Numeric2
Categorical6
Text3

Dataset

Description담도암 라이브러리 담도암_병리_외과 메타정보( 제공 되어질 데이터 항목, 타입, 사이즈, 항목설명, 항목별건수, 표시형식 등)를 제공
Author국립암센터
URLhttps://www.data.go.kr/data/15095026/fileData.do

Alerts

분류아이디 has constant value ""Constant
분류명 has constant value ""Constant
테이블아이디 has constant value ""Constant
테이블명 has constant value ""Constant
데이터타입 is highly overall correlated with 표시형식High correlation
표시형식 is highly overall correlated with 데이터타입High correlation
순번 has unique valuesUnique
컬럼아이디 has unique valuesUnique
컬럼명 has unique valuesUnique
컬럼데이터수 has 6 (14.0%) zerosZeros

Reproduction

Analysis started2023-12-12 23:40:15.872067
Analysis finished2023-12-12 23:40:16.725536
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22
Minimum1
Maximum43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-13T08:40:16.787197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.1
Q111.5
median22
Q332.5
95-th percentile40.9
Maximum43
Range42
Interquartile range (IQR)21

Descriptive statistics

Standard deviation12.556539
Coefficient of variation (CV)0.57075176
Kurtosis-1.2
Mean22
Median Absolute Deviation (MAD)11
Skewness0
Sum946
Variance157.66667
MonotonicityStrictly increasing
2023-12-13T08:40:17.159849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
1 1
 
2.3%
2 1
 
2.3%
25 1
 
2.3%
26 1
 
2.3%
27 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
Other values (33) 33
76.7%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
6 1
2.3%
7 1
2.3%
8 1
2.3%
9 1
2.3%
10 1
2.3%
ValueCountFrequency (%)
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%
37 1
2.3%
36 1
2.3%
35 1
2.3%
34 1
2.3%

분류아이디
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
PTH
43 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPTH
2nd rowPTH
3rd rowPTH
4th rowPTH
5th rowPTH

Common Values

ValueCountFrequency (%)
PTH 43
100.0%

Length

2023-12-13T08:40:17.285619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:40:17.364626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pth 43
100.0%

분류명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
병리
43 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row병리
2nd row병리
3rd row병리
4th row병리
5th row병리

Common Values

ValueCountFrequency (%)
병리 43
100.0%

Length

2023-12-13T08:40:17.448106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:40:17.529821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
병리 43
100.0%

테이블아이디
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
BLDT_PTH_SRGC
43 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBLDT_PTH_SRGC
2nd rowBLDT_PTH_SRGC
3rd rowBLDT_PTH_SRGC
4th rowBLDT_PTH_SRGC
5th rowBLDT_PTH_SRGC

Common Values

ValueCountFrequency (%)
BLDT_PTH_SRGC 43
100.0%

Length

2023-12-13T08:40:17.616662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:40:17.703983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
bldt_pth_srgc 43
100.0%

테이블명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
담도암_병리_외과
43 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담도암_병리_외과
2nd row담도암_병리_외과
3rd row담도암_병리_외과
4th row담도암_병리_외과
5th row담도암_병리_외과

Common Values

ValueCountFrequency (%)
담도암_병리_외과 43
100.0%

Length

2023-12-13T08:40:17.794306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:40:17.893133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
담도암_병리_외과 43
100.0%

컬럼아이디
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T08:40:18.091140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length17.534884
Min length7

Characters and Unicode

Total characters754
Distinct characters25
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st rowCENTER_CD
2nd rowIRB_APRV_NO
3rd rowPT_SBST_NO
4th rowSGPT_ACPT_YMD
5th rowSGPT_SEQ
ValueCountFrequency (%)
center_cd 1
 
2.3%
sgpt_srs_inva_cont 1
 
2.3%
sgpt_srmv_ln_totl_cnt 1
 
2.3%
sgpt_srmv_ln_mtst_cnt 1
 
2.3%
sgpt_srmv_ln_cont 1
 
2.3%
sgpt_inhp_mtst_cont 1
 
2.3%
sgpt_asso_find_cont 1
 
2.3%
sgpt_rslt_exnt_cont 1
 
2.3%
sgpt_patl_add_find_cont 1
 
2.3%
sgpt_cell_diff_cont 1
 
2.3%
Other values (33) 33
76.7%
2023-12-13T08:40:18.475156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 127
16.8%
T 108
14.3%
N 64
8.5%
S 63
8.4%
P 56
 
7.4%
G 47
 
6.2%
C 45
 
6.0%
O 34
 
4.5%
R 27
 
3.6%
A 26
 
3.4%
Other values (15) 157
20.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 627
83.2%
Connector Punctuation 127
 
16.8%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 108
17.2%
N 64
10.2%
S 63
10.0%
P 56
8.9%
G 47
 
7.5%
C 45
 
7.2%
O 34
 
5.4%
R 27
 
4.3%
A 26
 
4.1%
V 26
 
4.1%
Other values (14) 131
20.9%
Connector Punctuation
ValueCountFrequency (%)
_ 127
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 627
83.2%
Common 127
 
16.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 108
17.2%
N 64
10.2%
S 63
10.0%
P 56
8.9%
G 47
 
7.5%
C 45
 
7.2%
O 34
 
5.4%
R 27
 
4.3%
A 26
 
4.1%
V 26
 
4.1%
Other values (14) 131
20.9%
Common
ValueCountFrequency (%)
_ 127
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 754
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 127
16.8%
T 108
14.3%
N 64
8.5%
S 63
8.4%
P 56
 
7.4%
G 47
 
6.2%
C 45
 
6.0%
O 34
 
4.5%
R 27
 
3.6%
A 26
 
3.4%
Other values (15) 157
20.8%

컬럼명
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T08:40:18.680914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length10.27907
Min length4

Characters and Unicode

Total characters442
Distinct characters101
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row센터코드
2nd rowIRB승인번호
3rd row환자대체번호
4th row외과병리접수일자
5th row외과병리순번
ValueCountFrequency (%)
센터코드 1
 
2.3%
외과병리장막침윤내용 1
 
2.3%
외과병리절제림프절총수 1
 
2.3%
외과병리절제림프절전이수 1
 
2.3%
외과병리절제림프절내용 1
 
2.3%
외과병리간내전이내용 1
 
2.3%
외과병리동반발견내용 1
 
2.3%
외과병리결과비고내용 1
 
2.3%
외과병리병리학적추가발견내용 1
 
2.3%
외과병리세포분화내용 1
 
2.3%
Other values (33) 33
76.7%
2023-12-13T08:40:19.001159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
10.6%
42
 
9.5%
38
 
8.6%
37
 
8.4%
29
 
6.6%
28
 
6.3%
13
 
2.9%
12
 
2.7%
9
 
2.0%
8
 
1.8%
Other values (91) 179
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 436
98.6%
Uppercase Letter 6
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
10.8%
42
 
9.6%
38
 
8.7%
37
 
8.5%
29
 
6.7%
28
 
6.4%
13
 
3.0%
12
 
2.8%
9
 
2.1%
8
 
1.8%
Other values (85) 173
39.7%
Uppercase Letter
ValueCountFrequency (%)
T 1
16.7%
N 1
16.7%
M 1
16.7%
I 1
16.7%
R 1
16.7%
B 1
16.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 436
98.6%
Latin 6
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
10.8%
42
 
9.6%
38
 
8.7%
37
 
8.5%
29
 
6.7%
28
 
6.4%
13
 
3.0%
12
 
2.8%
9
 
2.1%
8
 
1.8%
Other values (85) 173
39.7%
Latin
ValueCountFrequency (%)
T 1
16.7%
N 1
16.7%
M 1
16.7%
I 1
16.7%
R 1
16.7%
B 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 436
98.6%
ASCII 6
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
47
 
10.8%
42
 
9.6%
38
 
8.7%
37
 
8.5%
29
 
6.7%
28
 
6.4%
13
 
3.0%
12
 
2.8%
9
 
2.1%
8
 
1.8%
Other values (85) 173
39.7%
ASCII
ValueCountFrequency (%)
T 1
16.7%
N 1
16.7%
M 1
16.7%
I 1
16.7%
R 1
16.7%
B 1
16.7%

데이터타입
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
VARCHAR(8000)
27 
VARCHAR(20)
VARCHAR(100)
VARCHAR(200)
VARCHAR(8)
 
2
Other values (5)

Length

Max length13
Median length13
Mean length12.232558
Min length8

Unique

Unique5 ?
Unique (%)11.6%

Sample

1st rowVARCHAR(20)
2nd rowVARCHAR(50)
3rd rowVARCHAR(10)
4th rowVARCHAR(8)
5th rowNUMBER(3)

Common Values

ValueCountFrequency (%)
VARCHAR(8000) 27
62.8%
VARCHAR(20) 3
 
7.0%
VARCHAR(100) 3
 
7.0%
VARCHAR(200) 3
 
7.0%
VARCHAR(8) 2
 
4.7%
VARCHAR(50) 1
 
2.3%
VARCHAR(10) 1
 
2.3%
NUMBER(3) 1
 
2.3%
VARCHAR(40) 1
 
2.3%
DATETIME 1
 
2.3%

Length

2023-12-13T08:40:19.118218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:40:19.232324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
varchar(8000 27
62.8%
varchar(20 3
 
7.0%
varchar(100 3
 
7.0%
varchar(200 3
 
7.0%
varchar(8 2
 
4.7%
varchar(50 1
 
2.3%
varchar(10 1
 
2.3%
number(3 1
 
2.3%
varchar(40 1
 
2.3%
datetime 1
 
2.3%
Distinct42
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T08:40:19.476524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length345
Median length45
Mean length41.651163
Min length3

Characters and Unicode

Total characters1791
Distinct characters155
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)95.3%

Sample

1st row센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 00030
2nd row센터별 기준에 따라 생성
3rd row개인고유번호(10자리) / 센터별 별도부여 예) RN12345678
4th row외과병리접수일자 / YYYYMMDD
5th row외과병리접수일자별 순번
ValueCountFrequency (%)
29
 
10.6%
free 18
 
6.6%
text 18
 
6.6%
11
 
4.0%
예)present 6
 
2.2%
margin 5
 
1.8%
예)absent 5
 
1.8%
ln 5
 
1.8%
y 4
 
1.5%
fatty 4
 
1.5%
Other values (138) 168
61.5%
2023-12-13T08:40:19.843365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
232
 
13.0%
e 144
 
8.0%
t 106
 
5.9%
i 80
 
4.5%
a 79
 
4.4%
r 77
 
4.3%
n 64
 
3.6%
s 62
 
3.5%
o 54
 
3.0%
) 46
 
2.6%
Other values (145) 847
47.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 970
54.2%
Other Letter 306
 
17.1%
Space Separator 232
 
13.0%
Uppercase Letter 92
 
5.1%
Other Punctuation 71
 
4.0%
Decimal Number 56
 
3.1%
Close Punctuation 46
 
2.6%
Open Punctuation 8
 
0.4%
Dash Punctuation 5
 
0.3%
Math Symbol 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
11.4%
21
 
6.9%
20
 
6.5%
19
 
6.2%
19
 
6.2%
18
 
5.9%
18
 
5.9%
11
 
3.6%
10
 
3.3%
6
 
2.0%
Other values (78) 129
42.2%
Lowercase Letter
ValueCountFrequency (%)
e 144
14.8%
t 106
10.9%
i 80
 
8.2%
a 79
 
8.1%
r 77
 
7.9%
n 64
 
6.6%
s 62
 
6.4%
o 54
 
5.6%
f 44
 
4.5%
c 41
 
4.2%
Other values (13) 219
22.6%
Uppercase Letter
ValueCountFrequency (%)
Y 12
13.0%
N 12
13.0%
C 9
9.8%
L 7
 
7.6%
P 7
 
7.6%
A 6
 
6.5%
X 5
 
5.4%
D 5
 
5.4%
M 5
 
5.4%
R 4
 
4.3%
Other values (10) 20
21.7%
Decimal Number
ValueCountFrequency (%)
0 19
33.9%
1 9
16.1%
2 7
 
12.5%
3 7
 
12.5%
5 7
 
12.5%
6 2
 
3.6%
7 2
 
3.6%
9 1
 
1.8%
4 1
 
1.8%
8 1
 
1.8%
Other Punctuation
ValueCountFrequency (%)
/ 27
38.0%
, 21
29.6%
: 13
18.3%
. 4
 
5.6%
" 2
 
2.8%
% 2
 
2.8%
' 1
 
1.4%
# 1
 
1.4%
Space Separator
ValueCountFrequency (%)
232
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Math Symbol
ValueCountFrequency (%)
| 4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1062
59.3%
Common 423
 
23.6%
Hangul 306
 
17.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
11.4%
21
 
6.9%
20
 
6.5%
19
 
6.2%
19
 
6.2%
18
 
5.9%
18
 
5.9%
11
 
3.6%
10
 
3.3%
6
 
2.0%
Other values (78) 129
42.2%
Latin
ValueCountFrequency (%)
e 144
13.6%
t 106
 
10.0%
i 80
 
7.5%
a 79
 
7.4%
r 77
 
7.3%
n 64
 
6.0%
s 62
 
5.8%
o 54
 
5.1%
f 44
 
4.1%
c 41
 
3.9%
Other values (33) 311
29.3%
Common
ValueCountFrequency (%)
232
54.8%
) 46
 
10.9%
/ 27
 
6.4%
, 21
 
5.0%
0 19
 
4.5%
: 13
 
3.1%
1 9
 
2.1%
( 8
 
1.9%
2 7
 
1.7%
3 7
 
1.7%
Other values (14) 34
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1485
82.9%
Hangul 306
 
17.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
232
15.6%
e 144
 
9.7%
t 106
 
7.1%
i 80
 
5.4%
a 79
 
5.3%
r 77
 
5.2%
n 64
 
4.3%
s 62
 
4.2%
o 54
 
3.6%
) 46
 
3.1%
Other values (57) 541
36.4%
Hangul
ValueCountFrequency (%)
35
 
11.4%
21
 
6.9%
20
 
6.5%
19
 
6.2%
19
 
6.2%
18
 
5.9%
18
 
5.9%
11
 
3.6%
10
 
3.3%
6
 
2.0%
Other values (78) 129
42.2%

컬럼데이터수
Real number (ℝ)

ZEROS 

Distinct28
Distinct (%)65.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean275.60465
Minimum0
Maximum571
Zeros6
Zeros (%)14.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-13T08:40:19.949233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1122.5
median155
Q3493.5
95-th percentile571
Maximum571
Range571
Interquartile range (IQR)371

Descriptive statistics

Standard deviation215.06357
Coefficient of variation (CV)0.78033361
Kurtosis-1.6143125
Mean275.60465
Median Absolute Deviation (MAD)155
Skewness0.1400973
Sum11851
Variance46252.34
MonotonicityNot monotonic
2023-12-13T08:40:20.047552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
571 7
 
16.3%
0 6
 
14.0%
141 2
 
4.7%
137 2
 
4.7%
494 2
 
4.7%
142 2
 
4.7%
108 1
 
2.3%
8 1
 
2.3%
376 1
 
2.3%
302 1
 
2.3%
Other values (18) 18
41.9%
ValueCountFrequency (%)
0 6
14.0%
1 1
 
2.3%
8 1
 
2.3%
47 1
 
2.3%
108 1
 
2.3%
113 1
 
2.3%
132 1
 
2.3%
137 2
 
4.7%
138 1
 
2.3%
139 1
 
2.3%
ValueCountFrequency (%)
571 7
16.3%
513 1
 
2.3%
511 1
 
2.3%
494 2
 
4.7%
493 1
 
2.3%
487 1
 
2.3%
485 1
 
2.3%
465 1
 
2.3%
435 1
 
2.3%
376 1
 
2.3%

표시형식
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)18.6%
Missing0
Missing (%)0.0%
Memory size476.0 B
Free 텍스트
29 
텍스트
YYYYMMDD
 
2
Y 여 | N 부
 
2
문자(5) : XXXXX
 
1
Other values (3)

Length

Max length19
Median length8
Mean length7.8372093
Min length2

Unique

Unique4 ?
Unique (%)9.3%

Sample

1st row문자(5) : XXXXX
2nd row텍스트
3rd row문자(10) : XXXXXXXXXX
4th rowYYYYMMDD
5th row숫자

Common Values

ValueCountFrequency (%)
Free 텍스트 29
67.4%
텍스트 6
 
14.0%
YYYYMMDD 2
 
4.7%
Y 여 | N 부 2
 
4.7%
문자(5) : XXXXX 1
 
2.3%
문자(10) : XXXXXXXXXX 1
 
2.3%
숫자 1
 
2.3%
YYYY-MM-DD HH:MI:SS 1
 
2.3%

Length

2023-12-13T08:40:20.177338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:40:20.317276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
텍스트 35
41.2%
free 29
34.1%
4
 
4.7%
yyyymmdd 2
 
2.4%
y 2
 
2.4%
2
 
2.4%
n 2
 
2.4%
2
 
2.4%
문자(5 1
 
1.2%
xxxxx 1
 
1.2%
Other values (5) 5
 
5.9%

Interactions

2023-12-13T08:40:16.372095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:40:16.223857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:40:16.438040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:40:16.307979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:40:20.442840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번컬럼아이디컬럼명데이터타입컬럼설명컬럼데이터수표시형식
순번1.0001.0001.0000.6290.9190.6690.451
컬럼아이디1.0001.0001.0001.0001.0001.0001.000
컬럼명1.0001.0001.0001.0001.0001.0001.000
데이터타입0.6291.0001.0001.0001.0000.5280.965
컬럼설명0.9191.0001.0001.0001.0001.0001.000
컬럼데이터수0.6691.0001.0000.5281.0001.0000.328
표시형식0.4511.0001.0000.9651.0000.3281.000
2023-12-13T08:40:20.552488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터타입표시형식
데이터타입1.0000.865
표시형식0.8651.000
2023-12-13T08:40:20.628042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번컬럼데이터수데이터타입표시형식
순번1.000-0.4570.2470.219
컬럼데이터수-0.4571.0000.2670.092
데이터타입0.2470.2671.0000.865
표시형식0.2190.0920.8651.000

Missing values

2023-12-13T08:40:16.542878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:40:16.672846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번분류아이디분류명테이블아이디테이블명컬럼아이디컬럼명데이터타입컬럼설명컬럼데이터수표시형식
01PTH병리BLDT_PTH_SRGC담도암_병리_외과CENTER_CD센터코드VARCHAR(20)센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 00030571문자(5) : XXXXX
12PTH병리BLDT_PTH_SRGC담도암_병리_외과IRB_APRV_NOIRB승인번호VARCHAR(50)센터별 기준에 따라 생성571텍스트
23PTH병리BLDT_PTH_SRGC담도암_병리_외과PT_SBST_NO환자대체번호VARCHAR(10)개인고유번호(10자리) / 센터별 별도부여 예) RN12345678571문자(10) : XXXXXXXXXX
34PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_ACPT_YMD외과병리접수일자VARCHAR(8)외과병리접수일자 / YYYYMMDD571YYYYMMDD
45PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_SEQ외과병리순번NUMBER(3)외과병리접수일자별 순번571숫자
56PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_READ_YMD외과병리판독일자VARCHAR(8)외과병리의 판독일자 / YYYYMMDD571YYYYMMDD
67PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_HTLG_TYPE_CONT외과병리조직학적유형내용VARCHAR(8000)외과병리조직학적유형내용 / free text 예) Classical adenocarcinoma155Free 텍스트
78PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_TUMR_MAX_SIZE_CONT외과병리종양최대크기내용VARCHAR(8000)외과병리종양최대크기내용 / free text 예)2.5511Free 텍스트
89PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_LESN_TYPE_CONT외과병리병변유형내용VARCHAR(8000)외과병리병변유형내용 / free text 예)mass forming type141Free 텍스트
910PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_HG_CONT외과병리조직학적등급내용VARCHAR(40)외과병리조직학적등급내용 / free text 예)The worst differentiation: poorly differentiated The major differentiation: moderately differentiated137Free 텍스트
순번분류아이디분류명테이블아이디테이블명컬럼아이디컬럼명데이터타입컬럼설명컬럼데이터수표시형식
3334PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_DSNT_MTST_YN외과병리원격전이여부VARCHAR(20)Y, Y |N, N0Y 여 | N 부
3435PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_DSNT_MTST_SITE_CONT외과병리원격전이부위내용VARCHAR(8000)예) Lung0Free 텍스트
3536PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_CRTV_SRMV_DGRE_CONT외과병리완치절제정도내용VARCHAR(8000)R0 | R1 | R20Free 텍스트
3637PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_HPRC_INVA_YN외과병리간실질침윤여부VARCHAR(20)Y, Y |N, N0Y 여 | N 부
3738PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_HPRC_INVA_CONT외과병리간실질침윤내용VARCHAR(8000)예) 숫자(%)0Free 텍스트
3839PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_PATL_STAG_VL외과병리병리학적병기값VARCHAR(100)예) ypT3N1494텍스트
3940PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_PATL_T_STAG_VL외과병리병리학적T병기값VARCHAR(200)예) 3494텍스트
4041PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_PATL_N_STAG_VL외과병리병리학적N병기값VARCHAR(200)예) 1376텍스트
4142PTH병리BLDT_PTH_SRGC담도암_병리_외과SGPT_PATL_M_STAG_VL외과병리병리학적M병기값VARCHAR(200)예) 08텍스트
4243PTH병리BLDT_PTH_SRGC담도암_병리_외과CRTN_DT생성일시DATETIME생성일시 DEFAULT current_timestamp()571YYYY-MM-DD HH:MI:SS