Overview

Dataset statistics

Number of variables4
Number of observations27
Missing cells10
Missing cells (%)9.3%
Duplicate rows1
Duplicate rows (%)3.7%
Total size in memory996.0 B
Average record size in memory36.9 B

Variable types

Unsupported1
Text2
Categorical1

Dataset

Description대전예술의전당 무대기계 장비 현황입니다. 아트홀 상.하부 무대기계, 앙상블홀 상.하부 무대기계 수량 및 제원에 대한 정보입니다.
URLhttps://www.data.go.kr/data/15081768/fileData.do

Alerts

Dataset has 1 (3.7%) duplicate rowsDuplicates
NO has 4 (14.8%) missing valuesMissing
조 물 명 has 5 (18.5%) missing valuesMissing
구 동 부 has 1 (3.7%) missing valuesMissing
NO is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 05:47:06.629885
Analysis finished2023-12-12 05:47:07.094878
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

NO
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)14.8%
Memory size348.0 B

조 물 명
Text

MISSING 

Distinct21
Distinct (%)95.5%
Missing5
Missing (%)18.5%
Memory size348.0 B
2023-12-12T14:47:07.260443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length24
Mean length21.090909
Min length12

Characters and Unicode

Total characters464
Distinct characters34
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)90.9%

Sample

1st rowAPRON STAGE LIGHT BATTEN NO.1
2nd rowAPRON STAGE LIGHT BATTEN NO.2
3rd rowPLACARD BATTEN
4th rowSAFETY CURTAIN
5th rowMAIN CURTAIN
ValueCountFrequency (%)
batten 7
 
10.0%
light 5
 
7.1%
stage 4
 
5.7%
curtain 4
 
5.7%
reflection 4
 
5.7%
board 4
 
5.7%
sound 4
 
5.7%
side 3
 
4.3%
apron 2
 
2.9%
horizont 2
 
2.9%
Other values (28) 31
44.3%
2023-12-12T14:47:07.693353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
10.3%
T 44
 
9.5%
E 38
 
8.2%
N 38
 
8.2%
A 33
 
7.1%
O 33
 
7.1%
R 32
 
6.9%
I 31
 
6.7%
S 22
 
4.7%
C 17
 
3.7%
Other values (24) 128
27.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 402
86.6%
Space Separator 48
 
10.3%
Other Punctuation 5
 
1.1%
Other Letter 5
 
1.1%
Decimal Number 2
 
0.4%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 44
10.9%
E 38
 
9.5%
N 38
 
9.5%
A 33
 
8.2%
O 33
 
8.2%
R 32
 
8.0%
I 31
 
7.7%
S 22
 
5.5%
C 17
 
4.2%
D 16
 
4.0%
Other values (12) 98
24.4%
Other Letter
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 4
80.0%
/ 1
 
20.0%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
48
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 402
86.6%
Common 57
 
12.3%
Hangul 5
 
1.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 44
10.9%
E 38
 
9.5%
N 38
 
9.5%
A 33
 
8.2%
O 33
 
8.2%
R 32
 
8.0%
I 31
 
7.7%
S 22
 
5.5%
C 17
 
4.2%
D 16
 
4.0%
Other values (12) 98
24.4%
Common
ValueCountFrequency (%)
48
84.2%
. 4
 
7.0%
2 1
 
1.8%
( 1
 
1.8%
/ 1
 
1.8%
) 1
 
1.8%
1 1
 
1.8%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 459
98.9%
Hangul 5
 
1.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48
 
10.5%
T 44
 
9.6%
E 38
 
8.3%
N 38
 
8.3%
A 33
 
7.2%
O 33
 
7.2%
R 32
 
7.0%
I 31
 
6.8%
S 22
 
4.8%
C 17
 
3.7%
Other values (19) 123
26.8%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

수 량
Categorical

Distinct9
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size348.0 B
1조
12 
2조
<NA>
3조
48조
 
1
Other values (4)

Length

Max length4
Median length2
Mean length2.3333333
Min length2

Unique

Unique5 ?
Unique (%)18.5%

Sample

1st row1조
2nd row1조
3rd row1조
4th row1조
5th row1조

Common Values

ValueCountFrequency (%)
1조 12
44.4%
2조 5
18.5%
<NA> 3
 
11.1%
3조 2
 
7.4%
48조 1
 
3.7%
12조 1
 
3.7%
4조 1
 
3.7%
6조 1
 
3.7%
98조 1
 
3.7%

Length

2023-12-12T14:47:07.862908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:47:07.999354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1조 12
44.4%
2조 5
18.5%
na 3
 
11.1%
3조 2
 
7.4%
48조 1
 
3.7%
12조 1
 
3.7%
4조 1
 
3.7%
6조 1
 
3.7%
98조 1
 
3.7%

구 동 부
Text

MISSING 

Distinct16
Distinct (%)61.5%
Missing1
Missing (%)3.7%
Memory size348.0 B
2023-12-12T14:47:08.135807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23.5
Mean length14.423077
Min length9

Characters and Unicode

Total characters375
Distinct characters32
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)46.2%

Sample

1st row5.5 kw * 6p
2nd row3.7 kw * 6p
3rd row3.7 kw * 6p
4th row7.5 kw * 4p, 8p
5th rowA.C 18.5kw * 4p
ValueCountFrequency (%)
30
25.6%
kw 24
20.5%
6p 16
13.7%
3.7 8
 
6.8%
4p 7
 
6.0%
2.2 4
 
3.4%
a.c 4
 
3.4%
up/down 3
 
2.6%
open/close 2
 
1.7%
18.5 2
 
1.7%
Other values (13) 17
14.5%
2023-12-12T14:47:08.470022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92
24.5%
. 31
 
8.3%
w 26
 
6.9%
k 26
 
6.9%
* 24
 
6.4%
p 22
 
5.9%
6 16
 
4.3%
5 15
 
4.0%
7 13
 
3.5%
2 9
 
2.4%
Other values (22) 101
26.9%

Most occurring categories

ValueCountFrequency (%)
Space Separator 92
24.5%
Decimal Number 87
23.2%
Lowercase Letter 74
19.7%
Other Punctuation 69
18.4%
Uppercase Letter 53
14.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
P 7
13.2%
O 7
13.2%
C 6
11.3%
N 6
11.3%
A 4
7.5%
E 4
7.5%
D 3
5.7%
U 3
5.7%
L 3
5.7%
W 3
5.7%
Other values (4) 7
13.2%
Decimal Number
ValueCountFrequency (%)
6 16
18.4%
5 15
17.2%
7 13
14.9%
2 9
10.3%
4 9
10.3%
3 9
10.3%
1 6
 
6.9%
0 6
 
6.9%
8 4
 
4.6%
Other Punctuation
ValueCountFrequency (%)
. 31
44.9%
* 24
34.8%
: 6
 
8.7%
/ 5
 
7.2%
, 3
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
w 26
35.1%
k 26
35.1%
p 22
29.7%
Space Separator
ValueCountFrequency (%)
92
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 248
66.1%
Latin 127
33.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 26
20.5%
k 26
20.5%
p 22
17.3%
P 7
 
5.5%
O 7
 
5.5%
C 6
 
4.7%
N 6
 
4.7%
A 4
 
3.1%
E 4
 
3.1%
D 3
 
2.4%
Other values (7) 16
12.6%
Common
ValueCountFrequency (%)
92
37.1%
. 31
 
12.5%
* 24
 
9.7%
6 16
 
6.5%
5 15
 
6.0%
7 13
 
5.2%
2 9
 
3.6%
4 9
 
3.6%
3 9
 
3.6%
: 6
 
2.4%
Other values (5) 24
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 375
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
92
24.5%
. 31
 
8.3%
w 26
 
6.9%
k 26
 
6.9%
* 24
 
6.4%
p 22
 
5.9%
6 16
 
4.3%
5 15
 
4.0%
7 13
 
3.5%
2 9
 
2.4%
Other values (22) 101
26.9%

Correlations

2023-12-12T14:47:08.573400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조 물 명수 량구 동 부
조 물 명1.0001.0001.000
수 량1.0001.0000.000
구 동 부1.0000.0001.000

Missing values

2023-12-12T14:47:06.804109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:47:06.906988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T14:47:07.018873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

NO조 물 명수 량구 동 부
01APRON STAGE LIGHT BATTEN NO.11조5.5 kw * 6p
12APRON STAGE LIGHT BATTEN NO.21조3.7 kw * 6p
23PLACARD BATTEN1조3.7 kw * 6p
34SAFETY CURTAIN1조7.5 kw * 4p, 8p
45MAIN CURTAIN1조A.C 18.5kw * 4p
56CAPTION SCREEN BATTEN1조2.2 kw * 6p
67HOUSE CURTAIN1조UP/DOWN : A.C 18.5 kw
7NaN<NA><NA>OPEN/CLOSE : 1.5 kw * 4p
88PROSCENIUM LIGHT BRIDGE1조7.5 kw * 6p
99PROSCENIUM TOWER2조0.75 kw * 6p
NO조 물 명수 량구 동 부
1715TOP LIGHT BRIDGE3조3.7 kw * 6p
1816DRAW CURTAIN4조UP/DOWN : 3.7 kw * 6p
19NaN<NA><NA>OPEN/CLOSE : 0.4 kw * 4P
2017UPPER HORIZONT LIGHT BATTEN1조2.2 kw * 6p
2118HORIZONT CURTAIN(블랙/화이트)2조2.2 kw * 6p
2219FRONT SOUND REFLECTION BOARD1조5.5 kw * 6p
2320REAR STAGE SET BATTEN6조3.7 kw * 6p
2421SIDE STAGE CRANE HOIST1조2.5 kw,0.75 kw, 0.5 * 3
2522SMOKING DOOR SYSTEM1조0.75 kw * 4p
26합 계<NA>98조<NA>

Duplicate rows

Most frequently occurring

조 물 명수 량구 동 부# duplicates
0SIDE SOUND REFLECTION BOARD2조3.7 kw * 6p2