Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory546.9 KiB
Average record size in memory56.0 B

Variable types

Text2
Categorical3
DateTime1

Dataset

Descriptionㅇ 시험인증이란 제조, 수입 및 판매를 목적으로 하는 기자재에 요구되는 기술시준 또는 규정 등에 적합한지 여부를 평가하고, 안전성 및 신뢰성 등을 확보하는 제도 장치입니다. 이러한 시험 인증을 통해 소비자는 기기안전성을 보장받고, 판매자는 기업과 기기의 신뢰성을 보장 받으며 시장 전체적으로는 불량품 양산 축소 및 기기의 고품질 성장화를 보장 받습니다.ㅇ 제품의 제조, 수입 및 판매에 시험인증이 필수적으로 진행되어야 하기 때문에, 시험인증 접수 데이터를 통해 다양한 산업분야의 동향을 파악할 수 있습니다.
Author한국산업기술시험원
URLhttps://www.data.go.kr/data/15124882/fileData.do

Alerts

사업구분명 has constant value ""Constant
단위사업중분류명 is highly overall correlated with 단위사업소분류명High correlation
단위사업소분류명 is highly overall correlated with 단위사업중분류명High correlation
단위사업중분류명 is highly imbalanced (80.1%)Imbalance
개별접수번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:29:51.194183
Analysis finished2023-12-12 15:29:51.842444
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개별접수번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:29:51.980004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters150000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row0610-12-0091020
2nd row0606-12-0060262
3rd row0610-12-0092751
4th row0602-12-0028701
5th row0603-12-0033468
ValueCountFrequency (%)
0610-12-0091020 1
 
< 0.1%
0610-12-0093591 1
 
< 0.1%
0610-12-0094647 1
 
< 0.1%
0608-12-0076770 1
 
< 0.1%
0602-12-0030055 1
 
< 0.1%
0602-12-0024880 1
 
< 0.1%
0604-12-0041930 1
 
< 0.1%
0607-12-0065442 1
 
< 0.1%
0609-12-0088307 1
 
< 0.1%
0609-12-0086107 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-13T00:29:52.357573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 44600
29.7%
- 20000
13.3%
1 17317
 
11.5%
2 15647
 
10.4%
6 13709
 
9.1%
8 7762
 
5.2%
7 7674
 
5.1%
3 6102
 
4.1%
5 6022
 
4.0%
4 5848
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 130000
86.7%
Dash Punctuation 20000
 
13.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 44600
34.3%
1 17317
 
13.3%
2 15647
 
12.0%
6 13709
 
10.5%
8 7762
 
6.0%
7 7674
 
5.9%
3 6102
 
4.7%
5 6022
 
4.6%
4 5848
 
4.5%
9 5319
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 150000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 44600
29.7%
- 20000
13.3%
1 17317
 
11.5%
2 15647
 
10.4%
6 13709
 
9.1%
8 7762
 
5.2%
7 7674
 
5.1%
3 6102
 
4.1%
5 6022
 
4.0%
4 5848
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 150000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 44600
29.7%
- 20000
13.3%
1 17317
 
11.5%
2 15647
 
10.4%
6 13709
 
9.1%
8 7762
 
5.2%
7 7674
 
5.1%
3 6102
 
4.1%
5 6022
 
4.0%
4 5848
 
3.9%

사업구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
표준
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row표준
2nd row표준
3rd row표준
4th row표준
5th row표준

Common Values

ValueCountFrequency (%)
표준 10000
100.0%

Length

2023-12-13T00:29:52.501246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:29:52.607052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
표준 10000
100.0%
Distinct318
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2006-01-27 00:00:00
Maximum2008-08-29 00:00:00
2023-12-13T00:29:52.800087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:29:53.001679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct56
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:29:53.235425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length1
Mean length1.3082
Min length1

Characters and Unicode

Total characters13082
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.1%

Sample

1st row5
2nd row5
3rd row8
4th row
5th row13
ValueCountFrequency (%)
8 770
9.7%
9 732
9.2%
10 725
9.1%
7 717
9.0%
6 666
 
8.4%
11 632
 
7.9%
5 624
 
7.8%
4 589
 
7.4%
12 422
 
5.3%
3 386
 
4.9%
Other values (45) 1689
21.2%
2023-12-13T00:29:53.660520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 3581
27.4%
2048
15.7%
2 913
 
7.0%
8 855
 
6.5%
4 844
 
6.5%
0 838
 
6.4%
5 825
 
6.3%
7 822
 
6.3%
9 790
 
6.0%
3 789
 
6.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11034
84.3%
Space Separator 2048
 
15.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 3581
32.5%
2 913
 
8.3%
8 855
 
7.7%
4 844
 
7.6%
0 838
 
7.6%
5 825
 
7.5%
7 822
 
7.4%
9 790
 
7.2%
3 789
 
7.2%
6 777
 
7.0%
Space Separator
ValueCountFrequency (%)
2048
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13082
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 3581
27.4%
2048
15.7%
2 913
 
7.0%
8 855
 
6.5%
4 844
 
6.5%
0 838
 
6.4%
5 825
 
6.3%
7 822
 
6.3%
9 790
 
6.0%
3 789
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13082
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 3581
27.4%
2048
15.7%
2 913
 
7.0%
8 855
 
6.5%
4 844
 
6.5%
0 838
 
6.4%
5 825
 
6.3%
7 822
 
6.3%
9 790
 
6.0%
3 789
 
6.0%

단위사업중분류명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KOLAS교정
9432 
KTL시험
 
567
환경측정기 정도검사
 
1

Length

Max length10
Median length7
Mean length6.8869
Min length5

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowKTL시험
2nd rowKOLAS교정
3rd rowKOLAS교정
4th rowKOLAS교정
5th rowKOLAS교정

Common Values

ValueCountFrequency (%)
KOLAS교정 9432
94.3%
KTL시험 567
 
5.7%
환경측정기 정도검사 1
 
< 0.1%

Length

2023-12-13T00:29:53.820599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:29:53.952337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kolas교정 9432
94.3%
ktl시험 567
 
5.7%
환경측정기 1
 
< 0.1%
정도검사 1
 
< 0.1%

단위사업소분류명
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
기타 길이 관련량(KOLAS교정)
2437 
비접촉식 온도(KOLAS교정)
1078 
교류 및 교류전력(KOLAS교정)
509 
직류(KOLAS교정)
 
476
압력(KOLAS교정)
 
475
Other values (31)
5025 

Length

Max length21
Median length19
Mean length14.7491
Min length10

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row힘토크 및 관련량 시험(KTL시험)
2nd row비접촉식 온도(KOLAS교정)
3rd row비접촉식 온도(KOLAS교정)
4th row밀도(KOLAS교정)
5th row기타 길이 관련량(KOLAS교정)

Common Values

ValueCountFrequency (%)
기타 길이 관련량(KOLAS교정) 2437
24.4%
비접촉식 온도(KOLAS교정) 1078
 
10.8%
교류 및 교류전력(KOLAS교정) 509
 
5.1%
직류(KOLAS교정) 476
 
4.8%
압력(KOLAS교정) 475
 
4.8%
토크(KOLAS교정) 442
 
4.4%
저항 용량 및 인덕턴스(KOLAS교정) 396
 
4.0%
열 및 온도(KTL시험) 391
 
3.9%
습도(KOLAS교정) 362
 
3.6%
힘(KOLAS교정) 316
 
3.2%
Other values (26) 3118
31.2%

Length

2023-12-13T00:29:54.103427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
길이 2456
 
11.9%
기타 2437
 
11.8%
관련량(kolas교정 2437
 
11.8%
2138
 
10.3%
비접촉식 1078
 
5.2%
온도(kolas교정 1078
 
5.2%
교류 509
 
2.5%
교류전력(kolas교정 509
 
2.5%
직류(kolas교정 476
 
2.3%
압력(kolas교정 475
 
2.3%
Other values (41) 7128
34.4%

Correlations

2023-12-13T00:29:54.189503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리일수단위사업중분류명단위사업소분류명
처리일수1.0000.3560.535
단위사업중분류명0.3561.0001.000
단위사업소분류명0.5351.0001.000
2023-12-13T00:29:54.582529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위사업중분류명단위사업소분류명
단위사업중분류명1.0000.998
단위사업소분류명0.9981.000
2023-12-13T00:29:54.709926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위사업중분류명단위사업소분류명
단위사업중분류명1.0000.998
단위사업소분류명0.9981.000

Missing values

2023-12-13T00:29:51.630947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:29:51.772549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

개별접수번호사업구분명접수일자처리일수단위사업중분류명단위사업소분류명
910190610-12-0091020표준2006-10-255KTL시험힘토크 및 관련량 시험(KTL시험)
602610606-12-0060262표준2006-06-125KOLAS교정비접촉식 온도(KOLAS교정)
927500610-12-0092751표준2006-10-138KOLAS교정비접촉식 온도(KOLAS교정)
287000602-12-0028701표준2006-02-22KOLAS교정밀도(KOLAS교정)
334670603-12-0033468표준2006-03-2113KOLAS교정기타 길이 관련량(KOLAS교정)
173300711-12-0017331표준2007-11-15KOLAS교정저항 용량 및 인덕턴스(KOLAS교정)
192330711-12-0019234표준2007-11-2314KOLAS교정기타 길이 관련량(KOLAS교정)
200430711-12-0020044표준2007-11-1917KOLAS교정부피(KOLAS교정)
794510608-12-0079452표준2006-08-226KOLAS교정부피(KOLAS교정)
795040608-12-0079505표준2006-08-109KOLAS교정광원 및 검출기(KOLAS교정)
개별접수번호사업구분명접수일자처리일수단위사업중분류명단위사업소분류명
134340807-12-0013435표준2008-07-146KOLAS교정유체유동(KOLAS교정)
345370603-12-0034538표준2006-03-3113KTL시험열 및 온도(KTL시험)
597240606-12-0059725표준2006-06-029KOLAS교정비접촉식 온도(KOLAS교정)
144500711-12-0014451표준2007-11-0511KOLAS교정비접촉식 온도(KOLAS교정)
638080606-12-0063809표준2006-06-02KOLAS교정기타 길이 관련량(KOLAS교정)
541510605-12-0054152표준2006-05-08KOLAS교정기타 길이 관련량(KOLAS교정)
625800606-12-0062581표준2006-06-0510KOLAS교정유체유동(KOLAS교정)
683770607-12-0068377표준2006-07-133KOLAS교정토크(KOLAS교정)
25390710-12-0002540표준2007-10-017KOLAS교정교류 및 교류전력(KOLAS교정)
108060807-12-0010807표준2008-07-2310KOLAS교정비접촉식 온도(KOLAS교정)