Overview

Dataset statistics

Number of variables4
Number of observations1833
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory57.4 KiB
Average record size in memory32.1 B

Variable types

Text3
Categorical1

Dataset

Description경남도립거창대학의 직업분류 공공데이터로, 분류코드, 대분류, 중분류, 직업명에 대한 데이터를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15097845/fileData.do

Alerts

분류코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:37:00.370139
Analysis finished2023-12-12 16:37:00.853453
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분류코드
Text

UNIQUE 

Distinct1833
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2023-12-13T01:37:01.189561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.5177305
Min length1

Characters and Unicode

Total characters8281
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1833 ?
Unique (%)100.0%

Sample

1st row11
2nd row111
3rd row1110
4th row11101
5th row11102
ValueCountFrequency (%)
11 1
 
0.1%
75333 1
 
0.1%
75331 1
 
0.1%
7533 1
 
0.1%
75323 1
 
0.1%
75322 1
 
0.1%
75321 1
 
0.1%
7532 1
 
0.1%
75319 1
 
0.1%
75315 1
 
0.1%
Other values (1823) 1823
99.5%
2023-12-13T01:37:01.772404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1977
23.9%
1 1593
19.2%
3 1024
12.4%
4 756
 
9.1%
9 660
 
8.0%
7 560
 
6.8%
8 520
 
6.3%
5 488
 
5.9%
0 454
 
5.5%
6 240
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8272
99.9%
Uppercase Letter 9
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1977
23.9%
1 1593
19.3%
3 1024
12.4%
4 756
 
9.1%
9 660
 
8.0%
7 560
 
6.8%
8 520
 
6.3%
5 488
 
5.9%
0 454
 
5.5%
6 240
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
A 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8272
99.9%
Latin 9
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
2 1977
23.9%
1 1593
19.3%
3 1024
12.4%
4 756
 
9.1%
9 660
 
8.0%
7 560
 
6.8%
8 520
 
6.3%
5 488
 
5.9%
0 454
 
5.5%
6 240
 
2.9%
Latin
ValueCountFrequency (%)
A 9
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8281
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1977
23.9%
1 1593
19.2%
3 1024
12.4%
4 756
 
9.1%
9 660
 
8.0%
7 560
 
6.8%
8 520
 
6.3%
5 488
 
5.9%
0 454
 
5.5%
6 240
 
2.9%

대분류
Categorical

Distinct10
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2.전문가 및 관련 종사자
647 
8.장치,기계조작 및 조립종사자
340 
7.기능원 및 관련 기능 종사자
303 
1.관리자
121 
4.서비스 종사자
120 
Other values (5)
302 

Length

Max length17
Median length14
Mean length13.35461
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.관리자
2nd row1.관리자
3rd row1.관리자
4th row1.관리자
5th row1.관리자

Common Values

ValueCountFrequency (%)
2.전문가 및 관련 종사자 647
35.3%
8.장치,기계조작 및 조립종사자 340
18.5%
7.기능원 및 관련 기능 종사자 303
16.5%
1.관리자 121
 
6.6%
4.서비스 종사자 120
 
6.5%
3.사무 종사자 96
 
5.2%
9.단순노무 종사자 90
 
4.9%
5.판매 종사자 58
 
3.2%
6.농림어업 숙련 종사자 49
 
2.7%
A.군인 9
 
0.5%

Length

2023-12-13T01:37:01.993747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:37:02.168498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종사자 1363
22.2%
1290
21.1%
관련 950
15.5%
2.전문가 647
10.6%
8.장치,기계조작 340
 
5.5%
조립종사자 340
 
5.5%
7.기능원 303
 
4.9%
기능 303
 
4.9%
1.관리자 121
 
2.0%
4.서비스 120
 
2.0%
Other values (6) 351
 
5.7%
Distinct53
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2023-12-13T01:37:02.459757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length38
Mean length21.666121
Min length5

Characters and Unicode

Total characters39714
Distinct characters124
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11.공공 및 기업 고위직
2nd row11.공공 및 기업 고위직
3rd row11.공공 및 기업 고위직
4th row11.공공 및 기업 고위직
5th row11.공공 및 기업 고위직
ValueCountFrequency (%)
1989
22.0%
관련 710
 
7.9%
전문가 534
 
5.9%
관련직 444
 
4.9%
처리관련 299
 
3.3%
재활용 299
 
3.3%
88.상하수도 299
 
3.3%
기계조작직 283
 
3.1%
기능직 269
 
3.0%
채굴 254
 
2.8%
Other values (91) 3654
40.4%
2023-12-13T01:37:02.916009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7309
18.4%
2319
 
5.8%
. 2303
 
5.8%
1989
 
5.0%
1794
 
4.5%
1735
 
4.4%
1590
 
4.0%
2 894
 
2.3%
806
 
2.0%
8 787
 
2.0%
Other values (114) 18188
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25969
65.4%
Space Separator 7309
 
18.4%
Decimal Number 4124
 
10.4%
Other Punctuation 2303
 
5.8%
Uppercase Letter 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2319
 
8.9%
1989
 
7.7%
1794
 
6.9%
1735
 
6.7%
1590
 
6.1%
806
 
3.1%
747
 
2.9%
742
 
2.9%
693
 
2.7%
690
 
2.7%
Other values (102) 12864
49.5%
Decimal Number
ValueCountFrequency (%)
2 894
21.7%
8 787
19.1%
7 687
16.7%
3 406
9.8%
1 369
8.9%
4 362
8.8%
5 274
 
6.6%
9 215
 
5.2%
6 130
 
3.2%
Space Separator
ValueCountFrequency (%)
7309
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2303
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25969
65.4%
Common 13736
34.6%
Latin 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2319
 
8.9%
1989
 
7.7%
1794
 
6.9%
1735
 
6.7%
1590
 
6.1%
806
 
3.1%
747
 
2.9%
742
 
2.9%
693
 
2.7%
690
 
2.7%
Other values (102) 12864
49.5%
Common
ValueCountFrequency (%)
7309
53.2%
. 2303
 
16.8%
2 894
 
6.5%
8 787
 
5.7%
7 687
 
5.0%
3 406
 
3.0%
1 369
 
2.7%
4 362
 
2.6%
5 274
 
2.0%
9 215
 
1.6%
Latin
ValueCountFrequency (%)
A 9
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25969
65.4%
ASCII 13745
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7309
53.2%
. 2303
 
16.8%
2 894
 
6.5%
8 787
 
5.7%
7 687
 
5.0%
3 406
 
3.0%
1 369
 
2.7%
4 362
 
2.6%
5 274
 
2.0%
9 215
 
1.6%
Other values (2) 139
 
1.0%
Hangul
ValueCountFrequency (%)
2319
 
8.9%
1989
 
7.7%
1794
 
6.9%
1735
 
6.7%
1590
 
6.1%
806
 
3.1%
747
 
2.9%
742
 
2.9%
693
 
2.7%
690
 
2.7%
Other values (102) 12864
49.5%
Distinct1668
Distinct (%)91.0%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2023-12-13T01:37:03.271048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length22
Mean length9.9367158
Min length2

Characters and Unicode

Total characters18214
Distinct characters412
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1508 ?
Unique (%)82.3%

Sample

1st row공공 및 기업 고위직
2nd row의회의원고위공무원 및 공공단체임원
3rd row의회의원고위공무원 및 공공단체임원
4th row국회의원
5th row지방의회의원 및 교육위원
ValueCountFrequency (%)
622
 
11.5%
조작원 189
 
3.5%
171
 
3.1%
171
 
3.1%
관련 101
 
1.9%
관리자 100
 
1.8%
연구원 97
 
1.8%
기술자 86
 
1.6%
종사원 84
 
1.5%
사무원 69
 
1.3%
Other values (1434) 3742
68.9%
2023-12-13T01:37:03.819468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3600
 
19.8%
1115
 
6.1%
640
 
3.5%
622
 
3.4%
573
 
3.1%
469
 
2.6%
439
 
2.4%
399
 
2.2%
282
 
1.5%
256
 
1.4%
Other values (402) 9819
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14607
80.2%
Space Separator 3600
 
19.8%
Uppercase Letter 4
 
< 0.1%
Decimal Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1115
 
7.6%
640
 
4.4%
622
 
4.3%
573
 
3.9%
469
 
3.2%
439
 
3.0%
399
 
2.7%
282
 
1.9%
256
 
1.8%
251
 
1.7%
Other values (397) 9561
65.5%
Uppercase Letter
ValueCountFrequency (%)
P 2
50.0%
C 2
50.0%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
9 1
33.3%
Space Separator
ValueCountFrequency (%)
3600
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14607
80.2%
Common 3603
 
19.8%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1115
 
7.6%
640
 
4.4%
622
 
4.3%
573
 
3.9%
469
 
3.2%
439
 
3.0%
399
 
2.7%
282
 
1.9%
256
 
1.8%
251
 
1.7%
Other values (397) 9561
65.5%
Common
ValueCountFrequency (%)
3600
99.9%
1 2
 
0.1%
9 1
 
< 0.1%
Latin
ValueCountFrequency (%)
P 2
50.0%
C 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14606
80.2%
ASCII 3607
 
19.8%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3600
99.8%
P 2
 
0.1%
C 2
 
0.1%
1 2
 
0.1%
9 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
1115
 
7.6%
640
 
4.4%
622
 
4.3%
573
 
3.9%
469
 
3.2%
439
 
3.0%
399
 
2.7%
282
 
1.9%
256
 
1.8%
251
 
1.7%
Other values (396) 9560
65.5%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-13T01:37:03.926258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분류중분류
대분류1.0001.000
중분류1.0001.000

Missing values

2023-12-13T01:37:00.730845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:37:00.816718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분류코드대분류중분류직업명
0111.관리자11.공공 및 기업 고위직공공 및 기업 고위직
11111.관리자11.공공 및 기업 고위직의회의원고위공무원 및 공공단체임원
211101.관리자11.공공 및 기업 고위직의회의원고위공무원 및 공공단체임원
3111011.관리자11.공공 및 기업 고위직국회의원
4111021.관리자11.공공 및 기업 고위직지방의회의원 및 교육위원
5111031.관리자11.공공 및 기업 고위직중앙정부 고위공무원
6111041.관리자11.공공 및 기업 고위직지방정부 고위공무원
7111051.관리자11.공공 및 기업 고위직공공기관 임원
8111061.관리자11.공공 및 기업 고위직정당 및 특수단체 임원
91121.관리자11.공공 및 기업 고위직기업고위임원
분류코드대분류중분류직업명
1823999999.단순노무 종사자99.농림어업 및 기타 서비스 단순노무직그 외 서비스관련 단순 종사원
1824A1A.군인A1.군인군인
1825A11A.군인A1.군인장교
1826A111A.군인A1.군인영관급 이상
1827A1110A.군인A1.군인영관급 이상 장교
1828A112A.군인A1.군인위관급
1829A1120A.군인A1.군인위관급 장교
1830A12A.군인A1.군인장기 부사관 및 준위
1831A120A.군인A1.군인장기 부사관 및 준위
1832A1200A.군인A1.군인장기 부사관 및 준위