Overview

Dataset statistics

Number of variables4
Number of observations8417
Missing cells0
Missing cells (%)0.0%
Duplicate rows622
Duplicate rows (%)7.4%
Total size in memory263.2 KiB
Average record size in memory32.0 B

Variable types

Categorical3
Text1

Dataset

Description2020년 대전광역시 안전사고 발생위치 자료입니다. 안전사고란 상해, 낙상, 추락, 열상 기타 등을 기본으로 추출하였습니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15091990/fileData.do

Alerts

has constant value ""Constant
Dataset has 622 (7.4%) duplicate rowsDuplicates
사고유형 is highly imbalanced (52.6%)Imbalance

Reproduction

Analysis started2023-12-12 23:47:32.548106
Analysis finished2023-12-12 23:47:32.914167
Duration0.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size65.9 KiB
대전광역시
8417 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전광역시
2nd row대전광역시
3rd row대전광역시
4th row대전광역시
5th row대전광역시

Common Values

ValueCountFrequency (%)
대전광역시 8417
100.0%

Length

2023-12-13T08:47:32.963289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:47:33.045409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전광역시 8417
100.0%


Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size65.9 KiB
서구
2233 
유성구
1735 
동구
1680 
중구
1519 
대덕구
1250 

Length

Max length3
Median length2
Mean length2.3546394
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서구
2nd row서구
3rd row서구
4th row서구
5th row서구

Common Values

ValueCountFrequency (%)
서구 2233
26.5%
유성구 1735
20.6%
동구 1680
20.0%
중구 1519
18.0%
대덕구 1250
14.9%

Length

2023-12-13T08:47:33.137037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:47:33.247495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서구 2233
26.5%
유성구 1735
20.6%
동구 1680
20.0%
중구 1519
18.0%
대덕구 1250
14.9%


Text

Distinct162
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size65.9 KiB
2023-12-13T08:47:33.568070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9196863
Min length2

Characters and Unicode

Total characters24575
Distinct characters119
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row갈마동
2nd row둔산동
3rd row변동
4th row둔산동
5th row둔산동
ValueCountFrequency (%)
둔산동 316
 
3.8%
봉명동 267
 
3.2%
월평동 242
 
2.9%
도마동 227
 
2.7%
갈마동 226
 
2.7%
가양동 226
 
2.7%
관저동 225
 
2.7%
법동 192
 
2.3%
판암동 183
 
2.2%
문화동 174
 
2.1%
Other values (152) 6139
72.9%
2023-12-13T08:47:34.012533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8417
34.3%
591
 
2.4%
541
 
2.2%
524
 
2.1%
515
 
2.1%
455
 
1.9%
431
 
1.8%
428
 
1.7%
423
 
1.7%
412
 
1.7%
Other values (109) 11838
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24575
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8417
34.3%
591
 
2.4%
541
 
2.2%
524
 
2.1%
515
 
2.1%
455
 
1.9%
431
 
1.8%
428
 
1.7%
423
 
1.7%
412
 
1.7%
Other values (109) 11838
48.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24575
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8417
34.3%
591
 
2.4%
541
 
2.2%
524
 
2.1%
515
 
2.1%
455
 
1.9%
431
 
1.8%
428
 
1.7%
423
 
1.7%
412
 
1.7%
Other values (109) 11838
48.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24575
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8417
34.3%
591
 
2.4%
541
 
2.2%
524
 
2.1%
515
 
2.1%
455
 
1.9%
431
 
1.8%
428
 
1.7%
423
 
1.7%
412
 
1.7%
Other values (109) 11838
48.2%

사고유형
Categorical

IMBALANCE 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size65.9 KiB
낙상
5013 
열상
1170 
상해
882 
추락
 
400
기타 둔상
 
383
Other values (15)
569 

Length

Max length5
Median length2
Mean length2.1936557
Min length1

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row상해
2nd row상해
3rd row기타 둔상
4th row낙상
5th row상해

Common Values

ValueCountFrequency (%)
낙상 5013
59.6%
열상 1170
 
13.9%
상해 882
 
10.5%
추락 400
 
4.8%
기타 둔상 383
 
4.6%
동물/곤충 161
 
1.9%
기계 141
 
1.7%
79
 
0.9%
자상 70
 
0.8%
화염 43
 
0.5%
Other values (10) 75
 
0.9%

Length

2023-12-13T08:47:34.174207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
낙상 5013
57.0%
열상 1170
 
13.3%
상해 882
 
10.0%
추락 400
 
4.5%
기타 383
 
4.4%
둔상 383
 
4.4%
동물/곤충 161
 
1.8%
기계 141
 
1.6%
79
 
0.9%
자상 70
 
0.8%
Other values (11) 118
 
1.3%

Correlations

2023-12-13T08:47:34.306542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사고유형
1.0000.114
사고유형0.1141.000
2023-12-13T08:47:34.378768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사고유형
1.0000.049
사고유형0.0491.000
2023-12-13T08:47:34.451441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사고유형
1.0000.049
사고유형0.0491.000

Missing values

2023-12-13T08:47:32.808550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:47:32.880656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사고유형
0대전광역시서구갈마동상해
1대전광역시서구둔산동상해
2대전광역시서구변동기타 둔상
3대전광역시서구둔산동낙상
4대전광역시서구둔산동상해
5대전광역시서구갈마동열상
6대전광역시서구괴정동기타 둔상
7대전광역시서구월평동낙상
8대전광역시서구괴정동낙상
9대전광역시서구내동추락
사고유형
8407대전광역시동구용운동열상
8408대전광역시동구가양동상해
8409대전광역시대덕구덕암동열상
8410대전광역시중구대흥동낙상
8411대전광역시유성구봉명동낙상
8412대전광역시동구대동낙상
8413대전광역시중구용두동낙상
8414대전광역시유성구송강동낙상
8415대전광역시대덕구평촌동화염
8416대전광역시중구대사동낙상

Duplicate rows

Most frequently occurring

사고유형# duplicates
285대전광역시서구둔산동낙상192
319대전광역시서구월평동낙상158
410대전광역시유성구봉명동낙상147
247대전광역시서구관저동낙상141
269대전광역시서구도마동낙상141
26대전광역시대덕구법동낙상136
99대전광역시동구가양동낙상132
212대전광역시동구판암동낙상127
537대전광역시중구문화동낙상121
239대전광역시서구갈마동낙상115