Overview

Dataset statistics

Number of variables7
Number of observations594
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory32.6 KiB
Average record size in memory56.2 B

Variable types

Categorical5
DateTime1
Text1

Dataset

Description울산시설공단 수영장수질현황정보를 시설명, 년월, 검사항목, 기준, 단위, 검사결과 등 항목으로 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15089189/fileData.do

Alerts

기준일자 has constant value ""Constant
단위 is highly overall correlated with 검사항목 and 1 other fieldsHigh correlation
기준 is highly overall correlated with 검사항목 and 1 other fieldsHigh correlation
검사항목 is highly overall correlated with 기준 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 23:17:35.136935
Analysis finished2023-12-12 23:17:35.529296
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설명
Categorical

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
노동자종합복지회관
252 
동천국민체육센터
198 
문수실내수영장
144 

Length

Max length9
Median length8
Mean length8.1818182
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노동자종합복지회관
2nd row노동자종합복지회관
3rd row노동자종합복지회관
4th row노동자종합복지회관
5th row노동자종합복지회관

Common Values

ValueCountFrequency (%)
노동자종합복지회관 252
42.4%
동천국민체육센터 198
33.3%
문수실내수영장 144
24.2%

Length

2023-12-13T08:17:35.598889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:17:35.695231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노동자종합복지회관 252
42.4%
동천국민체육센터 198
33.3%
문수실내수영장 144
24.2%

일자
Date

Distinct55
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
Minimum2021-01-21 00:00:00
Maximum2023-04-19 00:00:00
2023-12-13T08:17:35.796193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:17:35.918962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

검사항목
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
총대장균군
66 
과망간산칼륨소비량
66 
탁도
66 
결합잔류염소
66 
비소
66 
Other values (4)
264 

Length

Max length9
Median length6
Mean length4.6666667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row총대장균군
2nd row과망간산칼륨소비량
3rd row탁도
4th row결합잔류염소
5th row비소

Common Values

ValueCountFrequency (%)
총대장균군 66
11.1%
과망간산칼륨소비량 66
11.1%
탁도 66
11.1%
결합잔류염소 66
11.1%
비소 66
11.1%
수은 66
11.1%
수소이온농도 66
11.1%
유리잔류염소 66
11.1%
알루미늄 66
11.1%

Length

2023-12-13T08:17:36.068018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:17:36.199898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
총대장균군 66
11.1%
과망간산칼륨소비량 66
11.1%
탁도 66
11.1%
결합잔류염소 66
11.1%
비소 66
11.1%
수은 66
11.1%
수소이온농도 66
11.1%
유리잔류염소 66
11.1%
알루미늄 66
11.1%

기준
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
0.5 mg/L이하
132 
10mL 5개중 양성이 2개 이하
66 
12 mg/L이하
66 
1.5 NTU이하
66 
0.05 mg/L이하
66 
Other values (3)
198 

Length

Max length18
Median length12
Mean length11.111111
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10mL 5개중 양성이 2개 이하
2nd row12 mg/L이하
3rd row1.5 NTU이하
4th row0.5 mg/L이하
5th row0.05 mg/L이하

Common Values

ValueCountFrequency (%)
0.5 mg/L이하 132
22.2%
10mL 5개중 양성이 2개 이하 66
11.1%
12 mg/L이하 66
11.1%
1.5 NTU이하 66
11.1%
0.05 mg/L이하 66
11.1%
0.007 mg/L이하 66
11.1%
5.8~8.6 66
11.1%
0.4~1.0 mg/L이하 66
11.1%

Length

2023-12-13T08:17:36.329937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:17:36.463842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mg/l이하 396
30.0%
0.5 132
 
10.0%
10ml 66
 
5.0%
5개중 66
 
5.0%
양성이 66
 
5.0%
2개 66
 
5.0%
이하 66
 
5.0%
12 66
 
5.0%
1.5 66
 
5.0%
ntu이하 66
 
5.0%
Other values (4) 264
20.0%

단위
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
mg/L
396 
66 
NTU
66 
<NA>
66 

Length

Max length4
Median length4
Mean length3.5555556
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd rowmg/L
3rd rowNTU
4th rowmg/L
5th rowmg/L

Common Values

ValueCountFrequency (%)
mg/L 396
66.7%
66
 
11.1%
NTU 66
 
11.1%
<NA> 66
 
11.1%

Length

2023-12-13T08:17:36.589397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:17:36.707245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mg/l 396
66.7%
66
 
11.1%
ntu 66
 
11.1%
na 66
 
11.1%
Distinct108
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-13T08:17:36.934165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.1767677
Min length1

Characters and Unicode

Total characters1887
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)5.6%

Sample

1st row휴장기간
2nd row휴장기간
3rd row휴장기간
4th row휴장기간
5th row휴장기간
ValueCountFrequency (%)
불검출 152
25.6%
휴장기간 63
 
10.6%
0 59
 
9.9%
0.15 11
 
1.9%
0.02 11
 
1.9%
0.03 10
 
1.7%
7 10
 
1.7%
0.14 10
 
1.7%
6.7 9
 
1.5%
0.13 8
 
1.3%
Other values (98) 251
42.3%
2023-12-13T08:17:37.297608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 322
17.1%
. 305
16.2%
152
 
8.1%
152
 
8.1%
152
 
8.1%
1 116
 
6.1%
6 79
 
4.2%
2 77
 
4.1%
7 68
 
3.6%
63
 
3.3%
Other values (8) 401
21.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 874
46.3%
Other Letter 708
37.5%
Other Punctuation 305
 
16.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 322
36.8%
1 116
 
13.3%
6 79
 
9.0%
2 77
 
8.8%
7 68
 
7.8%
4 48
 
5.5%
3 46
 
5.3%
9 46
 
5.3%
5 36
 
4.1%
8 36
 
4.1%
Other Letter
ValueCountFrequency (%)
152
21.5%
152
21.5%
152
21.5%
63
8.9%
63
8.9%
63
8.9%
63
8.9%
Other Punctuation
ValueCountFrequency (%)
. 305
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1179
62.5%
Hangul 708
37.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 322
27.3%
. 305
25.9%
1 116
 
9.8%
6 79
 
6.7%
2 77
 
6.5%
7 68
 
5.8%
4 48
 
4.1%
3 46
 
3.9%
9 46
 
3.9%
5 36
 
3.1%
Hangul
ValueCountFrequency (%)
152
21.5%
152
21.5%
152
21.5%
63
8.9%
63
8.9%
63
8.9%
63
8.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1179
62.5%
Hangul 708
37.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 322
27.3%
. 305
25.9%
1 116
 
9.8%
6 79
 
6.7%
2 77
 
6.5%
7 68
 
5.8%
4 48
 
4.1%
3 46
 
3.9%
9 46
 
3.9%
5 36
 
3.1%
Hangul
ValueCountFrequency (%)
152
21.5%
152
21.5%
152
21.5%
63
8.9%
63
8.9%
63
8.9%
63
8.9%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-07-17
594 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-17
2nd row2023-07-17
3rd row2023-07-17
4th row2023-07-17
5th row2023-07-17

Common Values

ValueCountFrequency (%)
2023-07-17 594
100.0%

Length

2023-12-13T08:17:37.444770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:17:37.524873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-17 594
100.0%

Correlations

2023-12-13T08:17:37.577837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명일자검사항목기준단위
시설명1.0000.9580.0000.0000.000
일자0.9581.0000.0000.0000.000
검사항목0.0000.0001.0001.0001.000
기준0.0000.0001.0001.0001.000
단위0.0000.0001.0001.0001.000
2023-12-13T08:17:37.668297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명단위기준검사항목
시설명1.0000.0000.0000.000
단위0.0001.0000.9960.995
기준0.0000.9961.0000.999
검사항목0.0000.9950.9991.000
2023-12-13T08:17:37.780468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명검사항목기준단위
시설명1.0000.0000.0000.000
검사항목0.0001.0000.9990.995
기준0.0000.9991.0000.996
단위0.0000.9950.9961.000

Missing values

2023-12-13T08:17:35.391858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:17:35.492214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명일자검사항목기준단위검사결과기준일자
0노동자종합복지회관2021-01-21총대장균군10mL 5개중 양성이 2개 이하휴장기간2023-07-17
1노동자종합복지회관2021-01-21과망간산칼륨소비량12 mg/L이하mg/L휴장기간2023-07-17
2노동자종합복지회관2021-01-21탁도1.5 NTU이하NTU휴장기간2023-07-17
3노동자종합복지회관2021-01-21결합잔류염소0.5 mg/L이하mg/L휴장기간2023-07-17
4노동자종합복지회관2021-01-21비소0.05 mg/L이하mg/L휴장기간2023-07-17
5노동자종합복지회관2021-01-21수은0.007 mg/L이하mg/L휴장기간2023-07-17
6노동자종합복지회관2021-01-21수소이온농도5.8~8.6<NA>휴장기간2023-07-17
7노동자종합복지회관2021-01-21유리잔류염소0.4~1.0 mg/L이하mg/L휴장기간2023-07-17
8노동자종합복지회관2021-01-21알루미늄0.5 mg/L이하mg/L휴장기간2023-07-17
9노동자종합복지회관2021-02-21총대장균군10mL 5개중 양성이 2개 이하02023-07-17
시설명일자검사항목기준단위검사결과기준일자
584동천국민체육센터2023-03-08알루미늄0.5 mg/L이하mg/L0.032023-07-17
585동천국민체육센터2023-04-13총대장균군10mL 5개중 양성이 2개 이하02023-07-17
586동천국민체육센터2023-04-13과망간산칼륨소비량12 mg/L이하mg/L0.82023-07-17
587동천국민체육센터2023-04-13탁도1.5 NTU이하NTU0.132023-07-17
588동천국민체육센터2023-04-13결합잔류염소0.5 mg/L이하mg/L0.162023-07-17
589동천국민체육센터2023-04-13비소0.05 mg/L이하mg/L불검출2023-07-17
590동천국민체육센터2023-04-13수은0.007 mg/L이하mg/L불검출2023-07-17
591동천국민체육센터2023-04-13수소이온농도5.8~8.6<NA>72023-07-17
592동천국민체육센터2023-04-13유리잔류염소0.4~1.0 mg/L이하mg/L0.492023-07-17
593동천국민체육센터2023-04-13알루미늄0.5 mg/L이하mg/L0.022023-07-17