Overview

Dataset statistics

Number of variables5
Number of observations326
Missing cells0
Missing cells (%)0.0%
Duplicate rows98
Duplicate rows (%)30.1%
Total size in memory12.9 KiB
Average record size in memory40.4 B

Variable types

Categorical3
Text2

Dataset

Descriptionㅇ 변호사 사무실이 단 한곳도 없는 마을(무변호) 주민들에게 무료로 법률 상담을 해드리는 제도로 충청남도 마을변호사 현황입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=401&beforeMenuCd=DOM_000000201001001000&publicdatapk=15017573

Alerts

시도 has constant value ""Constant
Dataset has 98 (30.1%) duplicate rowsDuplicates
시군구 마을변호사 담당기관 연락처 is highly overall correlated with 시군구High correlation
시군구 is highly overall correlated with 시군구 마을변호사 담당기관 연락처High correlation

Reproduction

Analysis started2024-01-09 20:48:00.335772
Analysis finished2024-01-09 20:48:00.634375
Duration0.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
충남
326 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충남
2nd row충남
3rd row충남
4th row충남
5th row충남

Common Values

ValueCountFrequency (%)
충남 326
100.0%

Length

2024-01-10T05:48:00.683130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:48:00.756147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충남 326
100.0%

시군구
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
아산시
37 
부여군
35 
공주시
29 
천안시
29 
당진시
25 
Other values (10)
171 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row논산시
2nd row홍성군
3rd row홍성군
4th row홍성군
5th row논산시

Common Values

ValueCountFrequency (%)
아산시 37
11.3%
부여군 35
10.7%
공주시 29
 
8.9%
천안시 29
 
8.9%
당진시 25
 
7.7%
청양군 21
 
6.4%
보령시 21
 
6.4%
논산시 19
 
5.8%
서산시 18
 
5.5%
서천군 18
 
5.5%
Other values (5) 74
22.7%

Length

2024-01-10T05:48:00.830763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
아산시 37
11.3%
부여군 35
10.7%
공주시 29
 
8.9%
천안시 29
 
8.9%
당진시 25
 
7.7%
청양군 21
 
6.4%
보령시 21
 
6.4%
논산시 19
 
5.8%
서산시 18
 
5.5%
서천군 18
 
5.5%
Other values (5) 74
22.7%
Distinct159
Distinct (%)48.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2024-01-10T05:48:01.068506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9785276
Min length2

Characters and Unicode

Total characters971
Distinct characters129
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)18.4%

Sample

1st row가야곡면
2nd row갈산면
3rd row갈산면
4th row갈산면
5th row강경읍
ValueCountFrequency (%)
탕정면 8
 
2.5%
추부면 6
 
1.8%
장암면 6
 
1.8%
배방읍 5
 
1.5%
성환읍 5
 
1.5%
태안읍 5
 
1.5%
대산읍 5
 
1.5%
은산면 5
 
1.5%
신창면 4
 
1.2%
엄사면 4
 
1.2%
Other values (149) 273
83.7%
2024-01-10T05:48:01.450355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
263
27.1%
64
 
6.6%
45
 
4.6%
25
 
2.6%
23
 
2.4%
17
 
1.8%
16
 
1.6%
15
 
1.5%
15
 
1.5%
14
 
1.4%
Other values (119) 474
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 971
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
263
27.1%
64
 
6.6%
45
 
4.6%
25
 
2.6%
23
 
2.4%
17
 
1.8%
16
 
1.6%
15
 
1.5%
15
 
1.5%
14
 
1.4%
Other values (119) 474
48.8%

Most occurring scripts

ValueCountFrequency (%)
Hangul 971
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
263
27.1%
64
 
6.6%
45
 
4.6%
25
 
2.6%
23
 
2.4%
17
 
1.8%
16
 
1.6%
15
 
1.5%
15
 
1.5%
14
 
1.4%
Other values (119) 474
48.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 971
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
263
27.1%
64
 
6.6%
45
 
4.6%
25
 
2.6%
23
 
2.4%
17
 
1.8%
16
 
1.6%
15
 
1.5%
15
 
1.5%
14
 
1.4%
Other values (119) 474
48.8%
Distinct19
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
041-540-2236
37 
041-830-2046
31 
041-840-2034
29 
041-521-5321
29 
041-350-3236
25 
Other values (14)
175 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique4 ?
Unique (%)1.2%

Sample

1st row041-746-5212
2nd row041-630-1782
3rd row041-630-1782
4th row041-630-1782
5th row041-746-5212

Common Values

ValueCountFrequency (%)
041-540-2236 37
11.3%
041-830-2046 31
 
9.5%
041-840-2034 29
 
8.9%
041-521-5321 29
 
8.9%
041-350-3236 25
 
7.7%
041-930-3221 21
 
6.4%
041-940-2756 21
 
6.4%
041-746-5212 19
 
5.8%
041-660-2207 18
 
5.5%
041-950-4616 18
 
5.5%
Other values (9) 78
23.9%

Length

2024-01-10T05:48:01.581726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
041-540-2236 37
11.3%
041-830-2046 31
 
9.5%
041-840-2034 29
 
8.9%
041-521-5321 29
 
8.9%
041-350-3236 25
 
7.7%
041-930-3221 21
 
6.4%
041-940-2756 21
 
6.4%
041-746-5212 19
 
5.8%
041-950-4616 18
 
5.5%
041-660-2207 18
 
5.5%
Other values (9) 78
23.9%
Distinct159
Distinct (%)48.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2024-01-10T05:48:01.778840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.052147
Min length12

Characters and Unicode

Total characters3929
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)18.4%

Sample

1st row041-746-8822
2nd row041-630-9464
3rd row041-630-9464
4th row041-630-9464
5th row041-746-8502
ValueCountFrequency (%)
041-537-3073 8
 
2.5%
041-750-3107 6
 
1.8%
041-830-6695 6
 
1.8%
041-530-6686 5
 
1.5%
041-521-6755 5
 
1.5%
041-670-5502 5
 
1.5%
041-660-3703 5
 
1.5%
041-830-6392 5
 
1.5%
041-537-3134 4
 
1.2%
042-840-3114 4
 
1.2%
Other values (149) 273
83.7%
2024-01-10T05:48:02.094241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 652
16.6%
0 642
16.3%
4 571
14.5%
1 426
10.8%
3 373
9.5%
6 286
7.3%
5 259
 
6.6%
8 226
 
5.8%
7 189
 
4.8%
2 161
 
4.1%
Other values (2) 144
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3260
83.0%
Dash Punctuation 652
 
16.6%
Space Separator 17
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 642
19.7%
4 571
17.5%
1 426
13.1%
3 373
11.4%
6 286
8.8%
5 259
7.9%
8 226
 
6.9%
7 189
 
5.8%
2 161
 
4.9%
9 127
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 652
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3929
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 652
16.6%
0 642
16.3%
4 571
14.5%
1 426
10.8%
3 373
9.5%
6 286
7.3%
5 259
 
6.6%
8 226
 
5.8%
7 189
 
4.8%
2 161
 
4.1%
Other values (2) 144
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3929
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 652
16.6%
0 642
16.3%
4 571
14.5%
1 426
10.8%
3 373
9.5%
6 286
7.3%
5 259
 
6.6%
8 226
 
5.8%
7 189
 
4.8%
2 161
 
4.1%
Other values (2) 144
 
3.7%

Correlations

2024-01-10T05:48:02.178681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구시군구 마을변호사 담당기관 연락처
시군구1.0001.000
시군구 마을변호사 담당기관 연락처1.0001.000
2024-01-10T05:48:02.250432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구 마을변호사 담당기관 연락처시군구
시군구 마을변호사 담당기관 연락처1.0000.994
시군구0.9941.000
2024-01-10T05:48:02.320696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구시군구 마을변호사 담당기관 연락처
시군구1.0000.994
시군구 마을변호사 담당기관 연락처0.9941.000

Missing values

2024-01-10T05:48:00.525124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:48:00.600441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도시군구읍면동시군구 마을변호사 담당기관 연락처읍면동 마을변호사 담당기관 연락처
0충남논산시가야곡면041-746-5212041-746-8822
1충남홍성군갈산면041-630-1782041-630-9464
2충남홍성군갈산면041-630-1782041-630-9464
3충남홍성군갈산면041-630-1782041-630-9464
4충남논산시강경읍041-746-5212041-746-8502
5충남논산시강경읍041-746-5212041-746-8502
6충남홍성군결성면041-630-1782041-630-9424
7충남공주시계룡면041-840-2034041-840-8856
8충남공주시계룡면041-840-2034041-840-8856
9충남공주시계룡면041-840-2034041-840-8856
시도시군구읍면동시군구 마을변호사 담당기관 연락처읍면동 마을변호사 담당기관 연락처
316충남서산시해미면041-660-2207041-660-3550
317충남서산시해미면041-660-2207041-660-3550
318충남홍성군홍동면041-630-1782041-630-9363
319충남홍성군홍북면041-630-1782041-630-9319
320충남부여군홍산면041-830-2046041-830-6513
321충남홍성군홍성읍041-630-1782041-630-9162
322충남홍성군홍성읍041-630-1782041-630-9162
323충남청양군화성면041-940-2756041-940-4345
324충남청양군화성면041-940-2756041-940-4345
325충남서천군화양면041-950-4616041-950-6384

Duplicate rows

Most frequently occurring

시도시군구읍면동시군구 마을변호사 담당기관 연락처읍면동 마을변호사 담당기관 연락처# duplicates
64충남아산시탕정면041-540-2236041-537-30738
15충남금산군추부면041-750-2246041-750-31076
42충남부여군장암면041-830-2046041-830-66956
45충남서산시대산읍041-660-2207041-660-37035
56충남아산시배방읍041-540-2236041-530-66865
73충남천안시성환읍041-521-5321041-521-67555
92충남태안군태안읍041-670-2238041-670-55025
0충남계룡시두마면042-840-2106042-840-33144
1충남계룡시엄사면042-840-2106042-840-31144
9충남공주시이인면041-840-2034041-840-27344