Overview

Dataset statistics

Number of variables6
Number of observations2192
Missing cells5
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory107.2 KiB
Average record size in memory50.1 B

Variable types

Numeric2
Text3
Categorical1

Dataset

Description공공데이터 중장기 개방계획에 따라 공개하는 경상남도 하천관리 시스템의 데이터 입니다. 하천관리시스템의 표석 정보를 포함하고있습니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15093540

Alerts

구분코드 has constant value ""Constant
일련번호 is highly skewed (γ1 = 32.00584694)Skewed
공간아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:07:42.048550
Analysis finished2023-12-10 23:07:43.227451
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공간아이디
Real number (ℝ)

UNIQUE 

Distinct2192
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1096.5
Minimum1
Maximum2192
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.4 KiB
2023-12-11T08:07:43.318220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile110.55
Q1548.75
median1096.5
Q31644.25
95-th percentile2082.45
Maximum2192
Range2191
Interquartile range (IQR)1095.5

Descriptive statistics

Standard deviation632.92022
Coefficient of variation (CV)0.57721862
Kurtosis-1.2
Mean1096.5
Median Absolute Deviation (MAD)548
Skewness0
Sum2403528
Variance400588
MonotonicityStrictly increasing
2023-12-11T08:07:43.497897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1466 1
 
< 0.1%
1460 1
 
< 0.1%
1461 1
 
< 0.1%
1462 1
 
< 0.1%
1463 1
 
< 0.1%
1464 1
 
< 0.1%
1465 1
 
< 0.1%
1467 1
 
< 0.1%
1441 1
 
< 0.1%
Other values (2182) 2182
99.5%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2192 1
< 0.1%
2191 1
< 0.1%
2190 1
< 0.1%
2189 1
< 0.1%
2188 1
< 0.1%
2187 1
< 0.1%
2186 1
< 0.1%
2185 1
< 0.1%
2184 1
< 0.1%
2183 1
< 0.1%
Distinct525
Distinct (%)24.0%
Missing0
Missing (%)0.0%
Memory size17.3 KiB
2023-12-11T08:07:43.734528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length19
Mean length19
Min length19

Characters and Unicode

Total characters41648
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)1.5%

Sample

1st row20129502012F02Q0101
2nd row20129502012F02Q0101
3rd row20129502012F02Q0101
4th row20129502012F02Q0101
5th row20129502012F02Q0101
ValueCountFrequency (%)
20246902020f02q0101 43
 
2.0%
20228802004f01q0101 21
 
1.0%
20263602019f02q0101 19
 
0.9%
20249602010f02q0101 19
 
0.9%
20272002012f02q0101 17
 
0.8%
27209902014f02q0101 16
 
0.7%
20250302004f02q0101 15
 
0.7%
20268002012f02q0101 14
 
0.6%
20227802010f01q0101 14
 
0.6%
20255602014f02q0101 13
 
0.6%
Other values (515) 2001
91.3%
2023-12-11T08:07:44.061070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 14534
34.9%
2 8219
19.7%
1 7781
18.7%
F 2192
 
5.3%
Q 2192
 
5.3%
7 1239
 
3.0%
4 1081
 
2.6%
9 1042
 
2.5%
6 963
 
2.3%
5 916
 
2.2%
Other values (2) 1489
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 37264
89.5%
Uppercase Letter 4384
 
10.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 14534
39.0%
2 8219
22.1%
1 7781
20.9%
7 1239
 
3.3%
4 1081
 
2.9%
9 1042
 
2.8%
6 963
 
2.6%
5 916
 
2.5%
3 794
 
2.1%
8 695
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
F 2192
50.0%
Q 2192
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 37264
89.5%
Latin 4384
 
10.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 14534
39.0%
2 8219
22.1%
1 7781
20.9%
7 1239
 
3.3%
4 1081
 
2.9%
9 1042
 
2.8%
6 963
 
2.6%
5 916
 
2.5%
3 794
 
2.1%
8 695
 
1.9%
Latin
ValueCountFrequency (%)
F 2192
50.0%
Q 2192
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 41648
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 14534
34.9%
2 8219
19.7%
1 7781
18.7%
F 2192
 
5.3%
Q 2192
 
5.3%
7 1239
 
3.0%
4 1081
 
2.6%
9 1042
 
2.5%
6 963
 
2.3%
5 916
 
2.2%
Other values (2) 1489
 
3.6%

구분코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.3 KiB
E04
2192 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowE04
2nd rowE04
3rd rowE04
4th rowE04
5th rowE04

Common Values

ValueCountFrequency (%)
E04 2192
100.0%

Length

2023-12-11T08:07:44.186868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:07:44.265928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
e04 2192
100.0%

일련번호
Real number (ℝ)

SKEWED 

Distinct47
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.8777372
Minimum1
Maximum1002
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.4 KiB
2023-12-11T08:07:44.361239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile11
Maximum1002
Range1001
Interquartile range (IQR)3

Descriptive statistics

Standard deviation30.463057
Coefficient of variation (CV)6.2453255
Kurtosis1045.6004
Mean4.8777372
Median Absolute Deviation (MAD)1
Skewness32.005847
Sum10692
Variance927.99782
MonotonicityNot monotonic
2023-12-11T08:07:44.469006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
1 500
22.8%
2 478
21.8%
3 390
17.8%
4 272
12.4%
5 162
 
7.4%
6 100
 
4.6%
7 64
 
2.9%
8 49
 
2.2%
9 35
 
1.6%
10 24
 
1.1%
Other values (37) 118
 
5.4%
ValueCountFrequency (%)
1 500
22.8%
2 478
21.8%
3 390
17.8%
4 272
12.4%
5 162
 
7.4%
6 100
 
4.6%
7 64
 
2.9%
8 49
 
2.2%
9 35
 
1.6%
10 24
 
1.1%
ValueCountFrequency (%)
1002 1
< 0.1%
1001 1
< 0.1%
49 1
< 0.1%
48 1
< 0.1%
43 1
< 0.1%
42 1
< 0.1%
41 1
< 0.1%
40 1
< 0.1%
39 1
< 0.1%
38 1
< 0.1%
Distinct774
Distinct (%)35.4%
Missing5
Missing (%)0.2%
Memory size17.3 KiB
2023-12-11T08:07:44.716260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length4.3177869
Min length1

Characters and Unicode

Total characters9443
Distinct characters120
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique595 ?
Unique (%)27.2%

Sample

1st row표석좌0
2nd row표석우1
3rd row표석좌2
4th row표석우3
5th row표석좌4
ValueCountFrequency (%)
표석 207
 
8.4%
표석1 103
 
4.2%
표석2 102
 
4.1%
표석3 85
 
3.4%
우1 64
 
2.6%
표석4 59
 
2.4%
좌1 51
 
2.1%
좌2 48
 
1.9%
no.02 39
 
1.6%
no.01 38
 
1.5%
Other values (687) 1672
67.7%
2023-12-11T08:07:45.066735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1169
 
12.4%
1159
 
12.3%
1 708
 
7.5%
2 562
 
6.0%
. 512
 
5.4%
0 431
 
4.6%
3 423
 
4.5%
N 417
 
4.4%
412
 
4.4%
403
 
4.3%
Other values (110) 3247
34.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4235
44.8%
Decimal Number 2826
29.9%
Uppercase Letter 1097
 
11.6%
Other Punctuation 533
 
5.6%
Space Separator 284
 
3.0%
Close Punctuation 176
 
1.9%
Open Punctuation 176
 
1.9%
Dash Punctuation 48
 
0.5%
Lowercase Letter 48
 
0.5%
Math Symbol 20
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1169
27.6%
1159
27.4%
412
 
9.7%
403
 
9.5%
308
 
7.3%
104
 
2.5%
99
 
2.3%
69
 
1.6%
25
 
0.6%
21
 
0.5%
Other values (73) 466
 
11.0%
Uppercase Letter
ValueCountFrequency (%)
N 417
38.0%
O 368
33.5%
S 54
 
4.9%
C 37
 
3.4%
M 32
 
2.9%
A 29
 
2.6%
G 25
 
2.3%
Y 25
 
2.3%
J 23
 
2.1%
P 20
 
1.8%
Other values (8) 67
 
6.1%
Decimal Number
ValueCountFrequency (%)
1 708
25.1%
2 562
19.9%
0 431
15.3%
3 423
15.0%
4 275
 
9.7%
5 159
 
5.6%
6 102
 
3.6%
7 71
 
2.5%
8 54
 
1.9%
9 41
 
1.5%
Other Punctuation
ValueCountFrequency (%)
. 512
96.1%
# 13
 
2.4%
, 8
 
1.5%
Space Separator
ValueCountFrequency (%)
284
100.0%
Close Punctuation
ValueCountFrequency (%)
) 176
100.0%
Open Punctuation
ValueCountFrequency (%)
( 176
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Lowercase Letter
ValueCountFrequency (%)
o 48
100.0%
Math Symbol
ValueCountFrequency (%)
+ 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4235
44.8%
Common 4063
43.0%
Latin 1145
 
12.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1169
27.6%
1159
27.4%
412
 
9.7%
403
 
9.5%
308
 
7.3%
104
 
2.5%
99
 
2.3%
69
 
1.6%
25
 
0.6%
21
 
0.5%
Other values (73) 466
 
11.0%
Latin
ValueCountFrequency (%)
N 417
36.4%
O 368
32.1%
S 54
 
4.7%
o 48
 
4.2%
C 37
 
3.2%
M 32
 
2.8%
A 29
 
2.5%
G 25
 
2.2%
Y 25
 
2.2%
J 23
 
2.0%
Other values (9) 87
 
7.6%
Common
ValueCountFrequency (%)
1 708
17.4%
2 562
13.8%
. 512
12.6%
0 431
10.6%
3 423
10.4%
284
7.0%
4 275
 
6.8%
) 176
 
4.3%
( 176
 
4.3%
5 159
 
3.9%
Other values (8) 357
8.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5208
55.2%
Hangul 4235
44.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1169
27.6%
1159
27.4%
412
 
9.7%
403
 
9.5%
308
 
7.3%
104
 
2.5%
99
 
2.3%
69
 
1.6%
25
 
0.6%
21
 
0.5%
Other values (73) 466
 
11.0%
ASCII
ValueCountFrequency (%)
1 708
13.6%
2 562
10.8%
. 512
9.8%
0 431
8.3%
3 423
8.1%
N 417
 
8.0%
O 368
 
7.1%
284
 
5.5%
4 275
 
5.3%
) 176
 
3.4%
Other values (27) 1052
20.2%
Distinct2188
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size17.3 KiB
2023-12-11T08:07:45.239826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length45
Mean length44.515055
Min length40

Characters and Unicode

Total characters97577
Distinct characters19
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2184 ?
Unique (%)99.6%

Sample

1st rowPOINT (1038562.8812543752 1743923.5931866781)
2nd rowPOINT (1037698.2055845917 1743265.7596758264)
3rd rowPOINT (1036886.5516819841 1743187.3684825476)
4th rowPOINT (1035750.4832009944 1743109.0749452352)
5th rowPOINT (1034953.7046493429 1743714.1785205759)
ValueCountFrequency (%)
point 2192
33.3%
1713971.6424225557 2
 
< 0.1%
1151656.1491842486 2
 
< 0.1%
1714486.9951140685 2
 
< 0.1%
1151529.7176264662 2
 
< 0.1%
1713338.401496462 2
 
< 0.1%
1152188.0198497097 2
 
< 0.1%
1712778.78053249 2
 
< 0.1%
1151155.6640729688 2
 
< 0.1%
1720861.1439704355 1
 
< 0.1%
Other values (4367) 4367
66.4%
2023-12-11T08:07:45.567018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 11135
11.4%
7 7890
 
8.1%
0 7886
 
8.1%
6 7407
 
7.6%
8 6657
 
6.8%
2 6630
 
6.8%
3 6614
 
6.8%
5 6588
 
6.8%
4 6457
 
6.6%
9 6201
 
6.4%
Other values (9) 24112
24.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 73465
75.3%
Uppercase Letter 10960
 
11.2%
Other Punctuation 4384
 
4.5%
Space Separator 4384
 
4.5%
Open Punctuation 2192
 
2.2%
Close Punctuation 2192
 
2.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 11135
15.2%
7 7890
10.7%
0 7886
10.7%
6 7407
10.1%
8 6657
9.1%
2 6630
9.0%
3 6614
9.0%
5 6588
9.0%
4 6457
8.8%
9 6201
8.4%
Uppercase Letter
ValueCountFrequency (%)
P 2192
20.0%
O 2192
20.0%
T 2192
20.0%
N 2192
20.0%
I 2192
20.0%
Other Punctuation
ValueCountFrequency (%)
. 4384
100.0%
Space Separator
ValueCountFrequency (%)
4384
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2192
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2192
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 86617
88.8%
Latin 10960
 
11.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 11135
12.9%
7 7890
9.1%
0 7886
9.1%
6 7407
8.6%
8 6657
7.7%
2 6630
7.7%
3 6614
7.6%
5 6588
7.6%
4 6457
7.5%
9 6201
7.2%
Other values (4) 13152
15.2%
Latin
ValueCountFrequency (%)
P 2192
20.0%
O 2192
20.0%
T 2192
20.0%
N 2192
20.0%
I 2192
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 97577
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 11135
11.4%
7 7890
 
8.1%
0 7886
 
8.1%
6 7407
 
7.6%
8 6657
 
6.8%
2 6630
 
6.8%
3 6614
 
6.8%
5 6588
 
6.8%
4 6457
 
6.6%
9 6201
 
6.4%
Other values (9) 24112
24.7%

Interactions

2023-12-11T08:07:42.570680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:42.326380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:42.673077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:42.440205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:07:45.652090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디일련번호
공간아이디1.0000.084
일련번호0.0841.000
2023-12-11T08:07:45.716223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디일련번호
공간아이디1.000-0.008
일련번호-0.0081.000

Missing values

2023-12-11T08:07:42.807004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:07:43.174672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공간아이디하천관리코드구분코드일련번호표석명공간정보
0120129502012F02Q0101E041표석좌0POINT (1038562.8812543752 1743923.5931866781)
1220129502012F02Q0101E042표석우1POINT (1037698.2055845917 1743265.7596758264)
2320129502012F02Q0101E043표석좌2POINT (1036886.5516819841 1743187.3684825476)
3420129502012F02Q0101E044표석우3POINT (1035750.4832009944 1743109.0749452352)
4520129502012F02Q0101E045표석좌4POINT (1034953.7046493429 1743714.1785205759)
5620129502012F02Q0101E046표석우5POINT (1034192.4243100947 1743837.2814876295)
6720140402015F02Q0101E041표석1POINT (1026316.137697911 1732628.390798123)
7820140402015F02Q0101E042표석2POINT (1027057.7332822798 1733305.0192609734)
8920140402015F02Q0101E043표석3POINT (1026922.5334316847 1734165.1543737496)
91020140402015F02Q0101E044표석4POINT (1027384.4087096464 1734980.9424424327)
공간아이디하천관리코드구분코드일련번호표석명공간정보
2182218320254902021F02Q0101E041Y01POINT (1063962.2373871957 1674125.4927812687)
2183218420254902021F02Q0101E0410Y12POINT (1065823.9986468102 1668384.9966540884)
2184218520254902021F02Q0101E049Y11POINT (1065477.079375801 1668935.9284990556)
2185218620254902021F02Q0101E048Y09POINT (1065126.9157876016 1669635.341231667)
2186218720254902021F02Q0101E047Y08POINT (1065216.0955426027 1670329.3246639373)
2187218820254902021F02Q0101E046Y06POINT (1065884.9869519768 1670996.3360531242)
2188218920254902021F02Q0101E045Y05POINT (1065856.3399723698 1671788.185409334)
2189219020254902021F02Q0101E044Y04POINT (1065367.143416362 1672505.713987792)
2190219120254902021F02Q0101E043Y03POINT (1064943.5121360968 1673187.114273084)
2191219220254902021F02Q0101E042Y02POINT (1064119.5784825284 1673420.4894579344)