Overview

Dataset statistics

Number of variables5
Number of observations26
Missing cells9
Missing cells (%)6.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory46.1 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description익산시에 위치한 염소농장의 현황(사업장명칭, 주사육업종, 지번소재지, 도로명소재지)을 기록한 파일데이터입니다.
Author전북특별자치도 익산시
URLhttps://www.data.go.kr/data/15127785/fileData.do

Alerts

연번 is highly overall correlated with 주사육업종High correlation
주사육업종 is highly overall correlated with 연번High correlation
사업장소재지(도로명) has 9 (34.6%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-29 23:18:43.630076
Analysis finished2024-04-29 23:18:46.620148
Duration2.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.5
Minimum1
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2024-04-30T08:18:46.706895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.25
Q17.25
median13.5
Q319.75
95-th percentile24.75
Maximum26
Range25
Interquartile range (IQR)12.5

Descriptive statistics

Standard deviation7.6485293
Coefficient of variation (CV)0.56655772
Kurtosis-1.2
Mean13.5
Median Absolute Deviation (MAD)6.5
Skewness0
Sum351
Variance58.5
MonotonicityStrictly increasing
2024-04-30T08:18:46.882784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1 1
 
3.8%
15 1
 
3.8%
26 1
 
3.8%
25 1
 
3.8%
24 1
 
3.8%
23 1
 
3.8%
22 1
 
3.8%
21 1
 
3.8%
20 1
 
3.8%
19 1
 
3.8%
Other values (16) 16
61.5%
ValueCountFrequency (%)
1 1
3.8%
2 1
3.8%
3 1
3.8%
4 1
3.8%
5 1
3.8%
6 1
3.8%
7 1
3.8%
8 1
3.8%
9 1
3.8%
10 1
3.8%
ValueCountFrequency (%)
26 1
3.8%
25 1
3.8%
24 1
3.8%
23 1
3.8%
22 1
3.8%
21 1
3.8%
20 1
3.8%
19 1
3.8%
18 1
3.8%
17 1
3.8%
Distinct25
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-04-30T08:18:47.143318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length4.7692308
Min length3

Characters and Unicode

Total characters124
Distinct characters52
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)92.3%

Sample

1st row송종섭 농장
2nd row김단오농장
3rd row정민농장
4th row염소울
5th row염소우리
ValueCountFrequency (%)
황등목장 2
 
7.1%
농장 2
 
7.1%
송종섭 1
 
3.6%
우리흑염소 1
 
3.6%
염소농장 1
 
3.6%
왕춘농장 1
 
3.6%
박은영농장 1
 
3.6%
창평농장 1
 
3.6%
거대농장 1
 
3.6%
월성농장 1
 
3.6%
Other values (16) 16
57.1%
2024-04-30T08:18:47.501186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
16.1%
20
 
16.1%
9
 
7.3%
8
 
6.5%
3
 
2.4%
3
 
2.4%
2
 
1.6%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (42) 53
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 122
98.4%
Space Separator 2
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
16.4%
20
 
16.4%
9
 
7.4%
8
 
6.6%
3
 
2.5%
3
 
2.5%
2
 
1.6%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (41) 51
41.8%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 122
98.4%
Common 2
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
16.4%
20
 
16.4%
9
 
7.4%
8
 
6.6%
3
 
2.5%
3
 
2.5%
2
 
1.6%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (41) 51
41.8%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 122
98.4%
ASCII 2
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
16.4%
20
 
16.4%
9
 
7.4%
8
 
6.6%
3
 
2.5%
3
 
2.5%
2
 
1.6%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (41) 51
41.8%
ASCII
ValueCountFrequency (%)
2
100.0%

주사육업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
염소
17 
산양

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row염소
2nd row염소
3rd row염소
4th row산양
5th row산양

Common Values

ValueCountFrequency (%)
염소 17
65.4%
산양 9
34.6%

Length

2024-04-30T08:18:47.637116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T08:18:47.734777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
염소 17
65.4%
산양 9
34.6%
Distinct24
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-04-30T08:18:47.883909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length19
Mean length18.846154
Min length15

Characters and Unicode

Total characters490
Distinct characters56
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)84.6%

Sample

1st row전북특별자치도 익산시 왕궁면 구덕리
2nd row전북특별자치도 익산시 황등면 황등리
3rd row전북특별자치도 익산시 삼기면 서두리
4th row전북특별자치도 익산시 함열읍 흘산리
5th row전북특별자치도 익산시 황등면 신성리
ValueCountFrequency (%)
전북특별자치도 26
25.2%
익산시 26
25.2%
황등면 5
 
4.9%
왕궁면 5
 
4.9%
삼기면 4
 
3.9%
낭산면 3
 
2.9%
흥암리 2
 
1.9%
웅포면 2
 
1.9%
용안면 2
 
1.9%
서두리 2
 
1.9%
Other values (25) 26
25.2%
2024-04-30T08:18:48.222708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
77
15.7%
32
 
6.5%
27
 
5.5%
26
 
5.3%
26
 
5.3%
26
 
5.3%
26
 
5.3%
26
 
5.3%
26
 
5.3%
26
 
5.3%
Other values (46) 172
35.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 413
84.3%
Space Separator 77
 
15.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
7.7%
27
 
6.5%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
Other values (45) 146
35.4%
Space Separator
ValueCountFrequency (%)
77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 413
84.3%
Common 77
 
15.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
7.7%
27
 
6.5%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
Other values (45) 146
35.4%
Common
ValueCountFrequency (%)
77
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 413
84.3%
ASCII 77
 
15.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
77
100.0%
Hangul
ValueCountFrequency (%)
32
 
7.7%
27
 
6.5%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
26
 
6.3%
Other values (45) 146
35.4%
Distinct16
Distinct (%)94.1%
Missing9
Missing (%)34.6%
Memory size340.0 B
2024-04-30T08:18:48.407299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length19.176471
Min length15

Characters and Unicode

Total characters326
Distinct characters49
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)88.2%

Sample

1st row전북특별자치도 익산시 왕궁면 이탄1길
2nd row전북특별자치도 익산시 황등면 화강암로
3rd row전북특별자치도 익산시 삼기면 황금로
4th row전북특별자치도 익산시 망성면 어량말길
5th row전북특별자치도 익산시 삼기면 연동길
ValueCountFrequency (%)
전북특별자치도 17
25.4%
익산시 17
25.4%
왕궁면 4
 
6.0%
황금로 3
 
4.5%
삼기면 3
 
4.5%
황등면 3
 
4.5%
용안면 2
 
3.0%
춘포면 2
 
3.0%
강경길 1
 
1.5%
이탄1길 1
 
1.5%
Other values (14) 14
20.9%
2024-04-30T08:18:48.740399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
15.3%
18
 
5.5%
18
 
5.5%
17
 
5.2%
17
 
5.2%
17
 
5.2%
17
 
5.2%
17
 
5.2%
17
 
5.2%
17
 
5.2%
Other values (39) 121
37.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 273
83.7%
Space Separator 50
 
15.3%
Decimal Number 3
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
6.6%
18
 
6.6%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
Other values (37) 101
37.0%
Space Separator
ValueCountFrequency (%)
50
100.0%
Decimal Number
ValueCountFrequency (%)
1 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 273
83.7%
Common 53
 
16.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
6.6%
18
 
6.6%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
Other values (37) 101
37.0%
Common
ValueCountFrequency (%)
50
94.3%
1 3
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 273
83.7%
ASCII 53
 
16.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
50
94.3%
1 3
 
5.7%
Hangul
ValueCountFrequency (%)
18
 
6.6%
18
 
6.6%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
17
 
6.2%
Other values (37) 101
37.0%

Interactions

2024-04-30T08:18:46.251529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T08:18:48.842431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장명칭주사육업종사업장소재지(지번)사업장소재지(도로명)
연번1.0001.0000.9120.9680.909
사업장명칭1.0001.0001.0000.9790.988
주사육업종0.9121.0001.0000.6390.000
사업장소재지(지번)0.9680.9790.6391.0001.000
사업장소재지(도로명)0.9090.9880.0001.0001.000
2024-04-30T08:18:48.942809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주사육업종
연번1.0000.585
주사육업종0.5851.000

Missing values

2024-04-30T08:18:46.440993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T08:18:46.558902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명칭주사육업종사업장소재지(지번)사업장소재지(도로명)
01송종섭 농장염소전북특별자치도 익산시 왕궁면 구덕리전북특별자치도 익산시 왕궁면 이탄1길
12김단오농장염소전북특별자치도 익산시 황등면 황등리전북특별자치도 익산시 황등면 화강암로
23정민농장염소전북특별자치도 익산시 삼기면 서두리전북특별자치도 익산시 삼기면 황금로
34염소울산양전북특별자치도 익산시 함열읍 흘산리<NA>
45염소우리산양전북특별자치도 익산시 황등면 신성리<NA>
56검소한흑염소농장산양전북특별자치도 익산시 망성면 어량리전북특별자치도 익산시 망성면 어량말길
67김월중 농장산양전북특별자치도 익산시 삼기면 연동리전북특별자치도 익산시 삼기면 연동길
78한성영농염소전북특별자치도 익산시 황등면 용산리전북특별자치도 익산시 황등면 황교1길
89임마누엘고센농장산양전북특별자치도 익산시 웅포면 송천리<NA>
910팔봉농원염소전북특별자치도 익산시 삼기면 용연리<NA>
연번사업장명칭주사육업종사업장소재지(지번)사업장소재지(도로명)
1617또박이농장염소전북특별자치도 익산시 왕궁면 흥암리<NA>
1718분홍염소염소전북특별자치도 익산시 황등면 구자리<NA>
1819동산흑염소농장염소전북특별자치도 익산시 웅포면 제성리<NA>
1920월성농장염소전북특별자치도 익산시 월성동전북특별자치도 익산시 용연길
2021거대농장염소전북특별자치도 익산시 춘포면 삼포리전북특별자치도 익산시 춘포면 강경길
2122창평농장염소전북특별자치도 익산시 춘포면 창평리전북특별자치도 익산시 춘포면 창평길
2223박은영농장염소전북특별자치도 익산시 용안면 용두리전북특별자치도 익산시 용안면 용북로
2324왕춘농장염소전북특별자치도 익산시 왕궁면 쌍제리전북특별자치도 익산시 왕궁면 왕춘길
2425염소농장염소전북특별자치도 익산시 용안면 칠목리전북특별자치도 익산시 용안면 칠목학동길
2526이삭농장염소전북특별자치도 익산시 왕궁면 온수리전북특별자치도 익산시 왕궁면 학호1길