Overview

Dataset statistics

Number of variables6
Number of observations613
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory29.5 KiB
Average record size in memory49.2 B

Variable types

Numeric1
Text2
Categorical1
DateTime2

Dataset

Description산업단지 관련하여 진행중이거나 마감 완료된 분양공고관련 정보를 제공합니다. (공고명, 사업지구, 공고일, 마감일, 담당자 등)
Author한국수자원공사
URLhttps://www.data.go.kr/data/15054526/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:03:25.141995
Analysis finished2023-12-12 16:03:25.718150
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct613
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean307
Minimum1
Maximum613
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.5 KiB
2023-12-13T01:03:25.798641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile31.6
Q1154
median307
Q3460
95-th percentile582.4
Maximum613
Range612
Interquartile range (IQR)306

Descriptive statistics

Standard deviation177.10214
Coefficient of variation (CV)0.57687992
Kurtosis-1.2
Mean307
Median Absolute Deviation (MAD)153
Skewness0
Sum188191
Variance31365.167
MonotonicityStrictly increasing
2023-12-13T01:03:25.977890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
413 1
 
0.2%
406 1
 
0.2%
407 1
 
0.2%
408 1
 
0.2%
409 1
 
0.2%
410 1
 
0.2%
411 1
 
0.2%
412 1
 
0.2%
414 1
 
0.2%
Other values (603) 603
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
613 1
0.2%
612 1
0.2%
611 1
0.2%
610 1
0.2%
609 1
0.2%
608 1
0.2%
607 1
0.2%
606 1
0.2%
605 1
0.2%
604 1
0.2%
Distinct485
Distinct (%)79.1%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2023-12-13T01:03:26.241459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length26.280587
Min length12

Characters and Unicode

Total characters16110
Distinct characters195
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique399 ?
Unique (%)65.1%

Sample

1st row부산에코델타시티 공동주택용지 분양공고
2nd row부산에코델타시티 도시지원시설용지 분양공고(1순위)
3rd row부산에코델타시티 점포겸용 2차 이주자택지 공급공고
4th row[정정공고] 부산에코델타시티 단독주택용지(주거전용) 분양 정정공고
5th row부산에코델타시티 단독주택용지(주거전용) 분양공고
ValueCountFrequency (%)
분양공고 258
 
9.8%
공고 162
 
6.2%
물류단지 106
 
4.0%
수의분양 104
 
4.0%
지원시설용지 84
 
3.2%
시화mtv 74
 
2.8%
구미국가산업단지 74
 
2.8%
확장단지 63
 
2.4%
구미 62
 
2.4%
인천터미널 58
 
2.2%
Other values (347) 1586
60.3%
2023-12-13T01:03:26.815779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2018
 
12.5%
1215
 
7.5%
808
 
5.0%
712
 
4.4%
592
 
3.7%
567
 
3.5%
566
 
3.5%
564
 
3.5%
542
 
3.4%
380
 
2.4%
Other values (185) 8146
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12631
78.4%
Space Separator 2018
 
12.5%
Close Punctuation 380
 
2.4%
Open Punctuation 380
 
2.4%
Decimal Number 339
 
2.1%
Uppercase Letter 259
 
1.6%
Other Punctuation 90
 
0.6%
Dash Punctuation 12
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1215
 
9.6%
808
 
6.4%
712
 
5.6%
592
 
4.7%
567
 
4.5%
566
 
4.5%
564
 
4.5%
542
 
4.3%
380
 
3.0%
294
 
2.3%
Other values (157) 6391
50.6%
Decimal Number
ValueCountFrequency (%)
2 80
23.6%
1 76
22.4%
0 60
17.7%
9 37
10.9%
4 28
 
8.3%
5 25
 
7.4%
8 15
 
4.4%
3 10
 
2.9%
7 7
 
2.1%
6 1
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
M 84
32.4%
T 84
32.4%
V 83
32.0%
L 2
 
0.8%
B 2
 
0.8%
S 2
 
0.8%
G 1
 
0.4%
C 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 71
78.9%
· 15
 
16.7%
* 4
 
4.4%
Close Punctuation
ValueCountFrequency (%)
) 311
81.8%
] 69
 
18.2%
Open Punctuation
ValueCountFrequency (%)
( 311
81.8%
[ 69
 
18.2%
Space Separator
ValueCountFrequency (%)
2018
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12631
78.4%
Common 3220
 
20.0%
Latin 259
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1215
 
9.6%
808
 
6.4%
712
 
5.6%
592
 
4.7%
567
 
4.5%
566
 
4.5%
564
 
4.5%
542
 
4.3%
380
 
3.0%
294
 
2.3%
Other values (157) 6391
50.6%
Common
ValueCountFrequency (%)
2018
62.7%
) 311
 
9.7%
( 311
 
9.7%
2 80
 
2.5%
1 76
 
2.4%
, 71
 
2.2%
[ 69
 
2.1%
] 69
 
2.1%
0 60
 
1.9%
9 37
 
1.1%
Other values (10) 118
 
3.7%
Latin
ValueCountFrequency (%)
M 84
32.4%
T 84
32.4%
V 83
32.0%
L 2
 
0.8%
B 2
 
0.8%
S 2
 
0.8%
G 1
 
0.4%
C 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12609
78.3%
ASCII 3464
 
21.5%
Compat Jamo 22
 
0.1%
None 15
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2018
58.3%
) 311
 
9.0%
( 311
 
9.0%
M 84
 
2.4%
T 84
 
2.4%
V 83
 
2.4%
2 80
 
2.3%
1 76
 
2.2%
, 71
 
2.0%
[ 69
 
2.0%
Other values (17) 277
 
8.0%
Hangul
ValueCountFrequency (%)
1215
 
9.6%
808
 
6.4%
712
 
5.6%
592
 
4.7%
567
 
4.5%
566
 
4.5%
564
 
4.5%
542
 
4.3%
380
 
3.0%
294
 
2.3%
Other values (156) 6369
50.5%
Compat Jamo
ValueCountFrequency (%)
22
100.0%
None
ValueCountFrequency (%)
· 15
100.0%

사업지구
Categorical

Distinct16
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
인천물류단지
143 
시화MTV
114 
구미확장단지
90 
김포물류단지
80 
부산에코델타시티
41 
Other values (11)
145 

Length

Max length8
Median length6
Mean length5.995106
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산에코델타시티
2nd row부산에코델타시티
3rd row부산에코델타시티
4th row부산에코델타시티
5th row부산에코델타시티

Common Values

ValueCountFrequency (%)
인천물류단지 143
23.3%
시화MTV 114
18.6%
구미확장단지 90
14.7%
김포물류단지 80
13.1%
부산에코델타시티 41
 
6.7%
구미하이테크밸리 36
 
5.9%
송산그린시티 31
 
5.1%
구미제4단지 22
 
3.6%
시화지구 22
 
3.6%
나주노안지구 10
 
1.6%
Other values (6) 24
 
3.9%

Length

2023-12-13T01:03:27.003090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인천물류단지 143
23.3%
시화mtv 114
18.6%
구미확장단지 90
14.7%
김포물류단지 80
13.1%
부산에코델타시티 41
 
6.7%
구미하이테크밸리 36
 
5.9%
송산그린시티 31
 
5.1%
구미제4단지 22
 
3.6%
시화지구 22
 
3.6%
나주노안지구 10
 
1.6%
Other values (6) 24
 
3.9%
Distinct406
Distinct (%)66.2%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
Minimum2008-07-07 00:00:00
Maximum2021-12-13 00:00:00
2023-12-13T01:03:27.164831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:03:27.325328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct407
Distinct (%)66.4%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
Minimum2008-08-08 00:00:00
Maximum2022-01-07 00:00:00
2023-12-13T01:03:27.491848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:03:27.968252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct61
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2023-12-13T01:03:28.261265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9983687
Min length2

Characters and Unicode

Total characters1838
Distinct characters73
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)1.6%

Sample

1st row김진환
2nd row김진환
3rd row박준현
4th row김진환
5th row김진환
ValueCountFrequency (%)
김영선 64
 
10.4%
김도형 40
 
6.5%
임지희 39
 
6.4%
김영민 27
 
4.4%
김정호 27
 
4.4%
조경진 27
 
4.4%
이건영 26
 
4.2%
오진우 18
 
2.9%
공윤희 17
 
2.8%
심은정 17
 
2.8%
Other values (51) 311
50.7%
2023-12-13T01:03:28.645621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
224
 
12.2%
139
 
7.6%
106
 
5.8%
92
 
5.0%
85
 
4.6%
73
 
4.0%
71
 
3.9%
56
 
3.0%
55
 
3.0%
40
 
2.2%
Other values (63) 897
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1838
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
224
 
12.2%
139
 
7.6%
106
 
5.8%
92
 
5.0%
85
 
4.6%
73
 
4.0%
71
 
3.9%
56
 
3.0%
55
 
3.0%
40
 
2.2%
Other values (63) 897
48.8%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1838
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
224
 
12.2%
139
 
7.6%
106
 
5.8%
92
 
5.0%
85
 
4.6%
73
 
4.0%
71
 
3.9%
56
 
3.0%
55
 
3.0%
40
 
2.2%
Other values (63) 897
48.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1838
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
224
 
12.2%
139
 
7.6%
106
 
5.8%
92
 
5.0%
85
 
4.6%
73
 
4.0%
71
 
3.9%
56
 
3.0%
55
 
3.0%
40
 
2.2%
Other values (63) 897
48.8%

Interactions

2023-12-13T01:03:25.445008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:03:28.746130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호사업지구담당자
번호1.0000.6650.965
사업지구0.6651.0000.971
담당자0.9650.9711.000
2023-12-13T01:03:28.903522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호사업지구
번호1.0000.332
사업지구0.3321.000

Missing values

2023-12-13T01:03:25.574770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:03:25.675436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호공고명사업지구공고일마감일담당자
01부산에코델타시티 공동주택용지 분양공고부산에코델타시티2021-12-132021-12-23김진환
12부산에코델타시티 도시지원시설용지 분양공고(1순위)부산에코델타시티2021-12-092021-12-22김진환
23부산에코델타시티 점포겸용 2차 이주자택지 공급공고부산에코델타시티2021-12-082021-12-20박준현
34[정정공고] 부산에코델타시티 단독주택용지(주거전용) 분양 정정공고부산에코델타시티2021-12-012021-12-08김진환
45부산에코델타시티 단독주택용지(주거전용) 분양공고부산에코델타시티2021-12-012021-12-08김진환
56부산에코델타시티 전용주거 이주자택지 공급공고부산에코델타시티2021-11-182021-11-29박준현
67(나주노안지구) 블록형단독주택용지 수의분양 공고나주노안지구2021-11-152021-12-02정두원
78부산에코델타시티 산업시설(도시형공장)용지 분양공고(1차,명지동)부산에코델타시티2021-11-102021-11-25김진환
89나주노안지구 블록형 단독주택용지 분양공고나주노안지구2021-10-252021-11-04정두원
910부산에코델타시티 연구시설용지 분양공고(2차)부산에코델타시티2021-09-162021-09-30김진환
번호공고명사업지구공고일마감일담당자
603604화성 전곡항 토지 공개분양 공고시화지구2009-04-232009-05-14하정미
6046052008년 구미국가산업단지 제4단지 산업시설용지 분양공고구미제4단지2008-11-252008-12-12이광호
605606구미 국가산업단지 지원시설용지 수의분양 공고구미제4단지2008-10-232009-09-30조세정
606607안산신도시 고잔지구 (종합의료시설) 공개 분양안산2단계2008-10-022009-06-30김상숙
607608화성 전곡항 토지 공개분양 공고시화지구2008-09-302008-10-24하정미
6086092008년도 제2차 구미 국가산업단지 지원시설용지 공개분양구미제4단지2008-09-192008-10-10조세정
609610상업용지 및 주차장용지 분양공고구미제4단지2008-09-012008-09-26이광호
610611여수국가산업단지확장단지산업시설용지분양공고여수확장단지2008-07-252008-08-12이홍용
611612구미 국가산업단지 제4단지 지원시설용지 공개분양구미제4단지2008-07-152008-08-08조세정
612613구미 국가산업단지 제4단지 임대전용 산업단지 임대 및 청약구미임대전용2008-07-072008-09-08조세정