Overview

Dataset statistics

Number of variables4
Number of observations28
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)7.1%
Total size in memory1.1 KiB
Average record size in memory39.7 B

Variable types

Text1
Numeric2
Categorical1

Dataset

Description한국지역난방공사에서 제공하는 계약이행각서 관련 정보입니다(2016년 대상, 계약명, 계약금액, 계약보증금율, 보증금액 등)
Author한국지역난방공사
URLhttps://www.data.go.kr/data/15124152/fileData.do

Alerts

Dataset has 2 (7.1%) duplicate rowsDuplicates
계약금액 is highly overall correlated with 보증금High correlation
보증금 is highly overall correlated with 계약금액High correlation

Reproduction

Analysis started2023-12-12 14:27:27.961169
Analysis finished2023-12-12 14:27:28.712803
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct23
Distinct (%)82.1%
Missing0
Missing (%)0.0%
Memory size356.0 B
2023-12-12T23:27:28.988206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length30
Mean length24.678571
Min length11

Characters and Unicode

Total characters691
Distinct characters176
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)75.0%

Sample

1st row2016년 고양사업소 Critical Point 설비 유지보수용역
2nd row청탁금지법 등 청렴교육을 위한 청렴연극
3rd row배전야적장 목재휀스 설치공사
4th row대외업무 및 공사 내방객 응대를 위한 홍보기념품 구매
5th row황산 및 황연저감제 피해예측 기술용역
ValueCountFrequency (%)
10
 
6.7%
구매 7
 
4.7%
2016년 7
 
4.7%
광교지사 6
 
4.0%
설비 6
 
4.0%
교체 6
 
4.0%
순수생산 5
 
3.3%
membrane 5
 
3.3%
구매설치 4
 
2.7%
용역 3
 
2.0%
Other values (77) 91
60.7%
2023-12-12T23:27:29.441852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
123
 
17.8%
21
 
3.0%
18
 
2.6%
16
 
2.3%
16
 
2.3%
14
 
2.0%
13
 
1.9%
12
 
1.7%
12
 
1.7%
1 11
 
1.6%
Other values (166) 435
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 438
63.4%
Space Separator 123
 
17.8%
Lowercase Letter 59
 
8.5%
Decimal Number 40
 
5.8%
Uppercase Letter 26
 
3.8%
Open Punctuation 2
 
0.3%
Close Punctuation 2
 
0.3%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
4.8%
18
 
4.1%
16
 
3.7%
16
 
3.7%
14
 
3.2%
13
 
3.0%
12
 
2.7%
12
 
2.7%
11
 
2.5%
11
 
2.5%
Other values (133) 294
67.1%
Lowercase Letter
ValueCountFrequency (%)
e 10
16.9%
a 8
13.6%
r 7
11.9%
n 6
10.2%
m 5
8.5%
t 5
8.5%
b 5
8.5%
c 3
 
5.1%
i 3
 
5.1%
u 2
 
3.4%
Other values (4) 5
8.5%
Uppercase Letter
ValueCountFrequency (%)
M 5
19.2%
E 4
15.4%
C 4
15.4%
D 3
11.5%
S 3
11.5%
L 2
 
7.7%
B 2
 
7.7%
A 1
 
3.8%
P 1
 
3.8%
T 1
 
3.8%
Decimal Number
ValueCountFrequency (%)
1 11
27.5%
2 10
25.0%
0 9
22.5%
6 9
22.5%
7 1
 
2.5%
Space Separator
ValueCountFrequency (%)
123
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 438
63.4%
Common 168
 
24.3%
Latin 85
 
12.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
4.8%
18
 
4.1%
16
 
3.7%
16
 
3.7%
14
 
3.2%
13
 
3.0%
12
 
2.7%
12
 
2.7%
11
 
2.5%
11
 
2.5%
Other values (133) 294
67.1%
Latin
ValueCountFrequency (%)
e 10
 
11.8%
a 8
 
9.4%
r 7
 
8.2%
n 6
 
7.1%
m 5
 
5.9%
t 5
 
5.9%
M 5
 
5.9%
b 5
 
5.9%
E 4
 
4.7%
C 4
 
4.7%
Other values (14) 26
30.6%
Common
ValueCountFrequency (%)
123
73.2%
1 11
 
6.5%
2 10
 
6.0%
0 9
 
5.4%
6 9
 
5.4%
( 2
 
1.2%
) 2
 
1.2%
7 1
 
0.6%
~ 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 438
63.4%
ASCII 253
36.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
123
48.6%
1 11
 
4.3%
e 10
 
4.0%
2 10
 
4.0%
0 9
 
3.6%
6 9
 
3.6%
a 8
 
3.2%
r 7
 
2.8%
n 6
 
2.4%
m 5
 
2.0%
Other values (23) 55
21.7%
Hangul
ValueCountFrequency (%)
21
 
4.8%
18
 
4.1%
16
 
3.7%
16
 
3.7%
14
 
3.2%
13
 
3.0%
12
 
2.7%
12
 
2.7%
11
 
2.5%
11
 
2.5%
Other values (133) 294
67.1%

계약금액
Real number (ℝ)

HIGH CORRELATION 

Distinct23
Distinct (%)82.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17292325
Minimum1298000
Maximum47009870
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size384.0 B
2023-12-12T23:27:29.616760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1298000
5-th percentile2659250
Q19206725
median17837239
Q327295248
95-th percentile30155846
Maximum47009870
Range45711870
Interquartile range (IQR)18088522

Descriptive statistics

Standard deviation11251274
Coefficient of variation (CV)0.6506513
Kurtosis0.040074228
Mean17292325
Median Absolute Deviation (MAD)8663789
Skewness0.6177926
Sum4.841851 × 108
Variance1.2659116 × 1014
MonotonicityNot monotonic
2023-12-12T23:27:29.768866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
29980990 5
 
17.9%
19322630 2
 
7.1%
10143800 1
 
3.6%
2695000 1
 
3.6%
8673500 1
 
3.6%
30250000 1
 
3.6%
26400000 1
 
3.6%
17421000 1
 
3.6%
18253478 1
 
3.6%
1298000 1
 
3.6%
Other values (13) 13
46.4%
ValueCountFrequency (%)
1298000 1
3.6%
2640000 1
3.6%
2695000 1
3.6%
2915000 1
3.6%
8580000 1
3.6%
8673500 1
3.6%
9106900 1
3.6%
9240000 1
3.6%
9405000 1
3.6%
9537000 1
3.6%
ValueCountFrequency (%)
47009870 1
 
3.6%
30250000 1
 
3.6%
29980990 5
17.9%
26400000 1
 
3.6%
23265000 1
 
3.6%
19322630 2
 
7.1%
18947500 1
 
3.6%
18273817 1
 
3.6%
18253478 1
 
3.6%
17421000 1
 
3.6%
Distinct2
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size356.0 B
15
16 
10
12 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15
2nd row15
3rd row15
4th row10
5th row15

Common Values

ValueCountFrequency (%)
15 16
57.1%
10 12
42.9%

Length

2023-12-12T23:27:29.923899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:27:30.039458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15 16
57.1%
10 12
42.9%

보증금
Real number (ℝ)

HIGH CORRELATION 

Distinct23
Distinct (%)82.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2152429.6
Minimum194700
Maximum7051480
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size384.0 B
2023-12-12T23:27:30.133106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum194700
5-th percentile398887.5
Q11087751.5
median1913506.5
Q32998099
95-th percentile4335375
Maximum7051480
Range6856780
Interquartile range (IQR)1910347.5

Descriptive statistics

Standard deviation1498121.4
Coefficient of variation (CV)0.69601415
Kurtosis2.8497854
Mean2152429.6
Median Absolute Deviation (MAD)1007656.5
Skewness1.3172647
Sum60268028
Variance2.2443678 × 1012
MonotonicityNot monotonic
2023-12-12T23:27:30.253380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
2998099 5
 
17.9%
1932263 2
 
7.1%
1521570 1
 
3.6%
404250 1
 
3.6%
1301025 1
 
3.6%
4537500 1
 
3.6%
3960000 1
 
3.6%
2613150 1
 
3.6%
2738022 1
 
3.6%
194700 1
 
3.6%
Other values (13) 13
46.4%
ValueCountFrequency (%)
194700 1
3.6%
396000 1
3.6%
404250 1
3.6%
437250 1
3.6%
858000 1
3.6%
953700 1
3.6%
1061500 1
3.6%
1096502 1
3.6%
1301025 1
3.6%
1366035 1
3.6%
ValueCountFrequency (%)
7051480 1
 
3.6%
4537500 1
 
3.6%
3960000 1
 
3.6%
3489750 1
 
3.6%
2998099 5
17.9%
2741073 1
 
3.6%
2738022 1
 
3.6%
2613150 1
 
3.6%
1932263 2
 
7.1%
1894750 1
 
3.6%

Interactions

2023-12-12T23:27:28.296270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:27:28.144529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:27:28.394258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:27:28.214933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:27:30.335231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약명계약금액계약보증금율보증금
계약명1.0001.0001.0001.000
계약금액1.0001.0000.6430.976
계약보증금율1.0000.6431.0000.716
보증금1.0000.9760.7161.000
2023-12-12T23:27:30.445539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약금액보증금계약보증금율
계약금액1.0000.9370.447
보증금0.9371.0000.488
계약보증금율0.4470.4881.000

Missing values

2023-12-12T23:27:28.557154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:27:28.674736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

계약명계약금액계약보증금율보증금
02016년 고양사업소 Critical Point 설비 유지보수용역10143800151521570
1청탁금지법 등 청렴교육을 위한 청렴연극269500015404250
2배전야적장 목재휀스 설치공사47009870157051480
3대외업무 및 공사 내방객 응대를 위한 홍보기념품 구매18947500101894750
4황산 및 황연저감제 피해예측 기술용역291500015437250
5화성동탄2 집단에너지시설 건설공사 조명기구 1차 구매(LED방폭등)10965020101096502
6CTBD 재생설비 염산주입펌프 교체공사9405000151410750
72016년 홍보용 브로슈어(국문) 수정 제작 구매953700010953700
8노후 폐수처리조 바탕면 보수공사9106900151366035
92016년도 고양사업소 사무실 실내 공기질 측정 용역264000015396000
계약명계약금액계약보증금율보증금
18광교지사 순수생산 설비 Membrane 구매 및 교체29980990102998099
19광교지사 순수생산 설비 Membrane 구매 및 교체29980990102998099
20광교지사 순수생산 설비 Membrane 구매 및 교체29980990102998099
21광교지사 순수생산 설비 Membrane 구매 및 교체29980990102998099
22광교지사 순수생산 설비 Membrane 구매 및 교체29980990102998099
232016년 용인지사 소방설비 점검용역18253478152738022
24활선엘보연결분리 및 무정전 변압기 교체 공사17421000152613150
252016년 고양CES 터보냉동기 유지보수용역26400000153960000
262016년 고양CES 흡수식냉동기 유지보수용역30250000154537500
27관리동 옥상 누수공사8673500151301025

Duplicate rows

Most frequently occurring

계약명계약금액계약보증금율보증금# duplicates
0광교지사 순수생산 설비 Membrane 구매 및 교체299809901029980995
1염산 및 가성소다 누출감지기 구매설치193226301019322632