Overview

Dataset statistics

Number of variables7
Number of observations911
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory50.8 KiB
Average record size in memory57.1 B

Variable types

Categorical3
Text2
DateTime1
Numeric1

Dataset

Description인천광역시 연수구 계약현황 정보로 계약부서, 계약명, 계약종류, 계약유형, 계약일자, 계약금액, 계약상대자 등이 포함되어 있습니다.
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15063886&srcSe=7661IVAWM27C61E190

Alerts

계약부서 has constant value ""Constant
계약유형 is highly overall correlated with 계약종류High correlation
계약종류 is highly overall correlated with 계약유형High correlation

Reproduction

Analysis started2024-04-29 13:37:19.146825
Analysis finished2024-04-29 13:37:21.091778
Duration1.94 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

계약부서
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
재무회계과
911 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row재무회계과
2nd row재무회계과
3rd row재무회계과
4th row재무회계과
5th row재무회계과

Common Values

ValueCountFrequency (%)
재무회계과 911
100.0%

Length

2024-04-29T22:37:21.157195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:37:21.241846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재무회계과 911
100.0%
Distinct890
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
2024-04-29T22:37:21.497635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length42
Mean length26.059276
Min length8

Characters and Unicode

Total characters23740
Distinct characters546
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique875 ?
Unique (%)96.0%

Sample

1st row2022년 연수구 누리집 소프트웨어 유지보수 및 개편 사업
2nd row2022년 선학지하차도 전기안전관리대행 용역
3rd row2022년도 신문스크랩 프로그램 사용 계약
4th row연수공동구 유지관리용역(1차년도)
5th row2022 보건관리 위탁 용역 계약
ValueCountFrequency (%)
2022년 214
 
4.9%
구입 132
 
3.0%
99
 
2.3%
일원 93
 
2.1%
용역 82
 
1.9%
시행 69
 
1.6%
63
 
1.4%
공사 45
 
1.0%
2022년도 43
 
1.0%
설치 43
 
1.0%
Other values (1552) 3485
79.8%
2024-04-29T22:37:21.898641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3480
 
14.7%
2 1064
 
4.5%
647
 
2.7%
567
 
2.4%
543
 
2.3%
421
 
1.8%
) 419
 
1.8%
( 418
 
1.8%
0 366
 
1.5%
342
 
1.4%
Other values (536) 15473
65.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16803
70.8%
Space Separator 3480
 
14.7%
Decimal Number 1813
 
7.6%
Close Punctuation 593
 
2.5%
Open Punctuation 592
 
2.5%
Uppercase Letter 284
 
1.2%
Dash Punctuation 91
 
0.4%
Lowercase Letter 28
 
0.1%
Other Punctuation 27
 
0.1%
Math Symbol 17
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
647
 
3.9%
567
 
3.4%
543
 
3.2%
421
 
2.5%
342
 
2.0%
320
 
1.9%
302
 
1.8%
298
 
1.8%
284
 
1.7%
279
 
1.7%
Other values (472) 12800
76.2%
Uppercase Letter
ValueCountFrequency (%)
L 61
21.5%
C 57
20.1%
T 34
12.0%
V 27
9.5%
D 25
8.8%
E 20
 
7.0%
I 15
 
5.3%
W 8
 
2.8%
P 7
 
2.5%
F 6
 
2.1%
Other values (11) 24
 
8.5%
Lowercase Letter
ValueCountFrequency (%)
e 6
21.4%
s 4
14.3%
o 3
10.7%
m 3
10.7%
g 2
 
7.1%
h 2
 
7.1%
p 2
 
7.1%
t 1
 
3.6%
i 1
 
3.6%
a 1
 
3.6%
Other values (3) 3
10.7%
Decimal Number
ValueCountFrequency (%)
2 1064
58.7%
0 366
 
20.2%
1 158
 
8.7%
4 55
 
3.0%
5 51
 
2.8%
3 50
 
2.8%
7 22
 
1.2%
9 18
 
1.0%
8 15
 
0.8%
6 14
 
0.8%
Other Punctuation
ValueCountFrequency (%)
· 21
77.8%
; 2
 
7.4%
. 1
 
3.7%
! 1
 
3.7%
/ 1
 
3.7%
: 1
 
3.7%
Close Punctuation
ValueCountFrequency (%)
) 419
70.7%
] 102
 
17.2%
49
 
8.3%
22
 
3.7%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 418
70.6%
[ 102
 
17.2%
49
 
8.3%
22
 
3.7%
1
 
0.2%
Space Separator
ValueCountFrequency (%)
3480
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 91
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16803
70.8%
Common 6625
 
27.9%
Latin 312
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
647
 
3.9%
567
 
3.4%
543
 
3.2%
421
 
2.5%
342
 
2.0%
320
 
1.9%
302
 
1.8%
298
 
1.8%
284
 
1.7%
279
 
1.7%
Other values (472) 12800
76.2%
Latin
ValueCountFrequency (%)
L 61
19.6%
C 57
18.3%
T 34
10.9%
V 27
8.7%
D 25
8.0%
E 20
 
6.4%
I 15
 
4.8%
W 8
 
2.6%
P 7
 
2.2%
F 6
 
1.9%
Other values (24) 52
16.7%
Common
ValueCountFrequency (%)
3480
52.5%
2 1064
 
16.1%
) 419
 
6.3%
( 418
 
6.3%
0 366
 
5.5%
1 158
 
2.4%
] 102
 
1.5%
[ 102
 
1.5%
- 91
 
1.4%
4 55
 
0.8%
Other values (20) 370
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16801
70.8%
ASCII 6772
28.5%
None 165
 
0.7%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3480
51.4%
2 1064
 
15.7%
) 419
 
6.2%
( 418
 
6.2%
0 366
 
5.4%
1 158
 
2.3%
] 102
 
1.5%
[ 102
 
1.5%
- 91
 
1.3%
L 61
 
0.9%
Other values (47) 511
 
7.5%
Hangul
ValueCountFrequency (%)
647
 
3.9%
567
 
3.4%
543
 
3.2%
421
 
2.5%
342
 
2.0%
320
 
1.9%
302
 
1.8%
298
 
1.8%
284
 
1.7%
279
 
1.7%
Other values (471) 12798
76.2%
None
ValueCountFrequency (%)
49
29.7%
49
29.7%
22
13.3%
22
13.3%
· 21
12.7%
1
 
0.6%
1
 
0.6%
Compat Jamo
ValueCountFrequency (%)
2
100.0%

계약종류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
물품
406 
공사
278 
용역
227 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용역
2nd row용역
3rd row용역
4th row용역
5th row용역

Common Values

ValueCountFrequency (%)
물품 406
44.6%
공사 278
30.5%
용역 227
24.9%

Length

2024-04-29T22:37:22.012263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:37:22.097247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
물품 406
44.6%
공사 278
30.5%
용역 227
24.9%

계약유형
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
구매
287 
전문
184 
일반
150 
관급자재구매
115 
전기
55 
Other values (14)
120 

Length

Max length6
Median length2
Mean length2.5993414
Min length2

Unique

Unique4 ?
Unique (%)0.4%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
구매 287
31.5%
전문 184
20.2%
일반 150
16.5%
관급자재구매 115
12.6%
전기 55
 
6.0%
폐기물 42
 
4.6%
기술 19
 
2.1%
기타 15
 
1.6%
정보통신 15
 
1.6%
산림 10
 
1.1%
Other values (9) 19
 
2.1%

Length

2024-04-29T22:37:22.243281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
구매 287
31.5%
전문 184
20.2%
일반 150
16.5%
관급자재구매 115
12.6%
전기 55
 
6.0%
폐기물 42
 
4.6%
기술 19
 
2.1%
기타 15
 
1.6%
정보통신 15
 
1.6%
산림 10
 
1.1%
Other values (9) 19
 
2.1%
Distinct226
Distinct (%)24.8%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
Minimum2021-12-23 00:00:00
Maximum2022-12-30 00:00:00
2024-04-29T22:37:22.385345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:37:22.536076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

계약금액
Real number (ℝ)

Distinct869
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0394771 × 108
Minimum10000060
Maximum1.1702641 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.1 KiB
2024-04-29T22:37:22.686987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10000060
5-th percentile11332250
Q117094500
median23994000
Q365518500
95-th percentile3.3263443 × 108
Maximum1.1702641 × 1010
Range1.1692641 × 1010
Interquartile range (IQR)48424000

Descriptive statistics

Standard deviation4.8079541 × 108
Coefficient of variation (CV)4.6253583
Kurtosis408.81523
Mean1.0394771 × 108
Median Absolute Deviation (MAD)11346000
Skewness18.487761
Sum9.4696364 × 1010
Variance2.3116423 × 1017
MonotonicityNot monotonic
2024-04-29T22:37:22.873903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19000000 8
 
0.9%
20900000 6
 
0.7%
18000000 4
 
0.4%
19400000 3
 
0.3%
26400000 3
 
0.3%
100000000 3
 
0.3%
12490000 2
 
0.2%
19600000 2
 
0.2%
10450000 2
 
0.2%
19440000 2
 
0.2%
Other values (859) 876
96.2%
ValueCountFrequency (%)
10000060 1
0.1%
10000550 1
0.1%
10003000 1
0.1%
10006710 1
0.1%
10008540 1
0.1%
10027000 1
0.1%
10027600 1
0.1%
10031000 1
0.1%
10033360 1
0.1%
10038800 1
0.1%
ValueCountFrequency (%)
11702640700 1
0.1%
6565735000 1
0.1%
2047769850 1
0.1%
2026539040 1
0.1%
1708404350 1
0.1%
1459000000 1
0.1%
1441033160 1
0.1%
1343524000 1
0.1%
1322080200 1
0.1%
1234375210 1
0.1%
Distinct656
Distinct (%)72.0%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
2024-04-29T22:37:23.093663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length8.3347969
Min length2

Characters and Unicode

Total characters7593
Distinct characters384
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique514 ?
Unique (%)56.4%

Sample

1st row(주)도프네트웍
2nd row(주)대신전기안전
3rd row(주)다하미커뮤니케이션즈
4th row주식회사 광원
5th row의료법인 길 의료재단
ValueCountFrequency (%)
주식회사 321
 
25.2%
에스지이 14
 
1.1%
주)삼진콘크리트 7
 
0.5%
주)유원환경 7
 
0.5%
신유이엘 6
 
0.5%
주)은성환경 6
 
0.5%
연수구위생공사 6
 
0.5%
주)대도환경 6
 
0.5%
주)동우환경 6
 
0.5%
청연 5
 
0.4%
Other values (671) 890
69.9%
2024-04-29T22:37:23.501927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
700
 
9.2%
453
 
6.0%
385
 
5.1%
365
 
4.8%
363
 
4.8%
) 336
 
4.4%
( 336
 
4.4%
209
 
2.8%
170
 
2.2%
134
 
1.8%
Other values (374) 4142
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6527
86.0%
Space Separator 363
 
4.8%
Close Punctuation 336
 
4.4%
Open Punctuation 336
 
4.4%
Uppercase Letter 16
 
0.2%
Decimal Number 8
 
0.1%
Lowercase Letter 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
700
 
10.7%
453
 
6.9%
385
 
5.9%
365
 
5.6%
209
 
3.2%
170
 
2.6%
134
 
2.1%
113
 
1.7%
106
 
1.6%
105
 
1.6%
Other values (351) 3787
58.0%
Uppercase Letter
ValueCountFrequency (%)
B 3
18.8%
I 3
18.8%
D 2
12.5%
T 2
12.5%
H 1
 
6.2%
J 1
 
6.2%
W 1
 
6.2%
X 1
 
6.2%
R 1
 
6.2%
G 1
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
s 2
28.6%
k 1
14.3%
o 1
14.3%
c 1
14.3%
e 1
14.3%
y 1
14.3%
Decimal Number
ValueCountFrequency (%)
8 4
50.0%
2 2
25.0%
3 1
 
12.5%
0 1
 
12.5%
Space Separator
ValueCountFrequency (%)
363
100.0%
Close Punctuation
ValueCountFrequency (%)
) 336
100.0%
Open Punctuation
ValueCountFrequency (%)
( 336
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6523
85.9%
Common 1043
 
13.7%
Latin 23
 
0.3%
Han 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
700
 
10.7%
453
 
6.9%
385
 
5.9%
365
 
5.6%
209
 
3.2%
170
 
2.6%
134
 
2.1%
113
 
1.7%
106
 
1.6%
105
 
1.6%
Other values (347) 3783
58.0%
Latin
ValueCountFrequency (%)
B 3
13.0%
I 3
13.0%
D 2
 
8.7%
T 2
 
8.7%
s 2
 
8.7%
H 1
 
4.3%
J 1
 
4.3%
W 1
 
4.3%
X 1
 
4.3%
k 1
 
4.3%
Other values (6) 6
26.1%
Common
ValueCountFrequency (%)
363
34.8%
) 336
32.2%
( 336
32.2%
8 4
 
0.4%
2 2
 
0.2%
3 1
 
0.1%
0 1
 
0.1%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6523
85.9%
ASCII 1066
 
14.0%
CJK 4
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
700
 
10.7%
453
 
6.9%
385
 
5.9%
365
 
5.6%
209
 
3.2%
170
 
2.6%
134
 
2.1%
113
 
1.7%
106
 
1.6%
105
 
1.6%
Other values (347) 3783
58.0%
ASCII
ValueCountFrequency (%)
363
34.1%
) 336
31.5%
( 336
31.5%
8 4
 
0.4%
B 3
 
0.3%
I 3
 
0.3%
D 2
 
0.2%
2 2
 
0.2%
T 2
 
0.2%
s 2
 
0.2%
Other values (13) 13
 
1.2%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Interactions

2024-04-29T22:37:20.754236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-29T22:37:23.600454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약종류계약유형계약금액
계약종류1.0000.9930.068
계약유형0.9931.0000.640
계약금액0.0680.6401.000
2024-04-29T22:37:23.697030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약유형계약종류
계약유형1.0000.984
계약종류0.9841.000
2024-04-29T22:37:23.787565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약금액계약종류계약유형
계약금액1.0000.0640.405
계약종류0.0641.0000.984
계약유형0.4050.9841.000

Missing values

2024-04-29T22:37:20.935925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-29T22:37:21.049981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

계약부서계약명계약종류계약유형계약일자계약금액계약상대자
0재무회계과2022년 연수구 누리집 소프트웨어 유지보수 및 개편 사업용역일반2021-12-2343124400(주)도프네트웍
1재무회계과2022년 선학지하차도 전기안전관리대행 용역용역일반2021-12-2714252000(주)대신전기안전
2재무회계과2022년도 신문스크랩 프로그램 사용 계약용역일반2021-12-2715972000(주)다하미커뮤니케이션즈
3재무회계과연수공동구 유지관리용역(1차년도)용역일반2021-12-271198369810주식회사 광원
4재무회계과2022 보건관리 위탁 용역 계약용역일반2021-12-2812480000의료법인 길 의료재단
5재무회계과2022년 부동산종합공부시스템 전산장비 유지보수 용역용역일반2021-12-2813400000주식회사 더넷
6재무회계과2022 산업안전관리 위탁대행 용역용역일반2021-12-2813728000(주)세이프티컨설팅
7재무회계과2022년도 보도육교 승강기 유지관리용역용역일반2021-12-2815441000상원엘리베이터주식회사
8재무회계과2022년 보안등 설치 및 보수공사(연간단가)공사전기2021-12-2899352000보령기전주식회사
9재무회계과2022년 무인민원발급기(연수1동 외 7개소) 유지보수 용역용역일반2021-12-2912541000한국타피(주)
계약부서계약명계약종류계약유형계약일자계약금액계약상대자
901재무회계과다가치세움소 리모델링 공사(소방)공사소방2022-12-1920210000(주)부석이엔씨
902재무회계과옥련시장 방음벽 설치공사 시행물품관급자재구매2022-12-2118000000광스틸
903재무회계과연수 구정·비전·운영 홍보 영상 제작용역일반2022-12-2119400000이미지텔링
904재무회계과다가치세움소 리모델링 공사(통신)공사정보통신2022-12-2135354240(주)우진코퍼레이션
905재무회계과2023년 직원 업무수첩 제작물품구매2022-12-2218988000애플기획
906재무회계과2022년 도로환경미화원 정년퇴직자 부상품 구입물품구매2022-12-2318271000명보석
907재무회계과청학동 63-10번지 일원 석면 처리용역용역일반2022-12-2712367210진성산업개발(주)
908재무회계과송도국제도시도서관 가설재울타리 설치공사공사전문2022-12-2935000000(주)승지건설
909재무회계과송도 2공구 자동집하시설 관로 및 집하시설 개선공사 실시설계용역 시행용역일반2022-12-2986806000주식회사 유신
910재무회계과송도2공구 자동집하시설 제어시스템 제작·설치 시행물품구매2022-12-30832588760주식회사 코젠