Overview

Dataset statistics

Number of variables6
Number of observations25
Missing cells9
Missing cells (%)6.0%
Duplicate rows1
Duplicate rows (%)4.0%
Total size in memory1.3 KiB
Average record size in memory54.1 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description부산광역시 사상구 내 등록되어 있는 대부업체 현황에 관한 데이터로 상호명, 등록신청 유형, 소재지 등의 정보를 제공합니다.
Author부산광역시 사상구
URLhttps://www.data.go.kr/data/15043155/fileData.do

Alerts

Dataset has 1 (4.0%) duplicate rowsDuplicates
데이터기준일 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
등록신청사업 is highly overall correlated with 시군구명 and 1 other fieldsHigh correlation
시군구명 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 시군구명 and 1 other fieldsHigh correlation
연번 has 3 (12.0%) missing valuesMissing
상호 has 3 (12.0%) missing valuesMissing
소재지(도로명) has 3 (12.0%) missing valuesMissing

Reproduction

Analysis started2024-03-14 13:54:44.619552
Analysis finished2024-03-14 13:54:46.120414
Duration1.5 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct22
Distinct (%)100.0%
Missing3
Missing (%)12.0%
Infinite0
Infinite (%)0.0%
Mean11.5
Minimum1
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size353.0 B
2024-03-14T22:54:46.322558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.05
Q16.25
median11.5
Q316.75
95-th percentile20.95
Maximum22
Range21
Interquartile range (IQR)10.5

Descriptive statistics

Standard deviation6.4935866
Coefficient of variation (CV)0.5646597
Kurtosis-1.2
Mean11.5
Median Absolute Deviation (MAD)5.5
Skewness0
Sum253
Variance42.166667
MonotonicityStrictly increasing
2024-03-14T22:54:46.693315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
13 1
 
4.0%
22 1
 
4.0%
21 1
 
4.0%
20 1
 
4.0%
19 1
 
4.0%
18 1
 
4.0%
17 1
 
4.0%
16 1
 
4.0%
15 1
 
4.0%
14 1
 
4.0%
Other values (12) 12
48.0%
(Missing) 3
 
12.0%
ValueCountFrequency (%)
1 1
4.0%
2 1
4.0%
3 1
4.0%
4 1
4.0%
5 1
4.0%
6 1
4.0%
7 1
4.0%
8 1
4.0%
9 1
4.0%
10 1
4.0%
ValueCountFrequency (%)
22 1
4.0%
21 1
4.0%
20 1
4.0%
19 1
4.0%
18 1
4.0%
17 1
4.0%
16 1
4.0%
15 1
4.0%
14 1
4.0%
13 1
4.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size328.0 B
부산광역시 사상구
22 
<NA>

Length

Max length9
Median length9
Mean length8.4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 사상구
2nd row부산광역시 사상구
3rd row부산광역시 사상구
4th row부산광역시 사상구
5th row부산광역시 사상구

Common Values

ValueCountFrequency (%)
부산광역시 사상구 22
88.0%
<NA> 3
 
12.0%

Length

2024-03-14T22:54:47.077753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:54:47.383467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 22
46.8%
사상구 22
46.8%
na 3
 
6.4%

등록신청사업
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size328.0 B
대부업
15 
대부중개업
<NA>

Length

Max length5
Median length3
Mean length3.68
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대부업
2nd row대부중개업
3rd row대부업
4th row대부중개업
5th row대부업

Common Values

ValueCountFrequency (%)
대부업 15
60.0%
대부중개업 7
28.0%
<NA> 3
 
12.0%

Length

2024-03-14T22:54:47.976513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:54:48.322393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대부업 15
60.0%
대부중개업 7
28.0%
na 3
 
12.0%

상호
Text

MISSING 

Distinct20
Distinct (%)90.9%
Missing3
Missing (%)12.0%
Memory size328.0 B
2024-03-14T22:54:48.975614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length8.4090909
Min length4

Characters and Unicode

Total characters185
Distinct characters61
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)81.8%

Sample

1st row동반대부
2nd row(주)굿스마일대부
3rd row(주)굿스마일대부
4th row주식회사 투게더에셋대부
5th row미광전당포대부
ValueCountFrequency (%)
주식회사 5
 
17.2%
주)굿스마일대부 2
 
6.9%
부성자산관리(주 2
 
6.9%
동반대부 1
 
3.4%
대부중개 1
 
3.4%
다온 1
 
3.4%
사상전당포대부 1
 
3.4%
대륙전당포대부 1
 
3.4%
신엄궁전당포대부 1
 
3.4%
코리아에셋머니대부 1
 
3.4%
Other values (13) 13
44.8%
2024-03-14T22:54:50.093966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
11.9%
21
 
11.4%
10
 
5.4%
7
 
3.8%
6
 
3.2%
6
 
3.2%
5
 
2.7%
5
 
2.7%
( 4
 
2.2%
4
 
2.2%
Other values (51) 95
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 170
91.9%
Space Separator 7
 
3.8%
Open Punctuation 4
 
2.2%
Close Punctuation 4
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
12.9%
21
 
12.4%
10
 
5.9%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (48) 83
48.8%
Space Separator
ValueCountFrequency (%)
7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 170
91.9%
Common 15
 
8.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
12.9%
21
 
12.4%
10
 
5.9%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (48) 83
48.8%
Common
ValueCountFrequency (%)
7
46.7%
( 4
26.7%
) 4
26.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 170
91.9%
ASCII 15
 
8.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
12.9%
21
 
12.4%
10
 
5.9%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (48) 83
48.8%
ASCII
ValueCountFrequency (%)
7
46.7%
( 4
26.7%
) 4
26.7%

소재지(도로명)
Text

MISSING 

Distinct15
Distinct (%)68.2%
Missing3
Missing (%)12.0%
Memory size328.0 B
2024-03-14T22:54:50.722138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length25
Mean length24.409091
Min length17

Characters and Unicode

Total characters537
Distinct characters70
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)40.9%

Sample

1st row부산광역시 사상구 삼락동 418-26 성신태권도
2nd row부산광역시 사상구 모라동 1329-7
3rd row부산광역시 사상구 모라동 1329-7
4th row부산광역시 사상구 괘법동 562
5th row부산광역시 사상구 모라동 1057 이화하이츠
ValueCountFrequency (%)
부산광역시 22
21.0%
사상구 22
21.0%
감전동 9
 
8.6%
주례동 4
 
3.8%
부산산업용재유통상가 3
 
2.9%
152-2 3
 
2.9%
괘법동 3
 
2.9%
모라동 3
 
2.9%
54 2
 
1.9%
948-8 2
 
1.9%
Other values (25) 32
30.5%
2024-03-14T22:54:51.871613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
83
 
15.5%
30
 
5.6%
27
 
5.0%
25
 
4.7%
22
 
4.1%
22
 
4.1%
22
 
4.1%
22
 
4.1%
22
 
4.1%
22
 
4.1%
Other values (60) 240
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 347
64.6%
Decimal Number 94
 
17.5%
Space Separator 83
 
15.5%
Dash Punctuation 13
 
2.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
8.6%
27
 
7.8%
25
 
7.2%
22
 
6.3%
22
 
6.3%
22
 
6.3%
22
 
6.3%
22
 
6.3%
22
 
6.3%
9
 
2.6%
Other values (48) 124
35.7%
Decimal Number
ValueCountFrequency (%)
1 19
20.2%
2 19
20.2%
5 12
12.8%
4 9
9.6%
3 9
9.6%
8 7
 
7.4%
6 6
 
6.4%
9 6
 
6.4%
7 5
 
5.3%
0 2
 
2.1%
Space Separator
ValueCountFrequency (%)
83
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 347
64.6%
Common 190
35.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
8.6%
27
 
7.8%
25
 
7.2%
22
 
6.3%
22
 
6.3%
22
 
6.3%
22
 
6.3%
22
 
6.3%
22
 
6.3%
9
 
2.6%
Other values (48) 124
35.7%
Common
ValueCountFrequency (%)
83
43.7%
1 19
 
10.0%
2 19
 
10.0%
- 13
 
6.8%
5 12
 
6.3%
4 9
 
4.7%
3 9
 
4.7%
8 7
 
3.7%
6 6
 
3.2%
9 6
 
3.2%
Other values (2) 7
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 347
64.6%
ASCII 190
35.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
83
43.7%
1 19
 
10.0%
2 19
 
10.0%
- 13
 
6.8%
5 12
 
6.3%
4 9
 
4.7%
3 9
 
4.7%
8 7
 
3.7%
6 6
 
3.2%
9 6
 
3.2%
Other values (2) 7
 
3.7%
Hangul
ValueCountFrequency (%)
30
 
8.6%
27
 
7.8%
25
 
7.2%
22
 
6.3%
22
 
6.3%
22
 
6.3%
22
 
6.3%
22
 
6.3%
22
 
6.3%
9
 
2.6%
Other values (48) 124
35.7%

데이터기준일
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size328.0 B
2024-03-04
22 
<NA>

Length

Max length10
Median length10
Mean length9.28
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03-04
2nd row2024-03-04
3rd row2024-03-04
4th row2024-03-04
5th row2024-03-04

Common Values

ValueCountFrequency (%)
2024-03-04 22
88.0%
<NA> 3
 
12.0%

Length

2024-03-14T22:54:52.284065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:54:52.591321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-04 22
88.0%
na 3
 
12.0%

Interactions

2024-03-14T22:54:44.942497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T22:54:52.749423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록신청사업상호소재지(도로명)
연번1.0000.0000.9810.681
등록신청사업0.0001.0000.0000.000
상호0.9810.0001.0001.000
소재지(도로명)0.6810.0001.0001.000
2024-03-14T22:54:52.906483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터기준일등록신청사업시군구명
데이터기준일1.0001.0001.000
등록신청사업1.0001.0001.000
시군구명1.0001.0001.000
2024-03-14T22:54:53.155061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군구명등록신청사업데이터기준일
연번1.0001.0000.0001.000
시군구명1.0001.0001.0001.000
등록신청사업0.0001.0001.0001.000
데이터기준일1.0001.0001.0001.000

Missing values

2024-03-14T22:54:45.252442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T22:54:45.600606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T22:54:45.922583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번시군구명등록신청사업상호소재지(도로명)데이터기준일
01부산광역시 사상구대부업동반대부부산광역시 사상구 삼락동 418-26 성신태권도2024-03-04
12부산광역시 사상구대부중개업(주)굿스마일대부부산광역시 사상구 모라동 1329-72024-03-04
23부산광역시 사상구대부업(주)굿스마일대부부산광역시 사상구 모라동 1329-72024-03-04
34부산광역시 사상구대부중개업주식회사 투게더에셋대부부산광역시 사상구 괘법동 5622024-03-04
45부산광역시 사상구대부업미광전당포대부부산광역시 사상구 모라동 1057 이화하이츠2024-03-04
56부산광역시 사상구대부업동주자산관리대부부산광역시 사상구 삼락동 418-26 성신태권도2024-03-04
67부산광역시 사상구대부중개업주식회사 위해대부중개부산광역시 사상구 감전동 132-7 부산디지털밸리아파트형공장2024-03-04
78부산광역시 사상구대부업주식회사 위해대부부산광역시 사상구 감전동 132-7 부산디지털밸리아파트형공장2024-03-04
89부산광역시 사상구대부중개업주식회사 피플앤파이낸스대부중개부산광역시 사상구 감전동 152-2 부산산업용재유통상가2024-03-04
910부산광역시 사상구대부업주식회사 피플앤파이낸스대부부산광역시 사상구 감전동 152-2 부산산업용재유통상가2024-03-04
연번시군구명등록신청사업상호소재지(도로명)데이터기준일
1516부산광역시 사상구대부업부성자산관리(주)부산광역시 사상구 감전동 948-82024-03-04
1617부산광역시 사상구대부업션샤인대부부산광역시 사상구 감전동 152-2 부산산업용재유통상가2024-03-04
1718부산광역시 사상구대부중개업한성파이낸스대부중개부산광역시 사상구 주례동 91번지 21호2024-03-04
1819부산광역시 사상구대부업코리아에셋머니대부부산광역시 사상구 감전동 133번지 24호2024-03-04
1920부산광역시 사상구대부업신엄궁전당포대부부산광역시 사상구 엄궁동 465번지 19호2024-03-04
2021부산광역시 사상구대부업대륙전당포대부부산광역시 사상구 주례동 1162번지 65호2024-03-04
2122부산광역시 사상구대부업사상전당포대부부산광역시 사상구 괘법동 523번지 14호2024-03-04
22<NA><NA><NA><NA><NA><NA>
23<NA><NA><NA><NA><NA><NA>
24<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

연번시군구명등록신청사업상호소재지(도로명)데이터기준일# duplicates
0<NA><NA><NA><NA><NA><NA>3