Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows2089
Duplicate rows (%)20.9%
Total size in memory332.0 KiB
Average record size in memory34.0 B

Variable types

DateTime1
Numeric1
Categorical1

Dataset

Description인천광역시 통합전자도서관 홈페이지를 방문한 통계(방문일자,방문횟수, 방문장비 등)과 관련된 정보 데이터 입니다.
Author인천광역시
URLhttps://www.incheon.go.kr/data/DATA010201/view?docId=15049229

Alerts

방문장치(1-PC_2-모바일) has constant value ""Constant
Dataset has 2089 (20.9%) duplicate rowsDuplicates

Reproduction

Analysis started2024-01-28 05:38:10.192703
Analysis finished2024-01-28 05:38:10.764574
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2033
Distinct (%)20.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2016-08-30 00:00:00
Maximum2022-08-01 00:00:00
2024-01-28T14:38:10.825135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T14:38:10.941823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

방문자수
Real number (ℝ)

Distinct49
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.7266
Minimum1
Maximum111
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-28T14:38:11.061065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q34
95-th percentile12
Maximum111
Range110
Interquartile range (IQR)3

Descriptive statistics

Standard deviation4.8422306
Coefficient of variation (CV)1.2993696
Kurtosis59.795131
Mean3.7266
Median Absolute Deviation (MAD)1
Skewness5.3742372
Sum37266
Variance23.447197
MonotonicityNot monotonic
2024-01-28T14:38:11.178455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 3787
37.9%
2 1932
19.3%
3 1108
 
11.1%
4 771
 
7.7%
5 551
 
5.5%
6 378
 
3.8%
7 292
 
2.9%
8 221
 
2.2%
9 172
 
1.7%
10 120
 
1.2%
Other values (39) 668
 
6.7%
ValueCountFrequency (%)
1 3787
37.9%
2 1932
19.3%
3 1108
 
11.1%
4 771
 
7.7%
5 551
 
5.5%
6 378
 
3.8%
7 292
 
2.9%
8 221
 
2.2%
9 172
 
1.7%
10 120
 
1.2%
ValueCountFrequency (%)
111 1
< 0.1%
83 1
< 0.1%
82 1
< 0.1%
70 2
< 0.1%
63 1
< 0.1%
59 1
< 0.1%
54 1
< 0.1%
46 2
< 0.1%
45 1
< 0.1%
43 1
< 0.1%

방문장치(1-PC_2-모바일)
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2024-01-28T14:38:11.285055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T14:38:11.358680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

Interactions

2024-01-28T14:38:10.276001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-28T14:38:10.377418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T14:38:10.731244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

방문일자방문자수방문장치(1-PC_2-모바일)
652272018-03-1681
44942022-04-1811
390322020-03-2311
111792021-12-1011
190562021-05-0811
87602022-01-2411
153372021-08-16181
83812022-01-3011
224152021-02-1211
130432021-10-1761
방문일자방문자수방문장치(1-PC_2-모바일)
345352020-06-0111
219952021-02-2031
376442020-04-0931
288252020-09-16131
40262022-04-2911
348602020-05-2511
300212020-08-3031
167302021-07-1421
442382019-11-2471
242712021-01-0321

Duplicate rows

Most frequently occurring

방문일자방문자수방문장치(1-PC_2-모바일)# duplicates
10012020-04-28119
12982020-11-25119
9032020-03-19118
9252020-03-27118
14122021-02-19118
14552021-03-24118
16502021-08-29118
17512021-12-03118
19322022-03-28118
6202019-07-17117