Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Categorical1
DateTime1
Text2

Dataset

Description경기도_보도자료 현황
Author경기도뉴스포털
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=APXKY127QXG0TVVR1Y6E28683342&infSeq=1

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-05-17 20:58:12.469883
Analysis finished2024-05-17 20:58:14.520171
Duration2.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관명
Categorical

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도
4557 
포천시
1180 
광주시
992 
오산시
730 
성남시
695 
Other values (9)
1846 

Length

Max length8
Median length3
Mean length3.0661
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 4557
45.6%
포천시 1180
 
11.8%
광주시 992
 
9.9%
오산시 730
 
7.3%
성남시 695
 
7.0%
동두천시 656
 
6.6%
군포시 328
 
3.3%
고양시 302
 
3.0%
양주시 248
 
2.5%
안양시 124
 
1.2%
Other values (4) 188
 
1.9%

Length

2024-05-18T05:58:14.730381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 4557
45.6%
포천시 1180
 
11.8%
광주시 992
 
9.9%
오산시 730
 
7.3%
성남시 695
 
7.0%
동두천시 656
 
6.6%
군포시 328
 
3.3%
고양시 302
 
3.0%
양주시 248
 
2.5%
안양시 124
 
1.2%
Other values (4) 188
 
1.9%

일자
Date

Distinct2366
Distinct (%)23.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2016-04-27 00:00:00
Maximum2024-05-17 00:00:00
2024-05-18T05:58:15.043531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T05:58:15.307466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

제목
Text

Distinct9975
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-18T05:58:15.873283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length88
Median length73
Mean length32.3185
Min length8

Characters and Unicode

Total characters323185
Distinct characters1277
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9951 ?
Unique (%)99.5%

Sample

1st row400여 경기북부 소방가족, 가을 음악으로 심신 치유했다
2nd row경기게임아카데미 1기 수료식 개최…‘달려라 할배’ 등 8개 게임 런칭
3rd row지난해 경기북부 구조 출동 전년 대비 16.6% 증가‥일평균 15.3명 구조했다
4th row경기도, 올해 예술인ㆍ장애인 기회소득 총 1만4천명 지급. 내년 지원 규모 확대
5th rowMERS대응 일일상황보고 (9.13.목 18시 기준)
ValueCountFrequency (%)
경기도 1350
 
1.9%
개최 1096
 
1.6%
932
 
1.3%
포천시 786
 
1.1%
광주시 784
 
1.1%
실시 606
 
0.9%
동두천시 462
 
0.7%
위한 453
 
0.6%
성남시 438
 
0.6%
오산시 409
 
0.6%
Other values (26317) 62913
89.6%
2024-05-18T05:58:16.956621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
61072
 
18.9%
7639
 
2.4%
, 7456
 
2.3%
6195
 
1.9%
5003
 
1.5%
3735
 
1.2%
3667
 
1.1%
2 3613
 
1.1%
3481
 
1.1%
1 3339
 
1.0%
Other values (1267) 217985
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 222218
68.8%
Space Separator 61072
 
18.9%
Decimal Number 15802
 
4.9%
Other Punctuation 11985
 
3.7%
Initial Punctuation 3113
 
1.0%
Final Punctuation 3096
 
1.0%
Uppercase Letter 2137
 
0.7%
Open Punctuation 1253
 
0.4%
Close Punctuation 1250
 
0.4%
Lowercase Letter 506
 
0.2%
Other values (6) 753
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7639
 
3.4%
6195
 
2.8%
5003
 
2.3%
3735
 
1.7%
3667
 
1.7%
3481
 
1.6%
2667
 
1.2%
2624
 
1.2%
2472
 
1.1%
2468
 
1.1%
Other values (1156) 182267
82.0%
Uppercase Letter
ValueCountFrequency (%)
A 278
13.0%
S 211
 
9.9%
I 163
 
7.6%
C 135
 
6.3%
F 135
 
6.3%
T 123
 
5.8%
D 106
 
5.0%
R 101
 
4.7%
G 100
 
4.7%
E 95
 
4.4%
Other values (16) 690
32.3%
Lowercase Letter
ValueCountFrequency (%)
t 58
11.5%
e 53
10.5%
a 49
 
9.7%
o 48
 
9.5%
g 40
 
7.9%
l 36
 
7.1%
r 27
 
5.3%
n 27
 
5.3%
i 23
 
4.5%
k 21
 
4.2%
Other values (13) 124
24.5%
Other Punctuation
ValueCountFrequency (%)
, 7456
62.2%
. 1346
 
11.2%
· 999
 
8.3%
595
 
5.0%
! 457
 
3.8%
275
 
2.3%
& 171
 
1.4%
% 167
 
1.4%
? 167
 
1.4%
; 142
 
1.2%
Other values (5) 210
 
1.8%
Decimal Number
ValueCountFrequency (%)
2 3613
22.9%
1 3339
21.1%
0 3024
19.1%
3 1075
 
6.8%
9 943
 
6.0%
8 851
 
5.4%
7 836
 
5.3%
5 743
 
4.7%
4 733
 
4.6%
6 645
 
4.1%
Other Symbol
ValueCountFrequency (%)
48
71.6%
6
 
9.0%
3
 
4.5%
3
 
4.5%
2
 
3.0%
1
 
1.5%
1
 
1.5%
1
 
1.5%
° 1
 
1.5%
1
 
1.5%
Math Symbol
ValueCountFrequency (%)
~ 172
77.1%
+ 15
 
6.7%
13
 
5.8%
12
 
5.4%
5
 
2.2%
3
 
1.3%
3
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 1042
83.2%
110
 
8.8%
[ 70
 
5.6%
29
 
2.3%
2
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1041
83.3%
108
 
8.6%
] 70
 
5.6%
29
 
2.3%
2
 
0.2%
Initial Punctuation
ValueCountFrequency (%)
2428
78.0%
685
 
22.0%
Final Punctuation
ValueCountFrequency (%)
2423
78.3%
673
 
21.7%
Letter Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
61072
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 349
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 101
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 222056
68.7%
Common 98271
30.4%
Latin 2647
 
0.8%
Han 211
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7639
 
3.4%
6195
 
2.8%
5003
 
2.3%
3735
 
1.7%
3667
 
1.7%
3481
 
1.6%
2667
 
1.2%
2624
 
1.2%
2472
 
1.1%
2468
 
1.1%
Other values (1055) 182105
82.0%
Han
ValueCountFrequency (%)
20
 
9.5%
15
 
7.1%
13
 
6.2%
7
 
3.3%
7
 
3.3%
6
 
2.8%
5
 
2.4%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (93) 128
60.7%
Common
ValueCountFrequency (%)
61072
62.1%
, 7456
 
7.6%
2 3613
 
3.7%
1 3339
 
3.4%
0 3024
 
3.1%
2428
 
2.5%
2423
 
2.5%
. 1346
 
1.4%
3 1075
 
1.1%
( 1042
 
1.1%
Other values (48) 11453
 
11.7%
Latin
ValueCountFrequency (%)
A 278
 
10.5%
S 211
 
8.0%
I 163
 
6.2%
C 135
 
5.1%
F 135
 
5.1%
T 123
 
4.6%
D 106
 
4.0%
R 101
 
3.8%
G 100
 
3.8%
E 95
 
3.6%
Other values (41) 1200
45.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 221995
68.7%
ASCII 92500
28.6%
Punctuation 7081
 
2.2%
None 1329
 
0.4%
CJK 211
 
0.1%
Arrows 33
 
< 0.1%
Compat Jamo 12
 
< 0.1%
CJK Compat 11
 
< 0.1%
Number Forms 4
 
< 0.1%
Math Operators 3
 
< 0.1%
Other values (3) 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
61072
66.0%
, 7456
 
8.1%
2 3613
 
3.9%
1 3339
 
3.6%
0 3024
 
3.3%
. 1346
 
1.5%
3 1075
 
1.2%
( 1042
 
1.1%
) 1041
 
1.1%
9 943
 
1.0%
Other values (70) 8549
 
9.2%
Hangul
ValueCountFrequency (%)
7639
 
3.4%
6195
 
2.8%
5003
 
2.3%
3735
 
1.7%
3667
 
1.7%
3481
 
1.6%
2667
 
1.2%
2624
 
1.2%
2472
 
1.1%
2468
 
1.1%
Other values (1050) 182044
82.0%
Punctuation
ValueCountFrequency (%)
2428
34.3%
2423
34.2%
685
 
9.7%
673
 
9.5%
595
 
8.4%
275
 
3.9%
2
 
< 0.1%
None
ValueCountFrequency (%)
· 999
75.2%
110
 
8.3%
108
 
8.1%
48
 
3.6%
29
 
2.2%
29
 
2.2%
2
 
0.2%
2
 
0.2%
° 1
 
0.1%
1
 
0.1%
CJK
ValueCountFrequency (%)
20
 
9.5%
15
 
7.1%
13
 
6.2%
7
 
3.3%
7
 
3.3%
6
 
2.8%
5
 
2.4%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (93) 128
60.7%
Arrows
ValueCountFrequency (%)
13
39.4%
12
36.4%
5
 
15.2%
3
 
9.1%
Compat Jamo
ValueCountFrequency (%)
9
75.0%
2
 
16.7%
1
 
8.3%
CJK Compat
ValueCountFrequency (%)
6
54.5%
3
27.3%
1
 
9.1%
1
 
9.1%
Math Operators
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Geometric Shapes
ValueCountFrequency (%)
3
100.0%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct9998
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-18T05:58:17.543325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length78
Median length78
Mean length77.5906
Min length77

Characters and Unicode

Total characters775906
Distinct characters41
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9996 ?
Unique (%)> 99.9%

Sample

1st rowhttps://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=38527
2nd rowhttps://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=33437
3rd rowhttps://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=52078
4th rowhttps://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=59944
5th rowhttps://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=38247
ValueCountFrequency (%)
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=60271 2
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_sigun_view.do?bs_code=s003&number=107971 2
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_sigun_view.do?bs_code=s003&number=108120 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=49956 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=50106 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=38527 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=44054 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=51365 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=35693 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_sigun_view.do?bs_code=s003&number=105485 1
 
< 0.1%
Other values (9988) 9988
99.9%
2024-05-18T05:58:18.474560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
g 64557
 
8.3%
e 50000
 
6.4%
i 45443
 
5.9%
r 40000
 
5.2%
n 40000
 
5.2%
. 40000
 
5.2%
/ 40000
 
5.2%
b 34557
 
4.5%
_ 30000
 
3.9%
o 29114
 
3.8%
Other values (31) 362235
46.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 464557
59.9%
Other Punctuation 110000
 
14.2%
Decimal Number 81349
 
10.5%
Uppercase Letter 70000
 
9.0%
Connector Punctuation 30000
 
3.9%
Math Symbol 20000
 
2.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
g 64557
13.9%
e 50000
10.8%
i 45443
9.8%
r 40000
8.6%
n 40000
8.6%
b 34557
 
7.4%
o 29114
 
6.3%
s 25443
 
5.5%
w 20000
 
4.3%
t 20000
 
4.3%
Other values (8) 95443
20.5%
Decimal Number
ValueCountFrequency (%)
0 20675
25.4%
3 10827
13.3%
1 10022
12.3%
7 9627
11.8%
4 5575
 
6.9%
8 5518
 
6.8%
5 5456
 
6.7%
9 5253
 
6.5%
6 4333
 
5.3%
2 4063
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
S 20000
28.6%
D 10000
14.3%
E 10000
14.3%
O 10000
14.3%
C 10000
14.3%
B 10000
14.3%
Other Punctuation
ValueCountFrequency (%)
. 40000
36.4%
/ 40000
36.4%
& 10000
 
9.1%
? 10000
 
9.1%
: 10000
 
9.1%
Connector Punctuation
ValueCountFrequency (%)
_ 30000
100.0%
Math Symbol
ValueCountFrequency (%)
= 20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 534557
68.9%
Common 241349
31.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
g 64557
 
12.1%
e 50000
 
9.4%
i 45443
 
8.5%
r 40000
 
7.5%
n 40000
 
7.5%
b 34557
 
6.5%
o 29114
 
5.4%
s 25443
 
4.8%
w 20000
 
3.7%
t 20000
 
3.7%
Other values (14) 165443
30.9%
Common
ValueCountFrequency (%)
. 40000
16.6%
/ 40000
16.6%
_ 30000
12.4%
0 20675
8.6%
= 20000
8.3%
3 10827
 
4.5%
1 10022
 
4.2%
& 10000
 
4.1%
? 10000
 
4.1%
: 10000
 
4.1%
Other values (7) 39825
16.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 775906
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
g 64557
 
8.3%
e 50000
 
6.4%
i 45443
 
5.9%
r 40000
 
5.2%
n 40000
 
5.2%
. 40000
 
5.2%
/ 40000
 
5.2%
b 34557
 
4.5%
_ 30000
 
3.9%
o 29114
 
3.8%
Other values (31) 362235
46.7%

Missing values

2024-05-18T05:58:14.235889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T05:58:14.419015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관명일자제목링크URL
44143경기도2018-10-24400여 경기북부 소방가족, 가을 음악으로 심신 치유했다https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=38527
59618경기도2017-03-26경기게임아카데미 1기 수료식 개최…‘달려라 할배’ 등 8개 게임 런칭https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=33437
16474경기도2022-02-21지난해 경기북부 구조 출동 전년 대비 16.6% 증가‥일평균 15.3명 구조했다https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=52078
2836경기도2023-12-26경기도, 올해 예술인ㆍ장애인 기회소득 총 1만4천명 지급. 내년 지원 규모 확대https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=59944
45155경기도2018-09-14MERS대응 일일상황보고 (9.13.목 18시 기준)https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=38247
50217오산시2018-03-08오산시,‘공무원 선거중립 실천 결의대회’열어https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=83396
50231경기도2018-03-07경기도 24개 공립박물관, 문체부 우수인증기관으로 선정https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=36544
45160경기도2018-09-13성큼성큼 발걸음 내딛으며 떠나는 산성 테마탐방 참가자 모집https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=38232
49606경기도2018-03-30구제역·고병원성AI 방역대책 추진상황(18.3.30일자)https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=36790
56540포천시2017-07-13포천시, 2017년도 맛앤멋 음식점 지정서 수여https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=79057
기관명일자제목링크URL
30000경기도2020-06-25도민과의 약속을 넘어 대중교통산업 선도…‘경기교통공사 설립 및 운영 조례’ 도의회 통과https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=44785
18687동두천시2021-11-11최용덕 동두천시장, 생활 속 에너지 절약 실천 챌린지 동참https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=100661
19884광주시2021-09-13광주시 드림스타트, 행복 가득한 우리집 만들기 가족융합 상담 지원 실시https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=100053
20533경기도2021-08-16도, 내년까지 소규모 공동주택 총 974개 단지 유지보수 지원https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=50117
3200경기도2023-12-10경기도일자리재단, 경기도기술학교 수료식 진행. 기술인재 105명 배출https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=59758
35656광주시2019-09-30으뜸철강(주), 광주시에 이웃돕기 성금기탁https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=92576
52825광주시2017-11-22광주시 경안동 주민자치위, 6년째 이웃사랑 나눔 실천https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=81606
38919고양시2019-05-10고양시, ‘다문화가정 어린이 스포츠교실’ 5월11일 스타트!https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=90780
12248경기도2022-09-23북부소방재난본부, ‘건설현장 소방안전관리자 선임 제도’ 시행 안내https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=54536
12757경기도2022-08-30한국도자재단, 호주서 ‘한국생활도자전’ 개최…우리 생활도자 우수성 알린다https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=54232

Duplicate rows

Most frequently occurring

기관명일자제목링크URL# duplicates
0경기도2024-01-29오병권 부지사, 평택 공동주택 건설현장 한파대책 및 전통시장 화재 예방 점검https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=602712