‘Will I Regret for This Tweet?’—Twitter User’s Behavior Analysis System for Private Data Disclosure

Geetha, R; Karthika, S; Kumaraguru, Ponnurangam

doi:10.1093/comjnl/bxaa027

Abstract

Twitter is an extensively used micro-blogging site for publishing user’s views on recent happenings. This wide reachability of messages over large audience poses a threat, as the degree of personally identifiable information disclosed might lead to user regrets. The Tweet-Scan-Post system scans the tweets contextually for sensitive messages. The tweet repository was generated using cyber-keywords for personal, professional and health tweets. The Rules of Sensitivity and Contextuality was defined based on standards established by various national regulatory bodies. The naive sensitivity regression function uses the Bag-of-Words model built from short text messages. The imbalanced classes in dataset result in misclassification with 25% of sensitive and 75% of insensitive tweets. The system opted stacked classification to combat the problem of imbalanced classes. The system initially applied various state-of-art algorithms and predicted 26% of the tweets to be sensitive. The proposed stacked classification approach increased the overall proportion of sensitive tweets to 35%. The system contributes a vocabulary set of 201 Sensitive Privacy Keyword using the boosting approach for three tweet categories. Finally, the system formulates a sensitivity scaling called TSP’s Tweet Sensitivity Scale based on Senti-Cyber features composed of Sensitive Privacy Keywords, Cyber-keywords with Non-Sensitive Privacy Keywords and Non-Cyber-keywords to detect the degree of disclosed sensitive information.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

You do not currently have access to this article.

Download all slides

Month:	Total Views:
May 2020	5
June 2020	16
July 2020	5
August 2020	8
September 2020	2
October 2020	3
November 2020	8
December 2020	9
January 2021	6
March 2021	7
April 2021	8
May 2021	1
June 2021	2
July 2021	2
September 2021	1
October 2021	1
November 2021	6
December 2021	1
January 2022	14
February 2022	20
March 2022	6
April 2022	9
May 2022	7
June 2022	13
July 2022	17
August 2022	1
September 2022	7
October 2022	7
November 2022	2
December 2022	6
January 2023	3
February 2023	19
March 2023	12
April 2023	5
May 2023	5
June 2023	7
July 2023	4
September 2023	2
October 2023	8
November 2023	9
December 2023	3
January 2024	3
February 2024	3
March 2024	1
April 2024	1

‘Will I Regret for This Tweet?’—Twitter User’s Behavior Analysis System for Private Data Disclosure

Abstract

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

‘Will I Regret for This Tweet?’—Twitter User’s Behavior Analysis System for Private Data Disclosure

Abstract

Sign in

Personal account

Institutional access

Institutional account management

Get help with access

Institutional access

IP based access

Sign in through your institution

Sign in with a library card

Society Members

Sign in through society site

Sign in using a personal account

Personal account

Viewing your signed in accounts

Signed in but can't access content

Institutional account management

Purchase

Short-term Access

Rental

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only