Abstract

This study presents for the first time the SWB-J index, a subjective well-being indicator for Japan based on Twitter data. The index is composed by eight dimensions of subjective well-being and is estimated relying on Twitter data by using human supervised sentiment analysis. The index is then compared with the analogous SWB-I index for Italy in order to verify possible analogies and cultural differences. Further, through structural equation models, we investigate the relationship between economic and health conditions of the country and the well-being latent variable and illustrate how this latent dimension affects the SWB-J and SWB-I indicators. It turns out that, as expected, economic and health welfare is only one aspect of the multidimensional well-being that is captured by the Twitter-based indicator.

1. Introduction

The improvement of social well-being explicitly entered the policymaker agenda a few decades ago, when it became clear that objective measures of observable quantities – above all, GDP – were unsatisfactory proxies of the welfare conditions of a community (Stiglitz et al. 2009). As a consequence, instruments for measuring and monitoring social well-being began to appear in the toolbox of policymakers, progressively moving the focus from objective to subjective evaluation: multidimensional indicators encompassing both objective and subjective dimensions of well-being (Barrington-Leigh and Escande 2018; Fleurbaey 2009), face-to-face or telephone surveys investigating samples of citizens about their own perception of quality life (Kahneman et al. 2004; Schwarz and Strack 1999), and, after the development of the internet, the application of several techniques to the analysis of individual and collective mood through large-scale data provided by social networking sites (SNS), with the aim of drawing an evaluation of well-being status from conversations (Luhmann 2017; Scollon 2018) or word search (van der Wielen and Barrios 2021) on the web.

In this context, Twitter is one of the most popular SNS, with about 330 million monthly active users worldwide in 2019 (according to Statista.com, see https://www.statista.com/statistics/282087/number-of-monthly-active-twitter-users/). Due to the brevity of the messages allowed and the huge number of tweets potentially available and continually updated, the platform has been considered one of the most suitable information sources to estimate emotional well-being, i.e. the ‘mood’ or short-run component of the life quality evaluation.

Recent literature provides some examples of well-being evaluations that rely on Twitter data and, based on sentiment analysis methods, aim at monitoring the day-by-day evolution of self-declared emotional status of a community.

In particular, Dodds et al. (2011) built a happiness indicator, called the hedonometer, based on a so-called ‘closed vocabulary’ approach: they measured the frequency of use of a set of ten thousand words for which they obtained happiness evaluations on a nine-point scale, using Amazon Mechanical Turk (see online documentation at https://www.mturk.com/). Their dataset was huge, comprised of around 4.6 billion expressions posted by over 63 million Twitter users from September 2008 to September 2011. The project is still ongoing, and the hedonometer is now evaluated daily by the University of Vermont Complex Systems Center, which can thus provide a time series from 2008 (on-line plots are available at https://hedonometer.org/timeseries/en_all/).

A subjective well-being indicator – named the Gross National Happiness Index – has been proposed by Rossouw and Greyling (2020); the indicator has been evaluated since 2019 in three Commonwealth member countries: South Africa, New Zealand and Australia (an on-line dashboard is available at https://gnh.today). The aim of the project is to measure, in real time, the sentiment of countries’ citizens during different economic, social and political events. Its first application was an examination of the well-being impact of social restrictions imposed during the first wave of the Covid-19 pandemic in South Africa (Greyling et al. 2020). In order to calculate the index, sentiment analysis is applied to a live Twitter-feed, and each tweet is assigned either a positive, neutral or negative sentiment. Then, an algorithm evaluates a happiness score on a 0-to-10 scale. The Gross National Happiness Index provides a happiness score per hour for each of the three countries.

Several other studies apply sentiment analysis to the data provided by Twitter in order to monitor short-run levels of happiness (Bollen et al. 2017) but also life satisfaction, defined as a medium-long run evaluation of life quality (Schwartz et al. 2013; Yang and Srinivasan 2016; Lim et al. 2018; Durahim and Coşkun 2015; Abdullah et al. 2015; Quercia et al. 2012; Greco and Polli 2020). We acknowledge that Twitter is not a representative sample of any society, but we also follow previous studies that have successfully shown that Twitter data effectively illustrate what occurs in the real world, ranging from elections (Ceron et al. 2014), to stock markets (Bollen et al. 2011), to public health (Fahey et al. 2020).

An algorithm for sentiment analysis, named Integrated Sentiment Analysis (iSA) (Ceron et al. 2016), is used in this work to obtain a composite subjective well-being indicator for Japan named SWB-J (Subjective Well- Being Japan). The advantage of iSA, compared to the wide range of sentiment analysis algorithms and methods applied to SNS big data repositories, is that iSA is a human-supervised machine learning method, where a sample set of texts (training set) is first read and manually classified by human coders, and then the rest of the corpus (unlabelled set) is automatically classified by the algorithm. This allows for extracting qualitative information from a text without relying on predefined dictionaries or special semantic rules – on the contrary, iSA can investigate cultural, psychological, and emotional aspects of language, grasping all the nuances of informal and colloquial expressions. The feature is particularly significant because the analogue of SWB-J has been estimated for Italy (SWB-I) (Iacus et al. 2019, 2020a,b) with the same methodology: a comparison between the two indicators may be attempted, contributing to the challenging task of disentangling differences in life quality evaluations from cultural and linguistic specificities in expressing and communicating feelings and moods.

The SWB-J Project was created in collaboration between the University of Milan and Insubria in Italy and two Japanese counterparts: The University of Tokyo and Waseda University.

The paper is structured as follows: Section 2 introduces the big data approach in the study of well-being. Section 3 briefly presents some cultural aspects of how emotions are expressed in Japanese language and reviews some recent literature on computational linguistic about extracting emotions from Japanese tweets. Section 4 describes the SNS data used in this study as well as other sources of data used in the subsequent sections, while Section 5 describes the sentiment analysis methodology used to create the SWB-J indicator. Section 6 discusses the resulting new SWB-J indicator and compares its features with the Italian counterpart SWB-I. Section 7 presents a cross-country analysis aimed at explaining what can potentially impact the different patterns of SBW-J and SWB-I through an econometric analysis. Section 8 summarizes the results and limits of this approach, and the Supplementary Appendix contains additional technical material about the construction of the index.

2. Why to Estimate Subjective Well-Being via SNS

Well-being evaluation has turned from the estimation of objective quantities to the assessment of subjective mood and state of mind because observable variables – even in a multidimensional approach – have proven inadequate to accurately account for the welfare conditions of a society (Kuznets 1934; Sen 1980). Criticism of this approach has come to question its empirical relevance and opened doors to an overturning in the strategies for well-being evaluation: if both one-dimensional and multidimensional measures are unreliable due to the limits of observable variables, the only feasible option to estimate individual and collective well-being is to explicitly ask people to express an evaluation about their own condition.

To this aim, surveys and questionnaires have been increasingly and widely used to collect information about well-being levels and dynamics of individuals and communities. Different methods to conduct surveys have been developed – also conditioned by the technology applied (face-to-face interview, telephone, internet) – in order to disentangle the incidental, emotional aspect of self-reported well-being and the evaluation of life satisfaction, which requires examining current and past events from a medium or long-run perspective.

However, survey-based research projects have a significant drawback, which is the bias induced in well-being evaluation by the survey itself. This is the sort of ‘Hawthorne effect’1 that Angus Deaton (Deaton 2012; Deaton and Stone 2016) pointed out: in fact, changing the order of the questions of a survey may be sufficient to affect the evaluation the respondents give about their own mood or quality of life. More generally, when the respondents are aware of being asked for an assessment of their own life and of being observed while giving the evaluation, the answer they give may be biased by this awareness. Therefore, the dilemma the analysts face is quite clear: on one hand, they wish to ask people for a self-evaluation of their well-being, in order to overcome deficiencies due to measurements based only on observable quantities; on the other hand, they should not ask people for a self-reported evaluation, in order to avoid biases due to the awareness of respondents.

With the era of virtual communication, a new source of large-scale data is provided that seems to address the need for this kind of information: in fact, the availability of a huge and continually updated flow of conversations on SNS theoretically provides a real-time opportunity to know what people think about the quality of their own daily life – both from an emotional and an evaluative perspective – without submitting any explicit questionnaire. This is fostering a stream of studies whose aim is to extract meaningful information from the enormous amount of words or images posted on well-known platforms such as Facebook, Twitter, and Instagram (Voukelatou et al. 2021).

In order to appreciate the kinds of information that can be drawn from SNS data when used for evaluating subjective well-being, it should be clarified that the content and meaning of subjective well-being finds different definitions. The widely accepted definitions proposed by the OECD guidelines (OECD, 2013), for instance, distinguish: (a) affect, i.e. the description of a person’s feeling or emotion, typically measured with reference to a particular point in time; (b) life evaluation, which is an assessment of life ‘as a whole’ and requires a judgment by the individual, rather than a description of an emotional state. Life evaluations are not simply the average of punctual evaluations, being – on the contrary – based on how people remember their experiences, which can differ significantly from how they actually experienced things at the time; (c) eudaimonia, which focuses on functioning and the realization of personal potential, involving elements such as autonomy, competence, interest in learning, goal orientation, sense of purpose, resilience, social engagement, caring, and altruism.

Other sources use similar definitions, such as hedonic for affect or emotional well-being (Ryan and Deci 2001; Deaton and Stone 2013; Steptoe et al. 2015). Frequently – but not exhaustively – these definitions are expressed in relation to the time span used to carry out the well-being evaluation.

Twitter – like the other social networking platforms – is usually considered a good data source for the estimation of short-run emotional well-being. This is, in fact, the dimension of subjective well-being these indicators aim to capture. This notwithstanding, Twitter has also been used in some studies to evaluate components of subjective well-being conceived as more structural, commonly indicated as life evaluation or life satisfaction (see Schwartz et al. 2013; Yang and Srinivasan 2016).

One of the main advantages of large-scale datasets coming from SNS is their continuous updating. This offers the opportunity for nowcasting.2 In fact, while variables that are more traditionally assumed to be related to welfare – such as GDP or morbidity rates – are observable only with a time lag, which sometimes makes the policymaker intervention less effective, SNS data allow for real-time monitoring of public sentiment and can anticipate changes in objective variables. Moreover, when the methods for sentiment analysis are language-independent (i.e. they can be applied to texts expressed in different languages, without any particular limitation), a comparison among linguistic and socio-cultural contexts becomes possible, revealing not only differences in the use of language, but also cultural specificities – such as social conventions that impose stricter self-control in expressing emotions – as far as they are recorded in virtual conversations.

On the other hand, SNS data also have some intrinsic limitations. First of all, users of these platforms are not representative of the whole population. Therefore, any social well-being evaluation achievable from the analysis of these data cannot be immediately extended to the whole population. Procedures can be adjusted to make the results more general; but above all, and despite their limited representativeness, SNS can be considered a sort of opinion-making arena, where expressed ideas affect or anticipate collective sentiment and trends. As Salganik (2017) mentioned, there is always a risk of drifting in constructing indicators based on social media data due to the fact that, not only is the reference Twitter population not representative, but it might also change composition over time or the users may post different volumes of tweets at different times, etc. We cannot do much in computational social science at present without, e.g. crossing Twitter accounts with panel survey data, and this was not possible in this study. Nevertheless, our SWB indicators at least are not affected by changes in the volume of data since they are based on relative measures (see equation (1) below) and further based on aggregated analysis, preventing any particular account from dominating the others.

In other words, we think that the opinions of internet users are significant per se: insofar as SNS conversations are public (i.e. the accounts are freely accessible), they can be read – easily and at a low cost – by a wide set of people and exert a sort of ‘lighthouse effect’, in that they may anticipate, affect, and shape public opinion. That is why we think that the sentiment expressed via SNS is important, even beyond its statistical representativeness. This suggests that a second drawback can be imputed to evaluation of well-being via SNS data: the use of social networks itself can alter self-perceived or self-declared well-being. In fact, even if SNS users do not answer any explicit questions about their own personal status, they are aware that they are sharing their feelings with a community, and thus they may distort their well-being self-reports in order to satisfy self-representation needs. Furthermore, SNS messages and texts seem more suitable to reflect short-term mood changes than a long-term evaluation of life quality: therefore, a well-being indicator based on SNS data would be more reliable as a measure of emotional well-being than a source of life evaluation. However, despite the validity of this remark, adequate statistical analysis can help in separating the volatile and structural components of well-being described by virtual conversations on the net.

An important issue raised by the availability of this new data source is a technological one: the increase in computational power of technological devices does not guarantee, per se, the ability to separate helpful information from background noise in virtual conversations. Following Gary King, we can say that ‘Big data is not about the data’ (King, 2016): it is rather about the opportunity to extract knowledge from data, and this requires adequate methodologies and tools. Fortunately, recent advancements in statistical theory and its applications are improving the capacity of social scientists to analyze the content of these large-scale datasets and promoting the dissemination of different methods of sentiment analysis. The subjective well-being indicator we propose in this work is based on the iSA algorithm, which is one of these new methods.

3. Expressing Emotions in Japanese Culture and SNS

As mentioned by Miyake (2007), in traditional Japanese communication, people tend to maintain distance and make sure that neither party loses face (see also Matsumoto 1999). In textual analysis, most studies focus on the identification of big corpora of web blogs or SNS posts (Ptaszynski et al. 2014) in the context of sentiment and affect analysis.

It should be noted that emoticons, or emoji, are peculiar in Japanese written digital communication. For example, before graphical emoticons appeared, in the western cultures horizontal emotions like ‘:)’ were used, while in Japan (and other Asian countries) emoticons were and are traditionally vertical, for instance ‘(`o´)’. Emojis were already installed as a standard package in the messaging platforms of mobile devices in Japan in the late 1990s (e.g. Jphone and iMode). The development of emoji is distinctive in Japan and arguably originates from ‘kanji’ culture, in which characters represent an idea or concept as a graphic symbol (which also applies to other Asian countries that employ Chinese characters). Despite the abundance of emoticons, a large cross-country study seems to prove that regardless of the culture, vertical and horizontal emoticons convey similar concepts (Park et al. 2014). Still, emoticons coupled with adverbs seem to be able to predict better than simple emoticons the affective perception of a text message (Rzepka et al. 2016).

Many other studies related to the association between emoticons and emotions can be found in the literature (e.g. Shoeb and de Melo 2020; Novak et al. 2015), but they are not specific to the Japanese language. In relation to the Japanese cultural practice of expressing emotions by non-verbal means, a graphic design known as ASCII art has been quite extensively used in bulletin boards such as 2channel (one of the most popular online bulletin boards). Personality trait estimation of Japanese Twitter accounts has been studied in Kamijo et al. (2016). Large scale studies on automatic sentiment tagging in Japanese during crisis periods can be found in Vo and Collier (2013). More linguistic analyses on specific Japanese terms related to likeness and happiness have recently appeared (for the word ‘kawaii’ see Iio 2020) as well as gender-specific language studies (Carpi and Iacus 2020).

In summary, all the current studies are either dictionary or semantic rule-based or apply some version of the classical Word2Vec approach (Mikolov et al. 2013) or standard NLP techniques (e.g. Bengio et al. 2003). In this study, as will be described in Section 5, we use a mixed qualitative and quantitative approach which tries to take into account the complexity of the well-being dimensions and the language used to express them. Indeed, the analysis does not focus on special features of a message but on the whole set of the words in a tweet after accurate training by humans who are Japanese native speakers and fully understand all the shadings of the natural language used to express emotions.

4. Data Collection

The data used in this work come from two different repositories that were collected under two different projects, in both cases using Twitter search API. The Japanese tweets were collected using only the filters on language = Japanese and country = Japan and similarly for Italy (Italian and Italy). It is worth mentioning that Twitter posts do not belong to individuals randomly chosen from a physical population (Baker et al. 2013; Murphy et al. 2014). The reference population is the population of posts of all Twitter accounts selected in the analysis. Moreover, Twitter accounts cannot be uniquely associated to individuals, and some accounts are more active than others. For these reasons, the focus of our analysis is on the total volume of the posts collected (in Japan, written in Japanese language, during the reference period) through the public Twitter ‘search’ and ‘streaming’ API (Application Programming Interface). As per the official documentation, Twitter search API only provides a 10% sample of all tweets, though the company does not disclose any information about the representativeness of the sample with respect to the whole universe of tweets posted on the social network. Nevertheless, according to our personal experience, also confirmed in large scale experiments by Hino and Fahey (2019), the coverage of topics and keywords is quite accurate and appears to be randomly selected: therefore we consider the Twitter data used in this study to be a representative sample of what is discussed on Twitter. According to Statista, there are about 8 million accounts active daily in Italy whilst about 52 million are in Japan, therefore the number of tweets posted is not comparable. Despite these limitations, the advantage of using Twitter data is that the collection of data can be done in (almost) continuous time and, moreover, instead of asking something through a web-form (thanks to the human supervised qualitative analysis explained in Section 5) it is possible to capture expressions of well-being from the texts directly.

For Italy, the original project collected about 250.4 million tweets in the period 1 Feb 2012 to 21 June 2018, with a median of 50,000 tweets per day. For Japan, for several technical reasons, we were able to collect at most 50,000 tweets a day, amounting to about 60.8 million tweets in the period 24-08-2015/31-12-2018. Table 3 reports summary statistics. In the same table, we report also the values of the Happy Planet Index (HPI) by New Economics Foundation (NEF 2016) and the Human Development Index (HDI) by the United Nations Development Programme (2019), for the same years available for the SWB-I an SWB-J indexes. Data are taken from the data provider TheGlobalEconomy (web portal for data access at https://TheGlobalEconomy.com) for the other economic variables listed in Table 1, mostly coming from The World Bank, the International Monetary Fund, the United Nations, and the World Economic Forum. We also collected from OECD the variable Life Expectancy of Males at 40, to capture the perception and quality of aging (data available at web portal: https://data.oecd.org/healthstat/). This is the only yearly variable we consider; it varies non-monotonically in time and, moreover, is also correlated positively with economy growth in Italy (ρ = 0.91) but negatively in Japan (ρ = 0.27).

Table 1.

Economic and Environmental Variables Used in the Econometric Analysis of Section 7 Plus Two Additional Well-being Indexes.

Variable Frequency Description
GDP growthQuarterlyPercent change in quarterly real GDP year on year
Consumption growthQuarterlyPercent change year on year
Investment growth GDPQuarterlyPercent change year on year
Unemployment rateMonthlyPercentage of work force
Life Expectancy at 40YearlyMale only
Happy Planet IndexYearly
Human Development IndexYearly
Variable Frequency Description
GDP growthQuarterlyPercent change in quarterly real GDP year on year
Consumption growthQuarterlyPercent change year on year
Investment growth GDPQuarterlyPercent change year on year
Unemployment rateMonthlyPercentage of work force
Life Expectancy at 40YearlyMale only
Happy Planet IndexYearly
Human Development IndexYearly
Table 1.

Economic and Environmental Variables Used in the Econometric Analysis of Section 7 Plus Two Additional Well-being Indexes.

Variable Frequency Description
GDP growthQuarterlyPercent change in quarterly real GDP year on year
Consumption growthQuarterlyPercent change year on year
Investment growth GDPQuarterlyPercent change year on year
Unemployment rateMonthlyPercentage of work force
Life Expectancy at 40YearlyMale only
Happy Planet IndexYearly
Human Development IndexYearly
Variable Frequency Description
GDP growthQuarterlyPercent change in quarterly real GDP year on year
Consumption growthQuarterlyPercent change year on year
Investment growth GDPQuarterlyPercent change year on year
Unemployment rateMonthlyPercentage of work force
Life Expectancy at 40YearlyMale only
Happy Planet IndexYearly
Human Development IndexYearly
Table 3.

Yearly Average Values of SWB-I and SWB-J, Their Standard Deviation in Parentheses, and Number of Tweets in Millions. Data are in the Period 1 February 2012 to 21 June 2018 for Italy and 24 August 2015 to 31 December 2018 for Japan. For the Happy Planet Index (HPI) and Human Development Index (HDI), Data Was Sourced From the World Bank.

Year 2012 2013 2014 2015 2016 2017 2018
SWB-I48.952.249.748.750.557.755.7
(4.2)(3.8)(4.9)(9.8)(7.5)(4.5)(7.1)
tweets44.2M40.8M34.4M38.3M55.2M32.6M14.9M
HPI6.025.955.985.966.00
HDI0.8740.8730.8740.8750.8780.8810.883
SWB-J54.453.653.252.5
(13.4)(11.1)(13.1)(12.7)
tweets6.5M18.2M18.2M17.8M
HPI5.995.925.925.92
HDI0.9060.9100.9130.915
Year 2012 2013 2014 2015 2016 2017 2018
SWB-I48.952.249.748.750.557.755.7
(4.2)(3.8)(4.9)(9.8)(7.5)(4.5)(7.1)
tweets44.2M40.8M34.4M38.3M55.2M32.6M14.9M
HPI6.025.955.985.966.00
HDI0.8740.8730.8740.8750.8780.8810.883
SWB-J54.453.653.252.5
(13.4)(11.1)(13.1)(12.7)
tweets6.5M18.2M18.2M17.8M
HPI5.995.925.925.92
HDI0.9060.9100.9130.915
Table 3.

Yearly Average Values of SWB-I and SWB-J, Their Standard Deviation in Parentheses, and Number of Tweets in Millions. Data are in the Period 1 February 2012 to 21 June 2018 for Italy and 24 August 2015 to 31 December 2018 for Japan. For the Happy Planet Index (HPI) and Human Development Index (HDI), Data Was Sourced From the World Bank.

Year 2012 2013 2014 2015 2016 2017 2018
SWB-I48.952.249.748.750.557.755.7
(4.2)(3.8)(4.9)(9.8)(7.5)(4.5)(7.1)
tweets44.2M40.8M34.4M38.3M55.2M32.6M14.9M
HPI6.025.955.985.966.00
HDI0.8740.8730.8740.8750.8780.8810.883
SWB-J54.453.653.252.5
(13.4)(11.1)(13.1)(12.7)
tweets6.5M18.2M18.2M17.8M
HPI5.995.925.925.92
HDI0.9060.9100.9130.915
Year 2012 2013 2014 2015 2016 2017 2018
SWB-I48.952.249.748.750.557.755.7
(4.2)(3.8)(4.9)(9.8)(7.5)(4.5)(7.1)
tweets44.2M40.8M34.4M38.3M55.2M32.6M14.9M
HPI6.025.955.985.966.00
HDI0.8740.8730.8740.8750.8780.8810.883
SWB-J54.453.653.252.5
(13.4)(11.1)(13.1)(12.7)
tweets6.5M18.2M18.2M17.8M
HPI5.995.925.925.92
HDI0.9060.9100.9130.915

5. How to Extract Subjective Well-Being from Tweets

The SWB-J index is a multidimensional well-being indicator whose components were inspired by the dimensions suggested by the New Economic Foundation think-tank in its reports on national accounts of well-being (NEF 2009, 2012). In summary, the SWB-J mimics the same indicator SWB-I previously built for Italy (Iacus et al. 2019, 2020a, b) and consists of eight dimensions that concern three different well-being areas: personal well-being, social well-being, and well-being at work. In greater detail:

  1. Personal well-being:

  • emotional well-being: the overall balance between the frequency of experiencing positive and negative emotions, with higher scores showing that positive feelings are felt more often than negative ones (emo);

  • satisfying life: having a positive assessment of one’s life overall (sat);

  • vitality: having energy, feeling well-rested and healthy while also being physically active (vit);

  • resilience and self-esteem: a measure of individual psychological resources, of optimism, and of the ability to deal with life stress (res);

  • positive functioning: feeling free to choose and having the opportunity to do it; being able to make use of personal skills while feeling absorbed and gratified in daily activities (fun);

  1. Social well-being:

  • trust and belonging: trusting other people, feeling treated fairly and respectfully while experiencing sentiments of belonging (tru);

  • relationships: the degree and quality of interactions in close relationships with family, friends, and others who provide support (rel);

  1. Well-being at work:

  • quality of job: feeling satisfied with a job, experiencing satisfaction with work-life balance, evaluating the emotional experiences of work and work conditions (wor).

Given the collectivist nature of the Japanese society, it makes sense to measure subjective well-being based on multiple dimensions encompassing social well-being and well-being at work. It is known, in fact, that personal well-being is more closely associated with social well-being and well-being at work in Japan compared to the United States (Kitayama et al. 2000; Ford et al. 2015) and – more recently – China (Wong et al. 2020). It is also known that Asian countries (in particular Pacific Rim countries (Diener et al. 1995)) tend to mark lower scores in reported subjective well-being compared to the US, but few works have attempted to measure the level of subjective well-being with social media data. Thanks to the multi-dimensional nature of the index, one can better understand and investigate the internal dynamics of subjective well-being. It should be noted that, in discussing the concept of well-being in the Japanese society, Kumano (2018) distinguishes two types of well-being: shiawase or hedonic, emotional well-being, and ikigai or eudaimonic well-being. The distinction is quite familiar also to cultures of Western countries and, as far as the different concepts of well-being impact on the daily expression of well-being, they are both captured by our indicators. It must be emphasized, however, that disentangling the hedonic or emotional component from the eudaimonic one is far beyond the scope of this work and, perhaps, beyond the potential of our indicator, which is conceived as a proper measure of hedonic well-being. Examining the relationship between emotional (as expressed on SNS) and eudaimonic well-being, if any, will be a challenge for future research.

To extract semantic meaning from tweets, in this study we use a supervised sentiment analysis method and, in particular, the iSA (Integrated Sentiment Analysis) algorithm (Ceron et al. 2016) which has been also used to capture various aspects of happiness and well-being from Twitter data (Curini et al. 2015). iSA is a human supervised machine learning method wherein a sample of texts (the training set or labelled set) is first read and manually classified by human coders, and then the rest of the corpus (the test set or unlabelled set) is automatically classified by the algorithm. The supervised part is essential in that this is the step where qualitative information can be extracted from a text without relying on dictionaries or special semantic rules but rather on cultural, psychological and emotional interpretation. To this aim, it is very important that the labelling of the texts in the training set is performed by native language speakers with a minimum of field knowledge. We will discuss in full details the coding strategy in Section 5.2. Other approaches based on user-defined dictionaries exist, but mainly focus on the concept of happiness (Bollen et al. 2011; Zhao et al. 2019). The advantage of iSA over other machine learning techniques is that it is designed to estimate directly the aggregated distribution of the opinions (e.g. positive, negative, neutral) without passing through the individual classification of posts in the unlabelled set. This approach vastly reduces the estimation error. Moreover, as iSA is a sequential method, in this context of highly noised data, the size of the training set needed to reach the same accuracy of other methods is usually smaller by a factor of 10 or 20 times. We will briefly present the main technical points of the iSA algorithm in Section 5.1, but the reader can also refer to Ceron et al. (2016) for an in-depth technical explanation of the method.

Once the training set has been completely hand-coded, the iSA algorithm is applied to daily unlabelled sets of data. Each estimated distribution will contain the entries positive, neutral, negative and Off-Topic. The Off-Topic category represents the daily noise in the data, the rest represents the signal. For each component of the index, for example emo, the index is calculated, for day d, as follows:

emod=% positive% positive+% negative[0,1]
(1)

The rationale behind formula (1) is that the intent is to capture expressions or judgments, and this is why comments not expressing any issue, judgments, etc. – i.e. those classified as neutral – are removed from the calculation. The day d is easily extracted from the timestamp of Twitter data which is always present in the metadata of a tweet.

Finally, the SWB-J index is the simple average of the eight components emo, sat, vit, res, fun, tru, rel, and wor. Although it falls outside of the scope of this study, specific weighted averages of the components might be considered in order to obtain a well-being indicator oriented to specific life dimensions.

5.1. The iSA Classification Algorithm

Let us denote by the set D={D0, D1,, DM} possible categories (i.e. sentiments or opinions) in which we want to classify texts. Here D0 is the Off-Topic category, which is usually the most frequently observed category in unstructured data.3 We want to estimate {P(D), D in D}, i.e. the distribution of opinions in the corpus of N texts. Let Si, i = 1, …, K, be a unique vector of L possible stems (i.e. single words, unigrams, bigrams, etc.) which remain in the document-term matrix after the pre-processing4 phase which identifies one of the texts in a corpus. These vectors are sequences of zeros and ones, e.g. s1 = (1,1,1,0,0,0,0,0,0) is associated to document 1, s2 = (0,0,0,1,1,1,0,0,0) for document 2, etc. Of course, more than one document may have the same feature vector, so that, for example, S1 and S20 may both be equal to, e.g. sequence s3, etc. Each vector Si belongs to the space of 0/1 vectors of length L (the number of stems), where each element of Si is either 1 if that stem is contained in a text, or 0 in case of absence. The number of possible different rows in the document-term matrix is then K = 2^L. Most supervised machine learning methods (e.g. multinomial regression, Random Forests (RF), Support Vector Machines (SVM), and Artificial Neural Networks (ANN), just to mention a few) use the individual hand coding from the training set to construct a model P(D|S) for P(D) as a function of S to train a model that predicts the outcome of dj^=D for the texts with S = sj belonging to the test set. Then, when all data have been imputed in this way, these estimated categories dj^ are aggregated to obtain a final estimate of P^(D). In matrix form:

P(D)=P(D | S) P(S)                             (2)M×1  M×K  K×1

where P(D) is a M × 1 vector, P (D| S) is a M × K matrix of conditional probabilities and P(S) is a K × 1 vector which represents the distribution of Si over the corpus of texts. Once P(D|S) is estimated from the training set with, say, P^(D|S) is estimated with the naïve Bayes estimator as the maximizer of the conditional probability, i.e. dj=argmaxDinDP(D|S=sj). Following Hopkins and King (2010), the iSA idea is changing one’s point of view and focusing on what can be really and accurately estimated. Instead of equation (2), one can proceed considering this new equation:

P(S)=P(S | D)   P(D)
K×1       K×M      M×1

where now P(S |D) is a K × M matrix of conditional probabilities whose elements P(S = Sk |D = Di) represent the frequency of a particular stem Sk given the set of texts which actually express the opinion D = Di. In this case, all these probabilities can be considerably well estimated if there is a sufficient number of texts in the training set which are hand coded as D = Di (Hopkins and King 2010; Ceron et al. 2016). While the original method in Hopkins and King (2010) uses a combined approach of bagging on the number of stems used in each step and a bootstrap to obtain standard errors, iSA uses all stems at the same time, reducing the computational burden as well as making possible an accurate estimate of the standard error of estimates. Compared to traditional machine learning methods using equation (2), both the ReadMe method in Hopkins and King (2010) and iSA are at least 20 times more efficient in terms of size of the training set needed to reach the same level of bias (see, e.g. Fig. 3 in Ceron et al. 2016) in the estimation of P(D).

For what follows, it is important to remark that iSA reaches this high accuracy by losing the opportunity of doing individual classification. While this is a limitation in applications like Artificial Intelligence, in social sciences this is a common feature, as the usual target of social studies is the distribution of collective opinion: e.g. we want to know who won the elections more than who voted for whom at an individual level.

5.2. Coding Strategy

The data for the preparation of the training set are extracted according to the filtering keywords listed in Supplementary Appendix Tables 12, 13, 14, 15, 16, 17 and 18 . The choice of these keywords is based on an explicit decision made by the team. Some of these keywords have been extracted by looking at the list of keywords generated in other sentiment analysis studies on the topic, while some of them have been included to capture specific components of our composite indicator (e.g. job, work). Other keywords have been identified through a preliminary random prescreening of the tweets. We understand that this list is not exhaustive and is potentially biased towards our previous knowledge on the topic. Nevertheless, it is notable that, even though the training set was built by filtering the data, the whole statistical analysis was done on the complete repository of tweets collected filtered only by language and country.

Once the training set data have been collected, the qualitative step of the analysis can start. During this qualitative step, coding rules were distributed to human coders:

  • The first general rule is mark/tag/code Off-Topic posts appropriately. At this stage the machine learning algorithm will understand noise.

  • The second rule is: if you are not fully convinced about the semantic context of a post, do not classify it, just skip it go to the next one. These are not Off-Topic, let the algorithm try to classify it for you.

  • It is admissible to classify RT (re-tweets) as original tweets from the account. This is an assumption of transfer of emotion/opinion which we assume to be the same as if they were expressed by the user account directly. Other researchers may disagree with this assumption, of course.

  • As each tweet can be classified along one or more dimensions, always try to consider parallel coding for all the categories and leave unanswered/untagged those which do not apply to a given category;

  • If the text contains an explicit positive/negative judgement about one’s condition with respect to one of the dimensions (e.g. work, resilience), the tweet must be labelled as positive/negative along that dimension. In case of no judgement, the neutral label should be attributed to that text.

Examples of coding rules and real tweet classifications are given in the Supplementary Appendix. The same rules and code book were given to both the Italian and Japanese teams. Both teams were coordinated by the same PIs. We applied the Delphi method to conduct this step: a first set of 150 tweets were labelled by the full team discussing the precise content in order to reach unanimous consensus on the labelling. Then, the rest of the training set was randomly distributed to the coders. Finally, the whole team revised the labelling and solved any conflicts, again reaching unanimous consensus on the labeling. We ended up with 3,069 fully labelled tweets in the Japanese set and 2,952 in the Italian set.

It is very important to remark that the labelling of the texts in the training set had to be performed by native language speakers only and with a minimum of field knowledge. To this aim, we recruited seven coders from among PhD and final-year MA students at Waseda University in Japan, and eight from among PhD and final-year MA students at the University of Milan. All of them were trained for this project, and all of them were social science students. Some of the above students were not paid, as this was part of their training in computational social science.

5.3. Validation of the Coding

After the coding was completed, we performed a cross-validation study on the training set of 3,069 labelled tweets to verify the accuracy of iSA on this set of data. We randomly split, 1,000 times, the training set into two parts x% and (100 x%), where x = 30%, 50% and 80%. We trained iSA using x% of the data to predict the final distribution P(D) of each of the eight components of the SWB-J index: fun, rel, res, sat, tru, vit and wor. Recall that for each component we have a distribution of k = 4 categories: D = {positive, neutral, negative and Off-Topic}. The performance of the iSA classification method is measured through the MAE (Mean Absolute Error) statistic:

MAE=1Ki=0K|P(Di)P(Di)|

where Pˆ() is the final estimated distribution. As can be seen from Table 2, the mean MAE is less than or around 2% when only 30% of the data is used to train iSA, and less than 1% when the training set is 80%. This is more than satisfactory to proceed with the analysis.

Table 2.

Mean Absolute Error (MAE) Summary Statistics for Algorithm iSA in Cross Validation. Japanese Data. MAE in % Points. Number of Simulations = 1,000. Total Number of Labelled Cases = 3,069.

Dimension Min Q1 Median Mean Q3 Max Sub-training Set size
 emo0.141.502.142.232.886.6330%
0.941.371.501.925.17
 fun0.10
0.811.171.321.733.99
 rel0.12
0.761.111.261.606.70
 res0.09
 sat0.130.871.321.421.775.34
0.911.391.572.075.58
 tru0.01
0.961.451.582.065.33
 vit0.01
0.891.251.351.704.05
 wor0.12
 emo0.221.101.531.602.014.4550%
0.721.061.121.443.71
 fun0.08
0.660.930.991.282.95
 rel0.03
0.620.911.021.314.90
 res0.08
 sat0.110.701.071.121.474.07
0.721.091.181.543.85
 tru0.11
0.701.061.151.544.26
 vit0.09
0.731.021.091.383.14
 wor0.14
 emo0.110.600.850.881.122.2680%
0.420.610.650.862.00
 fun0.04
0.400.570.610.791.86
 rel0.04
0.370.560.610.792.09
 res0.05
 sat0.040.410.610.660.872.06
0.410.620.660.851.93
 tru0.05
0.400.600.630.811.83
 vit0.03
0.390.580.610.781.74
 wor0.04
Dimension Min Q1 Median Mean Q3 Max Sub-training Set size
 emo0.141.502.142.232.886.6330%
0.941.371.501.925.17
 fun0.10
0.811.171.321.733.99
 rel0.12
0.761.111.261.606.70
 res0.09
 sat0.130.871.321.421.775.34
0.911.391.572.075.58
 tru0.01
0.961.451.582.065.33
 vit0.01
0.891.251.351.704.05
 wor0.12
 emo0.221.101.531.602.014.4550%
0.721.061.121.443.71
 fun0.08
0.660.930.991.282.95
 rel0.03
0.620.911.021.314.90
 res0.08
 sat0.110.701.071.121.474.07
0.721.091.181.543.85
 tru0.11
0.701.061.151.544.26
 vit0.09
0.731.021.091.383.14
 wor0.14
 emo0.110.600.850.881.122.2680%
0.420.610.650.862.00
 fun0.04
0.400.570.610.791.86
 rel0.04
0.370.560.610.792.09
 res0.05
 sat0.040.410.610.660.872.06
0.410.620.660.851.93
 tru0.05
0.400.600.630.811.83
 vit0.03
0.390.580.610.781.74
 wor0.04
Table 2.

Mean Absolute Error (MAE) Summary Statistics for Algorithm iSA in Cross Validation. Japanese Data. MAE in % Points. Number of Simulations = 1,000. Total Number of Labelled Cases = 3,069.

Dimension Min Q1 Median Mean Q3 Max Sub-training Set size
 emo0.141.502.142.232.886.6330%
0.941.371.501.925.17
 fun0.10
0.811.171.321.733.99
 rel0.12
0.761.111.261.606.70
 res0.09
 sat0.130.871.321.421.775.34
0.911.391.572.075.58
 tru0.01
0.961.451.582.065.33
 vit0.01
0.891.251.351.704.05
 wor0.12
 emo0.221.101.531.602.014.4550%
0.721.061.121.443.71
 fun0.08
0.660.930.991.282.95
 rel0.03
0.620.911.021.314.90
 res0.08
 sat0.110.701.071.121.474.07
0.721.091.181.543.85
 tru0.11
0.701.061.151.544.26
 vit0.09
0.731.021.091.383.14
 wor0.14
 emo0.110.600.850.881.122.2680%
0.420.610.650.862.00
 fun0.04
0.400.570.610.791.86
 rel0.04
0.370.560.610.792.09
 res0.05
 sat0.040.410.610.660.872.06
0.410.620.660.851.93
 tru0.05
0.400.600.630.811.83
 vit0.03
0.390.580.610.781.74
 wor0.04
Dimension Min Q1 Median Mean Q3 Max Sub-training Set size
 emo0.141.502.142.232.886.6330%
0.941.371.501.925.17
 fun0.10
0.811.171.321.733.99
 rel0.12
0.761.111.261.606.70
 res0.09
 sat0.130.871.321.421.775.34
0.911.391.572.075.58
 tru0.01
0.961.451.582.065.33
 vit0.01
0.891.251.351.704.05
 wor0.12
 emo0.221.101.531.602.014.4550%
0.721.061.121.443.71
 fun0.08
0.660.930.991.282.95
 rel0.03
0.620.911.021.314.90
 res0.08
 sat0.110.701.071.121.474.07
0.721.091.181.543.85
 tru0.11
0.701.061.151.544.26
 vit0.09
0.731.021.091.383.14
 wor0.14
 emo0.110.600.850.881.122.2680%
0.420.610.650.862.00
 fun0.04
0.400.570.610.791.86
 rel0.04
0.370.560.610.792.09
 res0.05
 sat0.040.410.610.660.872.06
0.410.620.660.851.93
 tru0.05
0.400.600.630.811.83
 vit0.03
0.390.580.610.781.74
 wor0.04

6. Preliminary Analysis of the SWB-I & SWB-J Indexes

Table 3 shows the yearly average values of SWB-I and SWB-J since 2012 for Italy, and since 2015 for Japan. What we notice is that the Japanese indicator shows a high medium-run stability in the range [52.5, 54.5]. In the Italian case, on the contrary, the medium-run variability is higher – the range of yearly values is [48.7, 57.7] – and the value of SWB-I is, in some years, significantly lower or significantly higher than SWB-J. On the other hand, the standard deviations of the two indicators suggest (and the inspection of Figure 1 proves) that the short-run volatility of SWB-J is definitely higher than SWB-I. These results depict Japanese Twitter users as more reactive to day-by-day events and emotions, while their evaluation of quality of life is, on average, stable around quite satisfactory values. To be more exact, the Italian indicator shows a high variability period, which basically coincides with the year when Italy organized and hosted the Expo 2015 event; that was an abnormal time frame, when heavily negative (due to controversies over delays in the preparation of the event and to allegations of bribes given to some of the organizers) and strongly positive feelings (due to the appreciation and success of the event) rapidly emerged and changed in the Italian public opinion.5

Figure 1.

SWB-I and SWB-J Weekly Average Series With Estimated Local-linear Regression Trends and Standard Errors Bands. The Peak in June–September 2015 for Italy corresponds to the Expo 2015 Event.

An examination of the single components of SWB-J in Table 5 confirms the stability – along with a slight decline – of Japanese subjective well-being. A first paradox emerges when looking at the well-being perception about social relationships: the outstanding value of the sub-component evaluating the quality of family relationships and friendships (rel) is, to some extent, at odds with the perceived well-being in terms of trust and sentiment of belonging (tru). This may indicate that the positive feelings nourished towards family and friends are not generalized to the rest of society. The emotional sub-component of SWB-J is consistent with the global indicator, as are the other dimensions related to personal well-being: slightly above the average we find the fun component (relating to the opportunity to do and to choose, and the involvement and satisfaction in daily activities); slightly below is the self-perception of health and physical vitality (vit). Also, subjective well-being at work (wor) is strictly in line with the average SWB-J; this contrasts with the Italian case (see Table 4), where the wor component is the most volatile and does not show strong correlation with the overall value of SWB-I. This likely documents the strong identification Japanese people feel between their satisfaction as workers and their global well-being, and also that concerns at work are not expressed for cultural reasons, which is not unexpected.

Table 4.

The Values of Each Component of the SWB-I Index.

Year SWB-I emo fun rel res sat tru vit wor
201248.960.567.834.155.143.959.253.916.4
201352.257.373.337.457.255.064.058.015.5
201449.748.268.339.756.152.462.655.215.1
201548.753.152.757.755.433.237.757.042.8
201650.562.240.565.959.730.228.958.458.0
201757.723.559.164.445.879.020.280.688.9
201855.740.457.859.146.464.926.674.576.2
Year SWB-I emo fun rel res sat tru vit wor
201248.960.567.834.155.143.959.253.916.4
201352.257.373.337.457.255.064.058.015.5
201449.748.268.339.756.152.462.655.215.1
201548.753.152.757.755.433.237.757.042.8
201650.562.240.565.959.730.228.958.458.0
201757.723.559.164.445.879.020.280.688.9
201855.740.457.859.146.464.926.674.576.2
Table 4.

The Values of Each Component of the SWB-I Index.

Year SWB-I emo fun rel res sat tru vit wor
201248.960.567.834.155.143.959.253.916.4
201352.257.373.337.457.255.064.058.015.5
201449.748.268.339.756.152.462.655.215.1
201548.753.152.757.755.433.237.757.042.8
201650.562.240.565.959.730.228.958.458.0
201757.723.559.164.445.879.020.280.688.9
201855.740.457.859.146.464.926.674.576.2
Year SWB-I emo fun rel res sat tru vit wor
201248.960.567.834.155.143.959.253.916.4
201352.257.373.337.457.255.064.058.015.5
201449.748.268.339.756.152.462.655.215.1
201548.753.152.757.755.433.237.757.042.8
201650.562.240.565.959.730.228.958.458.0
201757.723.559.164.445.879.020.280.688.9
201855.740.457.859.146.464.926.674.576.2
Table 5.

The Values of Each Component of the SWB-J Index.

Year SWB-J emo fun rel res sat tru vit wor
201554.454.859.375.254.456.935.443.255.9
201653.653.559.473.958.953.035.642.652.2
201753.251.057.975.755.951.536.142.755.0
201852.551.557.072.354.953.435.643.352.2
Year SWB-J emo fun rel res sat tru vit wor
201554.454.859.375.254.456.935.443.255.9
201653.653.559.473.958.953.035.642.652.2
201753.251.057.975.755.951.536.142.755.0
201852.551.557.072.354.953.435.643.352.2
Table 5.

The Values of Each Component of the SWB-J Index.

Year SWB-J emo fun rel res sat tru vit wor
201554.454.859.375.254.456.935.443.255.9
201653.653.559.473.958.953.035.642.652.2
201753.251.057.975.755.951.536.142.755.0
201852.551.557.072.354.953.435.643.352.2
Year SWB-J emo fun rel res sat tru vit wor
201554.454.859.375.254.456.935.443.255.9
201653.653.559.473.958.953.035.642.652.2
201753.251.057.975.755.951.536.142.755.0
201852.551.557.072.354.953.435.643.352.2

A look at the correlation between SWB-J (and SWB-I) and two well-known well-being indicators may raise some concerns. Table 6 shows a high correlation of SWB-J with the Happy Planet Index (HPI), developed for the first time in 2006 by the New Economics Foundation (NEF 2016). The HPI aims to measure sustainable well-being; it compares how efficiently residents of different countries use natural resources to achieve long lives with high well-being. On the other hand, SWB-J is negatively related (with a correlation index equal to 0.99) to the Human Development Index, elaborated since 1990 by the United Nations Development Programme (UNDP), according to Amartya Sen’s capability approach to well-being definition and evaluation (Robeyns 2006). In measuring well-being, HDI takes into account three dimensions: health, education, and material standards of living. Notably, the Italian SWB-I is positively related to both the indicators: a weak relation is shown with HPI and a strong one with HDI. All this should remind us that the plethora of well-being indices currently available seldom gives a measure of the same variable; each indicator addresses a specific definition of well-being, and the relationships among all these definitions are sometimes ambiguous. This does not imply that the measures provided are wrong or unreliable: it only requires extreme clarity in explaining the methodology followed to construct the indicator, the data source, and the definition of well-being that the indicator aspires to account for.

Table 6.

Correlation Between the Yearly Average SWB-I and SWB-J and the Two Indexes Happy Planet Index and Human Development Index.

Happy Planet Index Human Development Index
SWB-I 0.14 0.80
SWB-J0.81−0.99
Happy Planet Index Human Development Index
SWB-I 0.14 0.80
SWB-J0.81−0.99
Table 6.

Correlation Between the Yearly Average SWB-I and SWB-J and the Two Indexes Happy Planet Index and Human Development Index.

Happy Planet Index Human Development Index
SWB-I 0.14 0.80
SWB-J0.81−0.99
Happy Planet Index Human Development Index
SWB-I 0.14 0.80
SWB-J0.81−0.99

6.1. Comparing the SWB-J Index With the Jiji PRESS Survey Data

Although real benchmarking is not possible for the reasons explained in the above, we try to compare the behaviour of the eight SWB-J components with the Jiji Press survey data. Jiji Press has conducted monthly polls interviewing 2,000 randomly selected nationwide samples since 1960, asking about party support and cabinet approval among other issues. Here we will focus on the following three questions.

The first one is about living: ‘How is your present situation compared with the same time last year? Are you feeling better or worse?’, and has six possible answers: Much better (q3c1), Somewhat better (q3c2), Same (q3c3), Somewhat worse (q3c4), Much worse (q3c5), Don’t know (q3c6). From the answers to this question we derive an indicator, namely life, that mimics the construction of our indicators in formula (1), i.e.

life=q3c1+q3c2q3c1+q3c2+q3c4+q3c5     .

The second question is about business conditions: ‘How do you see the national economy as a whole? Do you think it is the same as last month, getting worse, or getting better?’, with the following possible answers: Certainly getting better (q5c1), Somewhat getting better (q5c2), Same (q5c3), Somewhat getting worse (q5c4), Certainly getting worse (q5c5), Don’t know (q5c6). From these answers we construct the variable business as

business=q5c1+q5c2q5c1+q5c2+q5c4+q5c5     .

Finally, we considered a third question, which is about the future of the economy: ‘Do you think your life will get better or worse in the future?’, with these possible answers: Better (q6c1), Same (q6c2), Worse (q6c3), Don’t know (q6c4). Again we construct an indicator feconomy as follows

feconomy=q6c1q6c1+q6c2     .

The Jiji Press data and the SWB-J indicator overlap for the period August 2015 – July 2018. In periods before or after this time span, one of the two datasets has missing data. We considered monthly data for the correlation study, as the Jiji Press data have monthly frequency. Looking at the upper panel of Table 7, we can see that not much correlation emerges from the analysis. Therefore, in order to capture some possible lagged correlation effects, we run a lead-lag analysis. Let θ ∈ (−δ, δ) be the time lag between the two nonlinear time series X and Y. Roughly speaking, the idea is to construct acontrast function Un(θ) = Cov(Xt, Yt+θ) which evaluates the Hayashi-Yoshida covariance estimator (Hayashi and Yoshida 2008, 2005) for the times series Xt and Yt+θ, and then to maximise it as a function of θ. The lead-lag estimator θˆn of θ is defined as (Hoffmann et al. 2013):

Table 7.

Correlation Analysis for the Eight SWB-J Components Versus the Three Derived Indicators from the Jiji Press Survey Data (see text). Top Panel: The Simple Correlation. Middle Panel: Lagged Correlation After Lead-lag Analysis. Bottom Panel: The Result of the Lead-lag Analysis. A Positive Value Means the Variable in the Row Anticipates the Variable in the Column. The P-values of the Test Are in Parentheses.

Simple Correlation emo fun rel res sat tru vit wor
Life0.020.080.110.170.380.020.270.02
Business0.260.040.080.060.050.130.240.20
Feconomy0.070.130.050.100.260.230.330.07
Lagged Correlationemofunrelressattruvitwor
Life0.250.230.170.230.250.380.300.24
Business0.300.160.360.340.200.250.240.41
Feconomy0.330.220.190.440.330.380.330.17
LAG (months)emofunrelressattruvitwor
Life3 (0.057)2 (0.125)0 (0.192)1 (0.210)3 (0.076)0 (0.014)1 (0.034)1 (0.058)
Business4 (0.015)2 (0.260)1 (0.009)2 (0.025)2 (0.152)3 (0.171)0 (0.045)3 (<0.0001)
Feconomy2 (0.054)4 (0.280)3 (0.171)3 (0.011)1 (0.028)1 (0.010)0 (0.017)1 (0.178)
Simple Correlation emo fun rel res sat tru vit wor
Life0.020.080.110.170.380.020.270.02
Business0.260.040.080.060.050.130.240.20
Feconomy0.070.130.050.100.260.230.330.07
Lagged Correlationemofunrelressattruvitwor
Life0.250.230.170.230.250.380.300.24
Business0.300.160.360.340.200.250.240.41
Feconomy0.330.220.190.440.330.380.330.17
LAG (months)emofunrelressattruvitwor
Life3 (0.057)2 (0.125)0 (0.192)1 (0.210)3 (0.076)0 (0.014)1 (0.034)1 (0.058)
Business4 (0.015)2 (0.260)1 (0.009)2 (0.025)2 (0.152)3 (0.171)0 (0.045)3 (<0.0001)
Feconomy2 (0.054)4 (0.280)3 (0.171)3 (0.011)1 (0.028)1 (0.010)0 (0.017)1 (0.178)
Table 7.

Correlation Analysis for the Eight SWB-J Components Versus the Three Derived Indicators from the Jiji Press Survey Data (see text). Top Panel: The Simple Correlation. Middle Panel: Lagged Correlation After Lead-lag Analysis. Bottom Panel: The Result of the Lead-lag Analysis. A Positive Value Means the Variable in the Row Anticipates the Variable in the Column. The P-values of the Test Are in Parentheses.

Simple Correlation emo fun rel res sat tru vit wor
Life0.020.080.110.170.380.020.270.02
Business0.260.040.080.060.050.130.240.20
Feconomy0.070.130.050.100.260.230.330.07
Lagged Correlationemofunrelressattruvitwor
Life0.250.230.170.230.250.380.300.24
Business0.300.160.360.340.200.250.240.41
Feconomy0.330.220.190.440.330.380.330.17
LAG (months)emofunrelressattruvitwor
Life3 (0.057)2 (0.125)0 (0.192)1 (0.210)3 (0.076)0 (0.014)1 (0.034)1 (0.058)
Business4 (0.015)2 (0.260)1 (0.009)2 (0.025)2 (0.152)3 (0.171)0 (0.045)3 (<0.0001)
Feconomy2 (0.054)4 (0.280)3 (0.171)3 (0.011)1 (0.028)1 (0.010)0 (0.017)1 (0.178)
Simple Correlation emo fun rel res sat tru vit wor
Life0.020.080.110.170.380.020.270.02
Business0.260.040.080.060.050.130.240.20
Feconomy0.070.130.050.100.260.230.330.07
Lagged Correlationemofunrelressattruvitwor
Life0.250.230.170.230.250.380.300.24
Business0.300.160.360.340.200.250.240.41
Feconomy0.330.220.190.440.330.380.330.17
LAG (months)emofunrelressattruvitwor
Life3 (0.057)2 (0.125)0 (0.192)1 (0.210)3 (0.076)0 (0.014)1 (0.034)1 (0.058)
Business4 (0.015)2 (0.260)1 (0.009)2 (0.025)2 (0.152)3 (0.171)0 (0.045)3 (<0.0001)
Feconomy2 (0.054)4 (0.280)3 (0.171)3 (0.011)1 (0.028)1 (0.010)0 (0.017)1 (0.178)
θn=argmaxδ<θ<+δ|Un(θ)|

When the value of θˆn is positive it means that Xt and Yt+θ (or Xt-θ and Yt) are strongly correlated, so we say X leads Y by an amount of time θˆn’, so X is the leader and Y is the lagger (and vice-versa for negative θˆn). The lead-lag estimator is provided by the yuima R package (Iacus and Yoshida 2018). The results of the lead-lag analysis are given in Table 7, from which we can see (middle panel) that after lagging the variables, some correlation effects emerge and even some of the correlations change sign. The lags are given in the bottom panel of Table 7, and the P-values of the asymptotic test of significance are given in parentheses. From this analysis we can see that, for example, if we look at P-values lower than 0.05, business leads the res (with negative correlation) and wor (with positive correlation) indicators by one and three months respectively. One interpretation of this might be that while the expectation of future business decreases, the next month the resilience (res) increases as a sign of increasing stress. At the same time, the well-being at work (wor) also decreases. The variable business is instead led by emo and tru with different time lags, both with negative correlation. Notice that the correlation between business and wor it not so small.6

The feconomy variable seems to lead tru and is led by fun and sat by one month lag. On the other side, life seems to be led by vit (with positive correlation) by one month. This might be a nowcasting effect of the vit component over living conditions. These are of course correlation effects that seem to suggest that some relationships might exist between the SWB-J components and the Jiji Press survey.

7. Cross-Country Analysis 2015–2018

In this section we focus our attention on the impact of different economic variables on the SWB indicators. We make use of the monthly and quarterly data from Table 1 and interpolate quarterly data at monthly frequency to make use of as much data as possible. Note that, while Italy was examined over the period 2012–2018, this analysis is restricted to the period 2015–2018 when both countries are considered together for comparison purposes.

For the analysis we used the Structural Equation Modeling (SEM) with the continuous response variable (Bollen 1989) approach. SEM is a common method to test complex relationships between dependent variables, independent variables, mediators, and latent dimensions. We assume that true well-being is a latent variable influenced by the economic status of the country, which itself is supposed to be a latent variable, and by the health status of the country measured through life expectancy, and that the Twitter SWB-I/SWB-J indexes are observable measures of some aspects of the well-being latent variable.

In statistical terms, SEM consists of regression analysis, factor analysis, and path analysis to explore interrelationships between variables. It is a confirmatory technique, wherein an analyst tests a model to check consistency between the relationships put in place. The following latent dimensions are theorized:

  • Economy: captured by GDP growth, Consumption growth, Investment growth, and Unemployment rate;

  • Well-being: we assume this is affected by the Economy latent variable and by life expectancy, taken as a proxy of health conditions; in turn, the Well-being variable determines SBW-I/SWB-J measures. Then, a path diagram is constructed to represent the inter-dependencies of the independent variables (GDP growth, Consumption growth, Investment growth, Unemployment rate, Life expectancy at 40), the latent dimensions, and the dependent variable (SWB-I/SWB-J):

Economy GDP growth + Consumption growth + Investment growth + Unemployment rate

Well-being Economy + Life Expectancy at 40

SWB-J/SWB-J Well-being

Further, the interdependence among the observed variables (GDP growth and Life expectancy at 40, Consumption growth, Investment growth and Unemployment rate) is also captured in the model. The results of the fitted model are presented in Table 8, while Figures 3 and 4 give a graphical representation of the same fitting. The models have been fitted using the lavaan package (Rosseel 2012), and plots have been generated through the semPlot package (Epskamp 2019). It is worth noting that, despite the ambitious scope of the SEM model, we do not test causality here, but just correlation hypotheses through path analysis, as a first attempt to explain how SWB-I and SWB-J relate to official statistics.

Table 8.

Estimated Coefficients for the SEM Model Applied to the Japanese (Top) and Italian (Bottom) Data for the Period 2015–2018.

RelationshipCoefficient Std. Err.
Japan 2015–2018
Well-beingSWB-J 0.940***0.101
EconomyEconomic growth0.4060.497
EconomyUnemployment rate−0.377**0.148
EconomyConsumption growth1.173***0.159
EconomyInvestment growth0.730***0.155
Well-beingEconomy0.1780.123
Well-beingLife expectancy at 40−0.362**0.159
Economic growthcovLife expectancy at 40−0.743***0.174
Economic growthcovConsumption growth0.4040.525
Economic growthcovInvestment growth0.597*0.358
Economic growthcovUnemployment rate−0.440**0.195
Italy 2015–2018
Well-beingSWB-I0.597***0.113
EconomyEconomic growth0.514***0.190
EconomyUnemployment rate−0.581***0.178
EconomyConsumption growth0.597***0.178
EconomyInvestment growth0.398**0.179
Well-beingEconomy0.921**0.375
Well-beingLife expectancy at 400.834***0.242
Economic growthcovLife expectancy at 400.246**0.123
Economic growthcovConsumption growth0.1210.137
Economic growthcovInvestment growth0.2300.134*
Economic growthcovUnemployment rate0.0040.121
RelationshipCoefficient Std. Err.
Japan 2015–2018
Well-beingSWB-J 0.940***0.101
EconomyEconomic growth0.4060.497
EconomyUnemployment rate−0.377**0.148
EconomyConsumption growth1.173***0.159
EconomyInvestment growth0.730***0.155
Well-beingEconomy0.1780.123
Well-beingLife expectancy at 40−0.362**0.159
Economic growthcovLife expectancy at 40−0.743***0.174
Economic growthcovConsumption growth0.4040.525
Economic growthcovInvestment growth0.597*0.358
Economic growthcovUnemployment rate−0.440**0.195
Italy 2015–2018
Well-beingSWB-I0.597***0.113
EconomyEconomic growth0.514***0.190
EconomyUnemployment rate−0.581***0.178
EconomyConsumption growth0.597***0.178
EconomyInvestment growth0.398**0.179
Well-beingEconomy0.921**0.375
Well-beingLife expectancy at 400.834***0.242
Economic growthcovLife expectancy at 400.246**0.123
Economic growthcovConsumption growth0.1210.137
Economic growthcovInvestment growth0.2300.134*
Economic growthcovUnemployment rate0.0040.121

Note:

*P < 0.1;

**P < 0.05;

***P < 0.01.

Table 8.

Estimated Coefficients for the SEM Model Applied to the Japanese (Top) and Italian (Bottom) Data for the Period 2015–2018.

RelationshipCoefficient Std. Err.
Japan 2015–2018
Well-beingSWB-J 0.940***0.101
EconomyEconomic growth0.4060.497
EconomyUnemployment rate−0.377**0.148
EconomyConsumption growth1.173***0.159
EconomyInvestment growth0.730***0.155
Well-beingEconomy0.1780.123
Well-beingLife expectancy at 40−0.362**0.159
Economic growthcovLife expectancy at 40−0.743***0.174
Economic growthcovConsumption growth0.4040.525
Economic growthcovInvestment growth0.597*0.358
Economic growthcovUnemployment rate−0.440**0.195
Italy 2015–2018
Well-beingSWB-I0.597***0.113
EconomyEconomic growth0.514***0.190
EconomyUnemployment rate−0.581***0.178
EconomyConsumption growth0.597***0.178
EconomyInvestment growth0.398**0.179
Well-beingEconomy0.921**0.375
Well-beingLife expectancy at 400.834***0.242
Economic growthcovLife expectancy at 400.246**0.123
Economic growthcovConsumption growth0.1210.137
Economic growthcovInvestment growth0.2300.134*
Economic growthcovUnemployment rate0.0040.121
RelationshipCoefficient Std. Err.
Japan 2015–2018
Well-beingSWB-J 0.940***0.101
EconomyEconomic growth0.4060.497
EconomyUnemployment rate−0.377**0.148
EconomyConsumption growth1.173***0.159
EconomyInvestment growth0.730***0.155
Well-beingEconomy0.1780.123
Well-beingLife expectancy at 40−0.362**0.159
Economic growthcovLife expectancy at 40−0.743***0.174
Economic growthcovConsumption growth0.4040.525
Economic growthcovInvestment growth0.597*0.358
Economic growthcovUnemployment rate−0.440**0.195
Italy 2015–2018
Well-beingSWB-I0.597***0.113
EconomyEconomic growth0.514***0.190
EconomyUnemployment rate−0.581***0.178
EconomyConsumption growth0.597***0.178
EconomyInvestment growth0.398**0.179
Well-beingEconomy0.921**0.375
Well-beingLife expectancy at 400.834***0.242
Economic growthcovLife expectancy at 400.246**0.123
Economic growthcovConsumption growth0.1210.137
Economic growthcovInvestment growth0.2300.134*
Economic growthcovUnemployment rate0.0040.121

Note:

*P < 0.1;

**P < 0.05;

***P < 0.01.

Figure 4.

Graphical Representation of the Estimated SEM Model for the Italian Data for the Period 2015–2018.

7.1. Interpretation of the SEM Model

In the Italian case (see Figure 4 and Table 8 bottom panel), all the observed economic variables have an expected and significant relationship with the Economy latent variable which, in turn, positively and significantly affects Well-being. A few anomalies, on the contrary, may be noted in the analysis of Japan. First of all, we notice from Figure 3 and Table 8 (top panel) that the relationship between the observable economic variables and the Economy latent variable, as well as the inter-dependencies among the observable economic variables, have the expected sign. Moreover, both the Investment growth and the Consumption growth rates show a significant relationship, whose coefficients are higher compared to Italy: as investment and consumption are the main components of aggregate demand, this likely explains why the relationship with GDP growth comes out to be statistically non-significant. Also, the Unemployment rate is negatively and significantly related to the state of the economy. Nevertheless, the Economy latent variable does not significantly affect Well-being: this is probably our main result and suggests that well-being perception among the Japanese is not determined by the objective, observable, and mainly economic variables that have been traditionally used to measure the welfare of a country. It is worth recalling here that Diener et al. (1995) observed a tendency of Asian cultures, compared to continental European and Anglo-Saxon ones, to mark a lower score in reported subjective well-being for similar economic conditions, likely documenting that economic wealth is not necessarily the most important component of perceived well-being.

Figure 3.

Graphical Representation of the Estimated SEM Model for the Japanese Data for the Period 2015–2018.

Life expectancy at 40, which we adopt as a proxy of public health conditions, shows an opposite relationship in Japan compared to Italy, both with the Well-being latent variable and the GDP growth rate. As shown in Figure 2, life expectancy increased in both countries over the period 2015–2108. Similarly, the Italian GDP growth rate showed a positive trend in those years, whilst the Japanese one fluctuated: this justifies the observed different sign of the relationship.

Figure 2.

GDP Growth and Life Expectancy at 40 for Japan and Italy, 2015–2018.

Lastly, the SWB indicator is positively and significantly related to Well-being in both countries. Nevertheless, the coefficient is higher in the Japan case, highlighting that SWB-J more closely resembles the well-being level, as depicted by the latent variable.

8. Discussion and Limits of the Approach

This was the first attempt to elaborate a subjective well-being index based on Twitter data for Japan. This work shows that the same approach used to construct the analogous subjective well-being indicator for Italy can be transposed to Japan. The limitation of the human supervised method is that mother-tongue coders are needed to build the training set, but once that is done, the iSA algorithm (being completely agnostic to language) works seamlessly.

The structural equation model helps emphasize the differences between Japan and Italy, particularly with regard to the relevance of objective elements and circumstances in determining perceived well-being. In fact, it seems that Japanese subjective well-being is less affected by economic conditions, suggesting that personal, relational, and familiar dimensions may play a more important role in the individual and social well-being spheres. By nature, our well-being index is closely related to public perceptions of economic conditions, but we also need to be aware that it is not identical to them. As this is an effective measure of emotional well-being, we may need to be cautious in not conflating it with a measure for economic trends, to be used for forecasting.

However, the short time series for Japan may be one of the causes of the non-significance of some coefficients. Therefore, future work is needed to collect more data in terms of volume and time, in order to test the robustness of the results we obtained. As shown in the Supplementary Appendix, the set of keywords used to select the training set data may be enlarged to encompass more variations on the same concepts. Further, as shown in the Italian case (Iacus et al. 2019, 2020a, 2020b), sub-national differences could be examined in further development and applications of the indicator. As far as the posted messages are geolocalized, in fact, the SWB-J can be evaluated at regional or district level. Unfortunately, geolocalization is not available in this dataset, preventing this extension of the research.

An application of these two Twitter-based SWB indicators to the COVID-19 pandemic has been proposed by these authors in another study. Unfortunately, the data collection was not completely successful in 2020 due to some changes to Twitter APIs and policies. Nevertheless, we could see a generalized drop of the two indexes during the first nine months of 2020. Full details of that analysis are available in Carpi et al. (2021).

Footnotes

1

Also known as ‘observer effect’, it is the phenomenon by which individuals modify their behavior in response to their awareness of being observed (Landsberger, 1958).

2

The term is a contraction for ‘now’ and ‘forecasting’: it refers to the opportunity to collect information about some quantities or variables in real time.

3

For example, in a tv political debate, any non-electoral mention to the candidates or parties are considered as D0, or pure Off-Topic texts like spamming, advertising, etc.

4

By pre-processing phase, we mean all the textual data cleaning operations including: removal of stop words, punctuation, very frequent words (like conjunctions, prepositions, etc), numbers, special characters (#, @, etc.), and the reduction of words into stems, unigrams, etc.

5

In fact, despite several positive spikes, the average SWB-I in 2015 is the lowest of the examined period.

6

It is worth noting that the correlation coefficients among nonlinear time series are usually lower than those in standard linear models.

References

Abdullah
,
Saeed
,
Elizabeth L.
Murnane
,
Jean M. R.
Costa
, et al. .
2015
.
‘Collective Smile: Measuring Societal Happiness from Geolocated Images’
. In
Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, CSCW ’15.
ACM:
361
374
.

Baker
,
Reg
,
J.
Michael Brick
,
Nancy A.
Bates
, et al. .
2013
.
‘Summary Report of the AAPOR Task Force on Non-probability Sampling’
.
Journal of Survey Statistics and Methodology
1
(
2
):
90
143
.

Barrington-Leigh
,
Cristopher
and
Alice
Escande
.
2018
. ‘
Measuring Progress and Well-Being: A Comparative Review of Indicators’
.
Social Indicators Research
135
(
3
):
893
925
.

Bengio
,
Yoshua
,
Réjean
Ducharme
,
Pascal
Vincent
, et al. .
2003
.
‘A Neural Probabilistic Language Model’
.
The Journal of Machine Learning Research
3
:
1137
1155
.

Bollen
,
Kenneth A
.
1989
.
Structural Equations with Latent Variables
.
New York
:
Wiley
.

Bollen
,
Johan
,
Bruno
Gonçalves
,
Guangchen
Ruan
, et al. 
2011
.
‘Happiness Is Assortative in Online Social Networks’
.
Artificial Life
17
(
3
):
237
251
.

Bollen
,
Johan
,
Bruno
Gonçalves
,
Ingrid
van de Leemput
, et al. 
2017
. ‘
The Happiness Paradox: Your Friends Are Happier than You’
.
EPJ Data Science
6
(
4
):
1
10
.

Carpi
,
Tiziana
and
Stefano M.
Iacus
.
2020
.
‘Is Japanese Gendered Language Used on Twitter? A Large Scale Study’
.
Online Journal of Communication and Media Technologies
10
(
4
):
e202024
.

Carpi
,
Tiziana
,
Airo
Hino
,
Stefano M.
Iacus
, et al. 
2021
.
‘Twitter Subjective Well-Being Indicator During Covid-19 Pandemic: A Cross-Country Comparative Study’.
Available at https://arxiv.org/abs/2101.07695.

Ceron
,
Andrea
,
Luigi
Curini
,
Stefano M.
Iacus
, et al. 
2014
.
‘Every Tweet Counts? How Sentiment Analysis of Social Media Can Improve Our Knowledge of Citizens’ Political Preferences with an Application to Italy and France’
.
New Media & Society
,
16
:
340
358
.

Ceron
,
Andrea
,
Luigi
Curini
, and
Stefano M.
Iacus
.
2016
.
‘iSA: A Fast, Scalable and Accurate Algorithm for Sentiment Analysis of Social Media Content’
.
Information Sciences
367-368
:
105
124
.

Curini
,
Luigi
,
Stefano M.
Iacus
, and
Luciano
Canova
.
2015
.
‘Measuring Idiosyncratic Happiness through the Analysis of Twitter: An Application to the Italian Case’
.
Social Indicators Research
121
(
2
):
525
542
.

Deaton
,
Angus S
.
2012
.
‘The Financial Crisis and the Wellbeing of Americans’
.
Oxford Economic Papers
64
(
1
):
1
26
.

Deaton
,
Angus S.
and
Arthur A.
Stone
.
2013
.
‘Economic Analysis of Subjective Well-Being: Two Happiness Puzzles’
.
American Economic Review: Papers & Proceedings
103
(
3
):
591
597
.

Deaton
,
Angus S.
and
Arthur A.
Stone
.
2016
.
‘Understanding Context Effects for a Measure of Life Evaluation: How Responses Matter’
.
Oxford Economic Papers
68
(
4
):
861
870
.

Diener
,
Ed
,
Eunkook M.
Suh
,
Heidi
Smith
, et al. 
1995
.
‘National Differences in Reported Subjective Well-Being: Why Do They Occur?’
Social Indicators Research
34
(
1
):
7
32
.

Dodds
,
Peter S.
,
Kameron D.
Harris
,
Isabel M.
Kloumann
, et al. 
2011
.
‘Temporal Patterns of Happiness and Information in a Global Social Network: Hedonometrics and Twitter’
.
PLoS One
6
(
12
):
e26752
.

Durahim
,
Ahmet O.
and
Mustafa
Coşkun
.
2015
.
‘#iamhappybecause: Gross National Happiness through Twitter Analysis and Big Data’
.
Technological Forecasting and Social Change
99
:
92
105
.

Epskamp
,
Sacha
.
2019
.
‘semPlot: Path Diagrams and Visual Analysis of Various SEM Packages’ Output’
.
CRAN. R Package Version 1.1.2.
Available at https://cran.r-project.org/.

Fahey
,
Robert A.
,
Jeremy
Boo
, and
Michiko
Ueda
.
2020
.
‘Covariance in Diurnal Patterns of Suicide-Related Expressions on Twitter and Recorded Suicide Deaths’
.
Social Science & Medicine
253
:
112960
.

Fleurbaey
,
Marc
.
2009
.
‘Beyond GDP: The Quest for a Measure of Social Welfare’
.
Journal of Economic Literature
47
(
4
):
1029
1075
.

Ford
,
Brett Q.
,
Julia O.
Dmitrieva
,
Daniel
Heller
, et al. 
2015
.
‘Culture Shapes Whether the Pursuit of Happiness Predicts Higher or Lower Well-being’
.
Journal of Experimental Psychology: General
144
(
6
):
1053
.

Greco
,
Francesca
and
Alessandro
Polli
.
2020
.
‘Security Perception and People Well-being’
.
Social Indicators Research
153
(
2
),
741
758
.

Greyling
,
Talita
,
Stephanie
Rossouw
, and
Tamanna
Adhikari
.
2020
.
‘Happiness-Lost: Did Governments Make the Right Decisions to Combat Covid-19?’
. Available at https://ideas.repec.org/p/zbw/glodps/556.html.

Hayashi
,
Takaki
and
Nakahiro
Yoshida
.
2005
.
‘On Covariance Estimation of Non-Synchronously Observed Diffusion Processes’
.
Bernoulli
11
(
2
):
359
379
.

Hayashi
,
Takaki
and
Nakahiro
Yoshida
.
2008
.
‘Asymptotic Normality of a Covariance Estimator for Nonsynchronously Observed Diffusion Processes’
.
Annals of the Institute of Statistical Mathematics
60
:
367
406
.

Hino
,
Airo
and
Robert A.
Fahey
.
2019
.
‘Representing the Twittersphere: Archiving a Representative Sample of Twitter Data under Resource Constraints’
.
International Journal of Information Management
48
:
175
184
.

Hoffmann
,
Marc
,
Mathieu
Rosenbaum
, and
Nakahiro
Yoshida
,
2013
.
‘Estimation of the Lead-Lag Parameter from Non-Synchronous Data’
.
Bernoulli
19
(
2
):
426
461
.

Hopkins
,
Daniel J.
and
Gary
King
.
2010
.
‘A Method of Automated Nonparametric Content Analysis for Social Science’
.
American Journal of Political Science
54
(
1
):
229
247
.

Iacus
,
Stefano M.
and
Nakahiro
Yoshida
.
2018
.
Simulation and Inference for Stochastic Processes with YUIMA: A Comprehensive R Framework for SDEs and Other Stochastic Processes.
New York
:
Springer
.

Iacus
,
Stefano M.
,
Giuseppe
Porro
,
Silvia
Salini
, et al. 
2019
.
‘Social Networks Data and Subjective Well-Being. An Innovative Measurement for Italian Provinces’
.
Scienze Regionali - Italian Journal of Regional Science
18
(
special issue
):
667
678
.

Iacus
,
Stefano M.
,
Giuseppe
Porro
,
Silvia
Salini
, et al. 
2020
a.
‘Controlling for Selection Bias in Social Media Indicators through Official Statistics: A Proposal’
.
Journal of Official Statistics
36
(
2
):
315
338
.

Iacus
,
Stefano M.
,
Giuseppe
Porro
,
Silvia
Salini
, et al. 
2020
b.
‘An Italian Composite Subjective Well-Being Index: The Voice of Twitter Users from 2012 to 2017’
.
Social Indicators Research
1
19
. doi: 10.1007/s11205-020-02319-6

Iio
,
Jun
.
2020
.
‘Kawaii in Tweets: What Emotions Does the Word Describe in Social Media?’
. In
Advances in Networked-based Information Systems
, eds.
Leonard
Barolli
,
Hiroaki
Nishino
,
Tomoya
Enokido
, et al. 
Cham
:
Springer International Publishing
, pp.
715
721
.

Kahneman
,
Daniel
,
Alan B.
Krueger
,
David
Schkade
, et al. 
2004
.
‘Toward National Well-Being Accounts’
.
American Economic Review
94
(
2
):
429
434
.

Kamijo
,
Koichi
,
Tetsuya
Nasukawa
, and
Hideya
Kitamura
.
2016
.
‘Personality Estimation from Japanese Text’
. In
Proceedings of the Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media
.
Osaka
: The COLING 2016 Organizing Committee: pp.
101
109
.

King
,
Gary
.
2016
.
‘Preface: Big Data Is Not about the Data!’
In
Computational Social Science: Discovery and Prediction
, ed.
R.
Michael Alvarez
.
Cambridge
:
Cambridge University Press
: pp.
1
10
.

Kitayama
,
Shinobu
,
Hazel R.
Markus
, and
Masaru
Kurokawa
.
2000
.
‘Culture, Emotion, and Well-Being: Good Feelings in Japan and the United States’
.
Cognition and Emotion
14
(
1
):
93
124
. doi:
10.1080/026999300379003
.

Kumano
,
Michiko
.
2018
.
‘On the Concept of Well-Being in Japan: Feeling Shiawase as Hedonic Well-Being and Feeling Ikigai as Eudaimonic Well-Being’
.
Applied Research in Quality of Life
13
(
2
):
419
433
.

Kuznets
,
Simon
.
1934
.
‘National Income, 1929-1932’
. In
National Income, 1929-1932
.
Cambridge (Mass.)
:
NBER
, pp.
1
12
.

Landsberger
,
Henry A
.
1958
.
Hawthorne Revisited: Management and the Worker, Its Critics, and Developments in Human Relations in Industry.
Ithaca (N.Y.)
:
Cornell University
.

Lim
,
Kwan H.
,
Kate E.
Lee
,
Dave
Kendal
, et al. 
2018
.
‘The Grass Is Greener on the Other Side: Understanding the Effects of Green Spaces on Twitter User Sentiments’
. In
Companion of The Web Conference 2018. International World Wide Web Conferences Steering Committee
. pp.
275
282
.

Luhmann
,
Maike
.
2017
.
‘Using Big Data to Study Subjective Well-Being’
.
Current Opinion in Behavioral Sciences
18
:
28
33
.

Matsumoto
,
David
.
1999
.
‘American-Japanese Cultural Differences in Judgements of Expression Intensity and Subjective Experience’
.
Cognition and Emotion
13
(
2
):
201
218
.

Mikolov
,
Tomas
,
Wen-tau
Yih
, and
Geoffrey
Zweig
.
2013
.
‘Linguistic Regularities in Continuous Space Word Representations’
. In
Proceedings of NAACL HLT 2013.
Atlanta
:
Association for Computational Linguistics
: pp.
746
751
.

Miyake
,
Kazuko
.
2007
.
‘How Young Japanese Express Their Emotions Visually in Mobile Phone Messages: A Sociolinguistic Analysis’
.
Japanese Studies
27
(
1
):
53
72
.

Murphy
,
Joe
,
Michael W.
Link
,
Jennifer H.
Childs
, et al. 
2014
.
‘Social Media in Public Opinion Research. Executive Summary of the AAPOR Task Force on Emerging Technologies in Public Opinion Research’
.
Public Opinion Quarterly
78
(
4
):
788
794
.

NEF
.
2009
.
National Accounts of Well-Being: Bringing Real Wealth onto the Balance Sheet
. Technical report.
London
:
New Economics Foundation
.

NEF
.
2012
.
Measuring Well-being. A Guide for Practitioners
. Technical report.
London
:
New Economics Foundation
.

NEF
.
2016
.
The Happy Planet Index 2016. A Global Index of Sustainable Well-Being.
Technical report.
London
:
New Economics Foundation
.

Novak
,
Petra K.
,
Jasmina
Smailović
,
Borut
Sluban
, et al. 
2015
.
‘Sentiment of Emojis’
.
PLoS One
10
(
12
):
e0144296
.

Park
,
Jaram
,
Young M.
Baek
, and
Meeyoung
Cha
.
2014
.
‘Cross-Cultural Comparison of Nonverbal Cues in Emoticons on Twitter: Evidence from Big Data Analysis
.
Journal of Communication
64
(
2
):
333
354
.

Ptaszynski
,
Michal
,
Rafal
Rzepka
,
Kenji
Araki
,
et al. 
2014
.
‘Automatically Annotating a Five-Billion-Word Corpus of Japanese Blogs for Sentiment and Affect Analysis
.
Computer Speech & Language
28
(
1
):
38
55
.

Quercia
,
Daniele.
,
Jonathan
Ellis
,
Licia
Capra
,
et al. 
2012
.
‘Tracking Gross Community Happiness from Tweets’
. In
Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work, CSCW’12.
New York:
ACM
: pp.
965
968
.

Robeyns
,
Ingrid
.
2006
.
‘The Capability Approach in Practice’
.
Journal of Political Philosophy
14
(
3
):
351
376
.

Rosseel
,
Yves
.
2012
.
‘lavaan: An R Package for Structural Equation Modeling’
.
Journal of Statistical Software
48
(
2
):
1
36
.

Rossouw
,
Stephanie
and
Talita
Greyling
.
2020
.
‘Big Data and Happiness’.
In
Handbook of Labor, Human Resources and Population Economics
, ed.
Klaus F.
Zimmermann
.
Cham
:
Springer
: pp.
1
35
.

Ryan
,
Richard M.
and
Edward L.
Deci
.
2001
.
‘On Happiness and Human Potentials: A Review of Research on Hedonic and Eudaimonic Well-being’
.
Annual Review of Psychology
52
(
1
):
141
166
.

Rzepka
,
Rafal
,
Urszula
Jagla
,
Pawel
Dybala
, et al. 
2016
.
‘Influence of Emoticons and Adverbs on Affective Perception of Japanese Texts’
. In
Proceedings of the 30th Annual Conference of the Japanese Society for Artificial Intelligence, JSAI 2016
. pp.
1
3
.

Salganik
,
Matthew J
.
2017
.
Bit by Bit: Social Research in the Digital Age
.
Princeton
:
Princeton University Press
.

Schwartz
,
H. Andrew
,
Johannes C.
Eichstaedt
,
Margaret L.
Kern
, et al. 
2013
.
‘Characterizing Geographic Variation in Well-Being Using Tweets’.
In
Proceedings of the International AAAI Conference on Weblogs and Social Media
,
7
(
1
):
583
591
.

Schwarz
,
Norbert
and
Fritz
Strack
.
1999
.
‘Reports of Subjective Well-Being: Judgmental Processes and Their Methodological Implications’
. In
Well-being: The Foundations of Hedonic Psychology
, eds.
Daniel
Kahneman
,
Ed
Diener
, and
Norbert
Schwarz
.
New York
:
Russell Sage Foundation
: pp.
61
84
.

Scollon
,
Christie N
.
2018
.
‘Non-Traditional Measures of Subjective Well-Being and Their Validity: A Review’
. In
Handbook of Well-Being
, eds.
Ed
Diener
,
Shigehiro
Oishi
, and
Louis
Tay
.
Salt Lake City (UT)
:
DEF Publishers
: pp.
1
12
.

Sen
,
Amartya
.
1980
.
‘Equality of What?’
. In
The Tanner Lectures on Human Values
, vol.
1,
ed.
Sterling M.
MacMurrin
.
Cambridge
:
Cambridge University Press
: pp.
195
220
.

Shoeb
,
Abu
and
Gerard
de Melo
.
2020
.
‘Are Emojis Emotional? A Study to Understand the Association between Emojis and Emotions’.
Available at https://arxiv.org/abs/2005.00693.

Steptoe
,
Andrew
,
Angus S.
Deaton
, and
Arthur A.
Stone
.
2015
.
‘Subjective Wellbeing, Health, and Ageing’
.
The Lancet
385
(
9968
):
640
648
.

Stiglitz
,
Joseph
,
Amartya
Sen
, and
Jean-Paul
Fitoussi
.
2009
.
Report by the Commission on the Measurement of Economic Performance and Social Progress.
Technical report.
Paris
:
INSEE
.

United Nations Development Programme
.
2019
.
Human Development Report 2019. Beyond Income, Beyond Averages, Beyond Today: Inequalities in Human Development in the 21st Century.
Technical report.
New York
:
United Nations Development Programme
.

Van der Wielen
,
Wouter
and
Salvador
Barrios
.
2021
.
‘Economic Sentiment during the Covid Pandemic: Evidence from Search Behaviour in the EU’
.
Journal of Economics and Business
115
:
105970
.

Vo
,
Bao-Khanh H.
and
Nigel
Collier
.
2013
.
‘Twitter Emotion Analysis in Earthquake Situations’
.
International Journal of Computational Linguistics and Applications
4
(
1
):
159
173
.

Voukelatou
,
Vasiliki
, et al. 
2021
.
‘Measuring Objective and Subjective Well-Being: Dimensions and Data Sources’
.
International Journal of Data Science and Analytics
11
(
4
):
279
309
.

Wong
,
Natalie
,
Xianmin
Gong
, and
Helene H.
Fung
.
2020
.
‘Does Valuing Happiness Enhance Subjective Well-Being? The Age-Differential Effect of Interdependence’
.
Journal of Happiness Studies
21
(
1
):
1
14
.

Yang
,
Chao
and
Padmini
Srinivasan
.
2016
.
‘Life Satisfaction and the Pursuit of Happiness on Twitter’
.
PLoS One
11
(
3
):
e0150881
.

Zhao
,
Yukun
,
Feng
Yu
,
Bo
Jing
, et al. 
2019
.
‘An Analysis of Well-Being Determinants at the City Level in China Using Big Data’
.
Social Indicators Research
143
(
3
):
973
994
.

Author notes

The data collection was performed with the Japan Science and Technology Agency CREST (Core Research for Evolutional Science and Technology) project, grant n. JPMJCR14D7. This work was also performed in collaboration with the Waseda Institute of Social Media Data (WISDOM), especially the human coding of the training set. We thank Jiji Press for making their monthly polls data available and Hanako Ohmura for her valuable support.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.