key: cord-138978-hfmgc5ve authors: Venigalla, Akhila Sri Manasa; Vagavolu, Dheeraj; Chimalakonda, Sridhar title: Mood of India During Covid-19 -- An Interactive Web Portal Based on Emotion Analysis of Twitter Data date: 2020-05-06 journal: nan DOI: nan sha: doc_id: 138978 cord_uid: hfmgc5ve The severe outbreak of Covid-19 pandemic has affected many countries across the world, and disrupted the day to day activities of many people. During such outbreaks, understanding the emotional state of citizens of a country could be of interest to various organizations to carry out tasks and to take necessary measures. Several studies have been performed on data available on various social media platforms and websites to understand the emotions of people against many events, inclusive of Covid-19, across the world. Twitter and other social media platforms have been bridging the gap between the citizens and government in various countries and are of more prominence in India. Sentiment Analysis of posts on twitter is observed to accurately reveal the sentiments. Analysing real time posts on twitter in India during Covid-19, could help in identifying the mood of the nation. However, most of the existing studies related to Covid-19, on twitter and other social media platforms are performed on data posted during a specific interval. We are not aware of any research that identifies emotional state of India on a daily basis. Hence, we present a web portal that aims to display mood of India during Covid-19, based on real time twitter data. This portal also enables users to select date range, specific date and state in India to display mood of people belonging to the specified region, on the specified date or during the specified date range. Also, the number of Covid-19 cases and mood of people at specific cities and states on specific dates is visualized on the country map. As of May 6 2020, the web portal has about 194370 tweets, and each of these tweets are classified into seven categories that include six basic emotions and a neutral category. A list of Trigger Events are also specified, to allow users to view the mood of India on specific events happening in the country during Covid-19. Covid-19 pandemic has been severely affecting many countries across the world. The severity of the pandemic is growing very fast across the world. In India, the number of Covid-19 cases have greatly increased, from 724 cases to around 29K cases, in a span of one month, from 28 March, 2020 to 28 April, 2020. The total number of deaths reported in the country have increased from 17 on 28 March, 2020 to 934 on 28 April, 2020. There has been a growth rate of more than 100%, both in view of number of cases and number of deaths in the country, in a span 10 days. Increased number of cases being reported, from around 14K to around 29K and deaths from 480 to 934, indicate the rapid growth of the pandemic in the country (from 18 April, 2020 to 28 April, 2020). It has been observed that the sudden outrages of such pandemics affect public emotion and result in either constructive or destructive behavioural changes [Hou, Du, Jiang, Zhou, and Lin (2020) ]. Human behavior has been observed to play an important role in either controlling or scaling up spread of a disease, and is more prominent in case of highly contagious pandemics [Siegrist and Zingg (2014) ]. The most common emotions witnessed among the people during several pandemics and crisis situations are fear and anger [Lin, Savoia, Agboola, and Viswanath (2014) ] [Vaughan and Tinker (2009) ]. Though fear is observed as a basic instinct during the onset of unexpected situations involving danger, researchers also warn that excessive fear results in anxiety disorders, intensifying psychiatric disorders [Garcia (2017) ]. Also, such psychiatric disorders and anxiety might lead to cardiovascular disorders among the vulnerable population [Shin and Liberzon (2010) ]. Mental health should hence be considered as an important part of pandemic response [Douglas, Douglas, Harrigan, and Douglas (2009) ]. Lack of proper guidance to face the pandemic situation might sometimes result in acute fear, leading to self harm intentions including suicides [Mamun and Griffiths (2020) ] [Shigemura, Ursano, Morganstein, Kurosawa, and Benedek (2020) ]. Considering the severity of the pandemic in the context of Covid-19, it is extremely important to consider the psychological state and motivate the people accordingly, in controlling the spread of Covid-19. Several cases of anxiety, stress and panic have been reported in various countries affected with Covid-19 such s China, Japan and so on [J.-B. Li et al. (2020) ] [Shigemura et al. (2020) ]. Providing appropriate guidance to the population helps them to respond in a better way when contracted with Covid-19 [Zhao and Xu (2020) ]. Several organizations across the world such as WHO and CDC, have suggested various measures to be taken up to protect and enhance mental state of vulnerable population during pandemics, including Covid-19 [CDC (2020)][ WHO (2018) ]. Thus, analysing public emotions against Covid-19 helps in understanding the perception and preparedness towards the pandemic among the public [Hou et al. (2020) ]. It helps in broadcasting effective public health messages that are inline with public emotions, which could help the public in taking appropriate measures against Covid-19 [Van Bavel et al. (2020) ]. Several countries have issued strict home isolation and quarantine instructions towards battling Covid-19. Governments across the world have taken up health initiatives to reduce the negative psychological impact on the population during the period of home confinement and quarantine [Garriga et al. (2020) ] [Rubin and Wessely (2020) ]. The Government of India has taken up several measures towards controlling the pandemic, such as strengthening medical care, issuing strict lockdown instructions, carrying out awareness campaigns and so on 1 . The instructions passed by the government and severity of Covid-19 has greatly affected the day to day lives of citizens in the country. This is also observed to result in psychological stress among few people in the country. Several measures to reduce this affect are being taken up by both public and private sector organizations in the country. The government has set up helpline centers that aim towards helping people with psychological issues during the pandemic, through telephone. Also, many audio and video awareness clippings are being advertised by many organizations to prevent the psychological stress and to improve awareness among people towards the pandemic. Understanding the mood of people across various parts of the country could help various organizations in taking better measures to help citizens of the country in maintaining better psychological balance. Social Networking platforms such as Twitter, Facebook and so on serve as a source to analyse and understand emotional state of the public. Such platforms play an impor-tant role during pandemics, in assessing the mood and mental health of people across the world. Researchers across the globe have made several attempts to understand the emotions of people in various countries towards Covid-19, with an aim to help health organizations and government in taking up measures accordingly. However, most of the existing studies summarize the emotions of posts on social networking platforms during a specified time interval, but not on day to day posts. Depicting the emotion of people on a daily basis could help various organizations in understanding the changing mood of people. Hence, we present a web portal-Mood of India During Covid-19, that is aimed to provide visualizations of across various states in the country. The tweets posted on twitter related to Covid-19 are analysed and classified into one of the seven categories that include six emotions -Anger, Sadness, Happiness, Surprise, Fear and Disgust [Ekman (1992) ], and Neutral category, which are visualized on India Map based on the location from which the tweets have been posted. The emotional state of people plays an important role in responding to a pandemic. Understanding the psychological state of population helps the governments in formulating guidelines and in taking necessary measures that are intended to motivate the public towards taking appropriate measures to prevent spread of pandemics and to restore mental well being amongst vulnerable population. Hence, several studies have been conducted to analyse the emotional state of the public during pandemics, including Covid-19. The influence of various information sources and awareness campaigns aimed towards educating people about the mode of spread, safety measures to be taken against Covid-19 and so on has been analysed through a open-ended answer based questionnaire based on Situational Awareness Theory and Theory Planned Behaviour [Qazi et al. (2020) ]. A user survey has been conducted based on the questionnaire and sentiments of 82 responses received have been analysed. The results of this study indicate that enhanced situational awareness among the people motivates them in adopting better protective measures [Qazi et al. (2020) ]. Sentiments of 538 responses obtained from an online survey containing open ended questions related to Health Anxiety have been analysed among people in Philippines. The results of this analysis revealed moderate "level of symptoms of Hypochondriasis, Attitude on Acquiring Covid-19 Avoidance, and Reassurance Seeking" among the Philippines population [Nicomedes and Avila (2020) ]. Zhao et al. have analysed the attention of public to events related to Covid-19 in China and observed an increase in public attention towards information related Covid-19. This analysis has been conducted on Chinese social media platform -Sina Microblog, with an aim to help government in formulating better principles in communicating on health related aspects with the public [Zhao and Xu (2020) ]. Another study conducted on data from three social media platforms in China during the period of December 1, 2019 to February 15, 2020 aimed to observe public attention, emotion, behavioral response and so on. This data analysis revealed low public attention in the initial stages of the outbreak, delaying the control of Covid-19. Also, it has been observed that delayed information broadcasting has triggered negative emotions and resulted in panic buying in many cases [Hou et al. (2020) ]. Li et al. have analysed more than 17K Weibo posts from 13 January 2020 to 26 January 2020. The analysis was based on identifying the psychological profile of users based on Online Ecological Recognition and predictive machine learning models, and consequently identified the emotions of users. The results indicate a visible decline in life satisfaction and increase in negative emotions [S. Li, Wang, Xue, Zhao, and Zhu (2020) ]. Twitter is one of the most commonly used platform and a rich medium to analyse various factors of population such as public sentiments, public response to a situation across the world, predict outbreaks of diseases and so on. Posts on twitter during 14 January 2020 and 28 January 2020 related Covid-19 have been extracted have been extracted to understand the changes in sentiments and opinions among people towards Covid-19. It has been observed that negative emotion tweets are observed in areas having more number of cases reported [Medford, Saleh, Sumarsono, Perl, and Lehmann (2020) ]. Sharma et al. have designed a dashboard that identifies misinformation being spread with respect to Covid-19, reactions of users to various emergency policies, country-wide sentiments and so on, and displays them on the dashboard [Sharma et al. (2020) ]. Considering the importance of posts on twitter, a multilanguage twitter dataset has been created by Lopez et al. This dataset is also expected to provide insights on public response towards the pandemic across several countries and multiple languages [Lopez, Vasu, and Gallemore (2020) ]. About 364K posts on twitter during 31 December 2019 to 6 February 2020 have been extracted and analysed to predict the outbreak of Covid-19. It has been observed that the number of tweets all over the world were directly proportional to the number of cases being reported in respective countries [Jahanbin and Rahmanian (2020) ]. Chen et al. have analysed posts on twitter during 23 March 2020 and 5 April 2020 and respective emotions associated with the posts [Chen, Lyu, Yang, Wang, and Luo (2020) ]. They observed correlations between the nature of terms used in the posts and respective sentiments of the posts. They have classified the terms into two classes -controversial and non-controversial using LDA topic modelling and the results of analysis indicated that posts with controversial terms exhibit a higher level of negative emotions [Chen et al. (2020) ]. Public emotions have been analysed based on twitter data of around 20 million posts, tweeted during 28 January 2020 and 9 April 2020 [Lwin et al. (n.d.) ]. The tweets have been classified into (2020)]. It has been observed that positive sentiments have outnumbered the negative ones in the country [Barkur and Vibha (2020) ]. In spite of several studies being conducted on psychological state of public across the world, there is limited research in understanding the psychological state of public in India. Also, most of the existing studies deal with data only during specific timelines. In addition to this, to the best of knowledge, we are not aware of any real time, streaming twitter datasets, that are specific to India. Hence, we present an interactive web portal, aimed to display the mood of India during Covid-19, based on streaming data of twitter. This web portal provides visualizations of number of Covid cases reported and emotional states of various states in the country. Also, the users are facilitated to select date range and state, and to view respective statistics of emotion change as a graph. Mood of India During Covid-19 thus provides a platform to view the sentiment of people across in each state on each day. It also helps in viewing the trends in emotion change across the country during a specific interval. Twitter has been regarded as one of the richest platforms to assess trends, predict several activities, understand emotions and response of people towards various scenarios and so on. It has been widely used in analysing mental health and emotions of people during crisis situations. Hence, analysing emotions posts on twitter during Covid-19 in India could help in understanding the emotional states of population across the country. Several studies have reported that texts in twitter posts could be associated with one of the six basic emotions proposed by Ekman [Ekman (1992) ]. Also, it has been observed that several studies aimed to analyse the sentiment of people based on twitter posts during such pandemics and crisis situations, have classified the twitter posts into the six basic emotions [Do, Lim, Kim, and Choi (2016) ]. Do et al. have classified emotions on twitter during the outbreak of MERS in Korea during 2015, into seven categories [Do et al. (2016) ]. We observed the idea of classifying tweets into emotion categories during Covid-19, to be similar to that of classifying them during MERS in Korea [Do et al. (2016) ]. Hence, we considered a total seven emotions into which each of the tweets was classified, inline with those considered by Do et al. They included the six basic emotions -Anger, Disgust, Fear, Happiness, Sadness and Surprise, and an additional category -Neutral. Mood of India During Covid-19 provides emotion of a state in India through a six step mechanism as shown in Figure 1 . Step 1 -Data Extraction. Tweets posted from India with six hashtags related to Covid-19 are downloaded from Twitter. These hashtags included -CoronaVirus, Covid-19, India Fight Corona, Covid, Lockdown All such tweets (not more than 10,000 per day), posted during a day are downloaded along with their corresponding locations. Based on the location co-ordinates, corresponding states are identified. Step 2 -Data Preprocessing. The downloaded tweets are preprocessed using NLTK library. This method first tokenizes all the input sentences in the tweets and the parts of speech of each word in tweet in identified. All nouns and prepositions are removed as they do not convey any emotion. Step 3 -Sentence Ranking. An existing dataset containing synonyms of keywords corresponding to each of the six emotions -anger, disgust, fear, happiness, sadness and surprise has been downloaded from a github repository 2 . The number of keywords for each of the six emotions is presented in Table 1 . The resultant tweets after preprocessing are compared to the set of keywords corresponding to each emotion. The tweets are then assigned ranks with respect to each emotion category based on the percentage of keywords of each category present in the tweet. Step 4 -Emotion based Classification. The ranks of all emotions in each tweet are analysed and the highest ranked emotion in the tweet is considered to be the emotion of the tweet. The tweets are then labelled with their corresponding emotion. The tweets having no rank for any of the six emotions indicate the absence of keywords corresponding to all six emotions. Such tweets are considered to express neutral emotion and hence are labelled as Neutral. All the tweets containing facts also fall into the neutral category. Step 5 -Identify mood of states. The emotions of all the tweets in a day, in each state are analysed. Tweets having similar emotions are grouped together and sum of tweets conveying the same emotion are calculated. The emotion having the largest number of tweets in each state is identified and the states are labelled corresponding to the identified emotion. Step 6 -Store into Database. The tweets and states with corresponding emotions are stored into the database, to be displayed onto the web portal as required. Apart from the tweet data, information related to Covid-19 cases, including the number of new cases reported, number of recovered cases and the number of deaths reported are extracted using Covid19 API 3 . This data is rendered on the portal as a heat map, plotted on the country map. Apart from choosing to visualize emotion of states in the country, users can also view emotions of any one of the six cities-Mumbai, Chennai, Pune, Hyderabad, Bangalore and Tirupati. The fundamental motivation behind developing the web portal is to provide insights on feelings of people during Covid-19, based on twitter information. We have used the Flask framework as a lightweight back-end system and HTML5 for the front-end for the development, which enables us to serve Mood of India During Covid-19 as a web portal. Using the python scheduler library, we update the previous day's data precisely at 16:00 hours daily. We retrieve and store two kinds of data -Covid-19 Case data from covid19India 4 API and Daily tweet data. We then convert this data into .csv files for processing and generate JSON files with meta-data for reducing the stress on the backend. Using the tweet data, we generate two kinds of graphs, a bar graph for a single day evaluation and a line graph for evaluation of a continuous range of dates. These graphs are present for India, it's states and cities. ChartJS has been used to generate all the graphs in the application. The Covid-19 case data contains the number of state-wise Confirmed, Recovered and Deceased cases, which is shown to the right of the Map. It is visualized by generating a Heat Map of India. The Map shows state-wise Covid-19 confirmed cases based on the selected date. We built the Map using an SVG image with all the states listed in it and programmed it manually to respond to user interaction. By combining the slider and the Map, the user can select a specific state and know the statistics at any point in time. We plot pins in the graph showing the top two highest emotion/emotions in that state. To convert the geographical co-ordinates into a position on the Map, we use the amCharts library. A report is generated for the selected date range, which summarises the available details. The web portal, Mood of India During Covid-19, displays the emotion of population towards Covid-19, across India, with an aim to provide insights about the mood of the country. When the portal is visited on any specific day, emotions of the country from 14 March,2020 to the present day are loaded by default based on twitter data during the range, as shown in Figure 2 Table 2 depicts the number of tweets classified into each of the seven categories, for every state and union territory of India, along with Covid-19 statistics in the corresponding regions during March 14, 2020 to May 6, 2020. State, when the Extension of Lockdown by 2 weeks has been announced on May 1 2020. This graph is generated by selecting the Trigger Event -Extension of Lockdown by 2 weeks. The graph displays mixed emotions, with about 26% of the tweets expressing happiness, 18% expressing sadness and so on, indicating that most of the people in the state are happy about the announcement. Mood of India During Covid-19 has been developed as a webportal, that is aimed to provide information about emotions of the population during the pandemic. Data in the form of posts from twitter are mined on a daily basis, along with the location from which they are posted. Each of tweets is classified into one of the seven categories, based on the presence of percentage of keywords belonging to the six emotion categories. The tweets containing facts are classified into the neutral category. Currently, our database consists of about 86K tweets, during the period of March and April. However, more number of tweets for the two months could be mined gradually. Also, the maximum number of tweets being mined per day currently is 5K tweets from each state. Though more number of tweets could be mined, we observe that the total number of Covid-19 related posts are around 10k-12K per day across the country, on an average. Hence, we assume that the current number of tweets would suffice for analysis. However, this number could be increased if the number of tweets are observed to raise. Tweets are being classified into respective emotions based on comparison of the processed tweets with set of keywords present in the dataset extracted from the github repository 5 . Tweets containing words that would belong to an emotion, but not present in the set of respective keywords would not be identified to express the specific emotion. The existing list of keywords contains a minimum of 70 keywords per emotion. Since the number of keywords in the dataset for each emotion are not equal, the classification might be inclined towards emotions with more number of keywords in the dataset. Identifying and implementing Natural Language Processing and Machine Learning techniques that result in better accuracies could be used to improve the classification of tweets and consequently in better predicting the mood of the country. The existing portal displays mood of the population only upto state level. Mood of population in only six cities -Pune, Hyderabad, Mumbai, Tirupati, Chennai and Bangalore is represented. Representing the emotions at multiple levels such as nation, state, city, district and so on, could help in understanding the emotional states of people belonging to more specific regions. Considering the importance of understanding public emotions and the affects on psychological state of people during a crisis, in this paper, we present a web portal to identify the mood of India during the current Covid-19 pandemic. The web portal marks top most emotions of people across various states and cities in the country based on the sentiments associated with tweets in each region. The emotions of tweets are identified by ranking tweets based on comparison of words in the tweets with preloaded list of keywords for each emotion. The results of emotions in a specific state in the country are displayed on the country map on any day selected by user from March 14, 2020. Also, a heat map of number of new cases reported in the country across all the states is plotted on the country map. The trends of emotions changing across the country in a selected date range are also visualized as line graphs. Users can select to view information related to mood of the country. Also, bar graphs depicting percentage of tweets expressing specific emotions are presented to the users. Viewing the country map on any specific day could help users in understanding the emotion of the region with respect to number of Covid-19 cases reported in the region. In the future versions, we plan to increase the number of tweets being considered for emotion analysis. Also, we plan to mine more number of tweets related to Covid-19 in the country during the two month period of March and April to increase the accuracy of emotions being displayed. We also plan to improve the accuracy of classification model by exploring newer NLP and ML based approaches that could classify the tweets based on emotions. Furthermore, the existing portal could be improved to display mood of population with more specificity, which could include districts and cities of each state. Sentiment analysis of nationwide lockdown due to covid 19 outbreak: Evidence from india the eyes of the beholder: Sentiment and topic analyses on social media use of neutral and controversial terms for covid-19 Analyzing emotions in twitter during a crisis: A case study of the 2015 middle east respiratory syndrome outbreak in korea Preparing for pandemic influenza and its aftermath: mental health issues considered. International journal of emergency mental health An argument for basic emotions Neurobiology of fear and specific phobias The role of mental health home hospitalization care during the covid-19 pandemic Assessment of public attention, risk perception, emotional and behavioural responses to the covid-19 outbreak: social media surveillance in china. Risk Perception, Emotional and Behavioural Responses to the COVID-19 Outbreak: Social Media Using twitter and web news mining to predict covid-19 outbreak Chinese publics knowledge, perceived severity, and perceived controllability of the covid-19 and their associations with emotional and behavioural reactions, social participation The impact of covid-19 epidemic declaration on psychological consequences: a study on active weibo users What have we learned about communication inequalities during the h1n1 pandemic: a systematic review of the literature Understanding the perception of covid-19 policies by mining a multilanguage twitter dataset First covid-19 suicide case in bangladesh due to fear of covid-19 and xenophobia: possible suicide prevention strategies Leveraging high-volume twitter data to understand public sentiment for the covid-19 outbreak. medRxiv An analysis on the panic of filipinos during covid-19 pandemic in the philippines Analyzing situational awareness through public opinion to predict adoption of social distancing amid pandemic covid-19 The psychological effects of quarantining a city Coronavirus on social media: Analyzing misinformation in twitter conversations Public responses to the novel 2019 coronavirus (2019-ncov) in japan: mental health consequences and target populations The neurocircuitry of fear, stress, and anxiety disorders The role of public trust during pandemics Using social and behavioural science to support covid-19 pandemic response Effective health risk communication about pandemic influenza for vulnerable populations Global health estimates 2016: Disease burden by cause, age, sex, by country and by region Chinese public attention to covid-19 epidemic: Based on social media. medRxiv Bihar 169 32 1491 464 48 1011 1580 4795 528 127 4 Goa 469 52 2955 1017 97 2154 3355 10099 7 7 0 Punjab 857 122 4825 1680 253 3379 5136 16252 1232 128 23 Uttar 392 14 2344 607 45 1143 2551 7096 2766 802