key: cord-0656066-6prsigso authors: Suh, Jina; Horvitz, Eric; White, Ryen W.; Althoff, Tim title: Population-Scale Study of Human Needs During the COVID-19 Pandemic: Analysis and Implications date: 2020-08-17 journal: nan DOI: nan sha: 9738d5a34c4bed82e4e400404d7b5e97ef770431 doc_id: 656066 cord_uid: 6prsigso Most work to date on mitigating the COVID-19 pandemic is focused urgently on biomedicine and epidemiology. However, pandemic-related policy decisions cannot be made on health information alone but need to take into account the broader impacts on people and their needs. Quantifying human needs across the population is challenging as it requires high geo-temporal granularity, high coverage across the population, and appropriate adjustment for seasonal and other external effects. Here, we propose a computational framework, based on Maslow's hierarchy of needs, that can characterize a holistic view of relative changes in needs following the pandemic through a difference-in-differences approach that corrects for seasonality and query volume variations. We apply this framework to characterize changes in human needs across physiological, socioeconomic, and psychological realms in the US, based on more than 35 billion search interactions spanning over 36,000 ZIP codes over a period of 14 months. Our analyses reveal that the expression of basic human needs has increased exponentially while higher-level aspirations declined during the pandemic in comparison to the pre-pandemic period. In exploring the timing and variations in statewide policies, we find that the duration of shelter-in-place mandates significantly influenced social and emotional needs. We demonstrate that potential barriers to addressing critical needs such as support for unemployment and domestic violence can be identified through web search interactions. Our approach and results suggest that population-scale monitoring of shifts in human needs can inform policies and recovery efforts for current and anticipated needs. Many of the existing studies and datasets of the COVID-19 global pandemic focus on the biomedical and epidemiological aspect of the case and fatality rates, including efforts in detection, infection propagation, therapeutic intervention, and vaccine design, with a gaze fixed on the virus and illness it causes. However, despite their general effectiveness on specific goals of mitigating the spread of infection [60] , pandemic-related policy decisions and investments cannot be made on health information alone. Challenges span societal (e.g., disparities [14] ), economic (e.g., unemployment [8, 15] ), and psychosocial (e.g., stress, anxiety [46] , loneliness [56] ) realms. Recent work has called for identifying and understanding the multi-level system of humans needs and well-being for pandemic response and recovery strategies [49] . Our goal is to better understand the influences of the pandemic and associated policy decisions on a multitude of human needs. We hope that new insights about rising human needs can guide valuable refinements of policies and motivate the development of new interventions, programs, and investments. Quantifying human needs across the population is important, but challenging as it requires innovative, ethical, privacy-preserving approaches to collecting and assessing them with high geo-temporal granularity and coverage. A standard way to assess human needs is through survey-based measures [23] . Surveys can be costly and time-consuming to conduct at large scales. They are difficult to manage across time and geographies when the desire is to provide fine-grained analyses longitudinally and in near real-time to be able to understand and react. Other possible approaches to accessing this information involve passively observing human behaviors, grounded in the fact that physical and psychosocial needs motivate human behaviors to express and fulfill those needs when they are unmet [38] . For example, in market research, historical purchase behaviors are used to predict future consumer needs [1] , but this approach also relies on smaller-scale methods and is often limited to consumer and commercial interests. Ecommerce platforms (e.g., Amazon marketplace) or specialized service providers (e.g., Talkspace for mental health, Coursera for education) may have access to large-scale, real-time analysis of customer behaviors, but they are often myopic to a specific need. Publicly available social network data, such as Twitter, have been used to characterize needs [4, 63] , but these studies examine a subset of needs from a dataset that only portrays externalized behaviors. Such fragmentation of data presents a limitation in capturing broader expressions and comparisons across all aspects of human needs beyond specific platforms. Here, we address these limitations in obtaining signals about human needs by observing the behaviors of people through their everyday interactions with a web search engine. Human behaviors, through which human needs are expressed or fulfilled, often involve seeking information or obtaining tangible support or material items, for which web search has been an integral component. Thus, web search logs provide a unique lens into human needs via providing signals about human behaviors in their natural state, at large scale, and on already routinely collected data. We propose a computational framework built on constructs and theories of human needs by Maslow [38, 39] and Max-Neef [41] . We characterize post-pandemic changes across a full spectrum of fundamental human needs, spanning five broad human needs categories-Self Actualization, Cognitive, Love and Belonging, Safety, and Physiological-and 79 subcategories of these needs. We apply this framework to a dataset of 35+ billion search interactions across 36,000+ ZIP codes in the United States and over 14 months (7 months in 2019 and 7 months in 2020) to map search query strings and click interactions to human needs, resulting in over three billion expressions of human needs (Sec. 3). We demonstrate how this approach enables the examination of shifts in fundamental human needs based on disruptions induced by the pandemic. Concretely, our contributions include: • We propose a new computational framework for characterizing a holistic view of human behaviors, intents, and unmet needs based on web search logs and human motivational theories (Sec. 4). • Our framework leverages a difference-in-differences approach [32] to quantify the impact of the pandemic and its associated policies on the relative changes in needs while controlling for seasonality and external factors (Sec. 4.4). • We present the first population-scale analysis across a holistic set of human needs during the COVID-19 pandemic in the United States through the use of web search logs (35+ billion search interactions for 14 months on 86% of US ZIP codes; Sections 5-8). • We find that search interactions in pursuit of basic human needs (i.e., Physiological, Safety) have exponentially increased during the pandemic while several higher-level aspirations (i.e., Self Actualization, Cognitive) have declined (Sections 5, 6). • We observe geographical differences in how differing statewide shelter-in-place policies are associated with short-term and longterm changes in social and emotional needs (Sec. 7). • We demonstrate that potential barriers to accessing critical resources, in support of people facing unemployment or domestic violence, can be identified through search interactions combined with external data sources (Sec. 8). Our work suggests that signals from web search logs can be used to characterize and to monitor over time human needs at a population scale. Our findings also emphasize the importance of tracking broad sets of human needs in combination with other reported measures to identify gaps in our current understanding of challenges and support, to measure the impact of policy changes, and to design policies and programs that meet the human needs. Quantifying Human Needs. Theories about basic human needs have been studied for close to a century [47] . In particular, Maslow's hierarchy of needs [38, 39] has been applied to many different domains [12, 43, 57, 63] , despite criticisms of the validity of his theory [58] . These theories provide a holistic understanding of human needs which is increasingly relevant during the pandemic [49] . However, most studies use survey-based methods to measure human needs [12, 40, 42, 44, 54] . Others have applied human needs theories in computational social science [4, 36, 63] , but they focus deeply on specific topics (e.g., consumer behavior, well-being) and leverage publicly available social network data. In contrast, we introduce a computational methodology to extract the full spectrum of human needs at population scales, which is critical for aligning policies with societal needs that they are intended to support. Web Search Logs for Human Needs Analysis. In addition, our work uses search interaction data which captures more natural observations of human needs [19] . Web search logs have been used to understand human behaviors across many different domains [5, 25, 59, 61] , time [6, 7, 22, 45] , location [50, 62] , and to predict the now and the future [13, 20] , but prior studies typically focus on a single aspect of human well-being. Google Trends APIs have helped to stimulate a prolific range of research in the context of the COVID-19 pandemic for physical [35] , psychological [55] , and socioeconomic [2, 27] well-being that operate on highly normalized and aggregated data. In distinction, our work leverages finegrained geospatial comparisons across 79 need subcategories and a difference-in-differences framework, which allows for controlling confounders and understanding national coverage with flexible geotemporal normalization and aggregation. We also improve detection of needs by leveraging click interactions. We collected a dataset containing a random sample of deidentified search interactions from the first seven months of the years 2019 and 2020 obtained from Microsoft's Bing search engine. For each search interaction, we collected the search query strings, all subsequent clicks from the search results page (e.g., clicked URL), and time and ZIP code location of the search interaction. The resulting dataset contains 35+ billion search interactions and represents the web search traffic of over 86% (36,667/42,632) of US ZIP codes associated with at least 100 queries per month so as to preserve anonymity ( Table 1) . All data was deidentified, aggregated to ZIP code levels or higher, and stored in a way to preserve the privacy of the users and in accordance to Bing's Privacy Policy. Our study was approved by the Microsoft Research Institutional Review Board (IRB). Considering potential threats to validity, we examined the dataset from three perspectives: coverage of the population, analysis of selection bias, and reliability of the trends. Analyzing National Representation. To understand how much of the US population is represented by the collected data, we obtained demographics data from the Census Reporter API [10] . The demographics of the ZIP codes in our dataset closely matched the US population demographics (Appendix A.1). Although query volumes are not uniformly distributed across these ZIP codes, the vast majority of the ZIP codes is included in our dataset. We leverage location information for our analysis when geographical differentiation is necessary. Analyzing Selection Bias. We sought to understand potential biases in socioeconomic circumstances that would influence the usage of Bing search engine. We leveraged deidentified client id as a proxy for a unique user to estimate the 'client rate', or how much of the population in a ZIP code is using the Bing search engine. We examined the correlation between the client rate and various demographic factors. Although unsurprisingly the factors describe some of the variance, none were correlated more strongly than r =-0.058 (% Housing Owned). This suggests that the dataset is not strongly biased towards any single demographic (Appendix A.2). Trends. Many Americans use other search engines such as Google. Therefore, we compared search trends for Bing with data available via the Google Trends API 1 for the same time period and for specific keywords in each need category. We found that the search trends are highly similar with a median Pearson correlations of 0.96 (min=0.45, max=0.98, all p<0.001). This implies that our findings are not simply an artifact of using one search engine over another (Appendix A.3). We draw inspiration from the widely used Maslow's hierarchy of fundamental human needs [38, 39] to tag each search interaction with one or more of five broad categories of needs. Safety and Physiological are considered as 'basic' needs. Love and Belonging, Cognitive, and Self Actualization are often considered to be 'psychological' needs, and Cognitive and Self Actualization are considered as 'growth' needs, defined in more detail below: Self Actualization needs are about realizing personal potential, seeking personal growth, and self-fulfillment. Topics include: hobbies; parenting; wedding; talent acquisition; goals; charity. Cognitive needs are about pursuing knowledge and intelligence through learning and discovering. Topics include: online education, learning materials; educational degrees; cognition; memory; focus. Love and Belonging are social and emotional needs and include emotionally-based relationships such as friendship, family, dating, sexual intimacy. Topics include: mental health or emotions; social network or activities; relationships, dating, divorce or breakup. Safety needs stem from our desire to seek order, stability, and protection from elements in the world. Topics include: personal protection; finances; banking; job search; unemployment; housing. Physiological needs are the basic animal needs such as air, food, drink, shelter, warmth, sex, and other body needs. Topics include: health; food and groceries; basic staples; sleep; transportation. To understand the nuances of the needs, we further subdivided the five main categories into 79 subcategories. Several researchers independently brainstormed and categorized subcategories which were combined and resolved collaboratively through consensus meetings. We resolved remaining disagreements by closely following the theories of human needs [38] . Appendix A.4 describes the process in detail and presents the full taxonomy of our need categories, including example queries and/or clicked page URLs. A search interaction can be an observation of a human need in two ways: (1) an expression of a potential satisfier (physical or information) for that need, or (2) a direct expression of the deficiency or satisfaction of that need. For example, a search query for 'bandages' with a subsequent click on 'amazon.com' could indicate a purchase intent that satisfies a Physiological need. We require the additional click into one of many e-commerce domains to solidify that this interaction is a purchase intent. Information search about 'online games with friends' could satisfy a Love and Belonging need. A need (satisfaction or deficiency thereof) could be directly expressed in experiential statements such as 'I feel depressed' (Love and Belonging). We match each search interaction to a corresponding need subcategory through simple detectors based on regular expressions and basic propositional logic. Each need subcategory could have multiple regular expressions applied to either the query string, the clicked URL, or both, depending on the complexity of the expression and the need subcategory. Similarly to need categorization, we arrived at these regular expressions based on our data through several collaborative consensus meetings until we were satisfied with precision and recall (Appendix A.4). Overall 9.1% of our sample of queries or 3.2 billion instances matched at least one of the need categories. Each search interaction can satisfy multiple human needs [41] , so we allowed each search interaction to be tagged by multiple need categories (this only represents 0.32% of all queries). We then aggregated matched search interactions across need categories and subcategories, time (e.g., day, week), and geography (e.g., ZIP code, county, state). Figure 1 illustrates these steps in detecting and processing of human needs. Our goal is highly precise detection of a large number of needs across a broad set of categories. Precision. We sampled 1.2 million search interactions that matched at least one need category as a candidate set. From this sample, we randomly chose 100 unique tuples of search query string and clicked URL (e.g., '15 lb dumbbells' and click on 'walmart.com') for each of the five high-level need categories, for a total of 500 unique tuples representing 1,530 search interactions in our evaluation set. We selected unique tuples to avoid duplication in labeling, but we mapped the labels back to the original 1,530 search interactions to compute precision on the distribution of the source evaluation set. We then collected human labels for each tuple via Amazon's Mechanical Turk, where each tuple could be tagged with none, one, or more of the five needs categories. All labels and predictions have Boolean values with no ranking among needs categories. Upon inspecting the label quality, we found common systematic label errors such as labeling 'recipe'-related queries as Physiological or Cognitive needs, 'divorce'-related queries as Physiological needs, or visits to specific government unemployment information sites as Physiological needs, where the workers placed the label in a completely incorrect category according to the definitions we specified in the task. Other errors were due to inherent ambiguities in search. For example, 'rent in florida coronavirus' is tagged as Physiological for 'coronavirus' but not as Safety for 'rent' because our high precision detector requires more qualified keywords such as 'apartment rent.' Although the worker tagged this as Safety (i.e., rent for shelter), the use of the word 'rent' here may not be shelter-related. We took a conservative approach of only correcting definitive label errors and not ambiguous errors, and our evaluation set achieved a precision of 97.2%, using the example-based precision metric defined in [64] for multi-label classification. Recall. Although it is infeasible to ensure a perfect recall across a massive dataset, it is important that we capture a significant number of needs. We find that 9.1% of our search interactions match at least one need category. While this recall is significant and led to more than 3.2 billion detections of need expressions, we note that high recall is not necessary for an unbiased analysis approach, because we conduct a fair comparison among the outputs of the same detectors across pre-and post-pandemic periods. We did not find that the exact expressions of needs varied drastically within our dataset that would indicate any temporal bias. We also investigated whether our need expressions were dominated by a few categories. Clicks to Youtube or Facebook dominated, but still only represented 1% of our dataset. We categorized visits to these sites based on their primary functions (i.e., Facebook for social networking and Youtube for media consumption). We found that our results were robust, whether or not we included these high-traffic sites in our dataset. Our goal is to quantify the change in human needs during the pandemic relative to the pre-pandemic period. This can be challenging due to potential confounding effects of yearly seasonal variations, weekly seasonal variations, and variations in query volume over time. Conceptually, we control for yearly seasonal effects through comparisons with the previous year, for weekly seasonal effects by matching the day of the week between both years (i.e., Mon Jan 6, 2020 is aligned to Mon Jan 7, 2019), and by considering relative proportions of the query volume represented by each need over time. Formally, we follow a difference-in-differences methodology [18, 32] , commonly used in economics, to account for confounding effects between comparison groups. Finally, our R e l a t i v e C h a n g e i n Ne e d s i n L o g 2 S c a l e ( b e t we e n p r e -p a n d e mi c b a s e l i n e a n d e a c h d a y ) A c t p a s s e d adjusted effect size is the logarithm of the ratio between two groups. This is effectively the difference-in-differences approach applied to the logarithmic effect sizes and has the advantage of the effect sizes having symmetric properties (i.e., ∆(t 1 ; t 2 )=−∆(t 2 ; t 1 )) [16, 26] . This step allows for appropriate comparison of effect sizes across both increases and decreases in need. Our estimate of the relative change in human need C between two time periods is defined as where E(t 2020 2 , n) is the expression of need n at some time t 2020 2 in 2020 (i.e., after the pandemic declaration) and E(t 2020 1 , n) is the expression of need at t 2020 1 (i.e., before the pandemic). Across all following analyses, we choose the mean daily expression of needs between Jan 6 to Feb 23, 2020 as the 'pre-pandemic baseline', referred to throughout the paper, and dates on or after Mar 16 as the 'post-pandemic period' because individual states declared the state of emergency at different times (Feb 29 to Mar 15). We then compute the 95% confidence interval on this multiplicative effect size by using bootstrap resampling with replacement (N=500). We report mean estimates and p-values throughout the text and 95% confidence intervals in all figures and tables where applicable. We first consider how human needs change over time across the US in the context of major events surrounding the pandemic. We compute the daily relative change in the expressed needs in comparison to the pre-pandemic baseline, as described in Section 4.4. We perform this computation for the duration of our entire dataset, giving us per-day relative changes in all need categories and subcategories. For each inflection point and major national event, we examine need subcategories with the highest relative changes to understand which contribute the most to the overall need. Elevated Needs and Contributing Subcategories. Figure 2 illustrates daily relative changes of needs on a log scale, where zero indicates no change. Overall, we see that all need categories were at elevated rates during March through May relative to the earlier months. A few of the local inflection points correspond to US national events, such as the declaration of national emergency on Mar 13 or the first stimulus checks being deposited on Apr 11. Physiological needs start to increase first around February ( Fig. 2A) , dominated by health condition related queries (C=1.46 on Feb 29) and subsequently by toilet paper purchase and health measurement equipment purchase (C=1.14, 0.76 on Mar 6 respectively; Figure 2B ). Around Mar 16, Physiological needs peak at over 3.8 times the baseline (2 1.91 ; Figure 2C ). Following national emergency declaration (Mar 13) and mandated lock downs (first on Mar 21), we see a sharp increase in Cognitive needs (Fig. 2D) , dominated by educational site visits and online education queries (C=1.97, 1.42 on Mar 23). Self Actualization needs peak around Apr 11 (Fig. 2E) , dominated by cooking site visits and cooking related queries (C=1.73, 1.15), and online social activities queries and social technology uses dominate Love and Belonging needs (C=1.77, 1.72 on Apr 11). A sharp spike of Safety needs can be seen shortly after the first stimulus checks were deposited: stimulus related queries, state unemployment site visits, COVID-19 protection purchase dominate Safety needs (C=8.17, 5.49, 5.12 on Apr 18; Figure 2F ). While other needs start to trend downwards or stabilize throughout much of May-July, Physiological needs increase for a second time with additional interests in health conditions, followed by Safety needs with queries related to economic stimulus and loans (Fig. 2G ), which aligns with the rise of COVID-19 cases in the US around Jun 6 2 . Shifting of Needs. Based on the severe health impacts of the COVID-19 pandemic, we expected to see and confirmed that Physiological needs dominate throughout our dataset, as COVID-19 is still a major US public health issue at the time of this writing. At a glance, we see two instances of the surge in Physiological needs followed by a subsequent increase in Safety needs. As Physiological concerns rise, public health responses (e.g., business closures or restrictions) could potentially induce instabilities in Safety needs, and this observation needs to be further investigated. We see basic needs expressed before other needs consistent with the hypothesis by Maslow [38] and observations by others [54] . We also expected to see a decrease in the expression of growth needs (Self Actualization and Cognitive) as people's attention shifts toward basic needs. However, both Cognitive and Self Actualization needs increased overall, with the increase in Cognitive needs being more temporary and Self Actualization being more sustained. Despite health and economic concerns, interests in recreational activities or hobbies (e.g., cooking, gaming) contribute to this steady 23% (2 0.3 -1=23%) increase in Self Actualization needs. Further research into the temporary nature of Cognitive needs and the long-term impact of such sustained interest in Self Actualization is necessary. We also note that the peak in Physiological needs occurs around four weeks before the peak in Safety needs (between Figure 2C and F), while the second set of peaks are a few days apart (Fig. 2H ). This could be an indication of phenomena like resilience or endurance from economics and disaster management that requires further examination [21, 37] . Next, we examine individual need subcategories that present the largest increase or decrease in search expressions, possibly due to the pandemic impact. To explore these two ends of the spectrum, we compute the mean relative changes in needs during the initial four weeks during the post-pandemic period (Mar 16 to Apr 12) compared to the pre-pandemic baseline as described in Section 4.4. We then examined the top 12 need subcategories with the largest increase or decrease in the relative change. Heightened Physiological and Safety Needs. Table 2 shows that 11 of 12 need subcategories with the most increase fall under Physiological and Safety needs as seen from the temporal trends in Section 5, and one (online social activities queries) belongs to the Love and Belonging need category. Toilet paper purchase reached a maximum increase of 127 times the pre-pandemic baseline (2 7 -1=127 on Mar 16) . Recall that these are not just queries containing 'toilet paper', but purchase intents as indicated by subsequent clicks to e-commerce sites (Sec. 4.2). Such a high level of interest in toilet papers is commonly attributed to panic buying due to the supply scarcity [28] and media coverage [24] . Stimulus related queries, including general terms like 'loan forgiveness, ' reached an even higher maximum increase of 287 times the baseline (2 8.17 -1=288 on Apr 18) and is sustained at that high level through July, reflecting the magnitude of the pandemic's impact on the US economy. When we examine the daily trends, we see that COVID-19 protection purchase exhibits a small peak (C=1.0 on Jan 29) after the first reported COVID-19 case (Jan 20), and the needs quickly escalate from Feb 20 (Fig. 3B) , at least three weeks earlier than other needs (Fig. 3A ,C,D) that do not escalate until the national emergency declaration. A subsequent peak on Apr 3 (C=5.6) coincides with CDC's updated recommendation on cloth-based mask use. We also find that indicators of social-economic instabilities such as unemployment site visits and food assistance related queries still have not returned to their baseline levels (Fig. 3A,C) , arguably because the pandemic is still in effect. Online social activities queries follow a similar pattern (Fig. 3D) , reflecting the need to satisfy lock-down induced social isolation through online services, but it also raises the question of potential permanent shifts in ways of satisfying social needs. Our findings revealed that only a few of these needs have returned to the pre-pandemic baselines while many of them are sustained at elevated rates. Shifts Away from Positive Outlooks. Table 3 shows the most decrease in the expression of several Self Actualization and one Cognitive need subcategories Table 3 . Specifically, indications of Self Actualization needs for partnership have declined by more than 64% of their baseline (2 −1.49 -1=−64.4%) throughout the typical wedding season around Spring and early Summer, which is expected given restrictions on large gatherings. In addition, needs that are typically associated with growth, positive outlook, or new opportunities have taken a large toll. In Self Actualization, queries about life goals also see a large decline. In other need categories, needs expressed by educational degree related queries, job search related queries, job search site visits, or housing related queries have declined by over 34% (2 −0.61 -1=−34.5%) of their baseline. Upon inspection of daily trends, expressions of forward-looking needs have decreased and remain below the pre-pandemic baseline (Fig. 3E,F,G) . The sustained decline in job search related queries (Fig. 3G ) juxtaposed with near 30 times increase in unemployment needs (Fig. 3A) is a troubling evidence of the declining labor force as seen in other studies [15] . Indications of growth interests in educational degrees or life goals have not recovered at all (Fig. 3F) . These results combined with heightened Physiological needs suggests a shift of focus away from individual growth. In addition, divorce related queries exhibited a maximum of 47.3% decline (2 −0.93 -1=−47.3%; Figure 3H ), possibly reflecting the challenges that families face in proceeding with divorce during the lock downs [31] . Therefore, underlying mechanisms for these shifts and long-term impact of the lack of growth needs or support for relationships should be further studied. See Appendix A.5 for corresponding figures on all 24 subcategories. So far, we have focused on changes aggregrated at the US national level. We shift our attention to how the pandemic and its related policies differentially influence local subpopulations. We use a set of statewide policies 3 readily available through the COVID-19 US State Policy Database [48] . In particular, we examine the impact of the shelter-in-place mandate (its duration and effective date) on social-emotional and relationship needs (i.e., eight Love and Belonging and two wedding-related Self Actualization subcategories, see Appendix A.4). We hypothesized that social isolation induced by longer shelter-in-place mandates generates more expressions of social-emotional needs. We used the date on which the shelter-in-place mandate was enacted and relaxed or lifted for 38 states to derive the duration of the mandate 4 . We compute two relative changes in needs for each state. First, to understand the short-term impact of the shelter-in-place mandate, we compute the relative change in needs for two weeks after each state's shelter-in-place mandate compared to the one week before the mandates 5 . Second, to understand the long-term impact of the mandate, we compute the relative change in needs between the pre-pandemic baseline and the last four weeks of our dataset (Jul 6 to Aug 2), which is at least two weeks after the last state lifted its mandate (Jun 19). To quantify the potential impact of these mandates, we ran a Pearson correlation analysis (1) between the start date of the shelter-in-place mandates (i.e., ISO day number) and the short-term relative changes in needs, and (2) between the duration of the mandates and the long-term relative changes in needs. Early Adjustments to Social Needs. The start date of shelterin-place mandates ranged between Mar 21 and Apr 7. We find 3 Although our dataset allows ZIP code level analysis, a comprehensive list of local policies across the US are difficult to obtain at ZIP code, city, or even county levels. 4 Only 38 states had the start and end dates in the dataset as of this writing. 5 We chose one week before the mandate because it is the maximum number of full weeks after the declaration of national emergency and before the earliest start date (Mar 21) for any shelter-in-place mandate. Figure 4 : Relationships between shelter-in-place start date and short-term relative changes in mental health and online social activities needs between 1 week before and 2 weeks after shelterin-place mandates (A-B), and between shelter-in-place duration and long-term relative changes in mental health and wedding needs between pre-pandemic baseline and 4 weeks in July (C-D). that the relative changes in online social activities queries (r =−0.53, p<0.001), wedding site visits (r =0.48, p=0.002), wedding related purchase (r =0.43, p=0.006), and mental health resource site visits (r =0.47, p=0.003) needs were most correlated with the start date. People from states that have earlier shelter-in-place mandates expressed significantly reduced interests in weddings and mental health site visits (Fig. 4A) and significantly more need for online social activities (Fig. 4B ). For example, in the two weeks after the mandate, people in New Jersey (mandate on Mar 21) sought online mental health resources 25.7% less than the week before the mandate, while people in South Carolina (mandate on Apr 7) sought those resources 13.4% more than the week before. Long-term Social Impact. When we examine the long-term changes in expressed needs, similar social-emotional needs surface but in different ways. We find that changes in negative mental health experiential queries (r =0.42, p = 0.010, Figure 4C ), wedding site visits (r =−0.37, p=0.022, Figure 4D ), and wedding related purchase (r =−0.35, p=0.033) were most correlated with the duration of programs around sheltering and closures. The duration of the shelter-in-place mandates were highly correlated with the start date (r =−0.62, p<0.001), indicating that people faced with earlier mandates are also impacted by longer mandates. As we discovered above, these people likely suppressed their needs for weddings or mental health support early during the pandemic. At the same time, as wedding needs slowly recover to the pre-pandemic baseline level (Fig. 3E) , those impacted by longer mandates are even slower in their recovery of wedding needs and are more likely to express negative mental health issues. For example, people in Mississippi (24 days of shelter-in-place) express 33.2% less negative mental health experiences than before the pandemic while people in Oregon (88 days) express 27.2% more negative mental health experiences than before (Fig. 4C) . Others attribute this increase in negative emotions during the pandemic to the shift to basic needs [12] . Per-state analysis of this shift along with differential prevalence of COVID-19 may be necessary to understand the mechanisms for this increase in negative emotions and to provide appropriate mental health support. We have proposed a framework for quantifying expressions of human needs and tracking their changes in response to major events. As we have demonstrated so far, there are many needs that are well expressed by web search interactions such as purchasing goods online, looking for health information online, or accessing economic assistance through government facing websites. Our final analysis examines a gap between how web search facilitates the expression of these needs and the reported fulfillment of these needs in order to discover potential barriers to accessing critical resources. To demonstrate how our approach allows for exploring these barriers, we specifically look at two need subcategories: unemployment and domestic violence. Expressed Unemployment Needs vs. Reported Claims. We obtained weekly, seasonally adjusted initial unemployment claims for 2020 from the US Department of Labor 6 and computed the relative changes in unemployment claims using the same approach as in Section 4.4. We compare the change in unemployment claims with the change in expression of unemployment needs in the dataset and find that these two changes are closely aligned (Pearson r =0.996, p<0.001), approaching over 32 times the baseline (2 5 ) at the end of March (Fig. 5A ). In addition, we see that the expression of unemployment needs remain at a higher rate than the reported claims. This discrepancy corroborates with known issues with the unemployment benefits: many people were confused about the benefits or the waivers (e.g., job search requirement) and were being denied, having to search for more information or file multiple applications [9, 51] . Although our analysis excludes those who use traditional methods for filing unemployment claims (e.g., mail, phone), our results indicate that web search is a critical resource that facilitates unemployment needs and highlight a potential gap in satisfying unemployment needs which requires further investigation. Expressed Domestic Violence Needs vs. Reports. One of the dire consequences of the COVID-19 pandemic is an increased risk of domestic violence due to shelter-in-place mandates, where the risk is further increased by physical, financial, and socialemotional stressors as well as increased alcohol consumption at home [11] . News outlets have reported increased cases of domestic violence based on fragmented data points from select cities [30, 33] . In contrast, our analysis shows that the expression of needs for domestic violence queries has dropped by nearly 36.7% (2 −0.66 -1=−36.7%) since the pandemic and stabilized at around 15.9% (2 −0.25 -1=−15.9%) below pre-pandemic levels (Fig. 5B) . In fact, some cities have reported decreased number of domestic violence incidents with victims unable to reach out for help or seek resources under constant surveillance by the offenders at home and fearful of exposing themselves to the virus at a shelter [33, 52] . Our data also shows that interests in firearm purchase have increased by over 40% (2 0.5 -1=41.4% (Fig. 5B) , which corroborates with other reports of increased gun sales and gun violence [29, 53] . Given that firearm access is highly linked to fatal domestic violence incidents [34] , others have raised concerns about domestic-violence related homicides during the pandemic [11] . That is, our results frame an urgent question that needs to be resolved through expertise that resides in domestic violence and social work organizations to identify the underlying explanations for our findings, including the possibility that the decline in domestic violence queries amidst reports of increased risks is based in the rise of barriers to communications and resources during the pandemic. We have presented a computational framework based on theories of human needs to quantify the effects of the pandemic and its related events on web search interactions. Although all need categories had elevated changes in needs, we found that basic need subcategories were elevated the most while growth-based need subcategories, indicative of positive outlooks in life, were subdued (Sections 5, 6). We also found that earlier and longer shelter-in-place mandates may come with an unintended impact on mental health needs (Sec. 7). We used unemployment and domestic violence related queries to demonstrate how our methodology could help expose gaps between the expression and fulfillment of needs-and frame questions and directions for urgent investigation (Sec. 8). Limitations. We cannot use our methodology to make causal claims. If other major, concurrent events (e.g., Black Lives Matter) have significantly influenced human needs during the observations window, we are not be able to distinguish between needs caused by the pandemic or these events. However, note that we could plausibly explain the major changes over time through COVID-19 related events alone (Fig. 2) . In addition, we use the difference-in-differences method along with matched controls for the largest temporal effects in search data such as daily, weekly, and yearly query volume variations. Our analysis relies on heuristics and correct inference of needs from search interactions. But our framework is theory-driven, comprehensive, and achieves 97% precision across five main need categories. Further finetuning of the framework should be motivated by precision-recall requirements of individual analysis goals. Although expressions of needs may not reflect the actual needs or the fulfillment of the needs, the methodology serves as a useful detector for needs, as we have demonstrated through its corroboration with several external findings. In contrast to traditional methods of measuring needs, our approach can operate at population scales relying solely on data that is already routinely collected by web search engines, and thus allows for effective, retrospective, and high-granularity studies with observation periods of over one year. Implications: Resilience and Vulnerability. Our work has demonstrated major shifts in a spectrum of human needs during the pandemic, echoing the need to understand the societal, economic, and psychosocial effects of these events and the underlying system of human needs. Such understanding is essential for guiding programs around outreach and support of people during the current pandemic and preparing for future pandemic responses and recovery efforts [49] . We observed that some needs have returned to their baseline levels while others are sustained at elevated (e.g., health and economic instability) or suppressed (e.g., job search, educational degree) levels (Fig. 3) . As we saw in Section 5, temporal variations on how and when certain needs arise in response to major disruptions could indicate a degree of psychological and economic resilience [21, 37] 7 . Understanding how well a region can endure disruptions is crucial for balancing health risks against societal risks, especially for vulnerable populations who are severely impacted by the pandemic [14] . Our framework could be used to quantify how much a community has been and might be able to endure social and economic distress. For example, a region may be able to withstand prolonged business closures before loan forgiveness needs surge, while another region may show immediate need for financial stimulus. In addition, when coupled with demographic variables, our framework could be used to examine disparities in needs which may help with understandings of the disparate impacts of blanket policies on the most vulnerable populations. Implications: Preparedness and Resources. Our work also revealed strong correlations indicative of a long-term mental health impact of longer shelter-in-place mandate, suggesting that a state with longer duration of shelter-in-place mandates may need to provide additional support for social emotional well-being (Fig. 4) . We also saw potential barriers to accessing critical resources for unemployment and domestic violence; a decrease in domestic violence queries in spite of conflicting anecdotes should raise questions and an alarm (Fig. 5) . These examples suggest that expressions of needs through web interactions could highlight resource deficiencies or barriers. As regions prepare recovery efforts from the current pandemic or make plans for future disruptions, the methodology we describe can be harnessed to monitor or anticipate a spectrum of needs at various geotemporal granularities; this could help to reveal associated impacts of the pandemic (e.g., mental health [17] ) and to cater the appropriate interventions that meet individual and societal needs. Future Directions. We see this work as a step towards achieving a more holistic understanding of the multiple influences of the pandemic and associated policies and programs. While we focused on retrospective analyses, our approach has the potential to be used in near real-time to monitor human needs and support future policy decisions. Understanding the disparate impacts of the pandemic and its policies on a full spectrum of human needs, especially for vulnerable populations, is critical for designing response and recovery efforts for major disruptions. Our work highlighted future research opportunities and calls for action and collaboration across epidemiology, economy, social sciences, risk management, and more. We look forward to further refinement of this framework and hope the work will encourage discussions on the broad spectrum of human needs during a global crisis. To understand how much of the US population is represented by the collected data, we obtained demographics data from the Census Reporter API [10] for ZIP codes represented in our dataset. Census Reporter API provides demographics data for 32,989 US ZIP codes, and not all of the ZIP codes in our dataset have available demographics information through this service. Between our dataset and the demographics data, we have 96.4% overlap of ZIP codes, representing 97.5% of total queries in our dataset. We sought to understand potential biases in socioeconomic circumstances that would influence the usage of Bing search engine. We leveraged deidentified client id as a proxy for a unique user to estimate the 'client rate', or how much of the population in a ZIP code is using the Bing search engine. We examined the correlation between the client rate and various demographic factors at the ZIP code level (e.g., income, race, age, gender, education, housing, internet access). Although unsurprisingly the factors describe some of the variance, none were correlated more strongly than r =-0.058 (% Housing Owned). Table A2 summarizes Pearson correlation statistics. These results suggest that our dataset is not strongly biased towards any single demographic. Although our analysis relies solely on Bing search data, many Americans use other search engines such as Google. Therefore, we compared search trends for Bing with data available via the Google Trends API for the same time period and a subset of keywords. We chose to use select keywords from each need category, rather than applying our needs aggregation pipeline (described in Section 4.2), because the Google Trends public API does not support regular expressions or access to the click interactions. We chose 9 keywords that were representative of their respective need subcategory (i.e., 'online learning' captures online education queries), had a significantly larger query volume compared to other keywords in the need categories (i.e., 'hand sanitizer' had more query volume than 'mask'), or may have a seasonal effect (i.e., query volume for 'tax' could depend on the tax season). We conducted a correlation analysis on a moving average of a full week to account for timezone differences between the two data sources. Visual inspection of both Google and Bing trends confirm that search patterns across these two search engines are remarkably similar ( Figure A1 ), and Pearson correlation coefficients are very high with a median of 0.96 (min=0.45, max=0.98, all p<0.001). Table A3 summarizes Pearson correlation outputs. These results imply that our findings are not simply an artifact of using one search engine over another. We draw inspiration from Maslow's hierarchy of fundamental human needs [38, 39] to tag each search interaction. Of the eight toplevel human needs from Maslow's expanded hierarchy of needs, we omit Esteem, Aesthetics, and Transcendence because they are difficult to operationalize from observational data alone. We categorize search behaviors into one or more of five broad categories of needs. Safety and Physiological are considered as 'basic' needs. Love and Belonging, Cognitive, and Self Actualization are often considered to be 'psychological' needs, and Cognitive and Self Actualization are considered as 'growth' needs. We obtain further granularity of the 5 human need categories by decomposing each need category into many sub-need categories. This extra level of granularity has advantages in teasing apart the specific aspects of the need (e.g., shelter vs. finances in Safety needs) and in debugging and refining the detection logic. Table A4 enumerates 79 sub-needs and their examples. For each need subcategory, we had multiple regular expressions applied to either the query string, the clicked URL, or both. This is noted in Table A4 under the 'Logic' column. 'Keyword and domain' logic indicates that there is one regular expression for matching query strings and another regular expression for matching clicked URLs. Both must be matched for the search interaction to be categorized as that need. 'Queries' logic indicates that there is one regular expression for query strings alone regardless of the clicked URLs. 'Domains' logic indicates that there is one regular expression for clicked URLs regardless of the query strings. There were several steps in arriving at these subcategories and regular expressions. We first identified a set of e-commerce websites (1) from market research blog posts and reports that were publicly available online and (2) from enumerating top 100 URL hosts from a sample of search interactions that were tagged as purchase or commerce related using built-in classifiers in Bing. We combined these two data sources to manually curate a list of 61 e-commerce domains. Next, we collected another sample of query strings that led to subsequent clicks to these e-commerce domains as well as page snippets that were displayed on the search result page. Using the Latent Dirichlet Allocation module in gensim, a popular topic modeling tool in python, we trained an unsupervised topic model on the query strings and the associated page snippets for 100, 500, and 1000 topics on 100,000 unigrams. We manually inspected the keywords in the topics to curate 22 topics of over 800 keywords. We also extracted frequent bigrams from the same dataset to expand our keywords if existing unigrams were too ambiguous. These topics represented purchase categories because they were collected from search interactions with subsequent clicks to e-commerce websites. Our next step was to categorize these topics into five main need categories. We used a hybrid card sorting method [3] with 5 participants who merged, split, edited, or created these topics into need subcategories. Then these need subcategories were then placed into the 5 main need categories. Using the outputs from the card sorting activity as a basis, we further brainstormed subcategories beyond purchase categories. Several researchers independently brainstormed and categorized subcategories which were combined and resolved collaboratively through consensus meetings. We referred to the definitions presented in the theories of human needs to resolve any remaining disagreements [38] . Once the need categories and subcategories were identified, we leveraged keywords and domains from manually curated topics and card sort outputs. We also obtained a list of topical domains from publicly available blog posts and articles summarizing recommended websites (e.g., search for best parenting website) or manually curated a list of domains (e.g., search for unemployment benefit page for each state). Again, we independently brainstormed and curated lists of keywords, keyword patterns, and domains for each subcategory and collaboratively iterated on the lists through consensus meetings. These keywords, keyword patterns, and domain URLs were combined into many regular expressions that we used to tag each search interaction. We validated our detection logic by collecting human labels from Amazon's Mechanical Turk, and our evaluation set achieved a precision of 97.2%, as described in detail in Section 4.3. Logic Example A Look at How Home Care Product Claim Preferences Have Shifted Amid the COVID-19 Pandemic Winners and Losers from COVID-19: Global Evidence from Google Search Measuring the user experience: collecting, analyzing, and presenting usability metrics A dataset for psychological human needs detection from social networks Psychomotor function measured via online activity predicts motor vehicle fatality risk Harnessing the web for population-scale physiological sensing: A case study of sleep and performance Influence of Pokémon Go on physical activity: Study and implications Unemployment effects of stay-at-home orders: Evidence from high frequency claims data Most Job Search Requirements for Unemployment Benefits Are Waived. So Why Do State Websites Say Otherwise? American Community Survey 5-year estimates An increasing risk of family violence during the Covid-19 pandemic: Strengthening community collaborations to save lives A nation-wide survey on emotional and psychological impacts of COVID-19 social distancing Predicting the present with Google Trends Racial health disparities and Covid-19âĂŤcaution and context Labor markets during the covid-19 crisis: A preliminary view Statistics notes: percentage differences, symmetry, and natural logarithms Mental Health, Substance Use, and Suicidal Ideation During the COVID-19 Pandemic -United States Methods for evaluating changes in health care policy: the difference-in-differences approach Understanding user behavior through log data and analysis Google it!'Forecasting the US unemployment rate with a Google job search index Psychological resilience: A review and critique of definitions, concepts, and theory Exploring time-dependent concerns about pregnancy and childbirth from search logs Human needs and satisfactions: A global survey The novel coronavirus (COVID-2019) outbreak: Amplification of public health consequences by media exposure Detecting influenza epidemics using search engine query data Expressing relative differences (in percent) by the difference of natural logarithms Effects of social distancing policy on labor market outcomes 2020. Scarcity and Coronavirus Trauma does not quarantine: Violence during the Covid-19 pandemic Police see rise in domestic violence calls amid coronavirus lockdown The Challenges of COVID-19 for Divorcing and Post-divorce Families The estimation of causal effects by difference-in-difference methods Is Domestic Violence Rising During the Coronavirus Shutdown? Here's What the Data Shows Patterns of multiple family homicide Google searches for the keywords of âĂIJwash handsâĂİ predict the speed of national spread of COVID-19 outbreak among 21 countries NeedFull-A Tweet Analysis Platform to study Human Needs during the COVID-19 pandemic in New York State On the notion of regional economic resilience: conceptualization and explanation A Theory of Human Motivation New introduction: Religions, values, and peak-experiences An empirical test of Maslow's theory of motivation Human scale development: conception, application and further reflections Measurement of Maslow's need hierarchy An introduction to the entertainment industry. The entertainment industry: An introduction The relationship among deficiency needs and growth needs: An empirical investigation of Maslow's theory Search and breast cancer: On episodic shifts of attention over life histories of an illness Mental health and the Covid-19 pandemic Basic human needs COVID-19 US state policy database COVID-19 Community Stabilization and Sustainability Framework: An Integration of the Maslow Hierarchy of Needs and Social Determinants of Health Machine-learned epidemiology: real-time detection of foodborne illness at scale Stymied in Seeking Benefits, Millions of Unemployed Go Uncounted Why a Drop in Domestic Violence Reports Might Not Be a Good Sign Gun violence during COVID-19 pandemic: Paradoxical trends Needs and subjective well-being around the world Narrative economics, public policy and mental health Life in the pandemic: Social isolation and mental health Understanding socioeconomic inequalities in food choice behaviour: can Maslow's pyramid help? Maslow reconsidered: A review of research on the need hierarchy theory Mining web query logs to analyze political issues Impacts of state-level policies on social distancing in the united states using aggregated mobility data during the covid-19 pandemic From cookies to cooks: Insights on dietary patterns via analysis of web usage logs Here and there: Goals, activities, and predictions about location from geotagged queries Identifying user needs from social media. IBM Research Division A review on multi-label learning algorithms