key: cord-0063928-sx76tn0b authors: Nisha, Zairu; Kumar, Sachin; Pal, Saibal K.; Singh, Jagvinder; Singh, Ompal title: Insights and reflections using data analytics approach of COVID-19 situation among the Indian states and ethical considerations date: 2021-06-02 journal: Int J Syst Assur Eng Manag DOI: 10.1007/s13198-021-01153-4 sha: a0d0ec16720eeaf9d9df5d3efee4479dd9728039 doc_id: 63928 cord_uid: sx76tn0b COVID-19, a pandemic has suddenly created an unprecedented crisis in the world. COVID-19 has severely affected the lives of all the people around the world by forcing the closure or suspension of almost all socio-economic activities. Some of the countries including developed and developing such as China, the USA, Italy, Spain, the UK, France, Iran and India, etc are comprehensively affected to a deep level by this virus in terms of an incurable disease. Several countries have no options as they were hit by COVID-19 in total surprise and had no case study to follow. But some countries have attempted to face the virus such as China, Italy, Spain, Iran, etc. In this context, India has strict preventive majors in the form of a lockdown strategy besides the loosely executed testing and tracking methods. Despite all policies and decisions implementation, and their stringent execution, it is spread, and now India is in the top three COVID-19 affected countries. The rampant increase of COVID-19 cases, their response, and its impact on mortality, the effectiveness of controlling the situation by States need sharp analysis. The proposed work is an attempt to critically assess and explore the pervasive pandemic situation spread in most of the Indian state utilizing comparative analysis using data science methods contrasting them horizontally. Furthermore, the research work suggests few in-depth insights and recommends directions, steps for the public, government and other stakeholders for better management of the COVID-19 situation. Additionally, the study also deals with some ethical issues involved in the data creation, curation, and reporting on the various platforms used in the research work. COVID-19 is a pandemic; a disease affecting the whole world from small to large extent country after country. The first case of novel virus infection from COVID-19 was reported in China, Wuhan province in December, 2019 (Hui et al. 2019 . Since then, it has spread in Europe, the USA, and Asian and Latin American Countries. COVID-19's induced situation became extremely threatening and WHO declared COVID-19 a world health emergency on 13th March, 2020 (World Health Organization et al 2020). Countries such as China, Italy, UK, Spain and Iran were observed to be badly affected in February and early March 2020. Although In India, the first case of the COVID-19 was reported from Kerala, a southern state in the last of December 2019, the Indian government and its administration could not prepare a plan and strategy for COVID-19 till March 2020. Consequently, the Indian government decided suddenly to impose a severe and stringent lockdown on 25th March to contain the spread of the virus and break the transmission and infection chain 2020 (Sinha and Sinha 2020) . There have been two observations on COVID-19 and its unstoppable infection transmission and adopted measures across the world imposing lockdown. Firstly, there has been no cure for the virus. Although scientists and researchers are working day and night to find the cure for COVID-19 in the form of vaccines. Secondly, the only way to stop the transmission of the virus is to keep a safe distance from the other person. Hence, Countries in a desperate attempt to save lives and control the COVID-19 transmission preferred the lockdown approach. Simultaneously, the administration also pushed for extensive testing of the patient through contact tracing and of those showing the symptoms of COVID-19 such as high fever, coughing, body pain etc (Cohen and Kupferschmidt 2020). The Indian government has also strictly imposed the restriction of going outside except for essential services such as food, health, law and order, and National Security Services. Due to the nature of the economy and its impact on the life of people due to the suspension of economic activities. The International Monetary Fund (IMF) and other international policy institutions have expressed great concerns regarding the job losses and contraction of the gross domestic product (GDP) of the world economy (Fernandes 2020) . Few countries may be hard hit. Furthermore, few countries may go into recession and their socio-economic systems may collapse. Although it is quite clear that the economies in Europe, the USA, China, India, Japan, and Singapore are badly hit from COVID-19 (Loayza and Pennings 2020) . COVID-19 has wide implications starting from economic issues such as unemployment, job losses, salary reduction, etc to personal, ethical and psychological issues about life and the future. It is not wrong to say that post corona's big implications and consequences may force us to change and adapt to the new lifestyle across the world which is sustainable for the present generation and for the future generation. Thus, India's COVID-19 situation needs detailed and thorough investigation and assessment at the state level. India is the second-largest country in terms of population, sixth largest in terms of purchasing power parity (PPP) (Mathur 2020) due to this demographic conditioning, India reported the first covid positive case in Kerala in January, whereas the rest of the country was doing well. The Indian government also observed and witnessed the severity of the situation in other affected countries which gave the policymakers and administration at least two weeks extra time to prepare and to respond effectively. The study has incorporated all the considerations to provide a wide spectrum. Following are the questions which have been addressed and discussed in depth. 1. Discussion on initiation and growth analysis of the pandemic COVID-19 in India. 2. Analysis of growth rate, fatality and recovery rate and death state wise. 3. Analysis of test-related statistics such as testing rate, infection cases, confirmed cases state wise. 4. COVID-19 transmission cases, their trends and analysis state wise. 5. Insights and recommendation for policymakers and administrators. 6. Discussion some of the ethical issues related to the data quality and data collection methods. With this line of thought, the present paper is organised in the following manner; the first section presents the introduction. Section 2 deals with the literature survey. Section 3 explains the research methodology. Section 4 presents results and analysis. Section 5 presents insights and recommendations and Section 7 concludes the study. An unprecedented situation posed by COVID-19 has forced not only the general public but also the scientific community, policymakers and decision-makers to think seriously and get the insights and directions based on objectivity, rationality keeping in mind the welfare of the state and the lives of people at large. Several studies have been conducted to have scientific opinion-based decision making addressing several dimensions of the COVID-19 scenario. Emanuel et al. (2020) have focused on medical resource availability and fair resource allocation. Alhazzani et al. (2020) have addressed the decision making processes and recommended some guidelines for critically ill adults to save their life and better health care. At present the real-time situation is changing rapidly, country by country are coming into the grip (Del Rio and Malani 2020) . Each country has some specific characteristics depending on its population, demographic profile, geographic location, cultural aspects and development patterns. Hence the impact of COVID-19 and its intensity varies. Rodriguez-Morales et al. (2020) analysed Latin American cases with implications after the first reporting of the case in Brazil. Similarly, Gilbert et al. (2020) conducted a preparedness and vulnerability analysis of African countries. It is quite evident that COVID-19 is a new phenomenon and no sufficient past data is available related to a pandemic with countries which they could use for their benefit and for future course of actions. As researchers were able to get the enough data to detect the COVID-19 though many advanced and latest technological developments such as AI and Machine Learning conducted experiments and presented the results to the scientific community in order to assist the decision making. Data science, Artificial Intelligence and Machine Learning have been used in many diverse fields earlier also rapidly growing and having an impact on daily life. For instance, AI have been used in the development of smart cities, energy prediction tool (Kumar et al. 2018a; Kumar and Singh 2019) , intelligent transportation, especially in accident detection system (Kumar et al. 2017 ) and intelligent health management systems (Kumar et al. 2018b Research methodology and data visualisation tools provided directions to get insights and deep horizontal analysis of the COVID-19 situation in Indian states wise. All states and UTs have been compared on several important and relevant parameters to assess the performance and efficiency of states (having some level of autonomous decision making) in monitoring and mitigation of COVID-19 infection transmission, mortality and recovery patterns and trends. Additionally, states weaknesses and advantages have also been incorporated in the result discussions. Moreover, some demographic related data have been made the foundation to reason and support the claims as obtained and observed through data analysis. This makes better sense and gets a deep insight into the prevailing situation to make objective, rational and reliable recommendations. Figure 1 represents the time series of the COVID-19 cases reported in India in all the states and UTs starting from 25th March to 2020 to 25th April 2020, for a one-month duration. Carefully observing the plot signifies that the death rate is very less in comparison to the transmission of the COVID-19. After 25th April, COVID-19's transmission has sped up exponentially. The recovery rate is quite good. In this plot, it is observed that although initial COVID-19 cases were reported in the state, Kerala. But the number of the cases were very small by mid of April 2020. India has not seen the exponential growth of the cases or the community transmission as expected by many researchers in the predictions till the mid-week. This has been the result of stringent lockdown which stops the movement of human beings working as vectors for spreading the COVID-19. The good recovery rate has been attributed to the lifestyle of Indians and BCG vaccination in the 1960s by researchers to ensure a better immunity of the virus in the Indian population (Shivendu et al. 2020 Recovery per death tells that more people recover and fewer are dying after the infection. Haryana is a considered physically healthy State of India due to its food habits and physical strengths and farming occupation. The best recovery rate is in the Himalayan States such as Uttrakhand, Jammu and Kashmir, Himachal Pradesh and northeastern states as compared to states having plain geography. States having plain geography and falling in between the Yamuna and the Ganga have huge populations, high levels of water and air pollution, lack of health cards and educational facilities due to a high degree of competition with fewer resources and corruption. The highest days of hospitalisation have been observed in the Ladakh region due to the extreme weather of the region. In most of the States and UTs, the hospitalisation days are similar that is from 10 days to 15 days. Figure 4 presents the growth of COVID-19 infections in terms of the rate of doubling day wise. More clearly, it is the number of days in doubling the number of COVID-19 cases. This is a very important parameter. During the linear growth, this parameter is less but as the growth rate takes the shape of the exponential function, it reduced drastically. This happens during the community transmission stage when there is no control on the contact tracing and infection transmission and the only way left is to upgrade the health infrastructure and respond to the uncontrollable situation. To prevent this situation, Lockdown has been proposed and implemented. The plot gives the impression that the doubling rate is high in Delhi, Gujarat, Maharashtra, Tamil Nadu and Andhra Pradesh. All these states have substantial levels of industrialisation and migration from other states. These states also have more urbanisation as compared to the other states which have shown a low doubling rate due to their difficult terrain and less economic development. Such observations have inferred that more industrialised and urbanised regions must have better health and residential services to avoid the disease. States which are geographically difficult are better at having a high doubling rate. Here regional planning may also come into the picture if the case of Chandigarh is taken which has quite a high doubling rate and the city is a well developed and urban region of the State, Punjab. Figure 5 puts forward the trends of the testing strategy and methods adopted by each State and UT. The figure compares states and UTs on tests per million, positive cases per million and per cent of confirmed over total tests. It is observed and reported in international media that India's testing rate is very less as compared to other countries taking into consideration its population (IANS 2020). It is even less than that of Pakistan (Mygov WHO 2020). The testing rate measures the number of COVID-19 test cases conducted per million population. In India, Delhi is leading in terms of number of tests per million cases. Delhi is considered a good state in terms of health facilities due to the priority of the Delhi government which has allocated approximately 20 per cent of the economic budget to health and introduced several acclaimed new health facilities and schemes such as new hospitals and decentralisation of health facilities through PolyClinic and Mohalla Clinic. This is one of the reasons for having more cases as more testing is being done. This is reflected in reporting of positive cases as that number is also higher in Delhi. The other States and UTs are facing a problem of lack of testing kits and lack of trained health professionals (Shekhar 2020) . They are more focused on stringent lockdown imposition strategies and social distancing. Although, it seems that Lockdown is not the final answer. As soon as the lockdown is lifted and people having COVID-19 cases which are also asymptomatics start travelling, COVID-19 infections will spread in the second wave. Experts report that second waves of COVID-19 may be more fatal than this phase. So more testing must be done and more resources should be allocated to contain the spread and stop the community transmission stage. Figure 6 compares the results of the several states and UTs on four very important points related to COVID-19. Four parameters are the percentage share of states and UTs in total COVID-19 infections/cases, the share of states and UTs in total COVID-19 active cases, the share of states and UTs in total COVID-19 death cases, the share of states and UTs in COVID-19 tests. From the plot it is clear that the share of the rate of mortality and share COVID-19 death cases is highest in Maharashtra, followed by Gujarat and Delhi and Madhya Pradesh(MP). Interestingly, Each of the states and UTs has different reasons for the spread of COVID-19 and death due to COVID-19. Maharashtra and Gujarat follow similar trends and reasons for COVID-19 as both of them have an almost similar level of industrialisation and urbanisation. Decision making has been in confusion due to change in the administration and political authority. This situation forced the delay in active decision making in MP. Specifically, the top level of bureaucracy also suffered from COVID-19 and their health workers of top-level also faced similar problems forcing them and the system to complete breakdown. NCT Delhi has been able to manage its death rate to thirty due to the proactive testing rate and proactive measures taken by the government and volunteers. The figure depicts that the deaths are mostly in these four states and UTs. Another observed situation from our data visualisation is that in states like Gujarat and Maharashtra, their rate of testing is not near the satisfactory level. Good testing rate of COVID-19 helps in better health care provisions, mitigation and recovery than the lack of test cases which only contributes to more virus spread but also increases the mortality rate. In the case of the share of the COVID-19 cases, Maharashtra and Gujarat are leading followed by Delhi and Maharashtra. This is depicted in Fig. 7 , also with state and UTs wise map of India. The reasons again are the same as States have more congested living and lack of health and other facilities for migrant workers from other poor states, part of the country. There is an interesting note here that there is no or very few active cases of COVID-19 in Chhattisgarh, Goa, Mizoram, Nagaland Meghalaya, Ladakh, Tripura, Uttarakhand, Pondicherry and Sikkim etc. All these states either have better life expectancy and immunity or better way of living and also better education and awareness about the issues. These states are also from mountainous regions. After discussion and critical analysis of the results, this section suggests some measures which are better for decision making and administration for government officials and policymaking. States which are doing good are NCT Delhi and other remote areas such as the Himalayan States and Northeastern states including Kerala and Goa. Other states need to learn from their strategies, especially political and administrative decision making. Lockdown is a good measure but not a sufficient measure. Lockdown needs the support of extensive testing support to ensure the tracing and quarantine of affected persons. Imposing lockdown for a longer period will surely have a bad impact on the large population of this country, especially on poor people, daily wages earners, farmers and small shopkeepers as they don't have enough food and money to support their livelihood. District wise strategy should be adopted and locking of districts with clear and unambiguous guidelines keeping future circumstances and situations in mind, needs to be followed. Better educated states and UTs are better in health facilities and consequently better equipped and proactive in dealing with the pandemic. Hence investment in human capital is very much important and needed. Better education and health promote cleanliness, awareness and hygiene which improves the immunity of human beings and ultimately result in better and fewer chances of disease. It has been observed that the death rate is higher in the older population (Dowd et al. 2020) which is greater than 60 years so a better and targeted approach should be used about them and allow the younger population for the economic activity to run the cyclone of development. COVID-19 transmission and infection in the community have triggered collaborative efforts at the individual level, community level, institutional level, government level and intergovernmental levels. Technological giants and individuals are contributing to the data and its collection methods and in the development of analytical tools in order to understand the deep pattern of transmission and spread, demographic attribute's relationships with COVID-19 infections, relationship with medical health history, current location, movements of infected people. All this is being done through data collections systems such as websites and apps. Contact tracing apps such as the Arogya Setu app by the Indian Government, have been pushed to be installed for keeping safe the healthy person from the infected person by informing them of the rule-based quantified risk exposure in the surroundings of individuals. There are many individual efforts whose data validity and reliability are questioned. Contact tracing Apps are one of digital surveillance in the pandemic which needs proper protection of individual data and medical history and attributes from going in the hand of adversaries. In addition to this, in the present circumstances as posed by COVID-19, the Digital technologies and analytics framework of prediction and data collection should include ethical data collection practices, consent taking steps in all their data collection methods and stages. Digital technology tools accessing data need to conform to the international guidelines of data collection and access and storage of information without compromising the security and privacy of individuals. Ethical practices related to data sharing should also be considered so that the community trusts the finding of the research work. There is a need for clarity of perspective on the rapidly evolving field of human data collection using digital tools. The ethical and legal boundaries of such tools for disease surveillance and contact tracing are unclear. This needs clarity, equitable and egalitarian policies for the stakeholders. This proposed research work has done extensive analysis of the information related to Indian states and UTs and their adopted measures and strategies and outcome of those strategies. Results and discussion shows that States with higher education and better health services and proactive leadership are performing better in mitigation and containment of COVID-19 spread. States with more industrialised economies and labour in or out from the other states are lagging behind and suffering more from COVID-19. States with better law and order with stringent rules and regulation and their execution are also doing better. Geologically different terrain states are also in a better position to handle the COVID-19 situation. There is a need to learn from better-performing states and more focus should be Fig. 7 State wise map of COVID-19 infection cases. Dark color represents more COVID-19 cases and light color represents less number of cases (Mygov WHO 2020) given to testing rate and improving health services in states. Additionally, data collection and disease surveillance tools should be ethically and legally conforming to the guidelines and framework of the international community and stakeholders. Correlation of chest CT and RT-PCR testing in coronavirus disease 2019 (COVID-19) in China: a report of 1014 cases Surviving sepsis campaign: guidelines on the management of critically ill adults with coronavirus disease 2019 (COVID-19) Characteristics and outcomes of 21 critically ill patients with COVID-19 in Washington state Di Napoli R (2020) Features, evaluation and treatment coronavirus (COVID-19). In: Statpearls Countries test tactics in 'war' against Gender dimensions of the COVID-19 pandemic Covid-19-new insights on a rapidly changing epidemic A validation study for the successful isolation policy in china: a meta-analysis in COVID-19 Demographic science aids in understanding the spread and fatality rates of COVID-19 Fair allocation of scarce medical resources in the time of Economic effects of coronavirus outbreak (COVID-19) on the world economy Driving public compliance with the health policy of social distancing: the case of the COVID-19 pandemic in India The continuing 2019-ncov epidemic threat of novel coronaviruses to global health-the latest novel coronavirus outbreak in Wuhan, China IANS (2020) India's covid testing rate is low, says who chief scientist. The Times of India Forecasting the daily and cumulative number of cases for the COVID-19 pandemic in India Dynamics of the COVID-19 pandemic in India Predictive analysis of alertness related features for driver drowsiness detection A novel method based on extreme learning machine to predict heating and cooling load through design and structural attributes A conceptual architectural design for intelligent health information system: case study on India A novel hybrid model based on particle swarm optimisation and extreme learning machine for short-term temperature prediction using ambient sensors Strategies for solving wicked problems of true uncertainty: tackling pandemics like covid-19 (version Socio-economic implications of the COVID-19 pandemic on smallholder livelihoods in Zimbabwe Mihailov A et al (2020) Quantifying the macroeconomic effects of the COVID-19 lockdown: Comparative simulations of the estimated galĂ­-smets-wouters model Covid-19 as a complex intergovernmental problem A mathematical model for COVID-19 transmission dynamics with a case study of India Modeling and forecasting the COVID-19 pandemic in India Effect of restricted emissions during COVID-19 on air quality in India A quantitative framework for expanding testing capacity of COVID-19 in India Is there evidence that BCG vaccination has non-specific protective effects for COVID-19 infections or is it an illusion created by lack of testing? Age-structured impact of social distancing on the COVID-19 epidemic in India Modelling of covid-19 infection propagation data in Indian population Effectiveness of testing, tracing, social distancing and hygiene in tackling COVID-19 in India: a system dynamics model A descriptive study of indian general public's psychological responses during COVID-19 pandemic lockdown period in India Response to COVID-19 in Taiwan: big data analytics, new technology, and proactive testing When is the COVID-19 pandemic over? Evidence from the stay-at-home policy execution in 106 Chinese cities World Health Organization et al (2020) Who director-general's opening remarks at the media briefing on COVID-19-11 Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations