key: cord-0947069-1kqcry8y authors: Rostaminia, Samaneh; Aghaei, Seyed Soheil; Farahmand, Behrokh; Nazari, Raziye; Ghaemi, Amir title: Computational Design and Analysis of a Multi-epitope Against Influenza A virus date: 2021-09-12 journal: Int J Pept Res Ther DOI: 10.1007/s10989-021-10278-w sha: a1dda29a9845abd3bc5d08f842224a7bde7ac2db doc_id: 947069 cord_uid: 1kqcry8y Influenza A viruses are among the most studied viruses, however no effective prevention against influenza infection has been developed. So, designing an effective vaccine against Influenza A virus is a critical issue in the field of medical biotechnology. For this reason, to combat this disease, we have designed a novel multi-epitope vaccine candidate based on the several conserved and potential linear B-cell and T-cell binding epitopes by using the in silico approach. This vaccine consists of an ER signal conserved sequence, the PADRE conserved epitope and two conserved epitopes of Influenza matrix protein 2. T-cell binding epitopes from Matrix protein 2 were predicted by in silico tools of epitope prediction. The selected epitopes were joined by flexible linkers and physicochemical properties, toxicity, and allergenecity were investigated. The designed vaccine was antigenic, immunogenic, and non-allergenic with suitable physicochemical properties and has higher solubility. The final multi-epitope construct was modeled, confirmed by different programs and the molecular interactions with immune receptors were considered. The molecular docking assay indicated the interactions with immune-stimulatory toll-like receptor 3 (TLR3) and major histocompatibility complex class I (MHCI). The HADDOCK and H DOCK servers were used to make docking analysis, respectively. The docking analysis indicated a strong and stable binding interaction between the vaccine construct with major histocompatibility complex (MHC) class I and toll-like receptor 3. Overall, the findings suggest that the current vaccine may be a promising vaccine to prevent Influenza infection. Influenza A viruses are still widely spread in nature and can infect a diversity of birds and mammals. Influenza A virus has always been a major reason of morbidity and mortality during history. Both the ability of the virus genome to change at a very high rate and its special gene mechanism often lead to the emergence of highly virulent strains that may potentially become the cause of a pandemic, or at least some panic worldwide, for example the recent outbreak of swine-origin influenza A (H1N1) (Shao et al. 2017) . The swine-origin influenza A (H1N1) virus that appeared in 2009 and was first found in human beings in Mexico. The 2009 swine H1N1 flu pandemic-responsible for more than 17,000 deaths worldwide-originated in pigs from a very small region in central Mexico, a research team is reporting. The scientists say their findings represent the first time that the origin of an influenza pandemic virus has been determined in such detail (Shao et al. 2017) . Vaccination remains the most efficient strategies to control and prevent influenza infections in human and animal populations. Current vaccination approach against influenza virus consists of a live-attenuated or killed virus vaccine that are attention to be most widespread in the prospect influenza season. Thus, influenza vaccines have to be reformulated with the ever-changing influenza strains. In order to efficiently control and protect against pandemic outbreak, vaccine induced-immunity using conserved epitopes are protective and have been displayed to be an ''universal'' vaccine against various strains of the virus (Sabbaghi et al. 2021) . Influenza viruses belong to Orthomyxoviridae type A and type B, which cause diseases in humans. Their genomes consists of eight single-stranded RNA segments that code for 10 different proteins, one nucleoprotein (NP), two external glycoproteins hemagglutinin (HA) and neuraminidase (NA), two nonstructural proteins (NS1 and NS2), two matrix proteins (M1 and M2) and three polymerase proteins (PA, PB1, and PB2) . In view of the restrictions of usual influenza vaccines, researchers suggested an completely different approach derived from the greatly conserved extracellular domain of the viral M2 protein (M2e) (Ebrahimi and Tebianian 2011) . M2 is a structural protein, which serves a crucial function in the virus life cycle. The M2 is a type III membrane protein with 96 amino acid residues (Iwatsuki-Horimoto et al. 2006; Ebrahimi and Tebianian 2011) . The N-terminal ectodomain (M2e) contains 23 residues; this sequence has stayed nearly unchanged since the first human influenza strain was isolated in 1933 (Meshcheryakova et al. 2009 ). As the M2e sequence is really conservative, many M2e vaccines have been designed and successfully experienced for value against a group of different influenza viruses in animal models (Kotomina et al. 2020 ). Here we focus on the development of universal influenza A vaccine candidates based on the conserved M2e epitopes. In addition, we have selected many conserved epitopes for vaccine development: ER (Ig j chain) (Määttänen et al. 2010) , the PADRE Th epitope (Nezafat et al. 2014; Yarmarkovich et al. 2020) . The selected epitopes were joined together using the appropriate linker to construct the multi-epitope vaccine. In order to generate a stable immune response, it is important for the vaccine to interact with target immune cell receptors. For studying such interactions, molecular docking studies were performed with Toll-like receptors. Toll-like receptors (TLRs) have a central role in innate immunity as they detect conserved pathogen-associated molecular patterns (PAMPs) on a range of microbes, including viruses, leading to innate immune activation and adaptive immune response. Certain studies on Influenza A even indicated the noteworthy role of TLR 3 in generating protective responses against the virus. Immune responses are a crucial step to the pathophysiology of the Influenza virus-related disease, and initiation of immune response targeting TLR3 can trigger the anti-viral host defense mechanisms necessary for the elimination of the Influenza related infection (Astuti 2020) . Therefore, molecular docking studies of the vaccine candidate with TLR3 was conducted. Production of epitope-based vaccines by extremely conserved regions of Influenza virus proteins specially matrix protein 2 is considered as an important way to control the Influenza virus (Kumar et al. 2008; Astuti 2020) . The improvement and creation of a vaccine are costly and will take many years to reach this goal, the different approaches were used to reduce the time and costs of this procedure (Suresh and Basu 2008) . Reverse vaccinology is a new method that joins immunogenomics and bioinformatics to expand novel vaccines (Sarkar et al. 2020) . This method has many benefits over usual vaccinology, it decreases the vaccine development time and costs. The computational programs enable to use a large immunological data; including antigen presentation and processing them to acquire sensitive analysis. Currently, in-silico vaccine design studies are facilitated via application of the applied bioinformatics tools; such as protein modeling programs, epitope prediction and protein-protein interaction analysis tools (Kumar et al. 2008) . These vaccines are considered to be safer than traditional vaccines and focus on the most crucial antigenic elements of the pathogen to generate protective immunity (De Groot 2009) . In this study, some specific B cell and T-cell epitopes from various proteins were chosen according to their antigenicity, stability, and length. The chosen epitopes were joined together by different linkers to construct the final potential multi-epitope construct, and different properties of the structures were predicted via in silico approaches. The M2 protein (MATRIX PROTEIN 2) sequences of Influenza virus were retrieved from UniProtKB (www. unipr ot. org/) Or from Influenza Virus Resource of National Center Biotechnology Information website (http:// www. ncbi. nlm. nih. gov/ genom es/ FLU/ FLU. html/) in FASTA format and the sequence was used for computational prediction. Then, analyses of the sequences were performed to study antigenicity, solvent-available areas, surface openness, litheness and MHC class-I binding sites (Hasan et al. 2013; Khan et al. 2015) . M2 genes are one of the most conserved between Influenza serovars (with 99.1% similarities at the amino acid sequence level) (Ngamie 2020; Rosyda and Chaerul 2020) . This was confirmed by Clustal omega, an algorithm of multiple amino acid sequence alignment available at (http:// www. ebi. ac. uk/ Tools/ msa/ Clust alo). BLAST search was used to assemble the sequences of antigen reported for different serotypes Amimo et al. 2020 ). The resulting consensus sequence would characterize all of the virus's major serotypes. To evaluate the physicochemical features of M2 antigen the ProtParam tool at (https:// web. expasy. org/ protp aram/) (Tahir ul Qamar et al. 2020; Sharma, et al. 2021 ) was used. a number of factors given by the ProtParam tool, such as, molecular weight, theoretical pI, amino acid composition, atomic composition, extinction coefficient, estimated half-life, instability index, aliphatic index, and grand average of hydropathicity (GRAVY) were performed for this principle and self optimized prediction method with alignment (SOPMA) server at (https:// npsa-prabi. ibcp. fr/ cgi-bin/ npsa_ autom at. pl? page=/ NPSA/ npsa_ sopma. html) (Roy 2020) were used to obtain a prediction about the secondary structure of the chosen protein. Secondary structure calculation of a protein plans to forecast solvent convenience, transmembrane helices, globular areas, and coiled-coil regions. The purposes of this step were to conclude the protein's strength and task. Invariable forecasts of CTL (Cytotoxic T lymphocyte) epitopes are very critical for proposing consistent vaccine. For this reason, The T-cell epitopes of antigen (MHC I) was predicted, using the online prediction server Immune Epitope Database (IEDB) at (http:// www. iedb. org/) (Vita et al. 2019) . The binding affinity of the HLA haplotype to individual epitopes is the major factor related to the immune-dominance of CTL (Cytotoxic T lymphocyte) epitopes. According to allele frequency of different Iranian populations and among all HLA alleles, HLA-A*0201 is the most available ones in the world (Creary et al. 2019; Bahrami et al. 2020 ); Therefore, epitopes prediction was performed for this allele. The MHC I (HLA-A*0201)-restricted T cell epitopes were predicted by SYFPEITHI at (http:// www. syfpe ithi.de/) (Rashidian et al. 2020 ) and Bioinformatics and Molecular Analysis Section (BIMAS) at (https:// www-bimas. cit. nih. gov/ molbio/ hla_ bind/) (Pan et al. 2017 ); Furthermore, MHCpred at (http:// www. ddg-pharm fac.net/ mhcpred/MHCPred/) (Li et al. 2010 ). The purpose in increasing our T cell vaccine against influenza virus was initial to intend an immunogen using the conserved areas of influenza virus proteins like antigens and next to generate a recombinant vaccinia virus that can be applied as a wide-scale vaccine to stimulate specific immune response to influenza. For this aim, we created one kind of immunogen. Peptides were lined up together to satisfy one basic criteria: ER (Ig j chain) signal sequence. This signals the transportation of proteins into the ER where they are corrupted into peptides and loaded onto MHC class I molecules (Adler et al. 2017; Blackburn et al. 2017) . The PADRE epitope has been shown to have a high MHCbinding affinity to a wide range of mouse and human MHC class II haplotypes and to induce Th responses restricted through class II of both species (Kissick et al. 2014; Yarmarkovich et al. 2020) . Th1 peptide was separated by ARY extension since cytoplasmic Th1 epitope must gain access to secretory cavity to appear in the context of HLA class II molecules (Benveniste et al. 2015; Kuhn 2019) . The flexible rich Alanine-aspartic acid linkers were used to join the selected epitopes. Besides linker we had added extra additions as Alanine/Arginine/Tyrosine (ARY) into siding area to enhance the peptide sympathy for TAP molecule (Salaun et al. 2020) . the selected epitopes are fused together using proper amino acid linker from Linker Database at http:// www. ibi. vu. nl/ progr ams/ linke rdb. The flexible AD rich (one Alanine and one aspartic acid residues) linkers were used to join the selected epitopes. the multi-epitope were analysed with common on-line algorithms: NetCTL 1.2 (Kumar et al. 2020) , nHLApred (Jahangiri et al. 2018) ,CTLPRED (Charoenkwan et al. 2020) ,TAPPRED (Bidmeshki et al. 2018) ,P-CLEAVAGE (Liu et al. 2019 ) and IEDB ). This approach was followed for peptide arrangements of individual protein and then the most probable arrangements satisfying basic criteria was selected to be combined again in a longer polytope. the physicochemical features of the multi-epitope were calculated by ProtParam tool. After verification of the construct features, the multiepitope peptide was chemically synthesized in proper vector by biobasic Biological Engineering Co (CANADA). The polytope proteins were then submitted in the Vaxi-Jen v.2.0 server at (http:// www. jenner. ac. uk/ VaxiJ en) with default parameters to isolate the antigenic proteins. To determine the antigenicity of the poly-epitope"virus" option was selected as a target organism. The accuracy of this server based on the target organisms (i.e. bacterial, viral, and tumor protein datasets). The ANTIGENpro at (http:// scrat ch. prote omics. ics. uci. edu) was also used to predict the whole protein antigenicity (Mehmood et al. 2019) . Protein allergenicity was predicted exploiting the AlgPred web server at (http:// www. imtech. res. in/ ragha va/ algpr ed/) (Sharma et al. 2020) (accuracy with the threshold of 0.4). We used SOPMA protein secondary structure prediction servers (Sharma et al. 2020) . Additionally, for further verification, Jpred v.4 algorithm to predict the secondary structure of the polytope (> 80.0% accuracy) (Khodabandelou and Nakib 2021) . This server utilizes the hybrid method to predict the protein structure, which combines the alignment based and single sequence-based methods. The tertiary structure of the polytope protein was predicted using the 3DLigandSite server at (http:// www. sbg. bio. ic. ac. uk/ 3dlig andsi te/) (Khodabandelou and Nakib 2021) . The I-TASSER at (http:// zhang lab. ccmb. med. umich. edu/I-TASSER/ downl oad/) (Yang et al. 2015) was also applied to improve the accuracy of tertiary structure prediction. The quality of the 3D models was measured using the QMEAN and PROSA software at (https:// swiss model. expasy. org/ qmean/) (Pereira et al. 2019 ) and (https:// prosa. servi ces. came. sbg. ac. at/ prosa. php) (Basak et al. 2020 ), respectively. The quality of the predicted structures was assessed by Rampage software at (http:// mordr ed. bioc. cam. ac. uk/ ~rapper/ rampa ge. php) (Bahrami et al. 2020) . Finally, the 3D models were visually analyzed by the PyMOL software (Pymol molecular graphics systemV. 1) (Bahrami et al. 2020; Gainza et al. 2020 ). Netchop 3.1 at (http:// www. cbs. dtu. dk/ servi ces/ NetCh op/) ) and P-cleavage online servers were used to determine the proteasome cleavage sites of the polytope protein. Peptide binding affinity to TAP protein has been computed by TAPPred server at (http:// crdd. osdd. net/ ragha va/ tappr ed/) (Keshtvarz et al. 2021 ). The predicted multi-epitope was further tested by in silico docking simulation to find out whether or not this peptide will bind to the human leukocyte antigen (HLA) molecules and TLR3 molecule when it will be applied in vivo and The immunoreceptor-vaccine interaction is highly significant to elicit the immune response. Consequently, it was inexorable to perform molecular docking of the vaccine construct with immunological receptor molecule, TLR-3. Furthermore, the difficult among the ultimate vaccine and major histocompatibility complex (MHC-I) and TLR3 was calculated by molecular docking. To accomplish the simulation, a crystal structure of the HLA-A*02:01 molecule and TLR3, was retrieved from the Protein Data Bank (PDB) (Fatoba et al. 2021) . Docking simulation for the predicted multi-epitope with prepared HLA-A*0201 and TLR3 were done by using HADDCK (High Ambiguity Driven protein-protein DOCKing) and HDOCK (Yan et al. 2020) which are informationdriven flexible docking approaches for the modeling of bio molecular complexes (Van Zundert et al. 2016 ). The Docked clusters were formed and based on the lowest HADDOCK and HDOCK score, and one cluster was identified. A representative structure from this cluster was subjected to molecular refinements for better positioning. In current study, the protein sequence of the matrix protein 2 was retrieved from the UniProt database and then performed BLAST p. Based on previous articles, all the sequences were stored as a FASTA format for further analysis. From plenty of homologues, we have selected only 17 homologues having more than 60% identical sequences. MSA were performed and a phylogenetic tree was constructed. The findings from the MSA and phylogenetic data documented that the protein sequences have a closer relationship. The sequence under Gen Bank accession number of NP_040979.2 for M2 antigen was selected for the following analyses. The physiochemical properties for the matrix protein 2 antigen was determined in Table 1 . The trans-membrane topology analysis of antigen showed that matrix protein2 antigen could have transmembrane localization (Fig. 1) . In this study, we have two categories of epitopes, which include antigenic and non-antigenic epitopes. The antigenic epitopes with highest scores which were confirmed with different software were selected as final epitopes (Table 2) . MHC I processing (this tool combines predictors of proteasomal processing, TAP transport and MHC binding to produce an overall score for each peptides potential of being a T cell epitope). Based on this overall score, six best epitopes were selected for further evaluation Amimo et al. 2020) . The degree of similarity between the epitope and the target (i.e. given) sequence is elucidated by epitope conservancy. This property of epitope gives us the promise of its availability in a range of different strains. Hence for the analysis of the epitope conservancy, the web-based tool from IEDB analysis resources was used (http:// www. iedb. org/) (Vita et al. 2019) . Finally, two antigenic epitopes (SLLTEVETP (9 length) and EVETPIRN (9 length) with the most particular conserved and antigenic epitopes were chosen for further investigations. Also, two non-antigenic epitopes ER signal sequence (MGMQVQIQSLFLLLLWVPGSRG) with 22 length and the PADRE (Th epitope) (AKFVAAWTLK) with 10 length were selected from previous articles (Adler et al. 2017; Blackburn et al. 2017) . This study focuses on predicting MHC-I binding epitopes and the prediction of MHC-II binding epitopes has not been done. It should be noted, there is only one MHC-II binding epitope in multi-epitope (the PADRE epitope (AKFVAAWTLK)) structure that has been taken from previous articles (Kissick et al. 2014; Yarmarkovich et al. 2020 ). Peptides were lined up together to satisfy one basic criteria: ER (Ig jchain) signal sequence, the PADRE (Th epitope) and two high scores and more conservancy epitopes including the HLA-A*0201-restricted epitopes SLLTEVET and EVETPIRN (Valkenburg et al. 2016) . The selected epitopes were fused together using proper amino acid linker from Linker Database at http:// www. ibi. vu. nl/ progr ams/ linke rdb. The flexible AD (one alanine and one aspartic acid residues). Linkers have vital functions in the flexibility of the construct, protein folding, separating the functional domains and thus granting an overall stability to the protein. The selected epitopes of analyzed antigens were linked to each other using suitable linkers to build the final antigen. Besides linker we had added extra additions as Alanine/Arginine/ Tyrosine (ARY) into flanking area to increase the peptide affinity for TAP molecule (Seyed et al. 2014 ). The final construct of the polytope is consisted of 92 amino acid residues; ER region contributes 22 amino acids at the N-terminal of Fig. 1 A schema of transmembrane structure prediction of matrix protein 2 using TMHMM software the construct (1 − 22), the PADRE Th epitope (23-33) and 2 epitopes are derived from M2 protein of influenza virus, respectively (Fig. 2) . The physiochemical properties for each antigen were determined and summarized in Table 3 . The isoelectric point (pI) and molecular weight (MW) of the polytope were computed to be 5.38 and 10,481.98 kDa, respectively. The protein halflife was estimated to be 3.5 h within mammalian reticulocytes (In vitro) and > 10 h within Escherichia coli (In vivo). The aliphatic index and instability index was defined to be 97.61 and 0.217, respectively. These results indicate that the multi-epitope is a stable protein (Table 3) . The Vaxijen software is the first server for arrangementindependent forecast of defensive antigens; it was developed to allow antigen classification solely based on the physicochemical properties of proteins without recourse to sequence alignment. Based on this software the antigenicity of the multi-epitope construct with a threshold of 0.4 was predicted to be 0.3360 and 0.312% by ANTIGENpro. These results show that the multi-epitope is probably antigenic. The Aller-Tope results showed that the multi-epitope is probably non allergen. Moreover, based on the Algpred server results this protein is a non-allergen with the score of -1.366. The computerized protein sequence was inserted in the available text box. The default output width is 70. Meaning 70 amino acids are displayed in each line. The output width is changable. Regarding the parameters, options such as 'Number of conformational states', 'Similarity threshold' and 'Window width' are available. For the 'Number of conformational states', either '(3Helix, Sheet, Coil)' or '(4Helix, Sheet, Turn, Coil) can be selected. The former predicts the percentage of helix, sheet and coil structure while the latter predicts percentage of helix, coil, turn and sheet. Here we set the output width as 70; therefore 70 amino acids and corresponding predicted structures are visible in each line. The sequence length is also displayed in the output (92 amino acids in this case). The percentage of each structure is also included in this page. For instance, for Alpha helix it is 39.13%. There are two graphs shown in the result page of SOPMA. The first page visualizes the prediction. The second contains score curves for all predicted states. It also shows the parameters such as window width, number of states etc. that are used for the prediction. It provides a link on the prediction result file which gives the result in a text format. There are links to find the intermediate result files also. The prediction of secondary structure by SOPMA indicated that the multi-epitope consists of 39.13% α-helix (H), 6.52% extended strand (E), 5.43% beta turn (T) and 48.91% random coil (C) elements (Fig. 3) . The high ratio of random coils and extended strands in the structure of multiepitope indicates that the designed protein is probably capable of forming antigenic epitopes. The Phyre and I-TASSER servers have returned one and five 3D models of the designed protein, respectively. Between the five models that were presented by I-TASSER, a model with the highest C-score (-0.77) was selected as the best model (Fig. 4) . It is worth noting the C-score is usually within the range of (McMahon 2014; McLauchlan 2000) , the higher C-scores are associated with high confidence and vice versa. The higher quality of the selected model was confirmed by QMEAN results. The ProSA result indicated that the selected model did not appear within the range of native proteins of similar size and needs to be refined. The selected primary model was refined and energy minimized. The model refinement and energy minimization run have improved the quality of the selected 3D model. To validate the quality of the refined model, two online validation methods like Ramachandran plot (https:// servi cesn. mbi. ucla. edu/ PROCH ECK/) (Laskowski et al. 2012) and ProSA (https:// prosa. servi ces. came. sbg. ac. at/ prosa. php) (Wiederstein and Sippl 2007) were used. Ramachandran plot assessment showed that 84.6% of the structure was under favored region, 13.3% was under the allowed region, and 2.1% was observed under the disallowed region, signaling a high quality of the predicted vaccine structure (Fig. 5) . Additionally, Pros A-web showed a Z-score of − 3.85, which lies inside the range of acceptable scores and Z-score was required prior to studying molecular interactions Haghighi and Moradi 2020) . The obtained results suggest that the refinement run have relocated most of the amino acids within the allowed area. Molecular docking studies can reveal the binding interactions between modeled protein and receptor molecules. For this purpose, we submitted the refined vaccine model as ligand and HLA-A*02:01 and TLR3 proteins as immunological receptors into the HADDOCK output (Vangone et al. 2017) , for molecular docking. The HAD-DOCK output (Vangone et al. 2017 ) consisted of several models, among which the 10 highest clusters were selected. According to HADDOCK, the most valid cluster is the top one; however, this ranking also depends on its Z-score: the more negative the Z-score the more reliable the cluster. The top cluster is the most reliable according to HADDOCK. Its Z-score indicates how many standard Fig. 3 A The predicted Secondary structure plot of multiepitope construct secondary using SOPMA software. H: Alpha helix, E: Extended strand, T: Beta-turn and C: Random coil. B The graphical representation of secondary elements in multi-epitope construct (blue: alpha helix, red: extended strand, green: beta turn, yellow: random coil) (Color figure online) deviations from the average this cluster is located in terms of score (the lower the z-score is, the better). The docking analysis showed good interaction between the vaccine construct and HLA-A*02:01 with -1.9 Z-score (Fig. 6) . The TLR3 receptor (PDB ID-2A0Z) was selected and downloaded from the PDB server (Sussman et al. 1998 ). Initially, the receptor was prepared by separating the attached ligand from the protein, followed by the removal of waters and other chemicals. All these processes were performed in PyMOL v2.3.4 software (DeLano 2002). Binding interactions and residues involved in the interacting plane were analyzed with Discovery Studio 2017. The docking between the vaccine (ligand) and TLR3 (receptor) was performed to anticipate their binding affinity and interactions. In doing so, the HDOCK server provided 30 docked complexes with different poses. Among them, we selected the complex with the least energy score and binding pose with functional interactions. Thus, model 3 of TLR-3 -vaccine docked having complex lowest energy of -323.73 kcal/mol, were selected for further analysis (Fig. 7) . P-cleavage (server for identification of protesomal cleavage sites in a protein sequence) and Net Chop servers have been used for selected cleavage site. The servers have predicted 92 cleavage sites for the multi-epitope protein (data not shown). The proteasome complex consists of enzymes that cut the peptide bonds, changing the proteins into peptides. The peptide molecules from proteasome cleavage connect with class-1 MHC molecules and the peptide-MHC molecules are then moved to the cell membrane where they are presented to T helper cell. The multi-epitope TAP binding affinity results, predicted by TAPPred (The server that prediction of TAP binding peptides for understanding of peptide internalization to ER) are listed in Table 4 . In this case, the total score was predicted and the higher the score the higher the efficiency of all these processes/a higher score meant a higher efficiency for all these processes (Fig. 7) . Influenza remains a serious respiratory disease in spite of the availability of anti-virals and inactivated trivalent vaccines, which are effective for most recipients. The key challenges for creating vaccines against influenza A virus are antigenic drift and antigenic shift. Preventive effects of present influenza virus vaccines are quite specific to particular strains of influenza virus and are based generally on antibody responses to surface antigens of the virus hemagglutinin and neuraminidase (Nachbagauer et al. 2019) . Thus, influenza virus vaccines are not capable to afford protection from a different scale of viral strains. Therefore, to evade eruptions of epidemics among human populations, modern vaccine needs the growth of new universal and multi-epitope vaccines that are able to offer protection from a large variety of influenza A virus strains and would not need to be updated every year. Multi-epitope vaccines offer a viable alternative since they can elicit potent immune responses without causing undesirable allergic reactions and have been already used against a variety of Influenza strains successfully (Rakib et al. 2020) . These vaccines are a group of recombinant products with high specificity, safety, stability and lowcost of production (Chan et al. 2020 ). Therefore, any study Fig. 4 Representation of the modeled structure using (The PYMOL Molecular Graphics System). A structure of predicted multi-epitope. B structure of HLA-A*02:01. C structure of toll like receptor3 that could help to improve the quality of these agents is important. Since the laboratory-based works require application of a considerable amount of consumables, materials, human workforce, and sacrifice of animals; using computer-based methods could help to reduce the burden from laboratory, and decrease the costs of errors by using insilico methods. Immunoinformatics or computational immunology is a branch of bioinformatics that includes investigations such as B-and T-cell epitope mapping, along with many other aspects including toxicity, and allergenicity (Raghuvanshi et al. 2020) . The potential of insilico designed vaccines for Influenza has also been confirmed by many in vivo and in vitro studies, for example, a study by Rodrigueza et al. reported a novel epitope-based vaccine candidate with protective efficacy against Influenza A in a mouse model, and the output confirmed efficacy of the designed peptide against the virus (Onyango et al. 2012) . Generation of novel vaccines that can overcome antigenic diversity and inherent low immunogenicity of vaccines against Influenza A virus is critical for pandemic preparation. In this regard, in silico tools were utilized at the current research. Moreover, application of the graphic approaches for studying the biological and medical systems could present an intuitive knowledge to help analyzing the complicated molecular interactions (Chou 2020) . Our purpose was to design an epitope vaccine by targeting the Matrix protein 2 which is a viral ion channel protein that mediates viral uncoating and is the target protein for the therapeutic drug amantadine, and its methyl derivative rimantadine (Jalily et al. 2020) . The M2 protein can be segregated into three parts: the extracellular N-terminal domain (M2e, locations 2-24), the trans membrane (TM) domain (locations 25-46) and the intracellular C-terminal domain (positions 47-97) (72). The extracellular part of the M2 protein is remarkably conserved, as shown by alignment of the sequences of M2 proteins, isolated from different human strains of influenza A virus (De Filette et al. 2006) . The high degree of conservation of M2 makes it capable of stimulating both humoral and cellular immune responses (Kordon et al. 2018) . The ectodomain of influenza A virus M2 protein (M2e) is made of 24 amino acids (Kolpe et al. Fig. 5 Predicted 3D structure and validation of multi-epitope vaccine construct. A 3D structure of vaccine construct. B Ramachandran plot showed that 84.6% of the proposed vaccine construct residues were present in favored regions, 13.3% in allowed regions while only 2.1% in disallowed regions. C PROSA validation of 3D structure showing Z-score (− 3.85). The z-score indicates overall model quality 2017; Mezhenskaya et al. 2019) . Until recently, there were several universal flu vaccine candidates that focused on the highly conserved M2e protein. A area close to the beginning of M2e which is general to both M1 and M2 proteins, was exposed to be the highly conserved epitope, SLLTEVET. Definitely, monoclonal antibodies (mAb) against this epitope strongly reduced the replication of influenza A virus H1 and H3 subtypes in MDCK cells (Yamayoshi et al. 2017) . This particular area of M2e is still believed to characterize a probable epitope to be contained in a generally protective influenza vaccine. Furthermore, it has been demonstrated that the epitope EVETPIRN (aa6-13) on M2e could be responsible for the stimulation of the protective immunity (Zhou et al. 2012 ). At the current research, we aimed to design a potential multi-epitope vaccine candidate that contain the conserved epitopes of Matrix protein 2, ER (Ig jchain) the signal sequence and the 'universal' Th epitope (PADRE). we used Repetition of M2 selective epitopes in construct for increasing immune system stimulation (de Matos et al. 2020) . The coupling between matrix protein 2 epitopes that is mentioned above, invokes a desired increase in effectiveness of the protein (Kirsteina et al. 2020) . ER signal sequence directs the ribosome to the ER membrane and initiates transport of the growing poly-peptide across the ER membrane (Serwold et al. 2002) . This signal targets processing of proteins to the ER where they are corrupted into peptides and loaded onto MHC class I molecules. The 'universal' Th epitope, Pan-DR epitope (PADRE) has been shown to have a high MHC-binding affinity to a wide range of mouse and human MHC class II haplotypes (Rosa et al. 2010) . The multi-epitope construct was designed based on the conserved epitopes of different proteins, attached together using different linkers, and was subjected to different bioinformatics assays. It is remarkable that linkers could have an impact on the role and arrangement of multi-epitope vaccines. In this approach, the last multi-epitope construct has been designed including the flexible linkers between the selected epitopes. Flexible linkers are generally wealthy in small or polar amino acids such as Alanine and Aspartic acid to offer good flexibility and solubility (Chen et al. 2013) . The physicochemical properties of the proteins play pivotal roles in various biological designs. The computed pI, instability index, aliphatic index, GRAVY index values reveal stability, thermostability, hydrophobicity of the multiepitope protein respectively. These properties showed that hydrophilic residues could be placed on the protein surface and they are suitable for ligand binding. The vaccine structure has a half-life of 30, 20, and 10 h in mammalian, yeast, and E. coli, respectively, therefore the vaccine is exposed to the immune system for a relatively long time and causes more immune responses (Nikolich-Žugich 2018). The multi-epitope vaccine's instability index was calculated at 15.41, and as it was less than 40, it was considered to be a stable protein (Singh et al. 2020) . The calculated aliphatic index and the GRAVY were 97.61 and − 0.217, respectively. The higher aliphatic index value suggests greater thermostability and the negative GRAVY value shows the vaccine's hydrophilic nature therefore it can show strong interaction with water molecules (Sanami et al. 2020) . Results also suggested that the vaccine was soluble, non allergenic, and non-toxic. The secondary structure of protein is predicted using SOPMA (Self-Optimized Prediction Method with Alignment). This tool evaluates properties such as solvent accessibility, globular regions, transmembrane helices, random coil, bend region and coiled-coil region. The secondary structure analysis revealed higher random coil content in the construct. Random coils play an important role in the high flexibility of proteins. The high coil structural percentage of the designed multi-epitope (48.91%) was due to the high content of Alanine-Aspartic acid linker sequences. The role of Alanine-aspartic-rich residue in protein flexibility has been reported previously (Elfiky 2020) . Information regarding the three-dimensional (3D) structures of proteins and their complexes with their potential ligands are of critical importance for designing novel therapeutic agents; such as multi-epitope vaccines. Application of the in silico methods for prediction of 3D structure of the protein has been proven an efficient method by using bioinformatics algorithms (Alkema et al. 2016) . The 3D structure of multi-epitope construct was modeled by I-TASSER and Structure of this construct was confirmed by different programs, considering different aspects, such as Ramachandran plot, GalaxyRefine server,RAMPAGE server and ProSAweb. In the validation test of 3D structure, we found a good number of Z-score (-3.85) and the superior features of most favored, accepted and disallowed regions for the Ramachandran plot. HADDOCK server conducted the docking process between the multi-epitope vaccine with HLA-A*02:01 and TLR3. the value of lowest energy and Z-score indicated that the interaction of the vaccine with the HLA-A*02:01 and TLR3. The score of lowest energy and Z-score showed good interaction between them. Overall, this study provided a potential multi-epitope peptide vaccine candidate against Influenza A virus; based on the different conserved epitopes. The limitations of currently in market seasonal Influenza vaccines and the persistent threat of future pandemics have made it necessary for novel vaccine design. This designed construct could cover many subtypes of the virus and serve as wide rage protection against this seasonal disease. In this study, a series of computational approaches led to the discovery of potential T-and B-cell epitopes that eventually embroidered into a multi-epitope vaccine. These epitopes were chosen based on their antigenicity and stability. Importantly, it was able to bind with the immune receptor TLR3 and MHC class I strongly as well as to elicit strong immune response upon Influenza infection. The results provided by the current in silico study showed that the multi-epitope construct can provide promising outcomes against Influenza A virus and could be devoted for future in vitro, and in vivo studies. systematic analysis by the UN Maternal Mortality Estimation Inter-Agency Group Whole genome sequence analysis of porcine astroviruses reveals novel genetically diverse strains circulating in East African Smallholder pig farms Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2): an overview of viral structure and host response Computational design and analysis of a poly-epitope fusion protein: a new vaccine candidate for Hepatitis and Poliovirus Expression profiling and in silico homology modeling of Inositol penta kis phosphate 2-kinase, a potential candidate gene for low phytate trait in soybean Amyloid deposits and inflammatory infiltrates in sporadic inclusion body myositis: the inflammatory egg comes before the degenerative chicken A New Design and Epitopes Analysis for Recombinant Vaccine Against Salmonella typhi by In silico Analysis FCRLA-a resident endoplasmic reticulum protein that associates with multiple immunoglobulin isotypes in B lineage cells Enhancement of tetravalent immune responses to highly conserved epitopes of a dengue peptide vaccine conjugated to polystyrene nanoparticles Shoombuatong W (2020) iTTCA-Hybrid: Improved and robust identification of tumor T cell antigens by utilizing hybrid feature representation Fusion protein linkers: property, design and functionality Proposing pseudo amino acid components is an important milestone for proteome and genome analyses Next-generation sequencing reveals new information about HLA allele and haplotype diversity in a large European American population Improved design and intranasal delivery of an M2e-based human influenza A vaccine Pymol: an open-source molecular graphics tool The sixties unplugged: a kaleidoscopic history of a disorderly decade Oncolytic viruses and the immune system: the dynamic duo IEDB-AR: immune epitope database-analysis resource in 2019 Influenza A viruses: why focusing on M2e-based universal vaccines Anti-HCV, nucleotide inhibitors, repurposing against COVID-19 Immunoinformatics prediction of overlapping CD8+ IFN-γ and IL-4 inducer CD4+ T-cell and linear B-cell epitopes based vaccines against COVID-19 (SARS-CoV-2) Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning Homology modeling and molecular docking studies of glutamate dehydrogenase (GDH) from cyanobacterium Synechocystis sp. PCC 6803 In silico study of the structure and ligand interactions of alcohol dehydrogenase from Cyanobacterium Synechocystis sp. PCC 6803 as a key enzyme for biofuel production A computational assay to design an epitope-based Peptide vaccine against Saint Louis encephalitis virus The cytoplasmic tail of the influenza A virus M2 protein plays a role in viral assembly In silico analyses of staphylococcal enterotoxin B as a DNA vaccine for cancer therapy Put a cork in it: Plugging the M2 viral ion channel to sink influenza In silico analysis of STX2a-PE15-P4A8 chimeric protein as a novel immunotoxin for cancer therapy Epitope-based peptide vaccine design and target site depiction against Ebola viruses: an immunoinformatics study H-polytope decomposition-based algorithm for continuous optimization Construction and immunogenicity of a novel multivalent vaccine prototype based on conserved Influenza virus antigens Immunization with a peptide containing MHC class I and II epitopes derived from the tumor antigen SIM2 induces an effective CD4 and CD8 T-cell response M2-based influenza vaccines: recent advances and clinical potential Innate immune responses in fish: antigen presenting cells and professional phagocytes Generation and characterization of universal live-attenuated influenza vaccine candidates containing multiple M2e epitopes Overview of the immune response Conditions for propagating synchronous spiking and asynchronous firing rates in a cortical network model Structural, glycosylation and antigenic variation between 2019 novel coronavirus (2019-nCoV) and SARS coronavirus On predicting the T cell and B cell epitopes of platelet membrane glycoprotein II b/III a antibody from human and mice I2/TBHP mediated divergent C (sp2)-P cleavage of allenylphosphine oxides: substituentcontrolled regioselectivity Properties of the hepatitis C virus core protein: a structural protein that modulates cellular processes Chronic hepatitis B virus infection Prediction and validation of potent peptides against herpes simplex virus type 1 via immunoinformatic and systems biology approach Cowpea mosaic virus chimeric particles bearing the ectodomain of matrix protein 2 (M2E) of the influenza A virus: production and characterization M2e-based universal influenza vaccines: a historical overview and new approaches to development Pandemic influenza virus vaccines boost hemagglutinin stalk-specific antibody responses in primed adult and pediatric cohorts A novel multi-epitope peptide vaccine against cancer: an in silico approach Genetic detection and characterization of Influenza viruses in selected wildlife and humans at high-risk interface in Laikipia County Influenza surveillance among children with pneumonia admitted to a district hospital in coastal Kenya Bioinformatic prediction of the epitopes of Echinococcus granulosus antigen 5 In silico analysis and molecular dynamics simulation of human superoxide dismutase 3 (SOD3) genetic variants Immunoinformaticsguided design of an epitope-based vaccine against severe acute respiratory syndrome coronavirus 2 spike glycoprotein Immunoinformatics approach to engineer a potent poly-epitope fusion protein vaccine against Coxiella burnetii CD4+ T cell epitope discovery and rational vaccine design Physicochemical properties analysis of predicted SARS-Cov2 B-Cell epitope Protective cellular and mucosal immune responses following nasal administration of a whole gamma-irradiated influenza A (subtype H1N1) vaccine adjuvanted with interleukin-28B in a mouse model The linker domain of the SNARE protein SNAP25 acts as a flexible molecular spacer that ensures efficient S-acylation: role of the SNAP25 linker domain in S-acylation Design of a multi-epitope vaccine against SARS-CoV-2 using immunoinformatics approach A systematic and reverse vaccinology approach to design novel subunit vaccines against Dengue virus type-1 (DENV-1) and human Papillomavirus-16 (HPV-16) ERAAP customizes peptides for MHC class I molecules in the endoplasmic reticulum Immunogenicity evaluation of a rationally designed polytope construct encoding HLA-A* 0201 restricted epitopes derived from Leishmania major related proteins in HLA-A2/DR1 transgenic mice: steps toward polytope vaccine Evolution of influenza a virus by mutation and re-assortment AlgPred 2.0: an improved method for predicting allergenic proteins and mapping of IgE epitopes Immunoinformatics approach for a novel multiepitope subunit vaccine design against various subtypes of Influenza A virus Designing of precise vaccine construct against visceral leishmaniasis through predicted epitope ensemble: a contemporary approach Improving pharmaceutical product development and manufacturing: impact on cost of drug development and cost of goods sold of pharmaceuticals Protein Data Bank (PDB): database of three-dimensional structural information of biological macromolecules Multiepitope-based subunit vaccine design and evaluation against respiratory syncytial virus using reverse vaccinology approach Molecular basis for T-cell immunity against influenza viruses The HADDOCK2. 2 web server: user-friendly integrative modeling of biomolecular complexes Sense and simplicity in HADDOCK scoring: lessons from CASP-CAPRI round 1 The immune epitope database (IEDB): 2018 update OR7 MiHAIP: Comprehensive pipeline to discover immunogenic minor histocompatibility antigens via whole genome sequences of HLA-matched donorrecipient pairs ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins A broadly reactive human anti-hemagglutinin stem monoclonal antibody that inhibits influenza A virus particle release The HDOCK server for integrated protein-protein docking The I-TASSER Suite: protein structure and function prediction Identification of SARS-CoV-2 vaccine epitopes predicted to induce long-term population-scale immunity Salmonella serotype determination utilizing high-throughput genome sequencing data Immunization with high epitope density of M2e derived from 2009 pandemic H1N1 elicits protective immunity in mice The authors would like to acknowledgement Pasteur institute of Iran for the financial support.