key: cord-0254623-l0cyw4bi authors: Tovar, Adelaide; Smith, Gregory J.; Thomas, Joseph M.; McFadden, Kathryn M.; Kelada, Samir N. P. title: A novel gene-by-environment quantitative trait locus on mouse chromosome 15 underlies susceptibility to acute ozone-induced lung injury date: 2021-05-20 journal: bioRxiv DOI: 10.1101/2021.05.20.445039 sha: 0756c8c8a93b6169e8fe22a1348f3745df256400 doc_id: 254623 cord_uid: l0cyw4bi Respiratory toxicity caused by the common urban air pollutant ozone (O3) varies considerably within the human population and across inbred mouse strains, suggestive of gene-environment interactions (GxE). Though previous studies genetic mapping studies using classical inbred strains have identified several and quantitative trait locus (QTL) and candidate genes underlying responses to O3 exposure, precise mechanisms of susceptibility remain incompletely described. We sought to expand our understanding of the genetic architecture of O3 responsiveness using the Collaborative Cross (CC) recombinant inbred mouse panel, which contains more genetic diversity than previous inbred strain panels. We evaluated hallmark O3-induced respiratory phenotypes in 56 CC strains after exposure to filtered air or 2 ppm O3, and performed focused genetic analysis of variation in lung injury as measured by the total bronchoalveolar lavage protein concentration. Because animals were exposed in sex- and batch-matched pairs, we defined a protein response phenotype as the difference in lavage protein between the O3- and FA-exposed animal within a pair. The protein response phenotype was heritable, and QTL mapping revealed two novel loci on Chromosomes 10 (peak: 26.2 Mb; 80% CI: 24.6-43.6 Mb) and 15 (peak: 47.1 Mb; 80% CI: 40.2-54.9 Mb), the latter surpassing the 95% significance threshold. At the Chr. 15 locus, C57BL/6J and CAST/EiJ founder haplotypes were associated with higher protein responses compared to all other CC founder strain haplotypes. Using additional statistical analysis and high-density SNP data, we delimited the Chr. 15 QTL to a ∼2 Mb region containing 21 genes (10 protein coding). Using a weight of evidence approach that incorporated candidate variant analysis, functional annotations, and publicly available lung gene expression data, we nominated three candidate genes (Oxr1, Rspo2, and Angpt1). In summary, we have shown that O3-induced lung injury is modulated by genetic variation and demonstrated the value of the CC for uncovering and dissecting gene-environment interactions. Ozone (O3) is a potent oxidant gas and ground-level air pollutant. Acute O3 exposure causes temporary decrements in lung function (1, 2) , respiratory inflammation (3, 4) , and tissue injury (5, 6) , as well as aggravating symptoms of common chronic lung diseases including asthma and chronic obstructive pulmonary disease (7) (8) (9) . Short-term O3 exposure is also associated with an increased risk of respiratory tract infections and hospitalization (10) . Importantly, these adverse outcomes have been linked with the pathogenesis of respiratory diseases (11) . Thus, because of its involvement in disease incidence and exacerbations (12) , identifying the molecular circuitry by which O3 exposure causes pulmonary inflammation and injury is a critical public health concern. Controlled exposure and candidate gene studies in humans have provided strong evidence that responses to O3 vary widely and reproducibly across individuals, in a fashion partially attributable to genetic variation (13) (14) (15) (16) (17) (18) (19) . Studies using panels of genetically diverse rodent strains have corroborated these findings, provided evidence of heritability, and identified multiple loci that control hallmark respiratory responses such as airway neutrophilia and injury (20, 21) . These foundational studies proposed important candidate genes including Tlr4, Nos2, and Tnf, whose roles in O3 respiratory toxicity have since been thoroughly studied (22) (23) (24) (25) (26) (27) . Here, we sought to further characterize the genetic architecture of respiratory responses to acute O3 exposure, thereby illuminating novel gene-by-environment interactions (GxE) for future experimental validation and investigation. For the current study, we used the Collaborative Cross (CC) genetic reference population, a multi-parental inbred strain panel derived by funnel inbreeding of five classical (A/J, C57BL/6J, 129S1SvImJ, NOD/ShiLtJ, NZO/H1LtJ) and three wild-derived mouse strains (CAST/EiJ, PWK/PhJ, WSB/EiJ) (28, 29) . Nearly all previously published studies investigating genetic contributions to O3 responses focused on responses in classical laboratory strains, failing to capture the full breadth of genetic diversity available within the Mus musculus species. The CC captures over 90% of circulating genetic variants (> 40 million SNPs and several million indels and small structural variants), owing in large part to the inclusion of wild-derived strains from three different Mus musculus subspecies (M. m. domesticus, castaneus and musculus) (30) . Moreover, the breeding strategy used to generate CC lines created novel allelic combinations, thus enhancing the range of phenotypic variation observed compared to that across the founder strains alone (31, 32) . This population has already been used to identify genomic regions associated with various respiratory disease phenotypes including allergic inflammation in the airways (33) (34) (35) and susceptibility to virus-induced respiratory disease endpoints (36, 37) . Hence, this resource is well-suited for toxicogenomic discovery and identification of genes and variants contributing to O3-induced responses in the lung. We measured inflammatory and injury responses in mice from 56 CC strains after a 3hour exposure to filtered air (FA) or 2 ppm O3. The analyses here are primarily focused on total protein concentration in the bronchoalveolar lavage fluid, a non-specific marker of lung injury, for which we identified two novel genomic loci (one significant and one suggestive) associated with variation in responsiveness. Using a weight of evidence approach along with available genome sequence data, we prioritized candidate genes within the significant locus, located on Chromosome 15. Female and male mice from 56 Collaborative Cross strains were obtained from the UNC Systems Genetics Core Facility between August and November 2018. Mice were delivered as litter-and sex-matched pairs or trios (matches) upon weaning and aged on investigator's racks until the time of exposure. Animals were housed in groups of three or more under normal 12hour light/dark cycles, in polycarbonate cages with ad libitum food (Envigo 2929) and water on Teklad 7070C bedding (Envigo). All studies were conducted in compliance with a protocol reviewed by the University of North Carolina Institutional Animal Care and Use Committee in animal facilities approved and accredited by the Association for Assessment and Accreditation of Laboratory Animal Care International. All mice were 9-12 weeks of age at the time of filtered air or 2 ppm ozone exposure. Mice were randomly assigned to experimental groups within a pair or trio (referred to as a match), and all matches within a strain were randomized over the course of the study to address batch effects. A full calendar detailing the batches is included in Supplemental Figure S1 . Mice were exposed to O3 in individual wire-mesh chambers without access to food or water, as described previously (38, 39) . Exposures were performed at roughly the same time (from 9 am-12 pm), with highly stable concentrations of O3 delivered across all batches (Supplemental Figure S2 ). Lung phenotyping. Mice were returned to their normal housing upon cessation of exposure for 21 hours, at which point mice were anesthetized via an i.p. injection of urethane (2 g/kg) and euthanized by exsanguination via the inferior vena cava/descending abdominal aorta. Two bronchoalveolar lavage (BAL) fractions were collected as described previously (39) . Supernatant from the first BAL fraction was stored at -80°C for biochemical analysis. Pellets from both BAL fractions were pooled, and a portion was used for performing total and differential cell counts. Total protein measurement. Total BAL protein was quantified using the Quant-iT Protein Assay kit (Thermo Scientific), using manufacturer's instructions. Assays were performed in black 384well plates using 1 μL of BAL fluid and 75 μL of assay reagent. Samples were randomly assigned to 1 of 5 plates and each sample was plated in triplicate. Fluorescence was measured using the Cytation 5 multi-mode plate reader (BioTek), and values were calculated from the mean of the triplicate compared to the standard curve. Definition of ozone-induced lung injury phenotype using BAL protein data. Here, we define a protein response phenotype, indicative of lung injury, as the difference in total protein concentration between an O3-exposed mouse (or mice) and its FA-exposed match. For trios, the mean value from the two O3-exposed mice was used. Data from unpaired mice were excluded. Heritability calculations. Broad-sense heritability (H 2 ) was estimated using a Bayesian linear mixed model approach implemented in R/INLA (40) , as described previously with Heterogeneous Stock rats (41) . Importantly, because a strain-identity kinship matrix is used for the inbred CC strains to estimate genetic relatedness, additive and dominant genetic effects are confounded; thus, broad-sense rather than narrow-sense heritability is calculated, in contrast to the approach used in Keele et al. (41) . CC strain genotypes and haplotype mosaics. Inferred CC founder haplotype contributions were previously reconstructed by the UNC Systems Genetics Core using a Hidden Markov Model on weighted consensus genotype calls from the most recent common ancestors for a given CC strain. These founder genotype probability files are publicly available (http://csbio.unc.edu/CCstatus/index.py?run=FounderProbs). The genome cache was constructed with a reduced number of total loci, as described previously (42) . Briefly, to minimize computational burden and reduce the testing penalty when performing quantitative trait locus where yi represents the phenotype value for an individual pair or trio, i, represents the population mean, variables for batch date (with 17 levels, b = 1…17) and sex of a given match are included as covariates. Then, a full model was built, where QTL denotes the additive QTL effect of founder haplotypes at a given locus. The likelihoods of the two models were compared using a likelihood ratio test. QTL were deemed significant if they surpassed the genome-wide significance threshold, which was determined by permutation (n=1000). QTL effect size and QTL location estimation. The variance attributed to a QTL was calculated by linear regression. Effect size was computed as where SS is the sum of squares, or variation, attributed to founder haplotype at the peak marker (44) . Candidate variant (merge) analysis. A multiallelic merge analysis was used to identify candidate variants within the 80% confidence interval of the significant QTL peak (45) (46) (47) . This procedure tests whether a significant QTL signal, identified by the 8-haplotype association model, can be explained more parsimoniously by the pattern of known alleles at a given variant. Weight of evidence criteria for candidate gene identification. To evaluate candidate genes within the QTL region, we used a weight of evidence approach with the following criteria: 1. Existence of variants (within or near a gene, identified by merge analysis) whose strain distribution pattern is similar to or the same as the haplotype effects from the eight-allele model used for mapping. 2. Presence of variants (ascertained from #1) that alter the coding portion of the transcript, with an emphasis on those that also alter or disrupt the amino acid sequence. 3. Evidence that the candidate gene is expressed in any compartment or cell type within the lung. For this criterion, we queried (1) our previously published bulk RNA-seq data (39) generated from conducting airways tissue and airway macrophages isolated from female C57BL/6J mice exposed to FA and 2 ppm O3, and (2) We utilized a design in which mice within a given Collaborative Cross (CC) strain were exposed to filtered air (FA) or ozone (O3) in sex-and litter-matched pairs or trios, hereafter referred to as matches ( Figure 1 ). We observed an induction of neutrophilia that was highly variable across strains after O3 exposure ( Figure 2 ). Using data from O3-exposed animals only (because most strains have no neutrophils after FA exposure), we estimated the broad-sense heritability (H 2 ) to be 0.47 (95% CI: 0.36-0.60) and 0.52 (95% CI: 0.23-0.78) for percentage and total number of neutrophils in BAL, respectively. As a quantitative measure of lung injury, we measured total protein concentration in bronchoalveolar lavage (BAL) fluid ( Figure 3A ). This is a non-specific marker of serum contents (primarily serum albumin (4, 52) ) that leak into the airspace upon damage to the alveolar epithelium and increased permeability of the underlying capillary beds. Evidence of modest strain effects were observed in FA-treated animals, while treatment effects were clear in O3-exposed animals. H 2 was estimated at 0.26 (95% CI: 0.16-0.40) and 0.53 (95% CI: 0.41-0.64) for FA-and O3-exposed animals, respectively. We defined the total protein response as the difference in total protein concentration between the O3-exposed animal(s) and the FA-exposed animal for a given match within a strain. The range of total protein response values were nearly normally distributed, indicative of a polygenic, complex trait ( Figure 3B ), and H 2 for this response trait was estimated at 0.39 (95% CI: 0.26-0.54). To identify genomic regions associated with variation in O3-induced neutrophilia and total protein response, we performed quantitative trait locus (QTL) mapping using a haplotypebased regression procedure accounting for the effects of sex and batch. While we did not detect significant loci associated with neutrophilia, two QTL were associated with total protein response: one locus each on Chromosomes (Chr.) 15 (Oipq1, ozone-induced protein response QTL 1) and 10 (Oipq2), the first of which surpassed the genome-wide significance threshold of α = 0.05 ( Figure 4A ). These loci explained roughly 27 and 13% of variation in the phenotype, respectively. Oipq1 mapped to a region on Chr. 15 extending ~15 Mb (80% CI: 40.17-54.88 Mb) which contained 31 protein-coding genes ( Figure 4B ). To identify candidate genes underlying this QTL, we initially inspected the founder haplotype effects pattern to identify whether there were functionally distinct alleles that could be used to partition and prioritize candidates. Founder haplotype probabilities at the peak marker within Oipq1 (JAX00061625_to_UNC25526264: 47.12 Mb) showed an overrepresentation of CC strain pairs with C57BL/6J or CAST/EiJ associated with high total protein response, while strain pairs with A/J or WSB/EiJ were associated with low total protein response ( Figure 4C ). Haplotype substitution effects were modeled using Diploffect (43) , indicating a strong positive effect of the C57BL/6J and CAST/EiJ haplotypes on total protein response ( Figure 4D ). It should be noted that estimated allele effects for CAST/EiJ are less certain than for C57BL/6J due to lower representation of and less confident genotype calls for this founder haplotype at the peak locus ( Figure 4C ). Interestingly, when visualizing the subspecies origin of the CC founder haplotypes within the QTL interval using the Mouse Phylogeny Viewer (53, 54), we discovered a ~2 Mb region of Mus musculus domesticus intersubspecific introgression into the genome of CAST/EiJ (which is of castaneus origin) from ~43-45 Mb (Supplemental Figure S3 ). We then used the statistical approach TIMBR (44) to infer the allelic series, i.e., the number of functional alleles into which the founder haplotypes group. The results indicated a high likelihood of two functional alleles at this QTL, with greatest weight for a haplotype grouping of C57BL/6J and CAST/EiJ (Supplemental Figure S4 , Supplemental Tables S1 & S2). We note that, as we observed with Diploffect, there was a similar level of uncertainty for CAST/EiJ haplotype effects (Supplemental Figure S4) Figure S5A) . We examined the founder haplotype probabilities at the peak marker (UNC17621935: 26.204 Mb), which were sorted into a less defined pattern than at Oipq1. Strain matches with WSB/EiJ haplotype at this marker were associated with high total protein response, but all other haplotypes were spread evenly across the phenotypic spectrum (Supplemental Figure S5B) . Intriguingly, within much of this region (including at the peak marker) in the CC strains surveyed, there is essentially no or lowconfidence representation of the PWK/PhJ haplotype, and low representation of the other two wild-derived strains (30) . Previous work has established that PWK/PhJ has reduced genomewide contributions in living CC strains (30) , and selection against the PWK/PhJ haplotype may have occurred at this locus to maintain reproductive compatibility in the course of inbreeding (55) . Because Oipq1 surpassed the genome-wide significance threshold and had more clearly defined haplotype effects, we prioritized this locus for further analysis. To further rank candidate genes and variants within the QTL region, we performed a merge analysis with modifications to accommodate multi-allelic variants, as described previously (45) (46) (47) . In brief, this approach moves from association at the haplotype level to association at the level of individual variants (both single nucleotides and small insertions/deletions) with variation in the phenotype of interest. By using sequence information from the Inbred Strain Variant Database (56) for each of the CC founders and CC strains that were used for QTL mapping, variants can be identified that are distributed amongst the CC strains in concordance with the haplotype effects pattern. Founder strain haplotypes are "merged" into 2-7 groups in accordance with how the variants are distributed (i.e., the strain distribution profile). Variants in the merged model that explain trait variation equally well or better than the full haplotype model (but with fewer parameters) can be considered candidate quantitative trait variants (QTVs). In the Oipq1 region, 995 variants were identified using this method, with a -log10(p-value) > 4 (roughly the cut-off used for QTL mapping; Figure 5A ). Variants identified using merge analysis represented a variety of strain distribution patterns (SDPs), with two SDPs more common than the others: C57BL/6J alone or C57BL/6J and CAST/EiJ discordant from all other strains ( Figure 5B ). These variants were predicted to have several consequences on gene/protein function, though most were present within introns or other non-coding regions of the QTL ( Figure 5C ). Overall, this subset of variants was located within or near 21 genes (13 protein-coding, 8 predicted), most of which were concentrated within the region between ~40.9-44.6 Mb containing 16 genes (10 protein-coding) ( Figure 5D , Supplemental Table S4 ). We used a weight of evidence approach to further prioritize candidate genes within the interval, using the following criteria: (1) presence of variants that are concordant with the haplotype effects pattern, with an emphasis on C57BL/6J and/or CAST/EiJ discordant from all other founder strains; (2) presence of coding variants that lead to amino acid alterations; (3) evidence that a gene is expressed the lungs (from our previously unpublished and published data (39) , or publicly available datasets); and (4) biological relevance, as judged by a gene's known function and prior description in the literature (Table 1) . Two genes within the interval met all criteria: Angpt1 (angiopoietin-1) and Oxr1 (oxidation resistance 1). A third candidate (Rspo2, Rspondin 2) met three of the criteria used, and extensive prior annotation in the literature provided additional biological plausibility. Using information from Ensembl, UniProt, and multiple variant consequence prediction tools, we inspected the three missense variants within Oxr1 (rs50179186, rs31574788) and Angpt1 (rs32511504) to determine whether any had putative consequences on protein structure and/or function ( Here, we present discovery of two loci associated with ozone (O3)-induced lung injury, Oipq1 and Oipq2, located on Chr. 15 and 10, respectively. Notably, the region encompassed by Oipq1 has been previously implicated in two studies aiming to identify genetic variants associated with lung injury. One study utilized a C57BL/6J:129X1/SvJ F2 population to map QTL driving fatality due to hyperoxic (58) . In that study, the C57BL/6J allele at the QTL was associated with higher lung injury, consistent with the results of our study. Similarly, an overlapping QTL for pulmonary hemorrhage was identified in a study utilizing a CC003/Unc:CC053/Unc F2 intercross to identify genetic determinants of pulmonary responses to SARS-CoV. Though this QTL was not the focus of their study (59) , the authors did observe a positive association between the CAST/EiJ allele and pulmonary hemorrhage and the inverse relationship with the PWK/PhJ allele, matching the haplotype effects observed in our study. Therefore, there is strong evidence that this locus mediates responses to multiple stimuli that induce lung injury. Within Oipq1, we nominated three candidate genes: Rspo2 (R-spondin 2), Angpt1 (angiopoietin-1), and Oxr1 (oxidation resistance 1). Information about missense variants within Angpt1 and Oxr1 is lacking, because the results from variant prediction tools were conflicting and/or largely suggested that the missense variants had neutral effects on protein function. While we used a suite of approaches that each incorporate unique features for prediction including sequence homology, physico-chemical properties, and predicted secondary structure, even stateof-the-art methods fail to fully recapitulate functional characterization through experimental strategies (i.e., deep mutational scanning), which remain the gold standard (60) . Thus, direct experimental classification is needed to determine whether these variants have effects on protein function. It is also possible, perhaps even likely, that variants in this region affect the expression of these candidate genes (i.e., are eQTL), and that variation in gene expression underlies the Oipq1 QTL. Future studies addressing this hypothesis are also needed. One candidate gene at Oipq1, Rspo2, is a member of the R-spondin (RSPO) protein family, a class of secreted ligands known to regulate Wnt signaling, tissue regeneration and organization in various regions of the body (61). In particular, RSPO2 is required for lung, limb, and craniofacial development (62) , and Rspo2-deficient mice are born with various skeletal defects (63, 64) , and die immediately upon birth due to respiratory failure (65, 66) . Recessive mutations in RSPO2 cause tetra-amelia syndrome-2, a human syndrome characterized by partial or complete absence of limbs along with incomplete lung development (67). Jackson, et al. recently examined the consequences of Rspo2 conditional deletion in adult mice. They reported that Rspo2 loss caused lung neutrophilia via a disrupted lung endothelial barrier (68) . While there are no variants within this gene, we noted multiple noncoding variants within the in/near the gene, and its function is in alignment with features of the pathology caused by O3 exposure. Its expression is largely restricted to the lung mesenchyme; therefore, we were unable to directly interrogate whether it is altered by O3 exposure using our previously published data or other datasets. Additionally, its expression is quite low in all life stages beyond embryonic development, thus detecting whether Rspo2 expression is changed in the context of other lung insults using publicly available data has proven challenging. Nevertheless, the phenotypes observed after its conditional deletion in adulthood are compelling, and warrant further investigation in the context of O3-induced lung injury. The second candidate gene Angpt1 encodes angiopoietin-1 (ANGPT1), which is a secreted ligand that binds to and activates the Tie2 receptor tyrosine kinase (encoded by Tek) to promote endothelial barrier function and vascular growth (69) . It has also been suggested that ANGPT1 activity can improve and reinforce leaky or otherwise poorly functioning vessels (70, 71) . ANGPT1 is largely produced by vascular support cells and platelets, and its activity can be antagonized by ANGPT2 (72) . Disrupted balance of ANGPT2/ANGPT1 in serum is associated with poor outcomes in a variety of disease states, including acute respiratory distress syndrome, bronchopulmonary dysplasia, and sepsis (73) (74) (75) . Genetic association studies have identified variants in ANGPT2 associated with susceptibility to acute lung injury (76) . Together, these studies suggest that ANGPT2 is often a pathogenic regulator of acute lung injury and barrier dysfunction, and balance to this system can be restored by supplementation with ANGPT1. Thus, one hypothesis arising from our study is that in strains with haplotypes associated with more severe protein response (i.e., C57BL/6J and potentially CAST/EiJ), there may be variants in or near Angpt1 that alter its activity and/or function, thereby disrupting its ability to negatively regulate ANGPT2/TEK signaling. In vitro and in vivo functional validation studies in models of O3 exposure or other types of acute lung injury will be necessary to evaluate this hypothesis. Our final candidate gene, Oxr1 (oxidation resistance 1), has less evidence tying its functions to acute lung injury and epithelial barrier integrity, though these remain to be examined. This protein, while aptly named for a role in response to oxidant gases, has largely been implicated in maintaining genome integrity and cell survival in the face of stressors that cause either oxidative stress-dependent or -independent DNA damage (e.g., reactive oxygen species, radiation, alkylating agents) (77) . This gene was first discovered in an E. coli screen for human genes involved in repairing or preventing oxidative DNA damage (78) , and later discovered to function largely in the mitochondria (79) . Oxr1 has since been studied largely in the context of neurological diseases including amyotrophic lateral sclerosis (ALS) (80) (81) (82) and other neurodegenerative conditions, as well as a recent study describing recessive loss-offunction variants in OXR1 associated with cerebellar atrophy, seizures, developmental delays, among other clinical features (83) . Only a few studies have mentioned this gene in the context of lung disease, including one examining vanadium pentoxide (V2O5)-induced occupational bronchitis where OXR1 expression was induced in human lung fibroblasts after exposure to V2O5 (84) . It is worth mentioning that mice express multiple isoforms of Oxr1 whose tissue specificity was recently characterized (85): many tissues expressed the shortest version (Oxr1D) and Oxr1B1-4, while the longest (Oxr1A) was restricted to the brain. The cited study went on to characterize the functions of the OXR1A isoform, demonstrating that its TLDc domain (present in all OXR1 isoforms) facilitates interactions with a variety of proteins including the PRMT5 methyltransferase, thus representing one pathway by which this gene alters cellular function in response to stress. However, further studies will be required to understand its roles in oxidative stress responses in the lungs and assess its relationship to the protein response phenotype measured here. We did not detect any loci associated with variation in airway neutrophilia, another hallmark O3-induced phenotype, despite the fact that others have previously identified QTL for this trait (20) . However, we note that this phenotype was highly heritable (H 2 : ~0.47 for percentage and ~0.52 for total number). Thus, one potential explanation for the lack of QTL is that the genetic architecture of airway neutrophilia may be more complex than lung injury and involve contributions from many loci with individually small effects. One option to address this would be to incorporate information about phenotypes that lie along the causal chain from O3 exposure to airway neutrophilia (e.g., cytokines), as we expect these intermediate phenotypes to have higher heritabilities and simpler genetic architectures. Alternatively, to achieve greater statistical power for QTL mapping, one could make use of the Diversity Outbred mouse population since a larger number of genetically distinct mice can be examined in that population (86) . However, we note here that the protein response QTL we identified here was detected using a "delta" framework (in which the phenotype was the difference between O3-vs FA-exposed mice), which required inbred strains so that baseline (filtered air) effects could be accounted for. In conclusion, we have identified a significant QTL on mouse Chr. 15 associated with O3-induced lung injury. Through additional genetic and bioinformatic analyses, we delimited the QTL region and identified three high priority candidate genes worthy of additional investigation. Our study also demonstrates the utility of the Collaborative Cross genetic reference population for identifying interactions between genetic variants and environmental exposures. Lung function and inflammatory responses in healthy young adults exposed to 0.06 ppm ozone for 6.6 hours 6-hour inhalation of ozone concentrations from 60 to 87 parts per billion in healthy humans Ozone-induced airway inflammation in human subjects as determined by airway lavage and biopsy Ozone-induced inflammation in the lower airways of human subjects Inflammation and cell damage induced by repeated exposure of humans to ozone Ozone exposure increases respiratory epithelial permeability in humans Association between ozone and hospitalization for respiratory diseases in 16 Canadian cities Association of low-level ozone and fine particles with respiratory symptoms in children with asthma Short-term associations between ambient air pollutants and pediatric asthma emergency department visits Air pollution and acute respiratory infections among children 0-4 years of age: an 18-year time-series study Oxidants and the pathogenesis of lung diseases Outdoor Air Pollution and New-Onset Airway Disease Pulmonary function, bronchial reactivity, and epithelial permeability are response phenotypes to ozone and develop differentially in healthy humans Ozone-induced airway inflammatory changes differ between individuals and are reproducible Reproducibility of individual responses to ozone exposure The glutathione-S-transferase Mu 1 null genotype modulates ozoneinduced airway inflammation in human subjects Association of tumor necrosis factor-alpha polymorphisms and ozoneinduced change in lung function Ozone exposure, antioxidant genes, and lung function in an elderly cohort: VA normative aging study Ozone exposure, vitamin C intake, and genetic susceptibility of asthmatic children in Mexico City: a cohort study Linkage analysis of susceptibility to ozone-induced lung inflammation in inbred mice Genetic analysis of ozone-induced acute lung injury in sensitive and resistant strains of mice Genetic susceptibility to ozoneinduced lung hyperpermeability: role of toll-like receptor 4 Ozone-induced lung inflammation and hyperreactivity are mediated via tumor necrosis factor-alpha receptors Identification of candidate genes downstream of TLR4 signaling after ozone exposure in mice: a role for heat-shock protein 70 Deficiency in inducible nitric oxide synthase protects mice from ozone-induced lung inflammation and tissue injury Ozone-induced lung injury and sterile inflammation Tumor necrosis factor receptor 2 contributes to ozone-induced airway hyperresponsiveness in mice The Collaborative Cross, a community resource for the genetic analysis of complex traits The genome architecture of the Collaborative Cross mouse genetic reference population Genomes of the Mouse Collaborative Cross Genetic analysis of complex traits in the emerging Collaborative Cross Genetic analysis in the Collaborative Cross breeding population Integrative genetic analysis of allergic inflammation in the murine lung Identification of trans Protein QTL for Secreted Airway Mucins in Mice and a Causal Role for Bpifb1 Genetic regulation of Zfp30, CXCL1, and neutrophilic inflammation in murine lung Modeling host genetic regulation of influenza pathogenesis in the collaborative cross Genome Wide Identification of SARS-CoV Susceptibility Loci Using the Collaborative Cross Development of a large-scale computer-controlled ozone inhalation exposure system for rodents Transcriptional Profiling of the Murine Airway Response to Acute Ozone Exposure Animal models and integrated nested Laplace approximations Genetic Fine-Mapping and Identification of Candidate Genes and Variants for Adiposity Traits in Outbred Rats Integrative QTL analysis of gene expression and chromatin accessibility identifies multi-tissue patterns of genetic regulation Bayesian modeling of haplotype effects in multiparent populations Inferring the Allelic Series at QTL in Multiparental Populations Identification of Candidate Risk Factor Genes for Human Idelalisib Toxicity Using a Collaborative Cross Approach Editor's Highlight: Candidate Risk Factors and Mechanisms for Tolvaptan-Induced Liver Injury Are Identified Using a Collaborative Cross Approach Using progenitor strain information to identify quantitative trait nucleotides in outbred mice Predicting deleterious amino acid substitutions A method and server for predicting damaging missense mutations Predicting the functional effect of amino acid substitutions and indels Better prediction of functional effects for sequence variants The lung response to ozone is determined by age and is partially dependent on toll-Like receptor 4 Subspecific origin and haplotype diversity in the laboratory mouse Comparative analysis and visualization of multiple collinear genomes Male Infertility Is Responsible for Nearly Half of the Extinction Observed in the Mouse Collaborative Cross A Repository for Probabilistically Informed Sequence Differences Among the Collaborative Cross Strains and Their Founders. G3 Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies Genetic analysis of hyperoxic acute lung injury survival in reciprocal intercross mice Allelic Variation in the Toll-Like Receptor Adaptor Protein Ticam2 Contributes to SARS-Coronavirus Pathogenesis in Mice Variant effect predictions capture some aspects of deep mutational scanning experiments The R-spondin protein family R-spondin 2 is required for normal laryngeal-tracheal, lung and limb morphogenesis Asymmetric limb malformations in a new transgene insertional mutant, footless R-spondin2 expression in the apical ectodermal ridge is essential for outgrowth and patterning in mouse limb development Craniofacial malformation in R-spondin2 knockout mice Mouse R-spondin2 is required for apical ectodermal ridge maintenance in the hindlimb RSPO2 inhibition of RNF43 and ZNRF3 governs limb development independently of LGR4/5/6 R-spondin 2 mediates neutrophil egress into the alveolar space through increased lung permeability Requisite role of angiopoietin-1, a ligand for the TIE2 receptor, during embryonic angiogenesis Leakage-resistant blood vessels in mice transgenically overexpressing angiopoietin-1 Angiopoietin-1 protects the adult vasculature against plasma leakage Angiopoietin-2, a natural antagonist for Tie2 that disrupts in vivo angiogenesis Excess circulating angiopoietin-2 may contribute to pulmonary vascular leak in sepsis in humans Hyperoxia causes angiopoietin 2-mediated acute lung injury and necrotic cell death Circulating angiopoietin 2 correlates with mortality in a surgical population with acute lung injury/adult respiratory distress syndrome ANGPT2 genetic variant is associated with trauma-associated acute lung injury and altered plasma angiopoietin-2 isoform ratio Oxidation resistance 1 functions in the maintenance of cellular survival and genome stability in response to oxidative stress-independent DNA damage Functional genomics reveals a family of eukaryotic oxidation protection genes Stress induction and mitochondrial localization of Oxr1 proteins in yeast and humans Oxr1 is essential for protection against oxidative stress-induced neurodegeneration Neuron-specific antioxidant OXR1 extends survival of a mouse model of amyotrophic lateral sclerosis Oxr1 improves pathogenic cellular features of ALS-associated FUS and TDP-43 mutations Loss of Oxidation Resistance 1, OXR1, Is Associated with an Autosomal-Recessive Neurological Disease with Cerebellar Atrophy and Lysosomal Dysfunction Genomic analysis of human lung fibroblasts exposed to vanadium pentoxide to identify candidate genes for occupational bronchitis OXR1A, a Coactivator of PRMT5 Regulating Histone Arginine Methylation High-resolution genetic mapping using the Mouse Diversity outbred population The authors would like to acknowledge the assistance of Daniel Vargas and Jessica Bustamante Figure 1 . Study design. We exposed adult female and male mice from 56 Collaborative Cross (CC) strains in litter-and sex-matched pairs/trios to filtered air (FA) or 2 ppm ozone (O3) for 3 hours. Animals were necropsied 21 hours after exposure, and various parameters were measured including the type and quantity of inflammatory cells and the total protein concentration (lung injury marker) in the bronchoalveolar lavage (BAL) fluid. Matches were randomized over 17 batches. To examine strain-by-treatment interactions, we defined a protein response phenotype as the difference in total protein concentration between O3exposed mice and FA-exposed mice within a matched pair or trio. The distribution of this phenotype is roughly normally distributed, which suggests that it is a complex trait influenced by multiple genetic loci. (n = 2 pairs and 2 trios per strain, on average)