| Sign In to gain access to subscriptions and/or personal tools. |
Best Practices for Use of Historical Control Data of Proliferative Rodent Lesions
1 GlaxoSmithKline, King of Prussia, Pennsylvania, USA Correspondence: Address correspondence to: Charlotte Keenan, GlaxoSmithKline, 709 Swedeland Rd, UE0360, King of Prussia, PA 19406, USA; e-mail: charlotte.m.keenan{at}gsk.com.
Key Words: historical control data rodent tumors carcinogenicity studies best practices
The Historical Control Data Working Group, under the direction of the Scientific and Regulatory Policy Committee (SRPC) of the Society of Toxicologic Pathology (STP) was tasked with reviewing the current scientific practices, regulatory guidance, and relevant literature pertaining to rodent microscopic historical control data (HCD) of proliferative lesions to provide best practice recommendations for locating, generating, and applying such data. The Working Group focused exclusively on HCD of proliferative lesions from nonclinical rodent carcinogenicity studies. The HCD Working Group recommends the following consensus principles to guide the use of HCD of proliferative lesions from chronic rodent (rats/mice) bioassays:
The Historical Control Data Working Group, under the direction of the SRPC of the STP, was tasked with reviewing current scientific practices and the relevant literature and regulatory guidances to provide best practice recommendations for locating, accessing, generating, and applying rodent microscopic HCD for proliferative lesions from rodent carcinogenicity studies. The usefulness of HCD has been an ongoing topic of vigorous discussion within the field of toxicologic pathology. Authors have either dismissed or supported the use of HCD as a reliable comparator to concurrent study-specific control data. However, there is general consensus that for any particular study, the incidence of proliferative lesions in the concurrent control group or groups is the most appropriate and accurate comparison to that of the treated groups. When HCD are used as a comparator, the design of studies comprising them should be similar to the study under review (Boorman et al. 2002; Deschl et al. 2002; Gopinath 1994; Greim et al. 2003; Haseman, Huff, and Boorman 1984; Haseman 1995a, 1995b; Keenan et al. 2002; van Zwieten et al. 1988). The use of HCD can be valuable when, within a specific study, the concurrent control results give equivocal comparisons and interpretations to the treated groups or when there is a need to provide quality control for intercurrent factors that may have compromised the survival of the control or treated animals (Deschl et al. 2002; van Zwieten et al. 1988). A typical example of a result that may be difficult to explain would be the presence or high incidence of a rare tumor or other uncommon findings in the treated animals compared to the control animals. The reverse situation would be when an unusually variable or high incidence of a tumor type is found in the control animals, but at a lower incidence in the treated animals, thereby possibly masking a compound-related effect (Deschl et al. 2002; van Zwieten et al. 1988). The potential limitation for the use of HCD generally focuses on the variability and drift over time in animal and study-related factors such as animal genetics, the experimental environment, and the macroscopic and microscopic pathological interpretations (Wolf and Mann 2005). It is generally believed that HCD from within a laboratory are more homogeneous. In contrast, there is a concern that data from multiple laboratories may be of limited interpretive value because of increased variability of diagnostic interpretation (Roe 1994; Yoshimura and Matsumoto 1994). Progress has been made in recent years by various groups such as the Registry of Industrial Toxicology Animal Data (RITA), North American Control Animal Database (NACAD) International Agency for Research on Cancer (IARC), National Toxicology Program (NTP), STP, and others to reduce some of the sources of variability with international efforts to improve harmonization and standardization of terminology, trimming procedures, and study designs. Examples include the efforts by the International Conference on Harmonization (ICH; http://www.i-ch.org/cache/compo/502-272-1.html); guides produced by the International Agency for Research on Cancer (IARC) (International Classification of Rodent Tumours, part 1, Rat, ed. U. Mohr [Lyon, France: IARC Scientific Publications, 1992]; complete listing provided at http://reni.item.fraunhofer.de/reni/public/rita/icrt_rat.htm [accessed March 17, 2009]); the Revised Guides for Organ Sampling and Trimming in Rats and Mice: Parts 1–3 provided by the RITA group (http://www.item.fraunhofer.de/reni/trimming/index.php [accessed March 13, 2009]); and the Society of Toxicologic Pathology Standardized System of Nomenclature and Diagnostic Criteria (SSNDC) Guides (http://www.tox-path.org/ssndc.asp). The U.S. Food and Drug Administration (FDA) has recognized the efforts to harmonize, and this is reflected in the Redbook (http://www.cfsan.fda.gov/~redbook/red-ivc6.html). The STP also regularly publishes position papers outlining Best Practice Guidelines (Crissman et al. 2004; Morton et al. 2006). In 1994, a project to harmonize rat nomenclature was established by the Joint STPs and International Life Sciences Institute (ILSI) Committee on International Harmonization of Nomenclature and Diagnostic Criteria in Toxicologic Pathology. The results of discussions among the Rat Nomenclature Reconciliation Subcommittees, a presentation during the 1999 annual STP meeting in Washington, D.C., and comments that have been sent to the committee are available on Web sites (http://www.item.fraunhofer.de/reni/rat_nomenclature/index.htm [accessed March 13, 2009]; http://www.toxpath.org/nomen/index.htm [accessed March 13, 2009]). This project evolved into the current International Harmonization of Nomenclature and Diagnostic Criteria for Lesions in Rats and Mice (INHAND) initiative. This project is sponsored by several Societies of Toxicologic Pathology Japanese Society of Toxicologic Pathology (JSTP), British Society of Toxicologic Pathologists (BSTP) and European Society of Toxicologic Pathology (ESTP) and has been organized to describe and publish uniform nomenclature for both proliferative and nonproliferative lesions in laboratory rodents (http://www.goreni.org/back_inhand.php [accessed March 13, 2009]). This harmonization of nomenclature should improve uniformity and accuracy of histopathological diagnoses and further increase the value of HCD. At present, detailed procedures that assist with the use of HCD in interpreting the significance of lesions within a rodent study submitted for regulatory review are not available. To gain a better understanding of how HCD are currently used in the professional community, a survey was undertaken. The survey questions focused on current practices and use of historical data, the source of data, factors affecting the model system, presentation of the data, and demographic information. Respondents represented the pharmaceutical and chemical industry as well as government and regulatory bodies. This article summarizes existing regulatory guidance pertaining to HCD; provides a report of current practice by interpreting the results of the survey; presents a review of the relevant literature; and compiles recommendations for generating, locating, accessing, and applying HCD.
A short summary of guidances from a variety of regulatory agencies addressing HCD is provided.
U.S. Environmental Protection Agency (EPA)
U.S. Food and Drug Administration (FDA)
European Medicines Agency (EMEA) Although the 2004 guidance states that a factor limiting the regulatory acceptance of transgenic models for carcinogenicity assessment is the relatively small database for neoplastic HCD available for these models (EMEA 2004), the body of information on these models continues to grow.
Organization for Economic Cooperation and Development (OECD) In the OECD guidance for the evaluation and analysis of chronic toxicity and carcinogenicity studies (OECD 2002), it is stated that concurrent control groups should always be used and that statistical comparisons with HCD are generally not appropriate because of the many variables affecting the incidence of spontaneous tumors. The guidance does, however, note that HCD can be useful for establishing the acceptability of the "normal" data from the control groups and for judging the biological significance of the occurrence of rare or unusual tumors and nonneoplastic abnormalities. However, the guidance stresses that any HCD used should be comparable with the study in question. Ideally, HCD should be generated by the same laboratory at which the study being assessed was performed. Furthermore, the HCD should come from studies conducted within five years prior to or two to three years after the study being evaluated. The guidance recommends that parameters that could affect the occurrence of spontaneous tumors in HCD are identified. The guidance refers the reader to the European requirements for the submission of HCD in the European Commission (EC) Directive 91/414/EEC (European Economic Community 1991). Within this directive, Amendment M4—Section 5.5—Long Term Toxicity and Carcinogenicity, Test Conditions, pp. 61–62, states that while the standard reference point for evaluating treatment responses in long-term toxicity and carcinogenicity studies is the concurrent control group, HCD may be helpful in the interpretation of particular studies. The HCD should, however, come from animals of contemporaneous studies in the same species and strain, maintained under similar conditions.
Japan In summary, all regulatory guidance suggests that the concurrent control should always be the principal comparator and that properly selected HCD can be informative in specific cases where there may be questions on whether the treated groups show an effect or not.
An informal survey (Table 1) was sent to more than one hundred contacts covering a wide range of industry and regulatory agencies throughout the world, and fifty-five responses were returned. Responses consisted of individual opinions of contact representatives as well as of single, harmonized responses from organizations with multiple operational sites. Because of the relatively small sample size, no formal statistical analysis was possible. The responses were suitably proportional to the groups surveyed with the majority from pharmaceutical industry, followed by chemical industry, contract research organizations (CROs), government research organizations (e.g., NIH), and regulatory authorities. Regulatory respondents included staff from U.S. FDA, U.S. EPA, Health Canada, the European Union (EU), and Japan. Approximately 44% of respondents were from the United States, 33% from Europe, 7% from Japan, 4% from Canada, and 12% indicated a worldwide location.
For most survey questions, respondents had more than one option to answer the question (Table 1); therefore, the total of all percentage rates given is greater than 100%. In general, 49% of respondents indicated reviewing between one and five carcinogenicity studies per year. Approximately 24% of respondents review more than five studies a year. This suggests a substantial experience base among respondents as well as a substantial need for availability of high-quality HCD on a regular basis. The majority of respondents use HCD whenever there are statistically significant increased incidences of neoplastic or hyperplastic lesions (91%). Most respondents replied that HCD are particularly useful for the interpretation of rare proliferative lesions (93%) or borderline differences from concurrent control groups (73%). Frequently respondents stated they consult HCD for biological trends regardless of statistical significance (60%), incorporate HCD as part of their interpretation on the request of regulatory agencies (53%), and/or use them to evaluate the consistency of the concurrent control data (45%). The majority of respondents (60%) prefer to use HCD from animal models similar to those of the study being evaluated. Respondents felt that rodent strain, feeding practices, and route of administration are the three most important comparators when evaluating HCD. Typically, respondents use up to five years of collected HCD if available (53%), while 33% of respondents include more than five years, 13% include the most recent three years, and 1% include only the most recent two years. On how data are routinely presented, most respondents chose more than one of the five possible answers; therefore, the total percentage rate is greater than 100%. However, most respondents chose to report a historical control range of incidence (74%) followed by presentation of HCD as percent (51%) and range or mean with scientific opinion on relevance (44%). A majority of respondents (91%) report HCD without performing statistical analyses. There is no consistent preference for the source of the data as respondents appeared to routinely use with equal frequency published literature (71%), internal databases (70%), public and nonpublic sources (e.g., the public Charles River [56%] and NTP [54%] databases and the proprietary database of the RITA [34%]) as well as data from the laboratory that conducted the study (50%). About two-thirds of the respondents using RITA as a source for HCD are located in Europe or are part of globally operating organizations. An important concern expressed by 67% of respondents is the lack of consistency and robustness of the databases available to obtain HCD. As a consequence, some respondents commented that HCD are not applied appropriately in evaluating the relevance of proliferative lesions. Specifically, the respondents concerns centered on the variability in recording background lesions due to the lack of a standard lexicon; variability in the experience of the study pathologist; and differences between studies as well as dissimilar parameters used with regard to animal age, feed, strain, caging, and dose regimen. The consistency of the application of diagnostic criteria (26%); the fact that HCD are overvalued, overinterpreted, or abused (23%); and the lack of a reliable source for robust HCD (23%) were additional concerns expressed by respondents. Some presented concerns with genetic drift (19%) and the need for considering the biological significance of study findings and HCD (16%). A common theme in the "comments" section of the survey for ways to improve HCD was the desire for a readily accessible, centralized, independent, and standardized database for HCD. Many respondents suggested that such a database for HCD would be best served by the use of harmonized diagnostic criteria, terminology, and enhanced statistical methods.
When using HCD, several factors must be considered that can be consolidated into two categories: the in-life (strain, age, study duration, body weight, housing, route of administration, diet, vehicle, and test article cross-contamination of controls) and the postmortem factors (necropsy, trimming, histopathologic criteria, and terminology).
Strain (Source/Vendor, Age-Matched) There is a general presumption that genetic drift occurs over time, resulting in changes in the reported frequency of certain lesions leading to the recommendation of using only HCD of certain time intervals (U.S. EPA, http://www.epa.gov/iris/backgrd.htm; EMEA 2002). However, longitudinal analyses (Eiben and Bomhard 1999; Eiben 2001; Haseman et al. 1989; Tennekes et al. 2004a, 2004b) have shown that some parameters may change while others remain stable for periods of up to twenty years (Deschl et al. 2002). As the species and strain of the animals used in carcinogenicity testing of chemicals influence the spontaneous tumor profile, likewise the sex of the test animal also influences the profile. Gender differences in the spontaneous tumor incidences of rats and mice are well documented (Attia 1996; Baldrick 2005; Eiben 2001). Other factors may influence the variety and incidence rates of proliferative lesions within HCD, such as litter size and birth weight (Roe 1994). In addition, since most neoplasms increase in incidence with age, it is important to consider the age of the animal when comparing historical controls to concurrent controls and treatment groups. It may not be clear in reports or publications of tumor incidence if tumors identified in decedents (unscheduled deaths) have been combined with those at scheduled termination. Most rodent studies are conducted for 18 to 24 months in mice and 24 months in rats. However, some individual investigators extend the study to full lifetime or "natural" death of the animal. These studies can last significantly longer than most standard bioassays and can be difficult to interpret relative to HCD. As an example, in one study, there were clear trends of higher incidence rates over time in pituitary tumors, adrenal pheochromocytomas, and mammary gland tumors, which also showed a shift towards malignancy from 2/31 malignant tumors after 24 months to 34/101 after 30 months in study durations (Bomhard 1992; Bomhard and Rinke 1994). A study investigating strain differences between Sprague-Dawley and Wistar rats showed that pancreatic islet cell carcinomas occurred late in the study (Germann et al. 1999). In 36 male Wistar rats, the first carcinoma was observed at 649 days of age (mean age 847 days) and on day 575 in females (mean age 781 days of 7 cases). In Sprague Dawley rats, which had more severe nephropathy and did not survive as long, the first male with carcinoma was 591 days old (mean age 709 days of 25 cases) and the first female was 682 days old (mean age 761 days). These data highlight the importance of considering age effects when comparing historical controls, concurrent controls, and treatment groups.
Housing (Caging, Identification Methods, Temperature, Humidity, Lighting) The type of caging and number of animals per cage can influence proliferative responses. A decrease in testicular interstitial cell tumors and increase in pituitary tumors in F344 rats was reported in studies where rats were housed individually compared to studies with group housing (Haseman et al. 1997; Nyska et al. 1998, 2002). The survival rate for male B6C3F1 mice housed individually was significantly lower (66.5% vs. 82%) when housed in wire mesh instead of polycarbonate cages (Rao and Crockett 2003). However, in this evaluation, mice kept in wire mesh cages were used in inhalation studies and compared to animals from feeding studies. The unique identification of animals is critical for every toxicological study. The implantation of electronically readable transponder microchips is a reliable and frequently used identification method. However, induction of proliferative lesions associated with these implants have been reported as an additional variable impacting HCD (Rao and Edmondson 1990; Tillmann et al. 1997; Elcock et al. 2001; Le Calvez, Perron-Lepage, and Burnett 2006; Blanchard et al. 1999). There are also physiological effects from different levels of noise, lighting, temperature, and humidity (http://www.nal.usda.gov/awic/pubs/Rodents/noise_light_temp.htm [accessed March 13, 2009]). Recent experiments have evaluated light cycle disruption mimicking jet-lag in mouse tumor models. Prior to inoculation with tumor cells, mice were synchronized with 12 hours of light and 12 hours of darkness or underwent repeat 8-hour advances of the light/dark cycle every 2 days to simulate jet-lag. The 24-hour rest/activity cycle was ablated, and the rhythms of body temperature, serum corticosterone, and clock protein expression were markedly altered in jet-lagged mice as compared with controls. Tumors grew faster in the jet-lagged animals, compared with controls, suggesting that altered environmental conditions can disrupt circadian clock molecular coordination in peripheral organs including tumors and could play a role in malignant progression (Filipski et al. 2004).
Route of Exposure
Diet (Type and Feeding Practice) Extensive research has consistently shown the longevity benefits of moderate caloric restriction or diet modification by reducing protein and/or increasing fiber compared to the ad libitum feeding of a standard nutritionally balanced rodent diet. Dietary modifications that contribute to a reduction in body weight gain and a consequent reduced incidence and severity of chronic degenerative conditions lead to improved survival (Keenan et al. 1999; Rao, Edmondson, and Elwell 1993; Masoro, Shimokawa, and Yu 1991). Improved survival allows for the detection of tumors that occur later in life. Some debate has occurred regarding whether the restricted food consumption results in a decreased sensitivity to the development of tumors (Allaben et al. 1996); therefore, a distinction must be made between moderate caloric restriction and more severe caloric restriction, which has been shown to delay the onset of tumors in rodents (Kritchevsky 1993). In studies where rats were given an amount of nutritionally balanced feed that provided for initial growth and then maintenance over the life span of rodents, the incidence of spontaneous tumors observed at two years was in the same range as that from ad libitum fed rodents, but the trend was for tumors to be identified at the terminal necropsy rather than in early unscheduled necropsies (Keenan et al. 1999). Because of the impact of diet on the development of tumors and degenerative disease, diet is a major factor in comparing tumor data, either within the same laboratory or between laboratories. Information on the type of diet (standard, reduced protein, or increased fiber) and the amount provided is important to a sound comparative review. Additionally, depending on the type of feeder used, or whether animals are individually or group housed, partial food restriction may occur unintentionally even in studies with an ad libitum feeding regimen. Another important consideration is the potential for vehicle effects on study results. Of the many commonly used vehicles, only corn oil has been associated with increased body weights, an increased incidence of pancreatic acinar cell adenomas, and with a decreased incidence of mononuclear cell leukemia in male F344 rats (Haseman et al. 1985; Haseman and Rao 1992; NTP 1994). This observation underscores that any novel vehicle used in long-term studies needs to be evaluated for potential effects before their study data are used as HCD for comparative purposes. The potential contamination of control animals with test article also needs to be considered. Trace levels of contamination that are below the lower limit of quantification are, in principle, considered to be nonrelevant (EMEA 2005).
Tissue Sampling and Trimming Procedures
Diagnostic Criteria and Terminology (Use of Standardized Nomenclature) When considering published HCD, the use of different nomenclature for the same lesion can be misleading to those unaware that terminology may be synonymous. This can be a problem when different terminology for the same lesion is used in the same study (Hardisty 1985). There may be a need for different terminologies when describing a lesion in morphological descriptive terms or to address differences in topography (Goodman 1988). The use of synonymous terms within a study can impact the statistical interpretation of that study. Considering HCD, the use of different terminologies for the same change in different studies can be confusing and may lead to erroneous conclusions unless the synonyms are defined and understood (Haseman, Huff, and Boorman 1984; U.S. FDA CDER 2001; U.S. EPA 2005). A complete record of the diagnostic criteria by which the data were compiled is critical for appropriate use of HCD. When there are multiple instances of the same tumor in a tissue or an animal, it should be clear if they were counted separately or combined. In addition, there are neoplasms where classification as hyperplastic or benign or malignant can be difficult and somewhat subjective. For example, the diagnosis of thyroid follicular tumors can be challenging and differentiation of hyperplasia from adenoma is difficult; often the key feature separating these two entities is compression (Society of Toxicologic Pathology, SSNDC Guides, http://www.toxpath.org/ ssdnc/ThyroidParathyroidPro.pdf [accessed March 13, 2009]). Other complexities include when tumors that are related ontogenically are combined according to the predominant differentiated cell type, though they may have variable differentiation, such as basal cell tumors in the epidermis, adnexa, or Zymbals glands. In all of these cases, information on how these decisions were made within a particular study(ies) used as part of HCD would assist interpretation. When utilizing published HCD, it is helpful to know which diagnostic criteria were used; however, this information is not typically available. A number of papers have been published presenting and evaluating HCD from studies carried out by the NTP (Haseman, Hailey, and Morris 1998; Haseman and Rao 1992; Brix et al. 2005). Consistent diagnostic criteria were applied in the interpretation of these findings. Some authors did publish the diagnostic criteria used in their studies but reported that the criteria changed for studies included in the published historical data (Baldrick and Reeve 2007; McMartin et al. 1992; Tennekes et al. 2004a, 2004b). Other authors have compiled HCD tables where the same pathologists were involved in all aspects of the evaluation and peer review process, to ensure consistency between the studies used in the tabulation of data (Brix et al. 2005). This information should be taken into consideration when published data are used for comparison with study data. An important aim in the development of the RITA historical control database was the establishment of harmonized nomenclature and standard diagnostic criteria for proliferative lesions in rodents to ensure consistency between pathologists and laboratories (Mohr et al. 1990; Morawietz, Rittinghausen, and Mohr 1992). This effort resulted in the International Classification of Rodent Tumors published by IARC (1992–1997). As noted in the introduction, there is a current effort under way with INHAND to further increase global harmonization of diagnostic criteria and terminology.
Peer Review Histopathologic diagnoses are one of the most important sources of variability in tumor rates (Haseman 1993). A peer-review procedure conducted at the laboratory of origin ensures that consistent criteria are used for the diagnoses of all tumors in studies conducted at that laboratory and increases the reliability of study data (Gopinath 1994; Greim et al. 2003; Hardisty 1985; Haseman 1990, 1992, 1993, 1995a, 1995b; van Zwieten et al. 1988; Ward et al. 1995). The STP has published recommendations specifically on the peer-review process (STP 1991). The peer-review process is described within the Best Practices Guidelines for Toxicologic Histopathology (Crissman et al. 2004). A database for control animal pathology data, therefore, must have established peer-review procedures to ensure comparability of histopathology diagnoses for all studies entered into such a database (Haseman 1992). The peer-review process is a pivotal procedure completed prior to incorporation of data into the NTP and RITA historical control databases (Boorman et al. 2002; Deschl et al. 2002; Haseman et al. 1997; Morawietz, Rittinghausen, and Mohr 1992; Ward and Reznik 1983).
Innate Biological Variability
HCD may be useful in the interpretation of rare tumors, marginally greater incidences and/or severity of proliferative changes in treated animals compared to controls, and unexpected increases or decreases of tumor incidences in study control animals. HCD can be used as a tool, including the possibility of statistical evaluation, to provide scientific perspective of disparate findings in dual concurrent control groups and review trends in tumor biology and behavior that may evolve over time in these rodent models (Haseman, Huff, and Boorman 1984). There are a number of situations in which HCD may be used to assist in interpretation of study data. It is beyond the scope of this article to address all potential situations. Occasionally, the concurrent controls may have an incidence that is at the high end of the HCD range, which may mask a treatment effect; or the concurrent controls are at the low end of a normal range for the laboratory, which could result in an overinterpretation of a possible treatment related effect. There is also the possibility that the study is flawed for design or technical reasons that may bias a study toward an apparent increase, or decrease, in a specific tumor incidence unrelated to a test article effect (Ettlin and Prentice 2002). For example, apparently simple differences in housing—group versus individual—can lead to significant differences in the occurrence of certain tumor types, such as testicular and pituitary tumors, in otherwise identical studies (Haseman, Hailey, and Morris 1998). An important consideration impacting selection of HCD for comparison with study data is whether to combine neoplasms from different anatomic locations and similar histologic ontogeny but different histomorphology. Guidelines for combining neoplasms have shown that some neoplasms can be combined and, in some cases, preneoplastic lesions such as hyperplasia could be included as part of the weight of evidence in support of carcinogenicity (McConnell et al. 1986). Two examples are the transition from hyperplasia or dysplasia to malignancy for tumors of the nasal cavity and of the glandular stomach in rats. The guidelines also list neoplasms for which combining is inappropriate, such as combining malignant lymphomas and histiocytic sarcomas, which are of different cellular lineage (McConnell et al. 1986). Appropriate combination of lesions such as benign and malignant hepatocellular tumors (similar cellular lineage) can provide evidence in determining a mode of action of a compound. In contrast, inappropriate combinations can result in overinterpretation of an effect or masking of a response when one is actually present (McConnell et al. 1986; Linkov et al. 2000).
HCD from the laboratory that conducted the study under review will generally be more comparable than HCD collected from several laboratories, but the laboratory may not always have an adequate number of studies for compilation of HCD. HCD are widely available in organized databases and in published literature, apart from company-owned or CRO databases. Organized historical control databases have been, and continue to be, compiled by large institutions and organizations such as the NTP, Charles River (CR), and RITA. Each of these databases differs in the way the data are collected, handled and presented (Table 2).
The NTP was established in 1978 as a cooperative effort to coordinate toxicology testing programs within the federal government. Other goals were to strengthen the science base in toxicology; develop and validate improved testing methods; and provide information about potential adverse human health effects of chemicals to health, regulatory, and research agencies, the scientific and medical communities, and the public. As a way to follow changes in the biology of the test species and to evaluate test results, a database of HCD of neoplastic lesions from untreated or control groups was established and is available electronically through their Web site at http://ntp.niehs.nih.gov/?objectid=92E61F1B-F1F6-975E-7D3BED551F07DC0A (accessed March 17, 2009). To ensure that current HCD are presented, the NTP database is maintained as a five-year window of the most recent NTP data and updated annually. Costs to maintain this database are provided for in the conduct of the studies. To date, the NTP has published technical reports from more than 500 two-year, two species, toxicology and carcinogenicity bioassays in F344/N rats, Sprague Dawley rats, and B6C3F1 mice; and all corresponding raw data are provided at http://ntp.niehs.nih.gov/ntpweb (accessed March 17, 2009). The rigorous pathology peer-review process includes three independent pathology reviews and a final pathology working group of nine to eleven pathologists who meet to review and decide on diagnostic or terminology discrepancies. The publicly available historical control database includes tumor incidences and growth and survival curves. These data are summarized by species, strain, sex, route of administration, and vehicle. Over the past several decades, at their own cost, CR has compiled HCD on CR-produced rodent strains used in chronic studies conducted in various laboratories in the United States and Europe and published them as strain-specific monographs. Some are limited to specific organs (e.g., ophthalmic) and specific parameters (e.g., caloric restriction). Although there has been some attempt to standardize the diagnostic nomenclature, the data are, for the most part, presented as received from the laboratories where the studies were performed. Each publication specifies common study parameters such as time frame, rodent strain production site(s), diet versus gavage, untreated versus vehicle controls, and various husbandry/environmental factors dependent on sponsor disclosure. Each publications focus is on a specific CR rodent and contains data from multiple studies composed of thousands of control animals. These published compilations can be obtained from CR (http://www.criver.com/en-US/ProdServ/ByType/ResModOver/Pages/On-lineLiterature.aspx [accessed March 17, 2009]). RITA is a proprietary pathology database for historical control data founded in 1988 in Hannover, Germany, as a cooperative venture between the Fraunhofer Institute of Toxicology and Experimental Medicine (Fraunhofer ITEM) and thirteen pharmaceutical and chemical companies from Germany and Switzerland (Morawietz, Rittinghausen, and Mohr 1992). The objective of this coalition was to establish a centralized European database providing standardized and valid historical background data in specific rodents to be used for carcinogenic risk assessment (Deschl et al. 2002). From the very beginning, the development of standardized nomenclature and diagnostic criteria was considered as key for success (Mohr et al. 1990) and resulted in the previously mentioned publication of ten IARC/WHO fascicles, International Classification of Rodent Tumours, part 1, The Rat (1992—1997), each dealing with an organ system, and later the International Classification of Rodent Tumors: The Mouse, which was edited as a book (Mohr 2001). The RITA data collection has been ongoing since 1988 and contains data on animals from more than two hundred carcinogenicity studies of the major rat and mouse strains used in Europe including the Sprague Dawley and Wistar Han rats and the CD1 and B6C3F1 mice. The animals are from a variety of breeders and vendors. Detailed information is available for each individual animal and includes environmental factors such as housing conditions, feeding, group size, and others as described previously in this article. Companies throughout Europe and North America are currently participating in the RITA Group effort via a membership fee. The RITA project adheres to a rigorous peer-review process in which every preneoplastic and neoplastic lesion entered into the database is confirmed by an actual examination of the respective tissue section by an experienced independent pathologist, with all questionable findings submitted to a panel of experienced pathologists to establish a final diagnosis. By using systemized trimming procedures, nomenclature, and diagnostic criteria, the group adheres to standardized data acquisition and data validation procedures. Photomicrographs representing typical and equivocal histopathologic diagnoses are available for group members and to a large extent to users of the password-protected Web program "goRENI," which is accessible to members of any Toxicologic Pathology Society on request (http://www.goreni.org/back_inhand.php [accessed March 13, 2009]). Findings are searchable in the database by various criteria such as strain, breeder, time period, and study duration. There are many published reports of proliferative changes in rodents, and these often present results from individual toxicology studies conducted by investigators in industry or academia and represent a variety of rodent strains, study designs, and feeding and husbandry practices. Individually, the reports do not provide the comprehensive incidence of proliferative lesions that are found in larger databases such as those discussed above; however, they do provide useful information regarding tumor occurrences. These reports often summarize current literature on specific tumor types and/or incidence rates in specific organ systems and may include consideration of variables in study design, feeding practices, and strain among others. Published literature may serve as a useful supplement to the larger organized databases when searching for HCD. An organized listing of publications is available on the STP Web site, http://www.toxpath.org/positions.asp (accessed April 10, 2009). This compilation provides a listing of references in regard to strain, tumor types, and other factors.
When HCD are used, descriptions of the statistical analyses and all relevant data (including adjustments for survival) should be a component of study reports and available for review. A few examples of data presentation are provided on the STP Web site. As described above in the HCD survey results, HCD are commonly presented as a mean, standard deviation, and range of tumor incidence from a historical control database or published literature from studies with same route and all exposure routes combined. This method provides a representation of reported observations of a given tumor incidence. When using this approach, it should be noted that the range can be influenced by extreme outliers even from a single study. For example, consider mammary gland carcinoma in F344/N rats. In the NTPs database for the twenty studies conducted during the period January 6, 1997, and August 6, 2001 (based on NTP 2000 feed), there were 64 rats out of 1,050 diagnosed with mammary gland carcinoma. These studies had a mean of 6.0%, a standard deviation of 4.39%, and a range of 0% to 20% for all routes and all vehicles. However, the 20% incidence (10 out of 50 rats) was found in only one inhalation study in a total of twenty studies. The next largest incidence was 10%. Without the control from this single study, the range would have been 0% to 10%. To address this concern, incidences per study may also be reported as part of the HCD data-set. Reporting of incidences per study will allow both presentation and use of a broad range of observations and provide transparency of potential influences of outlier populations. Since the early 1980s, a number of attempts have been made to develop statistical procedures for analyzing concurrent experimental data by formally making use of the HCD (Tarone 1982; Dempster, Selwyn, and Weeks 1983; Hoel 1983; Hoel and Yanagawa 1986; Tamura and Young 1986, 1987; Prentice et al. 1992; Ibrahim and Ryan 1996; Ibrahim, Rynn, and Chen 1998; Dunson and Dinse 2001). Each of these methods has strengths and limitations. There may, however, be value, in some situations, in analyzing the concurrent data by applying additional informal or formal statistical methods to historical controls and evaluating HCD in the context of a specific data set or study (U.S. FDA CDER 2001; Elmore and Peddada 2009). HCD may aid in the interpretation of data for the assessment of a xenobiotic-induced proliferative change, but its use and application should be presented in the context of sound biological principles with regard to the pooling of findings, combining ontogenetically similar tissues, and related criteria.
While the concurrent control group provides the most relevant control data, one should consider HCD as one of many sources of information that add to the "weight of evidence" approach when assessing the potential carcinogenic effect of a compound. HCD can be used as a tool to assess the spontaneous tumor rates in the concurrent control group and to evaluate disparate findings in dual concurrent control groups (Haseman 1984; Haseman, Huff, and Boorman 1984). The HCD can help to determine whether marginally significant trends in common tumors are likely real or false positives. If the tumor rates in the treated groups are within the range of reliable HCD, then a marginally significant trend for a common tumor could be discounted due to a random occurrence of a low concurrent control rate. However, the incidences of other lesions of similar cell lineage (hypertrophy, hyperplasia, papilloma, adenoma, etc.) may also be considered in this weight of evidence approach. Such a weight of evidence approach is particularly helpful when the proliferative lesions are considered to be on a biological continuum as in the forestomach of the mouse where the lesions progress from focal hyperplasia to papilloma to squamous cell carcinoma (Leininger and Jokinen 1994; Leininger et al. 1999). In data on a pesticide submitted to the U.S. EPA, the incidence of thyroid follicular cell tumors was increased in treated female rats (0/60, 0/60, 2/60, 2/60, and 4/60 at 0, 200, 1,000, 4,000, and 20,000 ppm, respectively). The increased incidence of thyroid adenomas was statistically significant in the female 20,000 ppm group by the Cochran-Armitage trend test but not by the Fishers pair-wise exact test. There was a lack of hyperplasia or other preneoplastic morphologic changes and no progression of the thyroid follicular adenomas to carcinomas at the high dose. There was a lack of evidence for the typical progression of thyroid follicular adenomas, which typically includes perturbation of the thyroid pituitary axis and increased follicular cell hyperplasia. The effect on the thyroid was in female rats, but there was no associated thyroid follicular effect in male rats, which tend to be the more sensitive sex for follicular cell hypertrophy and neoplasia. In addition, an evaluation of the HCD from the source of the study animals showed that the spontaneous incidence of thyroid follicular adenomas in the source population was 1.1% to 6.1%. The incidence of thyroid follicular adenomas from the testing facility was 0% to 3%. The incidence of thyroid follicular adenomas in the high-dose treated female rats (6.7%) was near the upper end of the source historical control range and greater than the laboratory historical range. Based on the weight of the biological evidence, the slight increase in incidence of thyroid follicular cell adenomas was interpreted to be unrelated to treatment (Douglas Wolf, March 11, 2008, Environmental Protection Agency, Research Triangle Park, NC). One could also consider if there is histological evidence that any of the malignant lesions (e.g., carcinoma) arise within the benign counterparts (e.g., adenoma) and if there is multiplicity in site-specific tumors. Other issues to consider include body weight, survival, plasma concentration of test compound, time of tumor onset, if the neoplastic lesion occurs in both males and females (although there may be differences due to sex steroids), if it occurs in both rodent species, if there is a positive dose-related response, or if there are bilateral lesions in paired organs. Combining of benign and malignant neoplasms of the same histogenesis in the same or different organs for statistical analyses (i.e., hepatocellular adenomas and carcinomas; cardiovascular system, vascular endothelium, hemangiomas, hemangiosarcomas) may also add to the weight of evidence.
In conclusion, when evaluating proliferative lesions from nonclinical rodent carcinogenicity studies, the concurrent control group is the most relevant. However, when the biological significance of a change in incidence of proliferative lesions in compound-treated groups relative to concurrent controls is uncertain, historical control data can aid in the overall evaluation. When using HCD, several issues should be taken into consideration such as source and quality of HCD, in-life and postmortem factors associated with the origination of the HCD, and type of statistical tools used to present the HCD, which reflect the consensus principles summarized by the HCD Working Group.
The Working Group would like to thank Drs. C. Gopinath, S. Rittinghausen, C. Clifford, T. Peters, and J. Haseman and members of the various STPs for their review and critical comments.
Conflict of interest: The authors have not declared any conflict of interest. The views expressed in this article are those of the authors and do not necessarily represent the policies, positions, or opinions of their respective agencies and organizations. This research was supported (in part) by the Intramural Research Program of the NIH, National Institute of Environmental Health Sciences. The recommendations in this article are endorsed by the European Society of Toxicologic Pathology and the British Society of Toxicologic Pathologists.
This version was published on August
1, 2009 Toxicologic Pathology, Vol. 37, No. 5,
679-693 (2009) This article has been cited by other articles:
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


