中文(简体)
Albanian
Arabic
Armenian
Azerbaijani
Belarusian
Bengali
Bosnian
Catalan
Czech
Danish
Deutsch
Dutch
English
Estonian
Finnish
Français
Greek
Haitian Creole
Hebrew
Hindi
Hungarian
Icelandic
Indonesian
Irish
Italian
Japanese
Korean
Latvian
Lithuanian
Macedonian
Mongolian
Norwegian
Persian
Polish
Portuguese
Romanian
Russian
Serbian
Slovak
Slovenian
Spanish
Swahili
Swedish
Turkish
Ukrainian
Vietnamese
Български
中文(简体)
中文(繁體)

Predicting breast cancer treatment outcome

只有注册用户可以翻译文章
登陆注册
链接已保存到剪贴板
Mark Erlander
Xiao-Jun
Dennis Sgroi

关键词

专利信息

专利号9856533
已提起02/05/2004
专利日期01/01/2018

抽象

Methods and compositions are provided for the identification of expression signatures in ER+ breast cancer cases, where the signatures correlate with responsiveness, or lack thereof, to treatment with tamoxifen or another antiestrogen agent against breast cancer The signature profiles are identified based upon sampling of reference breast tissue samples from independent cases of breast cancer and provide a reliable set of molecular criteria for predicting the efficacy of treating a subject with breast cancer with tamoxifen or another antiestrogen agent against breast cancer. Additional methods and compositions are provided for predicting responsiveness to tamoxifen or another antiestrogen agent against breast cancer in cases of breast cancer by use of three biomarkers. Two biomarkers display increased expression correlated with tamoxifen response while the third biomarker displays decreased expression correlated with tamoxifen response.

索偿

What is claimed is:

1. A method for treating a human subject having ER+ (estrogen receptor positive) breast cancer, comprising assaying a breast cancer cell sample from the human subject by determining a ratio of HoxB13 and IL17BR mRNA expression levels in the breast cancer cell sample; comparing the ratio of HoxB13 and IL17BR mRNA expression levels in the breast cancer cell sample to a HoxB13 and IL17BR mRNA expression level threshold ratio, wherein a ratio of HoxB13 and IL17BR mRNA expression levels above the threshold ratio indicates an outcome comprising cancer recurrence via metastasis following tamoxifen or letrozole treatment, and treating the human subject with an alternative therapy other than tamoxifen or letrozole, if the ratio of HoxB13 and IL17BR mRNA expression levels is above the threshold ratio, wherein the alternative therapy other than tamoxifen or letrozole comprises a selective estrogen receptor modulator (SERM), a selective estrogen receptor downregulator (SERD), an aromatase inhibitor (AI), surgical ovarian ablation, or chemical ovarian ablation.

2. The method of claim 1, wherein said mRNA expression levels are determined by quantitative PCR.

3. The method of claim 1, wherein said assaying comprises RT-PCR (reverse transcription polymerase chain reaction).

4. The method of claim 3, wherein the ratio of HoxB13 and IL17BR mRNA expression levels is expressed as ACT, wherein CT is the PCR amplification cycle in which the HoxB13 or IL17BR mRNA reaches a threshold amount, and wherein ACT is the CT difference between HoxB13 or IL17BR mRNA.

5. The method of claim 1, wherein said breast cancer cell sample is a formalin fixed paraffin embedded (FFPE), ductal lavage or fine needle aspiration sample.

6. The method of claim 1, wherein said breast cancer cell sample is a section of tissue from a subject or comprises cells microdissected from said section.

7. The method of claim 1, wherein the alternative therapy other than tamoxifen or letrozole comprises a selective estrogen receptor modulator (SERM), a selective estrogen receptor downregulator (SERD), or an aromatase inhibitor (AI).

8. The method of claim 1, wherein the alternative therapy other than tamoxifen or letrozole comprises anastrozole, vorozole, exemestane, androstenedione, or formestane.

9. The method of claim 1, wherein the alternative therapy other than tamoxifen or letrozole comprises surgical or chemical ovarian ablation.

描述

FIELD OF THE INVENTION

The invention relates to the identification and use of gene expression profiles, or patterns, with clinical relevance to the treatment of breast cancer using tamoxifen (nolvadex) and other "antiestrogen" agents against breast cancer, including other "selective estrogen receptor modulators" ("SERM"s), "selective estrogen receptor downregulators" ("SERD"s), and aromatase inhibitors ("AI"s). In particular, the invention provides the identities of gene sequences the expression of which are correlated with patient survival and breast cancer recurrence in women treated with tamoxifen or other "antiestrogen" agents against breast cancer. The gene expression profiles, whether embodied in nucleic acid expression, protein expression, or other expression formats, may be used to select subjects afflicted with breast cancer who will likely respond positively to treatment with tamoxifen or another "antiestrogen" agent against breast cancer as well as those who will likely be non-responsive and thus candidates for other treatments. The invention also provides the identities of three sets of sequences from three genes with expression patterns that are strongly predictive of responsiveness to tamoxifen and other "antiestrogen" agents against breast cancer.

BACKGROUND OF THE INVENTION

Breast cancer is by far the most common cancer among women. Each year, more than 180,000 and 1 million women in the U.S. and worldwide, respectively, are diagnosed with breast cancer. Breast cancer is the leading cause of death for women between ages 50-55, and is the most common non-preventable malignancy in women in the Western Hemisphere. An estimated 2,167,000 women in the United States are currently living with the disease (National Cancer Institute, Surveillance Epidemiology and End Results (NCI SEER) program, Cancer Statistics Review (CSR), www-seer.ims.nci.nih.gov/Publications/CSR1973 (1998)). Based on cancer rates from 1995 through 1997, a report from the National Cancer Institute (NCI) estimates that about 1 in 8 women in the United States (approximately 12.8 percent) will develop breast cancer during her lifetime (NCI's Surveillance, Epidemiology, and End Results Program (SEER) publication SEER Cancer Statistics Review 1973-1997). Breast cancer is the second most common form of cancer, after skin cancer, among women in the United States. An estimated 250,100 new cases of breast cancer are expected to be diagnosed in the United States in 2001. Of these, 192,200 new cases of more advanced (invasive) breast cancer are expected to occur among women (an increase of 5% over last year), 46,400 new cases of early stage (in situ) breast cancer are expected to occur among women (up 9% from last year), and about 1,500 new cases of breast cancer are expected to be diagnosed in men (Cancer Facts & Figures 2001 American Cancer Society). An estimated 40,600 deaths (40,300 women, 400 men) from breast cancer are expected in 2001. Breast cancer ranks second only to lung cancer among causes of cancer deaths in women. Nearly 86% of women who are diagnosed with breast cancer are likely to still be alive five years later, though 24% of them will die of breast cancer after 10 years, and nearly half (47%) will die of breast cancer after 20 years.

Every woman is at risk for breast cancer. Over 70 percent of breast cancers occur in women who have no identifiable risk factors other than age (U.S. General Accounting Office. Breast Cancer, 1971-1991: Prevention, Treatment and Research. GAO/PEMD-92-12; 1991). Only 5 to 10% of breast cancers are linked to a family history of breast cancer (Henderson I C, Breast Cancer. In: Murphy G P, Lawrence W L, Lenhard R E (eds). Clinical Oncology. Atlanta, Ga.: American Cancer Society; 1995:198-219).

Each breast has 15 to 20 sections called lobes. Within each lobe are many smaller lobules. Lobules end in dozens of tiny bulbs that can produce milk. The lobes, lobules, and bulbs are all linked by thin tubes called ducts. These ducts lead to the nipple in the center of a dark area of skin called the areola. Fat surrounds the lobules and ducts. There are no muscles in the breast, but muscles lie under each breast and cover the ribs. Each breast also contains blood vessels and lymph vessels. The lymph vessels carry colorless fluid called lymph, and lead to the lymph nodes. Clusters of lymph nodes are found near the breast in the axilla (under the arm), above the collarbone, and in the chest.

Breast tumors can be either benign or malignant. Benign tumors are not cancerous, they do not spread to other parts of the body, and are not a threat to life. They can usually be removed, and in most cases, do not come back. Malignant tumors are cancerous, and can invade and damage nearby tissues and organs. Malignant tumor cells may metastasize, entering the bloodstream or lymphatic system. When breast cancer cells metastasize outside the breast, they are often found in the lymph nodes under the arm (axillary lymph nodes). If the cancer has reached these nodes, it means that cancer cells may have spread to other lymph nodes or other organs, such as bones, liver, or lungs.

Major and intensive research has been focused on early detection, treatment and prevention. This has included an emphasis on determining the presence of precancerous or cancerous ductal epithelial cells. These cells are analyzed, for example, for cell morphology, for protein markers, for nucleic acid markers, for chromosomal abnormalities, for biochemical markers, and for other characteristic changes that would signal the presence of cancerous or precancerous cells. This has led to various molecular alterations that have been reported in breast cancer, few of which have been well characterized in human clinical breast specimens. Molecular alterations include presence/absence of estrogen and progesterone steroid receptors, HER-2 expression/amplification (Mark H F, et al. HER-2/neu gene amplification in stages I-IV breast cancer detected by fluorescent in situ hybridization. Genet Med; 1(3):98-103 1999), Ki-67 (an antigen that is present in all stages of the cell cycle except G0 and used as a marker for tumor cell proliferation, and prognostic markers (including oncogenes, tumor suppressor genes, and angiogenesis markers) like p53, p27, Cathepsin D, pS2, multi-drug resistance (MDR) gene, and CD31.

Tamoxifen is the antiestrogen agent most frequently prescribed in women with both early stage and metastatic hormone receptor-positive breast cancer (for reviews, see Clarke, R. et al. "Antiestrogen resistance in breast cancer and the role of estrogen receptor signaling." Oncogene 22, 7316-39 (2003) and Jordan, C. "Historical perspective on hormonal therapy of advanced breast Cancer." Clin. Ther. 24 Suppl A, A3-16 (2002)). In the adjuvant setting, tamoxifen therapy results in a 40-50% reduction in the annual risk of recurrence, leading to a 5.6% improvement in 10 year survival in lymph node negative patients, and a corresponding 10.9% improvement in node-positive patients (Group, E.B.C.T.C. Tamoxifen for early breast cancer. Cochrane Database Syst Rev, CD000486 (2001)). Tamoxifen is thought to act primarily as a competitive inhibitor of estrogen binding to estrogen receptor (ER). The absolute levels of ER expression, as well as that of the progesterone receptor (PR, an indicator of a functional ER pathway), are currently the best predictors of tamoxifen response in the clinical setting (Group, (2001) and Bardou, V. J. et al. "Progesterone receptor status significantly improves outcome prediction over estrogen receptor status alone for adjuvant endocrine therapy in two large breast cancer databases." J Clin Oncol 21, 1973-9 (2003)).

However, 25% of ER+/PR+ tumors, 66% of ER+/PR- cases and 55% of ER-/PR+ cases fail to respond, or develop early resistance to tamoxifen, through mechanisms that remain largely unclear (see Clarke et al.; Nicholson, R. I. et al. "The biology of antihormone failure in breast cancer." Breast Cancer Res Treat 80 Suppl 1, S29-34; discussion S35 (2003) and Osborne, C. K. et al. "Growth factor receptor cross-talk with estrogen receptor as a mechanism for tamoxifen resistance in breast cancer." Breast 12, 362-7 (2003)). Currently, no reliable means exist to allow the identification of these non-responders. In these patients, the use of alternative hormonal therapies, such as the aromatase inhibitors letrozole and anastrozole (Ellis, M. J. et al. "Letrozole is more effective neoadjuvant endocrine therapy than tamoxifen for ErbB-1- and/or ErbB-2-positive, estrogen receptorpositive primary breast cancer: evidence from a phase III randomized trial." J Clin Oncol 19, 3808-16 (2001); Buzdar, A. U. "Anastrozole: a new addition to the armamentarium against advanced breast cancer." Am J Clin Oncol 21, 161-6 (1998); and Goss, P. E. et al. "A randomized trial of letrozole in postmenopausal women after five years of tamoxifen therapy for early-stage breast cancer." N Engl J Med 349, 1793-802 (2003)); chemotherapeutic agents, or inhibitors of other signaling pathways, such as trastuzmab and gefitinib might offer the possibility of improving clinical outcome. Therefore, the ability to accurately predict tamoxifen treatment outcome should significantly advance the management of early stage breast cancer by identifying patients who are unlikely to benefit from TAM so that additional or alternative therapies may be sought.

Citation of documents herein is not intended as an admission that any is pertinent prior art. All statements as to the date or representation as to the contents of documents is based on the information available to the applicant and does not constitute any admission as to the correctness of the dates or contents of the documents.

SUMMARY OF THE INVENTION

The present invention relates to the identification and use of gene expression patterns (or profiles or "signatures") and the expression levels of individual gene sequences which are clinically relevant to breast cancer. In particular, the identities of genes that are correlated with patient survival and breast cancer recurrence (e.g. metastasis of the breast cancer) are provided. The gene expression profiles, whether embodied in nucleic acid expression, protein expression, or other expression formats, may be used to predict survival of subjects afflicted with breast cancer and the likelihood of breast cancer recurrence, including cancer metastasis.

The invention thus provides for the identification and use of gene expression patterns (or profiles or "signatures") and the expression levels of individual gene sequences which correlate with (and thus are able to discriminate between) patients with good or poor survival outcomes. In one embodiment, the invention provides patterns that are able to distinguish patients with estrogen receptor (.alpha. isoform) positive (ER+) breast tumors into those with that are responsive, or likely to be responsive, to treatment with tamoxifen (TAM) or another "antiestrogen" agent against breast cancer (such as a "selective estrogen receptor modulator" ("SERM"), "selective estrogen receptor downregulator" ("SERD"), or aromatase inhibitor ("AI")) and those that are non-responsive, or likely to be non-responsive, to such treatment. In an alternative embodiment, the invention may be applied to patients with breast tumors that do not display detectable levels of ER expression (so called "ER-" subjects) but where the patient will nonetheless benefit from application of the invention due to the presence of some low level ER expression. Responsiveness may be viewed in terms of better survival outcomes over time. These patterns are thus able to distinguish patients with ER+ breast tumors into at least two subtypes.

In a first aspect, the present invention provides a non-subjective means for the identification of patients with breast cancer (ER+ or ER-) as likely to have a good or poor survival outcome following treatment with TAM or another "antiestrogen" agent against breast cancer by assaying for the expression patterns disclosed herein. Thus where subjective interpretation may have been previously used to determine the prognosis and/or treatment of breast cancer patients, the present invention provides objective gene expression patterns, which may used alone or in combination with subjective criteria to provide a more accurate assessment of ER+ or ER- breast cancer patient outcomes or expected outcomes, including survival and the recurrence of cancer, following treatment with TAM or another "antiestrogen" agent against breast cancer. The expression patterns of the invention thus provide a means to determine ER+ or ER- breast cancer prognosis. Furthermore, the expression patterns can also be used as a means to assay small, node negative tumors that are not readily assayed by other means.

The gene expression patterns comprise one or more than one gene capable of discriminating between breast cancer outcomes with significant accuracy. The gene sequence(s) are identified as correlated with ER+ breast cancer outcomes such that the levels of their expression are relevant to a determination of the preferred treatment protocols for a patient, whether ER+ or ER-. Thus in one embodiment, the invention provides a method to determine the outcome of a subject afflicted with breast cancer by assaying a cell containing sample from said subject for expression of one or more than one gene disclosed herein as correlated with breast cancer outcomes following treatment with TAM or another "antiestrogen" agent against breast cancer.

The ability to correlate gene expression with breast cancer outcome and responsiveness to TAM is particularly advantageous in light of the possibility that up to 40% of ER+ subjects that undergo TAM treatment are non-responders. Therefore, the ability to identify, with confidence, these non-responders at an early time point permits the consideration and/or application of alternative therapies (such as a different "antiestrogen" agent against breast cancer or other anti-breast cancer treatments) to the non-responders. Stated differently, the ability to identify TAM non-responder subjects permits medical personnel to consider and/or utilize alternative therapies for the treatment of the subjects before time is spent on ineffective TAM therapy. Time spent on an ineffective therapy often permits further cancer growth, and the likelihood of success with alternative therapies diminishes over time given such growth. Therefore, the invention also provides methods to improve the survival outcome of non-responders by use of the methods disclosed herein to identify non-responders for treatment with alternative therapies.

Gene expression patterns of the invention are identified as described below. Generally, a large sampling of the gene expression profile of a sample is obtained through quantifying the expression levels of mRNA corresponding to many genes. This profile is then analyzed to identify genes, the expression of which are positively, or negatively, correlated, with ER+ breast cancer outcome upon treatment with TAM or another "antiestrogen" agent against breast cancer. An expression profile of a subset of human genes may then be identified by the methods of the present invention as correlated with a particular outcome. The use of multiple samples increases the confidence which a gene may be believed to be correlated with a particular survival outcome. Without sufficient confidence, it remains unpredictable whether expression of a particular gene is actually correlated with an outcome and also unpredictable whether expression of a particular gene may be successfully used to identify the outcome for a breast cancer patient. While the invention may be practiced based on the identities of the gene sequences disclosed herein or the actual sequences used independent of identification, the invention may also be practiced with any other sequences the expression of which is correlated with the expression of sequences disclosed herein. Such additional sequences may be identified by any means known in the art, including the methods disclosed herein.

A profile of genes that are highly correlated with one outcome relative to another may be used to assay an sample from a subject afflicted with breast cancer to predict the likely responsiveness (or lack thereof) to TAM or another "antiestrogen" agent against breast cancer in the subject from whom the sample was obtained. Such an assay may be used as part of a method to determine the therapeutic treatment for said subject based upon the breast cancer outcome identified.

As discussed below, the correlated genes may be used singly with significant accuracy or in combination to increase the ability to accurately correlating a molecular expression phenotype with a breast cancer outcome. This correlation is a way to molecularly provide for the determination of survival outcomes as disclosed herein. Additional uses of the correlated gene(s) are in the classification of cells and tissues; determination of diagnosis and/or prognosis; and determination and/or alteration of therapy.

The ability to discriminate is conferred by the identification of expression of the individual genes as relevant and not by the form of the assay used to determine the actual level of expression. An assay may utilize any identifying feature of an identified individual gene as disclosed herein as long as the assay reflects, quantitatively or qualitatively, expression of the gene in the "transcriptome" (the transcribed fraction of genes in a genome) or the "proteome" (the translated fraction of expressed genes in a genome). Additional assays include those based on the detection of polypeptide fragments of the relevant member or members of the proteome. Identifying features include, but are not limited to, unique nucleic acid sequences used to encode (DNA), or express (RNA), said gene or epitopes specific to, or activities of, a protein encoded by said gene. All that is required are the gene sequence(s) necessary to discriminate between breast cancer outcomes and an appropriate cell containing sample for use in an expression assay.

In another embodiment, the invention provides for the identification of the gene expression patterns by analyzing global, or near global, gene expression from single cells or homogenous cell populations which have been dissected away from, or otherwise isolated or purified from, contaminating cells beyond that possible by a simple biopsy. Because the expression of numerous genes fluctuate between cells from different patients as well as between cells from the same patient sample, multiple data from expression of individual genes and gene expression patterns are used as reference data to generate models which in turn permit the identification of individual gene(s), the expression of which are most highly correlated with particular breast cancer outcomes.

In additional embodiments, the invention provides physical and methodological means for detecting the expression of gene(s) identified by the models generated by individual expression patterns. These means may be directed to assaying one or more aspects of the DNA template(s) underlying the expression of the gene(s), of the RNA used as an intermediate to express the gene(s), or of the proteinaceous product expressed by the gene(s).

In further embodiments, the gene(s) identified by a model as capable of discriminating between breast cancer outcomes may be used to identify the cellular state of an unknown sample of cell(s) from the breast. Preferably, the sample is isolated via non-invasive means. The expression of said gene(s) in said unknown sample may be determined and compared to the expression of said gene(s) in reference data of gene expression patterns correlated with breast cancer outcomes. Optionally, the comparison to reference samples may be by comparison to the model(s) constructed based on the reference samples.

One advantage provided by the present invention is that contaminating, non-breast cells (such as infiltrating lymphocytes or other immune system cells) are not present to possibly affect the genes identified or the subsequent analysis of gene expression to identify the survival outcomes of patients with breast cancer. Such contamination is present where a biopsy is used to generate gene expression profiles. However, and as noted herein, the invention includes the identity of genes that may be used with significant accuracy even in the presence of contaminating cells.

In a second aspect, the invention provides a non-subjective means based on the expression of three genes, or combinations thereof, for the identification of patients with breast cancer as likely to have a good or poor survival outcome following treatment with TAM or another "antiestrogen" agent against breast cancer. These three genes are members of the expression patterns disclosed herein which have been found to be strongly predictive of clinical outcome following TAM treatment of ER+ breast cancer.

The present invention thus provides gene sequences identified as differentially expressed in ER+ breast cancer in correlation to TAM responsiveness. The sequences of two of the genes display increased expression in ER+ breast cells that respond to TAM treatment (and thus lack of increased expression in nonresponsive cases). The sequences of the third gene display decreased expression in ER+ breast cells that respond to TAM treatment (and thus lack of decreased expression in nonresponsive cases).

The first set of sequences found to be more highly expressed in TAM responsive, ER+ breast cells are those of interleukin 17 receptor B (IL17RB), which has been mapped to human chromosome 3 at 3p21.1. IL17RB is also referred to as interleukin 17B receptor (IL17BR) and sequences corresponding to it, and thus may be used in the practice of the instant invention, are identified by UniGene Cluster Hs.5470.

The second set of sequences found to be more highly expressed in TAM responsive, ER+ breast cells are those of the calcium channel, voltage-dependent, L type, alpha 1D subunit (CACNA1D), which has been mapped to human chromosome 3 at 3p14.3. Sequences corresponding to CACNA1D, and thus may be used in the practice of the instant invention, are identified by UniGene Cluster Hs.399966.

The set of sequences found to be expressed at lower levels in TAM responsive, ER+ breast cells are those of homeobox B13 (HOXB13), which has been mapped to human chromosome 17 at 17q21.2. Sequences corresponding to HOXB13, and thus may be used in the practice of the instant invention, are identified by UniGene Cluster Hs.66731.

While the invention may be practiced based on the identities of these three gene sequences or the actual sequences used independent of the assigned identity, the invention may also be practiced with any other sequence the expression of which is correlated with the expression of these disclosed sequences. Such additional sequences may be identified by any means known in the art, including the methods disclosed herein.

The identified sequences may thus be used in methods of determining the responsiveness, or non-responsiveness, of a subject's ER+ or ER- breast cancer to TAM treatment, or treatment with another "antiestrogen" agent against breast cancer, via analysis of breast cells in a tissue or cell containing sample from a subject. As non-limiting examples, the lack of increased expression of IL17BR and CACNA1D sequences and/or the lack of decreased expression of HOXB13 sequences may be used as an indicator of nonresponsive cases. The present invention provides an non-empirical means for determining responsiveness to TAM or another SERM in ER+ or ER- patients. This provides advantages over the use of a "wait and see" approach following treatment with TAM or other "antiestrogen" agent against breast cancer. The expression levels of these sequences may also be used as a means to assay small, node negative tumors that are not readily assessed by conventional means.

The expression levels of the identified sequences may be used alone or in combination with other sequences capable of determining responsiveness to treatment with TAM or another "antiestrogen" agent against breast cancer. Preferably, the sequences of the invention are used alone or in combination with each other, such as in the format of a ratio of expression levels that can have improved predictive power over analysis based on expression of sequences corresponding to individual genes. The invention provides for ratios of the expression level of a sequence that is underexpressed to the expression level of a sequence that is overexpressed as a indicator of responsiveness or non-responsiveness.

The present invention provides means for correlating a molecular expression phenotype with a physiological response in a subject with ER+ or ER- breast cancer. This correlation provides a way to molecularly diagnose and/or determine treatment for a breast cancer afflicted subject. Additional uses of the sequences are in the classification of cells and tissues; and determination of diagnosis and/or prognosis. Use of the sequences to identify cells of a sample as responsive, or not, to treatment with TAM or other "antiestrogen" agent against breast cancer may be used to determine the choice, or alteration, of therapy used to treat such cells in the subject, as well as the subject itself, from which the sample originated.

Such methods of the invention may be used to assist the determination of providing tamoxifen or another "antiestrogen" agent against breast cancer as a chemopreventive or chemoprotective agent to a subject at high risk for development of breast cancer. These methods of the invention are an advance over the studies of Fabian et al. (J Natl Cancer Inst. 92(15):1217-27, 2000), which proposed a combination of cytomorphology and the Gail risk model to identify high risk patients. The methods may be used in combination with assessments of relative risk of breast cancer such as that discussed by Tan-Chiu et al. (J Natl Cancer Inst. 95(4):302-307, 2003). Non-limiting examples include assaying of minimally invasive sampling, such as random (periareolar) fine needle aspirates or ductal lavage samples (such as that described by Fabian et al. and optionally in combination with or as an addition to a mammogram positive for benign or malignant breast cancer), of breast cells for the expression levels of gene sequences as disclosed herein to assist in the determination of administering therapy with an "antiestrogen" agent against breast cancer, such as that which may occur in cases of high risk subjects (like those described by Tan-Chiu et al.). The assays would thus lead to the identification of subjects for who the application of an "antiestrogen" agent against breast cancer would likely be beneficial as a chemopreventive or chemoprotective agent. It is contemplated that such application as enabled by the instant invention could lead to beneficial effects such as those seen with the administration of tamoxifen (see for example, Wickerham D. L., Breast Cancer Res. and Treatment 75 Suppl 1:S7-12, Discussion S33-5, 2000). Other applications of the invention include assaying of advanced breast cancer, including metastatic cancer, to determine the responsiveness, or non-responsiveness, thereof to treatment with an "antiestrogen" agent against breast cancer.

An assay of the invention may utilize a means related to the expression level of the sequences disclosed herein as long as the assay reflects, quantitatively or qualitatively, expression of the sequence. Preferably, however, a quantitative assay means is preferred. The ability to determine responsiveness to TAM or other "antiestrogen" agent against breast cancer and thus outcome of treatment therewith is provided by the recognition of the relevancy of the level of expression of the identified sequences and not by the form of the assay used to determine the actual level of expression. Identifying features of the sequences include, but are not limited to, unique nucleic acid sequences used to encode (DNA), or express (RNA), the disclosed sequences or epitopes specific to, or activities of, proteins encoded by the sequences. Alternative means include detection of nucleic acid amplification as indicative of increased expression levels and nucleic acid inactivation, deletion, or methylation, as indicative of decreased expression levels. Stated differently, the invention may be practiced by assaying one or more aspect of the DNA template(s) underlying the expression of the disclosed sequence(s), of the RNA used as an intermediate to express the sequence(s), or of the proteinaceous product expressed by the sequence(s), as well as proteolytic fragments of such products. As such, the detection of the presence of, amount of, stability of, or degradation (including rate) of, such DNA, RNA and proteinaceous molecules may be used in the practice of the invention.

The practice of the present invention is unaffected by the presence of minor mismatches between the disclosed sequences and those expressed by cells of a subject's sample. A non-limiting example of the existence of such mismatches are seen in cases of sequence polymorphisms between individuals of a species, such as individual human patients within Homo sapiens. Knowledge that expression of the disclosed sequences (and sequences that vary due to minor mismatches) is correlated with the presence of non-normal or abnormal breast cells and breast cancer is sufficient for the practice of the invention with an appropriate cell containing sample via an assay for expression.

In one embodiment, the invention provides for the identification of the expression levels of the disclosed sequences by analysis of their expression in a sample containing ER+ or ER- breast cells. In one preferred embodiment, the sample contains single cells or homogenous cell populations which have been dissected away from, or otherwise isolated or purified from, contaminating cells beyond that possible by a simple biopsy. Alternatively, undissected cells within a "section" of tissue may be used. Multiple means for such analysis are available, including detection of expression within an assay for global, or near global, gene expression in a sample (e.g. as part of a gene expression profiling analysis such as on a microarray) or by specific detection, such as quantitative PCR (Q-PCR), or real time quantitative PCR.

Preferably, the sample is isolated via non-invasive or minimally invasive means. The expression of the disclosed sequence(s) in the sample may be determined and compared to the expression of said sequence(s) in reference data of non-normal or cancerous breast cells. Alternatively, the expression level may be compared to expression levels in normal or non-cancerous cells, preferably from the same sample or subject. In embodiments of the invention utilizing Q-PCR, the expression level may be compared to expression levels of reference genes in the same sample or a ratio of expression levels may be used.

When individual breast cells are isolated in the practice of the invention, one benefit is that contaminating, non-breast cells (such as infiltrating lymphocytes or other immune system cells) are not present to possibly affect detection of expression of the disclosed sequence(s). Such contamination is present where a biopsy is used to generate gene expression profiles. However, analysis of differential gene expression and correlation to ER+ breast cancer outcomes with both isolated and non-isolated samples, as described herein, increases the confidence level of the disclosed sequences as capable of having significant predictive power with either type of sample.

While the present invention is described mainly in the context of human breast cancer, it may be practiced in the context of breast cancer of any animal known to be potentially afflicted by breast cancer. Preferred animals for the application of the present invention are mammals, particularly those important to agricultural applications (such as, but not limited to, cattle, sheep, horses, and other "farm animals"), animal models of breast cancer, and animals for human companionship (such as, but not limited to, dogs and cats).

The above aspects and embodiments of the invention may be applied equally with respect to use of more than one "antiestrogen" agent against breast cancer. In the case of a combination of agents, any combination of more than one SERM, SERD, or AI may be used in place of TAM or another "antiestrogen" agent against breast cancer. Aromatase is an enzyme that provides a major source of estrogen in body tissues including the breast, liver, muscle and fat. Without being bound by theory, and solely provided to assist in a better understanding of the invention, AIs are understood to function in a manner comparable to TAM and other "antiestrogen" agents against breast cancer, which are thought to act as antagonists of estrogen receptor in breast tissues and thus as against breast cancer. AIs may be either nonsteroidal or steroidal agents. Examples of the former, which inhibit aromatase via the heme prosthetic group) include, but are not limited to, anastrozole (arimidex), letrozole (femara), and vorozole (rivisor), which have been used or contemplated as treatments for metastatic breast cancer. Examples of steroidal AIs, which inactivate aromatase, include, but are not limited to, exemestane (aromasin), androstenedione, and formestane (lentaron).

Other forms of therapy to reduce estrogen levels include surgical or chemical ovarian ablation. The former is physical removal of the ovaries while the latter is the use of agents to block ovarian production of estrogen. One non-limiting example of the latter are agonists of gonadotropin releasing hormone (GnRH), such as goserelin (zoladex). Of course the instant invention may also be practiced with these therapies in place of treatment with one or more "antiestrogen" agent against breast cancer.

The invention disclosed herein is based in part on the performance of a genome-wide microarray analysis of hormone receptor-positive invasive breast tumors from 60 patients treated with adjuvant tamoxifen alone, leading to the identification of a two-gene expression ratio that is highly predictive of clinical outcome. This expression ratio, which is readily adapted to PCR-based analysis of standard paraffin-embedded clinical specimens, was validated in an independent set of 20 patients as described below.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows receiver operating characteristic (ROC) analyses of IL17BR, HOXB13, and CACNA1D expression levels as predictors of breast cancer outcomes in whole tissue sections (top 3 graphs) and laser microdissected cells (bottom 3 graphs). AUC refers to area under the curve.

FIG. 2 contains six parts relating to the validation of a ratio of HOXB13 expression to IL17BR expression as an indicator of responsiveness, or lack thereof, to TAM. Parts a and b show the results of gene expression analysis of HOXB13 and IL17BR sequences by Q-PCR in both Responder and Non-responder samples. Plots of the Responder and Non-responder training and validation data sets are shown in Parts c and d, where "0" indicates Responder datapoints in both and "1" indicates Non-responder datapoints in both. Parts e and f show plots of the Responder and Non-responder training and validation data sets as a function of survival, where the upper line in each Part represents the Responders and the lower line represents the Non-responders.

MODES OF PRACTICING THE INVENTION

Definitions of terms as used herein:

A gene expression "pattern" or "profile" or "signature" refers to the relative expression of genes correlated with responsiveness to treatment of ER+ breast cancer with TAM or another "antiestrogen" agent against breast cancer. Responsiveness or lack thereof may be expressed as survival outcomes which are correlated with an expression "pattern" or "profile" or "signature" that is able to distinguish between, and predict, said outcomes.

A "selective estrogen receptor modulator" or SERM is an "antiestrogen" agent that in some tissues act like estrogens (agonist) but block estrogen action in other tissues (antagonist). A "selective estrogen receptor downregulators" (or "SERD"s) or "pure" antiestrogens includes agents which block estrogen activity in all tissues. See Howell et al. (Best Bractice & Res. Clin. Endocrinol. Metab. 18(1):47-66, 2004). Preferred SERMs of the invention are those that are antagonists of estrogen in breast tissues and cells, including those of breast cancer. Non-limiting examples of such include TAM, raloxifene, GW5638, and ICI 182,780. The possible mechanisms of action by various SERMs have been reviewed (see for example Jordan et al., 2003, Breast Cancer Res. 5:281-283; Hall et al., 2001, J. Biol. Chem. 276(40):36869-36872; Dutertre et al. 2000, J. Pharmacol. Exp. Therap. 295(2):431-437; and Wijayaratne et al., 1999, Endocrinology 140(12):5828-5840). Other non-limiting examples of SERMs in the context of the invention include triphenylethylenes, such as tamoxifen, GW5638, TAT-59, clomiphene, toremifene, droloxifene, and idoxifene; benzothiophenes, such as arzoxiphene (LY353381 or LY353381-HCl); benzopyrans, such as EM-800; naphthalenes, such as CP-336,156; and ERA-923.

Non-limiting examples of SERD or "pure" antiestrogens include agents such as ICI 182,780 (fulvestrant or faslodex) or the oral analogue SR16243 and ZK 191703 as well as aromatase inhibitors and chemical ovarian ablation agents as described herein.

Other agents encompassed by SERM as used herein include progesterone receptor inhibitors and related drugs, such as progestomimetics like medroxyprogesterone acetate, megace, and RU-486; and peptide based inhibitors of ER action, such as LH-RH analogs (leuprolide, zoladex, [D-Trp6]LH-RH), somatostatin analogs, and LXXLL motif mimics of ER as well as tibolone and resveratrol. As noted above, preferred SERMs of the invention are those that are antagonist of estrogen in breast tissues and cells, including those of breast cancer. Non-limiting examples of preferred SERMs include the actual or contemplated metabolites (in vivo) of any SERM, such as, but not limited to, 4-hydroxytamoxifen (metabolite of tamoxifen), EM652 (or SCH 57068 where EM-800 is a prodrug of EM-652), and GW7604 (metabolite of GW5638). See Willson et al. (1997, Endocrinology 138(9):3901-3911) and Dauvois et al. (1992, Proc. Nat'l. Acad. Sci., USA 89:4037-4041) for discussions of some specific SERMs.

Other preferred SERMs are those that produce the same relevant gene expression profile as tamoxifen or 4-hydroxytamoxifen. One example of means to identify such SERMs is provided by Levenson et al. (2002, Cancer Res. 62:4419-4426).

A "gene" is a polynucleotide that encodes a discrete product, whether RNA or proteinaceous in nature. It is appreciated that more than one polynucleotide may be capable of encoding a discrete product. The term includes alleles and polymorphisms of a gene that encodes the same product, or a functionally associated (including gain, loss, or modulation of function) analog thereof, based upon chromosomal location and ability to recombine during normal mitosis.

A "sequence" or "gene sequence" as used herein is a nucleic acid molecule or polynucleotide composed of a discrete order of nucleotide bases. The term includes the ordering of bases that encodes a discrete product (i.e. "coding region"), whether RNA or proteinaceous in nature, as well as the ordered bases that precede or follow a "coding region". Non-limiting examples of the latter include 5' and 3' untranslated regions of a gene. It is appreciated that more than one polynucleotide may be capable of encoding a discrete product. It is also appreciated that alleles and polymorphisms of the disclosed sequences may exist and may be used in the practice of the invention to identify the expression level(s) of the disclosed sequences or the allele or polymorphism. Identification of an allele or polymorphism depends in part upon chromosomal location and ability to recombine during mitosis.

The terms "correlate" or "correlation" or equivalents thereof refer to an association between expression of one or more genes and a physiological response of a breast cancer cell and/or a breast cancer patient in comparison to the lack of the response. A gene may be expressed at higher or lower levels and still be correlated with responsiveness, non-responsiveness or breast cancer survival or outcome. The invention provides for the correlation between increases in expression of IL17BR and CACNA1D sequences and responsiveness of ER+ breast cells to TAM or another "antiestrogen" agent against breast cancer. Thus increases are indicative of responsiveness. Conversely, the lack of increases, including unchanged expression levels, are indicators of non-responsiveness. Similarly, the invention provides for the correlation between decreases in expression of HOXB13 sequences and responsiveness of ER+ breast cells to TAM or another SERM. Thus decreases are indicative of responsiveness while the lack of decreases, including unchanged expression levels, are indicators of non-responsiveness. Increases and decreases may be readily expressed in the form of a ratio between expression in a non-normal cell and a normal cell such that a ratio of one (1) indicates no difference while ratios of two (2) and one-half indicate twice as much, and half as much, expression in the non-normal cell versus the normal cell, respectively. Expression levels can be readily determined by quantitative methods as described below.

For example, increases in IL17BR, CACNA1D, or HOXB13 expression can be indicated by ratios of or about 1.1, of or about 1.2, of or about 1.3, of or about 1.4, of or about 1.5, of or about 1.6, of or about 1.7, of or about 1.8, of or about 1.9, of or about 2, of or about 2.5, of or about 3, of or about 3.5, of or about 4, of or about 4.5, of or about 5, of or about 5.5, of or about 6, of or about 6.5, of or about 7, of or about 7.5, of or about 8, of or about 8.5, of or about 9, of or about 9.5, of or about 10, of or about 15, of or about 20, of or about 30, of or about 40, of or about 50, of or about 60, of or about 70, of or about 80, of or about 90, of or about 100, of or about 150, of or about 200, of or about 300, of or about 400, of or about 500, of or about 600, of or about 700, of or about 800, of or about 900, or of or about 1000. A ratio of 2 is a 100% (or a two-fold) increase in expression. Decreases in IL17BR, CACNA1D, or HOXB13 expression can be indicated by ratios of or about 0.9, of or about 0.8, of or about 0.7, of or about 0.6, of or about 0.5, of or about 0.4, of or about 0.3, of or about 0.2, of or about 0.1, of or about 0.05, of or about 0.01, of or about 0.005, of or about 0.001, of or about 0.0005, of or about 0.0001, of or about 0.00005, of or about 0.00001, of or about 0.000005, or of or about 0.000001.

For a given phenotype, a ratio of the expression of a gene sequence expressed at increased levels in correlation with the phenotype to the expression of a gene sequence expressed at decreased levels in correlation with the phenotype may also be used as an indicator of the phenotype. As a non-limiting example, the phenotype of non-responsiveness to tamoxifen treatment of breast cancer is correlated with increased expression of HOXB13 as well as decreased expression of IL17BR and CACNA1D. Therefore, a ratio of the expression levels of HOXB13 to IL17BR (or CACNA1D) may be used as an indicator of non-responsiveness.

A "polynucleotide" is a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. This term refers only to the primary structure of the molecule. Thus, this term includes double- and single-stranded DNA and RNA. It also includes known types of modifications including labels known in the art, methylation, "caps", substitution of one or more of the naturally occurring nucleotides with an analog, and internucleotide modifications such as uncharged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), as well as unmodified forms of the polynucleotide.

The term "amplify" is used in the broad sense to mean creating an amplification product can be made enzymatically with DNA or RNA polymerases. "Amplification," as used herein, generally refers to the process of producing multiple copies of a desired sequence, particularly those of a sample. "Multiple copies" mean at least 2 copies. A "copy" does not necessarily mean perfect sequence complementarity or identity to the template sequence. Methods for amplifying mRNA are generally known in the art, and include reverse transcription PCR (RT-PCR) and those described in U.S. patent application Ser. No. 10/062,857 (filed on Oct. 25, 2001), as well as U.S. Provisional Patent Application 60/298,847 (filed Jun. 15, 2001) and 60/257,801 (filed Dec. 22, 2000), all of which are hereby incorporated by reference in their entireties as if fully set forth. Another method which may be used is quantitative PCR (or Q-PCR). Alternatively, RNA may be directly labeled as the corresponding cDNA by methods known in the art.

By "corresponding", it is meant that a nucleic acid molecule shares a substantial amount of sequence identity with another nucleic acid molecule. Substantial amount means at least 95%, usually at least 98% and more usually at least 99%, and sequence identity is determined using the BLAST algorithm, as described in Altschul et al. (1990), J. Mol. Biol. 215:403-410 (using the published default setting, i.e. parameters w=4, t=17).

A "microarray" is a linear or two-dimensional or three dimensional (and solid phase) array of preferably discrete regions, each having a defined area, formed on the surface of a solid support such as, but not limited to, glass, plastic, or synthetic membrane. The density of the discrete regions on a microarray is determined by the total numbers of immobilized polynucleotides to be detected on the surface of a single solid phase support, preferably at least about 50/cm.sup.2, more preferably at least about 100/cm.sup.2, even more preferably at least about 500/cm.sup.2, but preferably below about 1,000/cm.sup.2. Preferably, the arrays contain less than about 500, about 1000, about 1500, about 2000, about 2500, or about 3000 immobilized polynucleotides in total. As used herein, a DNA microarray is an array of oligonucleotides or polynucleotides placed on a chip or other surfaces used to hybridize to amplified or cloned polynucleotides from a sample. Since the position of each particular group of primers in the array is known, the identities of a sample polynucleotides can be determined based on their binding to a particular position in the microarray. As an alternative to the use of a microarray, an array of any size may be used in the practice of the invention, including an arrangement of one or more position of a two-dimensional or three dimensional arrangement in a solid phase to detect expression of a single gene sequence.

Because the invention relies upon the identification of genes that are over- or underexpressed, one embodiment of the invention involves determining expression by hybridization of mRNA, or an amplified or cloned version thereof, of a sample cell to a polynucleotide that is unique to a particular gene sequence. Preferred polynucleotides of this type contain at least about 16, at least about 18, at least about 20, at least about 22, at least about 24, at least about 26, at least about 28, at least about 30, or at least about 32 consecutive basepairs of a gene sequence that is not found in other gene sequences. The term "about" as used in the previous sentence refers to an increase or decrease of 1 from the stated numerical value. Even more preferred are polynucleotides of at least or about 50, at least or about 100, at least about or 150, at least or about 200, at least or about 250, at least or about 300, at least or about 350, at least or about 400, at least or about 450, or at least or about 500 consecutive bases of a sequence that is not found in other gene sequences. The term "about" as used in the preceding sentence refers to an increase or decrease of 10% from the stated numerical value. Longer polynucleotides may of course contain minor mismatches (e.g. via the presence of mutations) which do not affect hybridization to the nucleic acids of a sample. Such polynucleotides may also be referred to as polynucleotide probes that are capable of hybridizing to sequences of the genes, or unique portions thereof, described herein. Such polynucleotides may be labeled to assist in their detection. Preferably, the sequences are those of mRNA encoded by the genes, the corresponding cDNA to such mRNAs, and/or amplified versions of such sequences. In preferred embodiments of the invention, the polynucleotide probes are immobilized on an array, other solid support devices, or in individual spots that localize the probes.

In another embodiment of the invention, all or part of a disclosed sequence may be amplified and detected by methods such as the polymerase chain reaction (PCR) and variations thereof, such as, but not limited to, quantitative PCR (Q-PCR), reverse transcription PCR (RT-PCR), and real-time PCR (including as a means of measuring the initial amounts of mRNA copies for each sequence in a sample), optionally real-time RT-PCR or real-time Q-PCR. Such methods would utilize one or two primers that are complementary to portions of a disclosed sequence, where the primers are used to prime nucleic acid synthesis. The newly synthesized nucleic acids are optionally labeled and may be detected directly or by hybridization to a polynucleotide of the invention. The newly synthesized nucleic acids may be contacted with polynucleotides (containing sequences) of the invention under conditions which allow for their hybridization. Additional methods to detect the expression of expressed nucleic acids include RNAse protection assays, including liquid phase hybridizations, and in situ hybridization of cells.

Alternatively, and in yet another embodiment of the invention, gene expression may be determined by analysis of expressed protein in a cell sample of interest by use of one or more antibodies specific for one or more epitopes of individual gene products (proteins), or proteolytic fragments thereof, in said cell sample or in a bodily fluid of a subject. The cell sample may be one of breast cancer epithelial cells enriched from the blood of a subject, such as by use of labeled antibodies against cell surface markers followed by fluorescence activated cell sorting (FACS). Such antibodies are preferably labeled to permit their easy detection after binding to the gene product. Detection methodologies suitable for use in the practice of the invention include, but are not limited to, immunohistochemistry of cell containing samples or tissue, enzyme linked immunosorbent assays (ELISAs) including antibody sandwich assays of cell containing tissues or blood samples, mass spectroscopy, and immuno-PCR.

The term "label" refers to a composition capable of producing a detectable signal indicative of the presence of the labeled molecule. Suitable labels include radioisotopes, nucleotide chromophores, enzymes, substrates, fluorescent molecules, chemiluminescent moieties, magnetic particles, bioluminescent moieties, and the like. As such, a label is any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means.

The term "support" refers to conventional supports such as beads, particles, dipsticks, fibers, filters, membranes and silane or silicate supports such as glass slides.

As used herein, a "breast tissue sample" or "breast cell sample" refers to a sample of breast tissue or fluid isolated from an individual suspected of being afflicted with, or at risk of developing, breast cancer. Such samples are primary isolates (in contrast to cultured cells) and may be collected by any non-invasive or minimally invasive means, including, but not limited to, ductal lavage, fine needle aspiration, needle biopsy, the devices and methods described in U.S. Pat. No. 6,328,709, or any other suitable means recognized in the art. Alternatively, the "sample" may be collected by an invasive method, including, but not limited to, surgical biopsy.

"Expression" and "gene expression" include transcription and/or translation of nucleic acid material.

As used herein, the term "comprising" and its cognates are used in their inclusive sense; that is, equivalent to the term "including" and its corresponding cognates.

Conditions that "allow" an event to occur or conditions that are "suitable" for an event to occur, such as hybridization, strand extension, and the like, or "suitable" conditions are conditions that do not prevent such events from occurring. Thus, these conditions permit, enhance, facilitate, and/or are conducive to the event. Such conditions, known in the art and described herein, depend upon, for example, the nature of the nucleotide sequence, temperature, and buffer conditions. These conditions also depend on what event is desired, such as hybridization, cleavage, strand extension or transcription.

Sequence "mutation," as used herein, refers to any sequence alteration in the sequence of a gene disclosed herein interest in comparison to a reference sequence. A sequence mutation includes single nucleotide changes, or alterations of more than one nucleotide in a sequence, due to mechanisms such as substitution, deletion or insertion. Single nucleotide polymorphism (SNP) is also a sequence mutation as used herein. Because the present invention is based on the relative level of gene expression, mutations in non-coding regions of genes as disclosed herein may also be assayed in the practice of the invention.

"Detection" includes any means of detecting, including direct and indirect detection of gene expression and changes therein. For example, "detectably less" products may be observed directly or indirectly, and the term indicates any reduction (including the absence of detectable signal). Similarly, "detectably more" product means any increase, whether observed directly or indirectly.

Increases and decreases in expression of the disclosed sequences are defined in the following terms based upon percent or fold changes over expression in normal cells. Increases may be of 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, or 200% relative to expression levels in normal cells. Alternatively, fold increases may be of 1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, or 10 fold over expression levels in normal cells. Decreases may be of 10, 20, 30, 40, 50, 55, 60, 65, 70, 75, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 99 or 100% relative to expression levels in normal cells.

Unless defined otherwise all technical and scientific terms used herein have the same meaning as commonly understood to one of ordinary skill in the art to which this invention belongs.

Embodiments of the Invention

In a first aspect, the disclosed invention relates to the identification and use of gene expression patterns (or profiles or "signatures") which discriminate between (or are correlated with) breast cancer survival in a subject treated with tamoxifen (TAM) or another "antiestrogen" agent against breast cancer. Such patterns may be determined by the methods of the invention by use of a number of reference cell or tissue samples, such as those reviewed by a pathologist of ordinary skill in the pathology of breast cancer, which reflect breast cancer cells as opposed to normal or other non-cancerous cells. The outcomes experienced by the subjects from whom the samples may be correlated with expression data to identify patterns that correlate with the outcomes following treatment with TAM or another "antiestrogen" agent against breast cancer. Because the overall gene expression profile differs from person to person, cancer to cancer, and cancer cell to cancer cell, correlations between certain cells and genes expressed or underexpressed may be made as disclosed herein to identify genes that are capable of discriminating between breast cancer outcomes.

The present invention may be practiced with any number of the genes believed, or likely to be, differentially expressed with respect to breast cancer outcomes, particularly in cases of ER+ breast cancer. The identification may be made by using expression profiles of various homogenous breast cancer cell populations, which were isolated by microdissection, such as, but not limited to, laser capture microdissection (LCM) of 100-1000 cells. The expression level of each gene of the expression profile may be correlated with a particular outcome. Alternatively, the expression levels of multiple genes may be clustered to identify correlations with particular outcomes.

Genes with significant correlations to breast cancer survival when the subject is treated with tamoxifen may be used to generate models of gene expressions that would maximally discriminate between outcomes where a subject responds to treatment with tamoxifen or another "antiestrogen" agent against breast cancer and outcomes where the treatment is not successful. Alternatively, genes with significant correlations may be used in combination with genes with lower correlations without significant loss of ability to discriminate between outcomes. Such models may be generated by any appropriate means recognized in the art, including, but not limited to, cluster analysis, supported vector machines, neural networks or other algorithm known in the art. The models are capable of predicting the classification of a unknown sample based upon the expression of the genes used for discrimination in the models. "Leave one out" cross-validation may be used to test the performance of various models and to help identify weights (genes) that are uninformative or detrimental to the predictive ability of the models. Cross-validation may also be used to identify genes that enhance the predictive ability of the models.

The gene(s) identified as correlated with particular breast cancer outcomes relating to tamoxifen treatment by the above models provide the ability to focus gene expression analysis to only those genes that contribute to the ability to identify a subject as likely to have a particular outcome relative to another. The expression of other genes in a breast cancer cell would be relatively unable to provide information concerning, and thus assist in the discrimination of, a breast cancer outcome.

As will be appreciated by those skilled in the art, the models are highly useful with even a small set of reference gene expression data and can become increasingly accurate with the inclusion of more reference data although the incremental increase in accuracy will likely diminish with each additional datum. The preparation of additional reference gene expression data using genes identified and disclosed herein for discriminating between different outcomes in breast cancer following treatment with tamoxifen or another "antiestrogen" agent against breast cancer is routine and may be readily performed by the skilled artisan to permit the generation of models as described above to predict the status of an unknown sample based upon the expression levels of those genes.

To determine the (increased or decreased) expression levels of genes in the practice of the present invention, any method known in the art may be utilized. In one preferred embodiment of the invention, expression based on detection of RNA which hybridizes to the genes identified and disclosed herein is used. This is readily performed by any RNA detection or amplification+detection method known or recognized as equivalent in the art such as, but not limited to, reverse transcription-PCR, the methods disclosed in U.S. patent application Ser. No. 10/062,857 (filed on Oct. 25, 2001) as well as U.S. Provisional Patent Application 60/298,847 (filed Jun. 15, 2001) and 60/257,801 (filed Dec. 22, 2000), and methods to detect the presence, or absence, of RNA stabilizing or destabilizing sequences.

Alternatively, expression based on detection of DNA status may be used. Detection of the DNA of an identified gene as methylated or deleted may be used for genes that have decreased expression in correlation with a particular breast cancer outcome. This may be readily performed by PCR based methods known in the art, including, but not limited to, Q-PCR. Conversely, detection of the DNA of an identified gene as amplified may be used for genes that have increased expression in correlation with a particular breast cancer outcome. This may be readily performed by PCR based, fluorescent in situ hybridization (FISH) and chromosome in situ hybridization (CISH) methods known in the art.

Expression based on detection of a presence, increase, or decrease in protein levels or activity may also be used. Detection may be performed by any immunohistochemistry (IHC) based, blood based (especially for secreted proteins), antibody (including autoantibodies against the protein) based, exfoliate cell (from the cancer) based, mass spectroscopy based, and image (including used of labeled ligand) based method known in the art and recognized as appropriate for the detection of the protein. Antibody and image based methods are additionally useful for the localization of tumors after determination of cancer by use of cells obtained by a non-invasive procedure (such as ductal lavage or fine needle aspiration), where the source of the cancerous cells is not known. A labeled antibody or ligand may be used to localize the carcinoma(s) within a patient or to assist in the enrichment of exfoliated cancer cells from a bodily fluid.

A preferred embodiment using a nucleic acid based assay to determine expression is by immobilization of one or more sequences of the genes identified herein on a solid support, including, but not limited to, a solid substrate as an array or to beads or bead based technology as known in the art. Alternatively, solution based expression assays known in the art may also be used. The immobilized gene(s) may be in the form of polynucleotides that are unique or otherwise specific to the gene(s) such that the polynucleotide would be capable of hybridizing to a DNA or RNA corresponding to the gene(s). These polynucleotides may be the full length of the gene(s) or be short sequences of the genes (up to one nucleotide shorter than the full length sequence known in the art by deletion from the 5' or 3' end of the sequence) that are optionally minimally interrupted (such as by mismatches or inserted non-complementary basepairs) such that hybridization with a DNA or RNA corresponding to the gene(s) is not affected. Preferably, the polynucleotides used are from the 3' end of the gene, such as within about 350, about 300, about 250, about 200, about 150, about 100, or about 50 nucleotides from the polyadenylation signal or polyadenylation site of a gene or expressed sequence. Polynucleotides containing mutations relative to the sequences of the disclosed genes may also be used so long as the presence of the mutations still allows hybridization to produce a detectable signal.

The immobilized gene(s) may be used to determine the state of nucleic acid samples prepared from sample breast cell(s) for which the outcome of the sample's subject (e.g. patient from whom the sample is obtained) is not known or for confirmation of an outcome that is already assigned to the sample's subject. Without limiting the invention, such a cell may be from a patient with ER+ or ER- breast cancer. The immobilized polynucleotide(s) need only be sufficient to specifically hybridize to the corresponding nucleic acid molecules derived from the sample under suitable conditions. While even a single correlated gene sequence may to able to provide adequate accuracy in discriminating between two breast cancer outcomes, two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten or more, or eleven or more of the genes identified herein may be used as a subset capable of discriminating may be used in combination to increase the accuracy of the method. The invention specifically contemplates the selection of more than one, two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten or more, or eleven or more of the genes disclosed in the tables and figures herein for use as a subset in the identification of breast cancer survival outcome.

Of course 1 or more, 2 or more, 3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or more, 9 or more, or all the genes provided in Tables 2 and/or 3 below may be used. "Accession" as used in the context of the Tables herein as well as the present invention refers to the GenBank accession number of a sequence of each gene, the sequences of which are hereby incorporated by reference in their entireties as they are available from GenBank as accessed on the filing date of the present application. P value refers to values assigned as described in the Examples below. The indications of "E-xx" where "xx" is a two digit number refers to alternative notation for exponential figures where "E-xx" is "10.sup.-xx". Thus in combination with the numbers to the left of "E-xx", the value being represented is the numbers to the left times 10.sup.-xx. "Description" as used in the Tables provides a brief identifier of what the sequence/gene encodes.

Genes with a correlation identified by a p value below or about 0.02, below or about 0.01, below or about 0.005, or below or about 0.001 are preferred for use in the practice of the invention. The present invention includes the use of gene(s) the expression of which identify different breast cancer outcomes after treatment with TAM or another "antiestrogen" agent against breast cancer to permit simultaneous identification of breast cancer survival outcome of a patient based upon assaying a breast cancer sample from said patient.

In a second aspect, the present invention relates to the identification and use of three sets of sequences for the determination of responsiveness of ER+ breast cancer to treatment with TAM or another "antiestrogen" agent against breast cancer. The differential expression of these sequences in breast cancer relative to normal breast cells is used to predict responsiveness to TAM or another "antiestrogen" agent against breast cancer in a subject.

To identify gene expression patterns in ER positive, early stage invasive breast cancers that might predict response to hormonal therapy, microarray gene expression analysis was performed on tumors from 60 women uniformly treated with adjuvant tamoxifen alone. These patients were identified from a total of 103 ER+ early stage cases presenting to Massachusetts General Hospital between 1987 and 1997, from whom tumor specimens were snap frozen and for whom minimal 5 year follow-up was available (see Table 1 for details). Within this cohort, 28 (46%) women developed distant metastasis with a median time to recurrence of 4 years ("tamoxifen non-responders") and 32 (54%) women remained disease-free with median follow-up of 10 years ("tamoxifen responders"). Responders were matched with non-responder cases with respect to TNM staging (see Singletary, S. E. et al. "Revision of the American Joint Committee on Cancer staging system for breast cancer." J Clin Oncol 20, 3628-36 (2002)) and tumor grade (see Dalton, L. W. et al. "Histologic grading of breast cancer: linkage of patient outcome with level of pathologist agreement." Mod Pathol 13, 730-5. (2000)).

Previous studies linking gene expression profiles to clinical outcome in breast cancer have demonstrated that the potential for distant metastasis and overall survival probability may be predictable through biological characteristics of the primary tumor at the time of diagnosis (see Huang, E. et al. "Gene expression predictors of breast cancer outcomes." Lancet 361, 1590-6 (2003); Sorlie, T. et al. "Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications." Proc Natl Acad Sci USA 98:10869-74 (2001); Sorlie, T. et al. "Repeated observation of breast tumor subtypes in independent gene expression data sets." Proc Natl Acad Sci USA 100, 8418-23 (2003); Sotiriou, C. et al. "Breast cancer classification and prognosis based on gene expression profiles from a population-based study." Proc Natl Acad Sci USA 100, 10393-8 (2003); van't Veer, L. J. et al. "Gene expression profiling predicts clinical outcome of breast cancer." Nature 415, 530-6 (2002); and van de Vijver, M. J. et al. "A gene-expression signature as a predictor of survival in breast cancer." N Engl J Med 347, 1999-2009 (2002)). In particular, a 70-gene expression signature has proven to be a strong prognostic factor, out-performing all known clinicopathological parameters. However, in those studies patients either received no adjuvant therapy (van't Veer, L. J. et al. Nature 2002) or were treated non-uniformly with hormonal and chemotherapeutic regimens (Huang, E. et al.; Sorlie, T. et al.; Sorlie, T. et al.; Sotiriou, C. et al.; and van de Vijver, M. J. et al. N Engl J Med 2002). Patients with ER+ early-stage breast cancer treated with tamoxifen alone, such as the cohort studied here, represent only a subset of the population tested with the 70-gene signature. Of note, 61 of the genes in the 70-gene signature were present on the microarray used as described below, but no significant association with clinical outcome was observed in the defined subset of patients.

In comparison with existing biomarkers, including ESR1, PGR, ERBB2 and EGFR, three sets of gene sequences disclosed herein are significantly more predictive of responsiveness to TAM treatment. Multivariate analysis indicated that these three genes were significant predictors of clinical outcome independent of tumor size, nodal status and tumor grade. ER and progesterone receptor (PR) expression have been the major clinicopathological predictors for response to TAM. However, up to 40% of ER+ tumors fail to respond or develop resistance to TAM. The invention thus provides for the use of the identified biomarkers to allow better patient management by identifying patients who are more likely to benefit from TAM or other endocrine therapy and those who are likely to develop resistance and tumor recurrence.

As noted herein, the sequences(s) identified by the present invention are expressed in correlation with ER+ breast cancer cells. For example, IL17BR, identified by I.M.A.G.E. Consortium Clusters NM_018725 and NM_172234 ("The I.M.A.G.E. Consortium: An Integrated Molecular Analysis of Genomes and their Expression," Lennon et al., 1996, Genomics 33:151-152; see also image.llnl.gov) has been found to be useful in predicting responsiveness to TAM treatment.

In preferred embodiments of the invention, any sequence, or unique portion thereof, of the IL17BR sequences of the cluster, as well as the UniGene Homo sapiens cluster Hs.5470, may be used. Similarly, any sequence encoding all or a part of the protein encoded by any IL17BR sequence disclosed herein may be used. Consensus sequences of I.M.A.G.E. Consortium clusters are as follows, with the assigned coding region (ending with a termination codon) underlined and preceded by the 5' untranslated and/or non-coding region and followed by the 3' untranslated and/or non-coding region:

SEQ ID NO:1 (Consensus Sequence for IL17BR, Transcript Variant 1, Identified as NM_018725 or NM_018725.2)

TABLE-US-00001 agcgcagcgt gcgggtggcc tggatcccgc gcagtggccc ggcgatgtcg ctcgtgctgc taagcctggc cgcgctgtgc aggagcgccg taccccgaga gccgaccgtt caatgtggct ctgaaactgg gccatctcca gagtggatgc tacaacatga tctaatcccc ggagacttga gggacctccg agtagaacct gttacaacta gtgttgcaac aggggactat tcaattttga tgaatgtaag ctgggtactc cgggcagatg ccagcatccg cttgttgaag gccaccaaga tttgtgtgac gggcaaaagc aacttccagt cctacagctg tgtgaggtgc aattacacag aggccttcca gactcagacc agaccctctg gtggtaaatg gacattttcc tacatcggct tccctgtaga gctgaacaca gtctatttca ttggggccca taatattcct aatgcaaata tgaatgaaga tggcccttcc atgtctgtga atttcacctc accaggctgc ctagaccaca taatgaaata taaaaaaaag tgtgtcaagg ccggaagcct gtgggatccg aacatcactg cttgtaagaa gaatgaggag acagtagaag tgaacttcac aaccactccc ctgggaaaca gatacatggc tcttatccaa cacagcacta tcatcgggtt ttctcaggtg tttgagccac accagaagaa acaaacgcga gcttcagtgg tgattccagt gactggggat agtgaaggtg ctacggtgca gctgactcca tattttccta cttgtggcag cgactgcatc cgacataaag gaacagttgt gctctgccca caaacaggcg tccctttccc tctggataac aacaaaagca agccgggagg ctggctgcct ctcctcctgc tgtctctgct ggtggccaca tgggtgctgg tggcagggat ctatctaatg tggaggcacg aaaggatcaa gaagacttcc ttttctacca ccacactact gccccccatt aaggttcttg tggtttaccc atctgaaata tgtttccatc acacaatttg ttacttcact gaatttcttc aaaaccattg cagaagtgag gtcatccttg aaaagtggca gaaaaagaaa atagcagaga tgggtccagt gcagtggctt gccactcaaa agaaggcagc agacaaagtc gtcttccttc tttccaatga cgtcaacagt gtgtgcgatg gtacctgtgg caagagcgag ggcagtccca gtgagaactc tcaagacctc ttcccccttg cctttaacct tttctgcagt gatctaagaa gccagattca tctgcacaaa tacgtggtgg tctactttag agagattgat acaaaagacg attacaatgc tctcagtgtc tgccccaagt accacctcat gaaggatgcc actgctttct gtgcagaact tctccatgtc aagcagcagg tgtcagcagg aaaaagatca caagcctgcc acgatggctg ctgctccttg tagcccaccc atgagaagca agagacctta aaggcttcct atcccaccaa ttacagggaa aaaacgtgtg atgatcctga agcttactat gcagcctaca aacagcctta gtaattaaaa cattttatac caataaaatt ttcaaatatt gctaactaat gtagcattaa ctaacgattg gaaactacat ttacaacttc aaagctgttt tatacataga aatcaattac agttttaatt gaaaactata accattttga taatgcaaca ataaagcatc ttcagccaaa catctagtct tccatagacc atgcattgca gtgtacccag aactgtttag ctaatattct atgtttaatt aatgaatact aactctaaga acccctcact gattcactca atagcatctt aagtgaaaaa ccttctatta catgcaaaaa atcattgttt ttaagataac aaaagtaggg aataaacaag ctgaacccac ttttaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa

SEQ ID NO:2 (Consensus Sequence for IL17BR, Transcript Variant 2, Identified as NM_172234 or NM_172234.1)

TABLE-US-00002 agcgcagcgt gcgggtggcc tggatcccgc gcagtggccc ggcgatgtcg ctcgtgctgc taagcctggc cgcgctgtgc aggagcgccg taccccgaga gccgaccgtt caatgtggct ctgaaactgg gccatctcca gagtggatgc tacaacatga tctaatcccc ggagacttga gggacctccg agtagaacct gttacaacta gtgttgcaac aggggactat tcaattttga tgaatgtaag ctgggtactc cgggcagatg ccagcatccg cttgttgaag gccaccaaga tttgtgtgac gggcaaaagc aacttccagt cctacagctg tgtgaggtgc aattacacag aggccttcca gactcagacc agaccctctg gtggtaaatg gacattttcc tacatcggct tccctgtaga gctgaacaca gtctatttca ttggggccca taatattcct aatgcaaata tgaatgaaga tggcccttcc atgtctgtga atttcacctc accaggctgc ctagaccaca taatgaaata taaaaaaaag tgtgtcaagg ccggaagcct gtgggatccg aacatcactg cttgtaagaa gaatgaggag acagtagaag tgaacttcac aaccactccc ctgggaaaca gatacatggc tcttatccaa cacagcacta tcatcgggtt ttctcaggtg tttgagccac accagaagaa acaaacgcga gcttcagtgg tgattccagt gactggggat agtgaaggtg ctacggtgca ggtaaagttc agtgagctgc tctggggagg gaagggacat agaagactgt tccatcattc attgctttta aggatgagtt ctctcttgtc aaatgcactt ctgccagcag acaccagtta agtggcgttc atgggggctc tttcgctgca gcctccaccg tgctgaggtc aggaggccga cgtggcagtt gtggtccctt ttgcttgtat taatggctgc tgaccttcca aagcactttt tattttcatt ttctgtcaca gacactcagg gatagcagta ccattttact tccgcaagcc tttaactgca agatgaagct gcaaagggtt tgaaatggga aggtttgagt tccaggcagc gtatgaactc tggagagggg ctgccagtcc tctctgggcc gcagcggacc cagctggaac acaggaagtt ggagcagtag gtgctccttc acctctcagt atgtctcttt caactctagt ttttgaggtg gggacacagg aggtccagtg ggacacagcc actccccaaa gagtaaggag cttccatgct tcattccctg gcataaaaag tgctcaaaca caccagaggg ggcaggcacc agccagggta tgatggctac tacccttttc tggagaacca tagacttccc ttactacagg gacttgcatg tcctaaagca ctggctgaag gaagccaaga ggatcactgc tgctcctttt ttctagagga aatgtttgtc tacgtggtaa gatatgacct agccctttta ggtaagcgaa ctggtatgtt agtaacgtgt acaaagttta ggttcagacc ccgggagtct tgggcacgtg ggtctcgggt cactggtttt gactttaggg ctttgttaca gatgtgtgac caaggggaaa atgtgcatga caacactaga ggtatgggcg aagccagaaa gaagggaagt tttggctgaa gtaggagtct tggtgagatt ttgctctgat gcatggtgtg aactttctga gcctcttgtt tttcctcagc tgactccata ttttcctact tgtggcagcg actgcatccg acataaagga acagttgtgc tctgcccaca aacaggcgtc cctttccctc tggataacaa caaaagcaag ccgggaggct ggctgcctct cctcctgctg tctctgctgg tggccacatg ggtgctggtg gcagggatct atctaatgtg gaggcacgaa aggatcaaga agacttcctt ttctaccacc acactactgc cccccattaa ggttcttgtg gtttacccat ctgaaatatg tttccatcac acaatttgtt acttcactga atttcttcaa aaccattgca gaagtgaggt catccttgaa aagtggcaga aaaagaaaat agcagagatg ggtccagtgc agtggcttgc cactcaaaag aaggcagcag acaaagtcgt cttccttctt tccaatgacg tcaacagtgt gtgcgatggt acctgtggca agagcgaggg cagtcccagt gagaactctc aagacctctt cccccttgcc tttaaccttt tctgcagtga tctaagaagc cagattcatc tgcacaaata cgtggtggtc tactttagag agattgatac aaaagacgat tacaatgctc tcagtgtctg ccccaagtac cacctcatga aggatgccac tgctttctgt gcagaacttc tccatgtcaa gcagcaggtg tcagcaggaa aaagatcaca agcctgccac gatggctgct gctccttgta gcccacccat gagaagcaag agaccttaaa ggcttcctat cccaccaatt acagggaaaa aacgtgtgat gatcctgaag cttactatgc agcctacaaa cagccttagt aattaaaaca ttttatacca ataaaatttt caaatattgc taactaatgt agcattaact aacgattgga aactacattt acaacttcaa agctgtttta tacatagaaa tcaattacag ttttaattga aaactataac cattttgata atgcaacaat aaagcatctt cagccaaaca tctagtcttc catagaccat gcattgcagt gtacccagaa ctgtttagct aatattctat gtttaattaa tgaatactaa ctctaagaac ccctcactga ttcactcaat agcatcttaa gtgaaaaacc ttctattaca tgcaaaaaat cattgttttt aagataacaa aagtagggaa taaacaagct gaacccactt ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa

I.M.A.G.E. Consortium Clone ID numbers and the corresponding GenBank accession numbers of sequences identified as belonging to the I.M.A.G.E. Consortium and UniGene clusters, are listed below. Also included are sequences that are not identified as having a Clone ID number but still identified as being those of IL17RB. The sequences include those of the "sense" and complementary strands sequences corresponding to IL17RB. The sequence of each GenBank accession number is presented in the Sequence Listing.

TABLE-US-00003 TABLE (I) Clone ID numbers GenBank accession numbers 2985728 AW675096, AW673932, BC000980 5286745 BI602183 5278067 BI458542 5182255 BI823321 924000 AA514396 3566736 BF110326 3195409 BE466508 3576775 BF740045 2772915 AW299271 1368826 AA836217 1744837 AI203628 2285564 AI627783 2217709 AI744263 2103651 AI401622 2419487 AI826949 3125592 BE047352 2284721 AI911549 3643302 BF194822 1646910 AI034244 1647001 AI033911 3323709 BF064177 1419779 AA847767 2205190 AI538624 2295838 AI913613 2461335 AI942234 2130362 AI580483 2385555 AI831909 2283817 AI672344 2525596 AW025192 454687 AA677205 1285273 AA721647 3134106 BF115018 342259 W61238, W61239 1651991 AI032064 2687714 AW236941 3302808 BG057174 2544461 AW058532 122014 T98360, T98361 2139250 AI470845 2133899 AI497731 121300 T96629, T96740 162274 H25975, H25941 3446667 BE539514, BX282554 156864 R74038, R74129 4611491 BG433769 4697316 BG530489 429376 AA007528, AA007529 5112415 BI260259 701357 AA287951, AA287911 121909 T97852, T97745 268037 N40294 1307489 AA809841 1357543 AA832389 48442 H14692 1302619 AA732635 1562857 AA928257 1731938 AI184427 1896025 AI298577 2336350 AI692717 1520997 AA910922 240506 H90761 2258560 AI620122 1569921 AI793318, AA962325, AI733290 6064627 BQ226353 299018 W04890 5500181 BM455231 2484011 BI492426 4746376 BG674622 233783 BX111256 1569921 BX117618 450450 AA682806 1943085 AI202376 2250390 AI658949 4526156 BG403405 3249181 BE673417 2484395 AW021469 30515867 CF455736 2878155 AW339874 4556884 BG399724 3254505 BF475787 3650593 BF437145 233783 H64601 None AF212365, AF208110, AF208111, AF250309, (mRNA AK095091 sequences) None BM983744, CB305764, BM715988, BM670929, BI792416, BI715216, N56060, CB241389, AV660618, BX088671, CB154426, CA434589, CA412162, CA314073, BF921554, BF920093, AV685699, AV650175, BX483104, CD675121, BE081436, AW970151, AW837146, AW368264, D25960, AV709899, BX431018, AL535617, AL525465, BX453536, BX453537, AV728945, AV728939, AV727345

In one preferred embodiment, any sequence, or unique portion thereof, of the following IL17BR sequence, identified by AF208111 or AF208111.1, may be used in the practice of the invention.

SEQ ID NO:3 (Sequence for IL17BR):

TABLE-US-00004 CGGCGATGTCGCTCGTGCTGATAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAG AGCCGACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATG ATCTAATCCCCGGAGACTTGAGGGACCTCCGAGTAGAACCTGTTACAACTAGTGTTGCAA CAGGGGACTATTCAATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCC GCTTGTTGAAGGCCACCAAGATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCT GTGTGAGGTGCAATTACACAGAGGCCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAAT GGACATTTTCCTATATCGGCTTCCCTGTAGAGCTGAACACAGTCTATTTCATTGGGGCCC ATAATATTCCTAATGCAAATATGAATGAAGATGGCCCTTCCATGTCTGTGAATTTCACCT CACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAAAGTGTGTCAAGGCCGGAAGCC TGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACAGTAGAAGTGAACTTCA CAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCACTATCATCGGGT TTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGATTCCAG TGACTGGGGATAGTGAAGGTGCTACGGTGCAGGTAAAGTTCAGTGAGCTGCTCTGGGGAG GGAAGGGACATAGAAGACTGTTCCATCATTCATTGCTTTTAAGGATGAGTTCTCTCTTGT CAAATGCACTTCTGCCAGCAGACACCAGTTAAGTGGCGTTCATGGGGGTTCTTTCGCTGC AGCCTCCACCGTGCTGAGGTCAGGAGGCCGACGTGGCAGTTGTGGTCCCTTTTGCTTGTA TTAATGGCTGCTGACCTTCCAAAGCACTTTTTATTTTCATTTTCTGTCACAGACACTCAG GGATAGCAGTACCATTTTACTTCCGCAAGCCTTTAACTGCAAGATGAAGCTGCAAAGGGT TTGAAATGGGAAGGTTTGAGTTCCAGGCAGCGTATGAACTCTGGAGAGGGGCTGCCAGTC CTCTCTGGGCCGCAGCGGACCCAGCTGGAACACAGGAAGTTGGAGCAGTAGGTGCTCCTT CACCTCTCAGTATGTCTCTTTCAACTCTAGTTTTTGAAGTGGGGACACAGGAAGTCCAGT GGGGACACAGCCACTCCCCPAAGAATAAGGAACTTCCATGCTTCATTCCCTGGCATAAAA AGTGNTCAAACACACCAGAGGGGGCAGGCACCAGCCAGGGTATGATGGGTACTACCCTTT TCTGGAGAACCATAGACTTCCCTTACTACAGGGACTTGCATGTCCTAAAGCACTGGCTGA AGGAAGCCAAGAGGATCACTGCTGCTCCTTTTTTGTAGAGGAAATGTTTGTGTACGTGGT AAGATATGACCTAGCCCTTTTAGGTAAGCGAACTGGTATGTTAGTAACGTGTACAAAGTT TAGGTTCAGACCCCGGGAGTCTTGGGCATGTGGGTCTCGGGTCACTGGTTTTGACTTTAG GGCTTTGTTACAGATGTGTGACCAAGGGGAAAATGTGCATGACAACACTAGAGGTAGGGG CGAAGCCAGAAAGAAGGGAAGTTTTGGCTGAAGTAGGAGTCTTGGTGAGATTTTGCTGTG ATGCATGGTGTGAACTTTCTGAGCCTCTTGTTTTTCCTCAGCTGACTCCATATTTTCCTA CTTGTGGCAGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCG TCCCTTTCCCTCTGGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGC TGTCTCTGCTGGTGGCCACATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACG AAAGGATCAAGAAGACTTCCTTTTCTACCACCACACTACTGCCCCCCATTAAGGTTCTTG TGGTTTACCCATCTGAAATATGTTTCCATCACACAATTTGTTACTTCACTGAATTTCTTC AAAACCATTGCAGAAGTGAGGTCATCCTTGAAAAGTGGCAGAAAAAGAAAATAGCAGAGA TGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCAGCAGACAAAGTCGTCTTCCTTC TTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAAGAGCGAGGGCAGTCCCA GTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCAGTGATCTAAGAA GCCAGATTCATCTGCACAAATACGTGGTGGTCTACTTTAGAGAGATTGATACAAAAGACG ATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACTTCATGAAGGATGCCACTGCTTTCT GTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCC ACGATGGCTGCTGCTCCTTGTAGCCCACCCATGAGAAGCAAGAGACCTTAAAGGCTTCCT ATCCCACCAATTACAGGGAAAAAACGTGTGATGATCCTGAAGCTTACTATGCAGCCTACA AACAGCCTTAGTAATTAAAACATTTTATACCAATAAAATTTTCAAATATTACTAACTAAT GTAGCATTAACTAACGATTGGAAACTACATTTACAACTTCAAAGCTGTTTTATACATAGA AATCAATTACAGCTTTAATTGAAAACTGTAACCATTTTGATAATGCAACAATAAAGCATC TTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

In another set of preferred embodiments of the invention, any sequence, or unique portion thereof, of the CACNA1D sequences of the I.M.A.G.E. Consortium cluster NM_000720, as well as the UniGene Homo sapiens cluster Hs.399966, may be used. Similarly, any sequence encoding all or a part of the protein encoded by any CACNA1D sequence disclosed herein may be used. The consensus sequence of the I.M.A.G.E. Consortium cluster is as follows, with the assigned coding region (ending with a termination codon) underlined and preceded by the 5' untranslated and/or non-coding region and followed by the 3' untranslated and/or non-coding region:

SEQ ID NO:4 (Consensus Sequence for CACNA1D, Identified as NM_000720 or NM_000720.1)

TABLE-US-00005 agaataaggg cagggaccgc ggctcctatc tcttggtgat ccccttcccc attccgcccc cgcctcaacg cccagcacag tgccctgcac acagtagtcg ctcaataaat gttcgtggat gatgatgatg atgatgatga aaaaaatgca gcatcaacgg cagcagcaag cggaccacgc gaacgaggca aactatgcaa gaggcaccag acttcctctt tctggtgaag gaccaacttc tcagccgaat agctccaagc aaactgtcct gtcttggcaa gctgcaatcg atgctgctag acaggccaag gctgcccaaa ctatgagcac ctctgcaccc ccacctgtag gatctctctc ccaaagaaaa cgtcagcaat acgccaagag caaaaaacag ggtaactcgt ccaacagccg acctgcccgc gcccttttct gtttatcact caataacccc atccgaagag cctgcattag tatagtggaa tggaaaccat ttgacatatt tatattattg gctatttttg ccaattgtgt ggccttagct atttacatcc cattccctga agatgattct aattcaacaa atcataactt ggaaaaagta gaatatgcct tcctgattat ttttacagtc gagacatttt tgaagattat agcgtatgga ttattgctac atcctaatgc ttatgttagg aatggatgga atttactgga ttttgttata gtaatagtag gattgtttag tgtaattttg gaacaattaa ccaaagaaac agaaggcggg aaccactcaa gcggcaaatc tggaggcttt gatgtcaaag ccctccgtgc ctttcgagtg ttgcgaccac ttcgactagt gtcaggggtg cccagtttac aagttgtcct gaactccatt ataaaagcca tggttcccct ccttcacata gcccttttgg tattatttgt aatcataatc tatgctatta taggattgga actttttatt ggaaaaatgc acaaaacatg tttttttgct gactcagata tcgtagctga agaggaccca gctccatgtg cgttctcagg gaatggacgc cagtgtactg ccaatggcac ggaatgtagg agtggctggg ttggcccgaa cggaggcatc accaactttg ataactttgc ctttgccatg cttactgtgt ttcagtgcat caccatggag ggctggacag acgtgctcta ctgggtaaat gatgcgatag gatgggaatg gccatgggtg tattttgtta gtctgatcat ccttggctca tttttcgtcc ttaacctggt tcttggtgtc cttagtggag aattctcaaa ggaaagagag aaggcaaaag cacggggaga tttccagaag ctccgggaga agcagcagct ggaggaggat ctaaagggct acttggattg gatcacccaa gctgaggaca tcgatccgga gaatgaggaa gaaggaggag aggaaggcaa acgaaatact agcatgccca ccagcgagac tgagtctgtg aacacagaga acgtcagcgg tgaaggcgag aaccgaggct gctgtggaag tctctggtgc tggtggagac ggagaggcgc ggccaaggcg gggccctctg ggtgtcggcg gtggggtcaa gccatctcaa aatccaaact cagccgacgc tggcgtcgct ggaaccgatt caatcgcaga agatgtaggg ccgccgtgaa gtctgtcacg ttttactggc tggttatcgt cctggtgttt ctgaacacct taaccatttc ctctgagcac tacaatcagc cagattggtt gacacagatt caagatattg ccaacaaagt cctcttggct ctgttcacct gcgagatgct ggtaaaaatg tacagcttgg gcctccaagc atatttcgtc tctcttttca accggtttga ttgcttcgtg gtgtgtggtg gaatcactga gacgatcctg gtggaactgg aaatcatgtc tcccctgggg atctctgtgt ttcggtgtgt gcgcctctta agaatcttca aagtgaccag gcactggact tccctgagca acttagtggc atccttatta aactccatga agtccatcgc ttcgctgttg cttctgcttt ttctcttcat tatcatcttt tccttgcttg ggatgcagct gtttggcggc aagtttaatt ttgatgaaac gcaaaccaag cggagcacct ttgacaattt ccctcaagca cttctcacag tgttccagat cctgacaggc gaagactgga atgctgtgat gtacgatggc atcatggctt acgggggccc atcctcttca ggaatgatcg tctgcatcta cttcatcatc ctcttcattt gtggtaacta tattctactg aatgtcttct tggccatcgc tgtagacaat ttggctgatg ctgaaagtct gaacactgct cagaaagaag aagcggaaga aaaggagagg aaaaagattg ccagaaaaga gagcctagaa aataaaaaga acaacaaacc agaagtcaac cagatagcca acagtgacaa caaggttaca attgatgact atagagaaga ggatgaagac aaggacccct atccgccttg cgatgtgcca gtaggggaag aggaagagga agaggaggag gatgaacctg aggttcctgc cggaccccgt cctcgaagga tctcggagtt gaacatgaag gaaaaaattg cccccatccc tgaagggagc gctttcttca ttcttagcaa gaccaacccg atccgcgtag gctgccacaa gctcatcaac caccacatct tcaccaacct catccttgtc ttcatcatgc tgagcagcgc tgccctggcc gcagaggacc ccatccgcag ccactccttc cggaacacga tactgggtta ctttgactat gccttcacag ccatctttac tgttgagatc ctgttgaaga tgacaacttt tggagctttc ctccacaaag gggccttctg caggaactac ttcaatttgc tggatatgct ggtggttggg gtgtctctgg tgtcatttgg gattcaatcc agtgccatct ccgttgtgaa gattctgagg gtcttaaggg tcctgcgtcc cctcagggcc atcaacagag caaaaggact taagcacgtg gtccagtgcg tcttcgtggc catccggacc atcggcaaca tcatgatcgt cactaccctc ctgcagttca tgtttgcctg tatcggggtc cagttgttca aggggaagtt ctatcgctgt acggatgaag ccaaaagtaa ccctgaagaa tgcaggggac ttttcatcct ctacaaggat ggggatgttg acagtcctgt ggtccgtgaa cggatctggc aaaacagtga tttcaacttc gacaacgtcc tctctgctat gatggcgctc ttcacagtct ccacgtttga gggctggcct gcgttgctgt ataaagccat cgactcgaat ggagagaaca tcggcccaat ctacaaccac cgcgtggaga tctccatctt cttcatcatc tacatcatca ttgtagcttt cttcatgatg aacatctttg tgggctttgt catcgttaca tttcaggaac aaggagaaaa agagtataag aactgtgagc tggacaaaaa tcagcgtcag tgtgttgaat acgccttgaa agcacgtccc ttgcggagat acatccccaa aaacccctac cagtacaagt tctggtacgt ggtgaactct tcgcctttcg aatacatgat gtttgtcctc atcatgctca acacactctg cttggccatg cagcactacg agcagtccaa gatgttcaat gatgccatgg acattctgaa catggtcttc accggggtgt tcaccgtcga gatggttttg aaagtcatcg catttaagcc taaggggtat tttagtgacg cctggaacac gtttgactcc ctcatcgtaa tcggcagcat tatagacgtg gccctcagcg aagcggaccc aactgaaagt gaaaatgtcc ctgtcccaac tgctacacct gggaactctg aagagagcaa tagaatctcc atcacctttt tccgtctttt ccgagtgatg cgattggtga agcttctcag caggggggaa ggcatccgga cattgctgtg gacttttatt aagtcctttc aggcgctccc gtatgtggcc ctcctcatag ccatgctgtt cttcatctat gcggtcattg gcatgcagat gtttgggaaa gttgccatga gagataacaa ccagatcaat aggaacaata acttccagac gtttccccag gcggtgctgc tgctcttcag gtgtgcaaca ggtgaggcct ggcaggagat catgctggcc tgtctcccag ggaagctctg tgaccctgag tcagattaca accccgggga ggagtataca tgtgggagca actttgccat tgtctatttc atcagttttt acatgctctg tgcatttctg atcatcaatc tgtttgtggc tgtcatcatg gataatttcg actatctgac ccgggactgg tctattttgg ggcctcacca tttagatgaa ttcaaaagaa tatggtcaga atatgaccct gaggcaaagg gaaggataaa acaccttgat gtggtcactc tgcttcgacg catccagcct cccctggggt ttgggaagtt atgtccacac agggtagcgt gcaagagatt agttgccatg aacatgcctc tcaacagtga cgggacagtc atgtttaatg caaccctgtt tgctttggtt cgaacggctc ttaagatcaa gaccgaaggg aacctggagc aagctaatga agaacttcgg gctgtgataa agaaaatttg gaagaaaacc agcatgaaat tacttgacca agttgtccct ccagctggtg atgatgaggt aaccgtgggg aagttctatg ccactttcct gatacaggac tactttagga aattcaagaa acggaaagaa caaggactgg tgggaaagta ccctgcgaag aacaccacaa ttgccctaca ggcgggatta aggacactgc atgacattgg gccagaaatc cggcgtgcta tatcgtgtga tttgcaagat gacgagcctg aggaaacaaa acgagaagaa gaagatgatg tgttcaaaag aaatggtgcc ctgcttggaa accatgtcaa tcatgttaat agtgatagga gagattccct tcagcagacc aataccaccc accgtcccct gcatgtccaa aggccttcaa ttccacctgc aagtgatact gagaaaccgc tgtttcctcc agcaggaaat tcggtgtgtc ataaccatca taaccataat tccataggaa agcaagttcc cacctcaaca aatgccaatc tcaataatgc caatatgtcc aaagctgccc atggaaagcg gcccagcatt gggaaccttg agcatgtgtc tgaaaatggg catcattctt cccacaagca tgaccgggag cctcagagaa ggtccagtgt gaaaagaacc cgctattatg aaacttacat taggtccgac tcaggagatg aacagctccc aactatttgc cgggaagacc cagagataca tggctatttc agggaccccc actgcttggg ggagcaggag tatttcagta gtgaggaatg ctacgaggat gacagctcgc ccacctggag caggcaaaac tatggctact acagcagata cccaggcaga aacatcgact ctgagaggcc ccgaggctac catcatcccc aaggattctt ggaggacgat gactcgcccg tttgctatga ttcacggaga tctccaagga gacgcctact acctcccacc ccagcatccc accggagatc ctccttcaac tttgagtgcc tgcgccggca gagcagccag gaagaggtcc cgtcgtctcc catcttcccc catcgcacgg ccctgcctct gcatctaatg cagcaacaga tcatggcagt tgccggccta gattcaagta aagcccagaa gtactcaccg agtcactcga cccggtcgtg ggccacccct ccagcaaccc ctccctaccg ggactggaca ccgtgctaca cccccctgat ccaagtggag cagtcagagg ccctggacca ggtgaacggc agcctgccgt ccctgcaccg cagctcctgg tacacagacg agcccgacat ctcctaccgg actttcacac cagccagcct gactgtcccc agcagcttcc ggaacaaaaa cagcgacaag cagaggagtg cggacagctt ggtggaggca gtcctgatat ccgaaggctt gggacgctat gcaagggacc caaaatttgt gtcagcaaca aaacacgaaa tcgctgatgc ctgtgacctc accatcgacg agatggagag tgcagccagc accctgctta atgggaacgt gcgtccccga gccaacgggg atgtgggccc cctctcacac cggcaggact atgagctaca ggactttggt cctggctaca gcgacgaaga gccagaccct gggagggatg aggaggacct ggcggatgaa atgatatgca tcaccacctt gtagccccca gcgaggggca gactggctct ggcctcaggt ggggcgcagg agagccaggg gaaaagtgcc tcatagttag gaaagtttag gcactagttg ggagtaatat tcaattaatt agacttttgt ataagagatg tcatgcctca agaaagccat aaacctggta ggaacaggtc ccaagcggtt gagcctggca gagtaccatg cgctcggccc cagctgcagg aaacagcagg ccccgccctc tcacagagga tgggtgagga ggccagacct gccctgcccc attgtccaga tgggcactgc tgtggagtct gcttctccca tgtaccaggg caccaggccc acccaactga aggcatggcg gcggggtgca ggggaaagtt aaaggtgatg acgatcatca cacctcgtgt cgttacctca gccatcggtc tagcatatca gtcactgggc ccaacatatc catttttaaa ccctttcccc caaatacact gcgtcctggt tcctgtttag ctgttctgaa ata

I.M.A.G.E. Consortium Clone ID numbers and the corresponding GenBank accession numbers of sequences identified as belonging to the I.M.A.G.E. Consortium and UniGene clusters, are listed below. Also included are sequences that are not identified as having a Clone ID number but still identified as being those of CACNA1D. The sequences include those of the "sense" and complementary strands sequences corresponding to CACNA1D. The sequence of each GenBank accession number is presented in the Sequence Listing.

TABLE-US-00006 TABLE (II) Clone ID numbers GenBank accession numbers 5676430 BM128550 5197948 BI755471 6027638 BQ549084, BQ549571 2338956 AI693324 36581 R25307, R46658 49630 H29256, H29339 4798765 BG716371 2187310 AI537488 838231 AA458692 2111614 AI393327 2183482 AI520947 1851007 AI248998 1675503 AI075844 2434923 AI869807 2434924 AI869800 1845827 AI243110 2511756 AI955764 628568 AA192669, AA192157 2019331 AI361691 2337381 AI914244 2503579 AW008769 2503626 AW008794 1160989 AA877582 1653475 AI051972 1627755 AI017959 287750 N79331, N62240 1867677 AI240933 1618303 AI015031 1881344 AI290994 1408031 AA861160 1557035 AA915941 956303 AA493341 2148234 AI467998 1499899 AA885585 1647592 AI033648 2341185 AI697633 981603 AA523647 6281678 BQ710377 6278348 BQ706920 5876024 BQ016847 6608849 CA943595 5440464 BM008196 5209489 BI769856 5183025 BI758971 880540 AA468565 757337 AA437099 6608849 CA867864 461797 AA682690 434787 AA701888 6151588 BU182632 6295618 BQ898429 6300779 BQ711800 434811 AA703120 1568025 AA978315 3220210 BE550599 3214121 BE502741 3009312 AW872382 2733394 AW444663 2872156 AW341279 30514550 CF456750 2718456 AW139850 2543682 AW029633 2492730 AI963788 2545866 AI951788 2272081 AI680744 2152336 AI601252 2146429 AI459166 1274498 AA885750 2272081 BX092736 287750 BX114568 3233645 BE672659 289209 N78509, N73668 277086 N46744, N39597 3272340 BF439267 3273859 BF436153 3568401 BF110611 None M76558, AF088004, M83566 (mRNA sequences) None CB410657, BQ372430, BQ366601, BQ324528, BQ318830, AL708030, BM509161, N85902, BQ774355, CA774243, CA436347, CA389011, BU679327, BU608029, BU073743, BE175413, AW969248, AI908115, BF754485, BI015409, BG202552, BF883669, BF817590, BF807128, BF806160, BF805244, BF805235, BF805080, T27949, BE836638, BE770685, BE769065,

In one preferred embodiment, any sequence, or unique portion thereof, of the following CACNA1D sequence, identified by AF088004 or AF088004.1, may be used in the practice of the invention.

SEQ ID NO:5 (Sequence for CACNA1D):

TABLE-US-00007 TTTTTTTTTTTTTTTTTTTTTCTTACAAAGAAAAATTTAATATTCGATGAGAGGTTGAAC CAGGCTTAAAGCAGACATACTAGGAAATGGTGCAGCCTGTAAGAATGCCAGTTTGTAAGT ACTGACTTTGGAAAAGATCATCGCCTCTATCAGACACTTAGGGTCCTGGTCTGGCAATTT TGGCCTGATGTGATGCCACAAGACCCAACAGAGAGAGACACAGAGTCCAGGATAATGTTG ACAGTGGTGTAGCCCTTTAGGAGAAATGGCGCTCCCTGCGGCTGGTATTAGGTTACCATT GGCACCGAAGGAACCAGGAGGATAAGAATATCCATAATTTCAGAGCTGCCCTGGCACAGT ACCTGCCCCGTCGGAGGCTCTCACTGGCAAATGACAGCTCTGTGCAAGGAGCACTCCCAA GTATAAAAATTATTACACAGTTTTATTCTGAAGAACATTTTGCATTTTAATAAAAAAGGA TTTATGTCAGGAAAGAGTCATTTACAAACCTTGAAGTGTTTTTGCCTGGATCAGAGTAAG AATGTCTTAAGAAGAGGTTTGTAAGGTCTTCATAACAAAGTGGTGTTTGTTATTTACAAA AAAAAAAAAAAAAAAAATTAACAGGTTGTCTGTATACTATTAAAAATTTTGGACCAAAAA AAAAAAAAAAAAAAA

In another set of preferred embodiments of the invention, any sequence, or unique portion thereof, of the HOXB13 sequences of the I.M.A.G.E. Consortium cluster NM_006361, as well as the UniGene Homo sapiens cluster Hs.66731, may be used. Similarly, any sequence encoding all or a part of the protein encoded by any HOXB13 sequence disclosed herein may be used. The consensus sequence of the I.M.A.G.E. Consortium cluster is as follows, with the assigned coding region (ending with a termination codon) underlined and preceded by the 5' untranslated and/or non-coding region and followed by the 3' untranslated and/or non-coding region:

SEQ ID NO:6 (Consensus Sequence for HOXB13, Identified as NM_006361 or NM_006361.2)

TABLE-US-00008 cgaatgcagg cgacttgcga gctgggagcg atttaaaacg ctttggattc ccccggcctg ggtggggaga gcgagctggg tgccccctag attccccgcc cccgcacctc atgagccgac cctcggctcc atggagcccg gcaattatgc caccttggat ggagccaagg atatcgaagg cttgctggga gcgggagggg ggcggaatct ggtcgcccac tcccctctga ccagccaccc agcggcgcct acgctgatgc ctgctgtcaa ctatgccccc ttggatctgc caggctcggc ggagccgcca aagcaatgcc acccatgccc tggggtgccc caggggacgt ccccagctcc cgtgccttat ggttactttg gaggcgggta ctactcctgc cgagtgtccc ggagctcgct gaaaccctgt gcccaggcag ccaccctggc cgcgtacccc gcggagactc ccacggccgg ggaagagtac cccagtcgcc ccactgagtt tgccttctat ccgggatatc cgggaaccta ccacgctatg gccagttacc tggacgtgtc tgtggtgcag actctgggtg ctcctggaga accgcgacat gactccctgt tgcctgtgga cagttaccag tcttgggctc tcgctggtgg ctggaacagc cagatgtgtt gccagggaga acagaaccca ccaggtccct tttggaaggc agcatttgca gactccagcg ggcagcaccc tcctgacgcc tgcgcctttc gtcgcggccg caagaaacgc attccgtaca gcaaggggca gttgcgggag ctggagcggg agtatgcggc taacaagttc atcaccaagg acaagaggcg caagatctcg gcagccacca gcctctcgga gcgccagatt accatctggt ttcagaaccg ccgggtcaaa gagaagaagg ttctcgccaa ggtgaagaac agcgctaccc cttaagagat ctccttgcct gggtgggagg agcgaaagtg ggggtgtcct ggggagacca gaaacctgcc aagcccaggc tggggccaag gactctgctg agaggcccct agagacaaca cccttcccag gccactggct gctggactgt tcctcaggag cggcctgggt acccagtatg tgcagggaga cggaacccca tgtgacaggc ccactccacc agggttccca aagaacctgg cccagtcata atcattcatc ctcacagtgg caataatcac gataaccagt

I.M.A.G.E. Consortium Clone ID numbers and the corresponding GenBank accession numbers of sequences identified as belonging to the I.M.A.G.E. Consortium and UniGene clusters, are listed below. Also included are sequences that are not identified as having a Clone ID number but still identified as being those of HOXB13. The sequences include those of the "sense" and complementary strands sequences corresponding to HOXB13. The sequence of each GenBank accession number is presented in the Sequence Listing.

TABLE-US-00009 TABLE (III) Clone ID numbers GenBank accession numbers 4250486 BF676461, BC007092 5518335 BM462617 4874541 BG752489 4806039 BG778198 3272315 CB050884, CB050885 4356740 BF965191 6668163 BU930208 1218366 AA807966 2437746 AI884491 1187697 AA652388 3647557 BF446158 1207949 AA657924 1047774 AA644637 3649397 BF222357 971664 AA527613 996191 AA533227 813481 AA456069, AA455572, BX117624 6256333 BQ673782 2408470 AI814453 2114743 AI417272 998548 AA535663 2116027 AI400493 3040843 AW779219 1101311 AA594847 1752062 AI150430 898712 AA494387 1218874 AA662643 2460189 AI935940 986283 AA532530 1435135 AA857572 1871750 AI261980 3915135 BE888751 2069668 AI378797 667188 AA234220, AA236353 1101561 AA588193 1170268 AI821103, AI821851, AA635855 2095067 AI420753 4432770 BG180547 783296 AA468306, AA468232 3271646 CB050115, CB050116 1219276 AA661819 30570598 CF146837 30570517 CF146763 30568921 CF144902 3099071 CF141511 3096992 CF139563 3096870 CF139372 3096623 CF139319 3096798 CF139275 30572408 CF122893 2490082 AI972423 2251055 AI918975 2419308 AI826991 2249105 AI686312 2243362 AI655923 30570697 CF146922 3255712 BF476369 3478356 BF057410 3287977 BE645544 3287746 BE645408 3621499 BE388501 30571128 CF147366 30570954 CF147143 None BT007410, BC007092, U57052, U81599 (mRNA sequences) None CB120119, CB125764, AU098628, CB126130, BI023924, BM767063, BM794275, BQ363211, BM932052, AA357646, AW609525, CB126919, AW609336, AW609244, BF855145, AU126914, CB126449, AW582404, BX641644

In one preferred embodiment, any sequence, or unique portion thereof, of the following HOXB13 sequence, identified by BC007092 or BC007092.1, may be used in the practice of the invention.

SEQ ID NO:7 (Sequence for HOXB13):

TABLE-US-00010 GGATTCCCCCGGCCTGGGTGGGGAGAGCGAGCTGGGTGCCCCCTAGATTCCCCGCCCCCG CACCTCATGAGCCGACCCTCGGCTCCATGGAGCCCGGCAATTATGCCACCTTGGATGGAG CCAAGGATATCGAAGGCTTGCTGGGAGCGGGAGGGGGGCGGAATCTGGTCGCCCACTCCC CTCTGACCAGCCACCCAGCGGCGCCTACGCTGATGCCTGCTGTCAACTATGCCCCCTTGG ATCTGCCAGGCTCGGCGGAGCCGCCAAAGCAATGCCACCCATGCCCTGGGGTGCCCCAGG GGACGTCCCCAGCTCCCGTGCCTTATGGTTACTTTGGAGGCGGGTACTACTCCTGCCGAG TGTCCCGGAGCTCGCTGAAACCCTGTGCCCAGGCAGCCACCCTGGCCGCGTACCCCGCGG AGACTCCCACGGCCGGGGAAGAGTACCCCAGCCGCCCCACTGAGTTTGCCTTCTATCCGG GATATCCGGGAACCTACCAGCCTATGGCCAGTTACCTGGACGTGTCTGTGGTGCAGACTC TGGGTGCTCCTGGAGAACCGCGACATGACTCCCTGTTGCCTGTGGACAGTTACCAGTCTT GGGCTCTCGCTGGTGGCTGGAACAGCCAGATGTGTTGCCAGGGAGAACAGAACCCACCAG GTCCCTTTTGGAAGGCAGCATTTGCAGACTCCAGCGGGCAGCACCCTCCTGACGCCTGCG CCTTTCGTCGCGGCCGCAAGAAACGCATTCCGTACAGCAAGGGGCAGTTGCGGGAGCTGG AGCGGGAGTATGCGGCTAACAAGTTCATCACCAAGGACAAGAGGCGCAAGATCTCGGCAG CCACCAGCCTCTCGGAGCGCCAGATTACCATCTGGTTTCAGAACCGCCGGGTCAAAGAGA AGAAGGTTCTCGCCAAGGTGAAGAACAGCGCTACCCCTTAAGAGATCTCCTTGCCTGGGT GGGAGGAGCGAAAGTGGGGGTGTCCTGGGGAGACCAGGAACCTGCCAAGCCCAGGCTGGG GCCAAGGACTCTGCTGAGAGGCCCCTAGAGACAACACCCTTCCCAGGCCACTGGCTGCTG GACTGTTCCTCAGGAGCGGCCTGGGTACCCAGTATGTGCAGGGAGACGGAACCCCATGTG ACAGCCCACTCCACCAGGGTTCCCAAAGAACCTGGCCCAGTCATAATCATTCATCCTGAC AGTGGCAATAATCACGATAACCAGTACTAGCTGCCATGATCGTTAGCCTCATATTTTCTA TCTAGAGCTCTGTAGAGCACTTTAGAAACCGCTTTCATGAATTGAGCTAATTATGAATAA ATTTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

Sequences identified by SEQ ID NO. are provided using conventional representations of a DNA strand starting from the 5' phosphate linked end to the 3' hydroxyl linked end. The assignment of coding regions is generally by comparison to available consensus sequence(s) and therefore may contain inconsistencies relative to other sequences assigned to the same cluster. These have no effect on the practice of the invention because the invention can be practiced by use of shorter segments (or combinations thereof) of sequences unique to each of the three sets described above and not affected by inconsistencies. As non-limiting examples, a segment of IL17BR, CACNA1D, or HOXB13 nucleic acid sequence composed of a 3' untranslated region sequence and/or a sequence from the 3' end of the coding region may be used as a probe for the detection of IL17BR, CACNA1D, or HOXB13 expression, respectively, without being affected by the presence of any inconsistency in the coding regions due to differences between sequences. Similarly, the use of an antibody which specifically recognizes IL17BR, CACNA1D, or HOXB13 protein to detect its expression would not be affected by the presence of any inconsistency in the representation of the coding regions provided above.

As will be appreciated by those skilled in the art, some of the above sequences include 3' poly A (or poly T on the complementary strand) stretches that do not contribute to the uniqueness of the disclosed sequences. The invention may thus be practiced with sequences lacking the 3' poly A (or poly T) stretches. The uniqueness of the disclosed sequences refers to the portions or entireties of the sequences which are found only in IL17BR, CACNA1D, or HOXB13 nucleic acids, including unique sequences found at the 3' untranslated portion of the genes. Preferred unique sequences for the practice of the invention are those which contribute to the consensus sequences for each of the three sets such that the unique sequences will be useful in detecting expression in a variety of individuals rather than being specific for a polymorphism present in some individuals. Alternatively, sequences unique to an individual or a subpopulation may be used. The preferred unique sequences are preferably of the lengths of polynucleotides of the invention as discussed herein.

To determine the (increased or decreased) expression levels of the above described sequences in the practice of the present invention, any method known in the art may be utilized. In one preferred embodiment of the invention, expression based on detection of RNA which hybridizes to polynucleotides containing the above described sequences is used. This is readily performed by any RNA detection or amplification+detection method known or recognized as equivalent in the art such as, but not limited to, reverse transcription-PCR (optionally real-time PCR), the methods disclosed in U.S. patent application Ser. No. 10/062,857 entitled "Nucleic Acid Amplification" filed on Oct. 25, 2001 as well as U.S. Provisional Patent Application 60/298,847 (filed Jun. 15, 2001) and 60/257,801 (filed Dec. 22, 2000), the methods disclosed in U.S. Pat. No. 6,291,170, and quantitative PCR. Methods to identify increased RNA stability (resulting in an observation of increased expression) or decreased RNA stability (resulting in an observation of decreased expression) may also be used. These methods include the detection of sequences that increase or decrease the stability of mRNAs containing the IL17BR, CACNA1D, or HOXB13 sequences disclosed herein. These methods also include the detection of increased mRNA degradation.

In particularly preferred embodiments of the invention, polynucleotides having sequences present in the 3' untranslated and/or non-coding regions of the above disclosed sequences are used to detect expression or non-expression of IL17BR, CACNA1D, or HOXB13 sequences in breast cells in the practice of the invention. Such polynucleotides may optionally contain sequences found in the 3' portions of the coding regions of the above disclosed sequences. Polynucleotides containing a combination of sequences from the coding and 3' non-coding regions preferably have the sequences arranged contiguously, with no intervening heterologous sequence(s).

Alternatively, the invention may be practiced with polynucleotides having sequences present in the 5' untranslated and/or non-coding regions of IL17BR, CACNA1D, or HOXB13 sequences in breast cells to detect their levels of expression. Such polynucleotides may optionally contain sequences found in the 5' portions of the coding regions. Polynucleotides containing a combination of sequences from the coding and 5' non-coding regions preferably have the sequences arranged contiguously, with no intervening heterologous sequence(s). The invention may also be practiced with sequences present in the coding regions of IL17BR, CACNA1D, or HOXB13.

Preferred polynucleotides contain sequences from 3' or 5' untranslated and/or non-coding regions of at least about 16, at least about 18, at least about 20, at least about 22, at least about 24, at least about 26, at least about 28, at least about 30, at least about 32, at least about 34, at least about 36, at least about 38, at least about 40, at least about 42, at least about 44, or at least about 46 consecutive nucleotides. The term "about" as used in the previous sentence refers to an increase or decrease of 1 from the stated numerical value. Even more preferred are polynucleotides containing sequences of at least or about 50, at least or about 100, at least about or 150, at least or about 200, at least or about 250, at least or about 300, at least or about 350, or at least or about 400 consecutive nucleotides. The term "about" as used in the preceding sentence refers to an increase or decrease of 10% from the stated numerical value.

Sequences from the 3' or 5' end of the above described coding regions as found in polynucleotides of the invention are of the same lengths as those described above, except that they would naturally be limited by the length of the coding region. The 3' end of a coding region may include sequences up to the 3' half of the coding region. Conversely, the 5' end of a coding region may include sequences up the 5' half of the coding region. Of course the above described sequences, or the coding regions and polynucleotides containing portions thereof, may be used in their entireties.

Polynucleotides combining the sequences from a 3' untranslated and/or non-coding region and the associated 3' end of the coding region are preferably at least or about 100, at least about or 150, at least or about 200, at least or about 250, at least or about 300, at least or about 350, or at least or about 400 consecutive nucleotides. Preferably, the polynucleotides used are from the 3' end of the gene, such as within about 350, about 300, about 250, about 200, about 150, about 100, or about 50 nucleotides from the polyadenylation signal or polyadenylation site of a gene or expressed sequence. Polynucleotides containing mutations relative to the sequences of the disclosed genes may also be used so long as the presence of the mutations still allows hybridization to produce a detectable signal.

In another embodiment of the invention, polynucleotides containing deletions of nucleotides from the 5' and/or 3' end of the above disclosed sequences may be used. The deletions are preferably of 1-5, 5-10, 10-15, 15-20, 20-25, 25-30, 30-35, 35-40, 40-45, 45-50, 50-60, 60-70, 70-80, 80-90, 90-100, 100-125, 125-150, 150-175, or 175-200 nucleotides from the 5' and/or 3' end, although the extent of the deletions would naturally be limited by the length of the disclosed sequences and the need to be able to use the polynucleotides for the detection of expression levels.

Other polynucleotides of the invention from the 3' end of the above disclosed sequences include those of primers and optional probes for quantitative PCR. Preferably, the primers and probes are those which amplify a region less than about 350, less than about 300, less than about 250, less than about 200, less than about 150, less than about 100, or less than about 50 nucleotides from the from the polyadenylation signal or polyadenylation site of a gene or expressed sequence.

In yet another embodiment of the invention, polynucleotides containing portions of the above disclosed sequences including the 3' end may be used in the practice of the invention. Such polynucleotides would contain at least or about 50, at least or about 100, at least about or 150, at least or about 200, at least or about 250, at least or about 300, at least or about 350, or at least or about 400 consecutive nucleotides from the 3' end of the disclosed sequences.

The invention thus also includes polynucleotides used to detect IL17BR, CACNA1D, or HOXB13 expression in breast cells. The polynucleotides may comprise a shorter polynucleotide consisting of sequences found in the above provided SEQ ID NOS in combination with heterologous sequences not naturally found in combination with IL17BR, CACNA1D, or HOXB13 sequences.

As non-limiting examples, a polynucleotide comprising one of the following sequences may be used in the practice of the invention.

TABLE-US-00011 SEQ ID NO: 8: CAATTACAGGGAAAAAACGTGTGATGATCCTGAAGCTTACTATGCAGCCTACAAACAGCC SEQ ID NO: 9: GCTCTCACTGGCAAATGACAGCTCTGTGCAAGGAGCACTCCCAAGTATAAAAATTATTAC SEQ ID NO: 10: GATCGTTAGCCTCATATTTTCTATCTAGAGCTCTGTAGAGCACTTTAGAAACCGCTTTCA

Stated differently, the invention may be practiced with a polynucleotide consisting of the sequence of SEQ ID NOS:8, 9 or 10 in combination with one or more heterologous sequences that are not normally found with SEQ ID NOS:8, 9 or 10. Alternatively, the invention may also be practiced with a polynucleotide consisting of the sequence of SEQ ID NOS:8, 9 or 10 in combination with one or more naturally occurring sequences that are normally found with SEQ ID NOS:8, 9 or 10.

Polynucleotides with sequences comprising SEQ ID NOS:8 or 9, either naturally occurring or synthetic, may be used to detect nucleic acids which are over expressed in breast cancer cells that are responsive, and those which are not over expressed in breast cancer cells that are non-responsive, to treatment with TAM or another "antiestrogen" agent against breast cancer. Polynucleotides with sequences comprising SEQ ID NO:10, either naturally occurring or synthetic, may be used to detect nucleic acids which are under expressed in breast cancer cells that are responsive, and those which are not under expressed in breast cancer cells that are non-responsive, to treatment with TAM or another "antiestrogen" agent against breast cancer.

Additional sequences that may be used in polynucleotides as described above for SEQ ID NOS:8 and 9 are the following, wherein SEQ ID NOs:33 is complementary to a portion of IL17BR sequences disclosed herein:

TABLE-US-00012 SEQ ID NO: 11: TGCCTAATTTCACTCTCAGAGTGAGGCAGGTAACTGGGGCTCCACTGGGTCACTCTGAGA SEQ ID NO: 12: TTGGAAGCAGAGTCCCTCTAAAGGTAACTCTTGTGGTCACTCAATATTGTATTGGCATTT SEQ ID NO: 13: ACGTTAGACTTTTGCTGGCATTCAAGTCATGGCTAGTCTGTGTATTTAATAAATGTGTGT SEQ ID NO: 14: CTGGTCAGCCACTCTGACTTTTCTACCACATTAAATTCTCCATTACATCTCACTATTGGT SEQ ID NO: 15: TACAACTTCTGAATGCTGCACATTCTTCCAAAATGATCCTTAGCACAATCTATTGTATGA SEQ ID NO: 16: GGGATGGCCTTTAGGCCACAGTAGTGTCTGTGTTAAGTTCACTAAATGTGTATTTAATGA SEQ ID NO: 17: CTCAAAGTGCTAAAGCTATGGTTGACTGCTCTGGTGTTTTTATATTCATTCGTGCTTTAG SEQ ID NO: 32: CTGAAGCTTACTATGCAGCCTACAA SEQ ID NO: 33: TCCAATCGTTAGTTAATGCTACATTAGTT SEQ ID NO: 34: CAGCCTTAGTAATTAAAAC

Additional sequences that may be used in polynucleotides as described above for SEQ ID NO:10 are the following, wherein SEQ ID NOs:36 is complementary to a portion of IL17BR sequences disclosed herein:

TABLE-US-00013 SEQ ID NO: 18: CTATGGGGATGGTCCACTGTCACTGTTTCTCTGCTGTTGCAAATACATGGATAACACATT SEQ ID NO: 19: ACTGGAAAAGCAGATGGTCTGACTGTGCTATGGCCTCATCATCAAGACTTTCAATCCTAT SEQ ID NO: 20: ACGCCAAGCTCTTCAGTGAAGACACGATGTTATTAAAAGCCTGTTTTAGGGACTGCAAAA SEQ ID NO: 21: TTTTTGTAAAATCTTTAACCTTCCCTTTGTTCTTCATGTACACGCTGAACTGCAATTCTT SEQ ID NO: 22: AACCTGGGGCATTTAGGGCAGAGGACAAAAGGATGTCAGCAATTGCTTGGGCTGCTTGGC SEQ ID NO: 23: CTGGAACCTCTGGACTCCCCATGCTCTAACTCCCACACTCTGCTATCAGAAACTTAAACT SEQ ID NO: 24: AACCCCAGAACCATCTAAGACATGGGATTCAGTGATCATGTGGTTCTCCTTTTAACTTAC SEQ ID NO: 25: GGCCATGTGCCATGGTATTTGGGTCCTGGGAGGGTGGGTGAAATAAAGGCATACTGTCTT SEQ ID NO: 26: GTGTAGGCAGTCATGGCACCAAAGCCACCAGACTGACAAATGTGTATCAGATGCTTTTGT SEQ ID NO: 27: GAAAACCTCTTCAAAAGACAAAAAGCTGGCACTGCATTCTCTCTCTGTAGCAGGACAGAA SEQ ID NO: 28: CACATCTTTAGGGTCAGTGAACAATGGGGCACATTTGGCACTAGCTTGAGCCCAACTCTG SEQ ID NO: 29: GCCTTAATTTCCTCATCTGAAAACTGGAAGGCCTGACTTGACTTGTTGAGCTTAAGATCC SEQ ID NO: 30: CTTCAGGGGAGGATCAAGCTTTGAACCAAAGCCAATCACTGGCTTGATTTGTGTTTTTTA SEQ ID NO: 31: ACAAGTTTTCACTGAATGAGCATGGCAGTGCCACTCAAGAAAATGAATCTCCAAAGTATC SEQ ID NO: 35: GCCATGATCGTTAGCCTCATATT SEQ ID NO: 36: CAATTCATGAAAGCGGTTTCTAAAG SEQ ID NO: 37: TCTATCTAGAGCTCTGTAGAGC

Additionally, polynucleotides containing other sequences, particularly unique sequences, present in naturally occurring nucleic acid molecules comprising SEQ ID NOS:8-37 may be used in the practice of the invention.

Other polynucleotides for use in the practice of the invention include those that have sufficient homology to those described above to detect expression by use of hybridization techniques. Such polynucleotides preferably have about or 95%, about or 96%, about or 97%, about or 98%, or about or 99% identity with IL17BR, CACNA1D, or HOXB13 sequences as described herein. Identity is determined using the BLAST algorithm, as described above. The other polynucleotides for use in the practice of the invention may also be described on the basis of the ability to hybridize to polynucleotides of the invention under stringent conditions of about 30% v/v to about 50% formamide and from about 0.01M to about 0.15M salt for hybridization and from about 0.01M to about 0.15M salt for wash conditions at about 55 to about 65.degree. C. or higher, or conditions equivalent thereto.

In a further embodiment of the invention, a population of single stranded nucleic acid molecules comprising one or both strands of a human IL17BR or CACNA1D sequence is provided as a probe such that at least a portion of said population may be hybridized to one or both strands of a nucleic acid molecule quantitatively amplified from RNA of a breast cancer cell. The population may be only the antisense strand of a human IL17BR or CACNA1D sequence such that a sense strand of a molecule from, or amplified from, a breast cancer cell may be hybridized to a portion of said population. The population preferably comprises a sufficiently excess amount of said one or both strands of a human IL17BR or CACNA1D sequence in comparison to the amount of expressed (or amplified) nucleic acid molecules containing a complementary IL17BR or CACNA1D sequence from a normal breast cell. This condition of excess permits the increased amount of nucleic acid expression in a breast cancer cell to be readily detectable as an increase.

Alternatively, the population of single stranded molecules is equal to or in excess of all of one or both strands of the nucleic acid molecules amplified from a breast cancer cell such that the population is sufficient to hybridize to all of one or both strands. Preferred cells are those of a breast cancer patient that is ER+ or for whom treatment with tamoxifen or one or more other "antiestrogen" agent against breast cancer is contemplated. The single stranded molecules may of course be the denatured form of any IL17BR and/or CACNA1D sequence containing double stranded nucleic acid molecule or polynucleotide as described herein.

The population may also be described as being hybridized to IL17BR or CACNA1D sequence containing nucleic acid molecules at a level of at least twice as much as that by nucleic acid molecules of a normal breast cell. As in the embodiments described above, the nucleic acid molecules may be those quantitatively amplified from a breast cancer cell such that they reflect the amount of expression in said cell.

The population is preferably immobilized on a solid support, optionally in the form of a location on a microarray. A portion of the population is preferably hybridized to nucleic acid molecules quantitatively amplified from a non-normal or abnormal breast cell by RNA amplification. The amplified RNA may be that derived from a breast cancer cell, as long as the amplification used was quantitative with respect to IL17BR or CACNA1D containing sequences.

In another embodiment of the invention, expression based on detection of DNA status may be used. Detection of the HOXB13 DNA as methylated, deleted or otherwise inactivated, may be used as an indication of decreased expression as found in non-normal breast cells. This may be readily performed by PCR based methods known in the art. The status of the promoter regions of HOXB13 may also be assayed as an indication of decreased expression of HOXB13 sequences. A non-limiting example is the methylation status of sequences found in the promoter region.

Conversely, detection of the DNA of a sequence as amplified may be used for as an indication of increased expression as found in non-normal breast cells. This may be readily performed by PCR based, fluorescent in situ hybridization (FISH) and chromosome in situ hybridization (CISH) methods known in the art.

A preferred embodiment using a nucleic acid based assay to determine expression is by immobilization of one or more of the sequences identified herein on a solid support, including, but not limited to, a solid substrate as an array or to beads or bead based technology as known in the art. Alternatively, solution based expression assays known in the art may also be used. The immobilized sequence(s) may be in the form of polynucleotides as described herein such that the polynucleotide would be capable of hybridizing to a DNA or RNA corresponding to the sequence(s).

The immobilized polynucleotide(s) may be used to determine the state of nucleic acid samples prepared from sample breast cancer cell(s), optionally as part of a method to detect ER status in said cell(s). Without limiting the invention, such a cell may be from a patient suspected of being afflicted with, or at risk of developing, breast cancer. The immobilized polynucleotide(s) need only be sufficient to specifically hybridize to the corresponding nucleic acid molecules derived from the sample (and to the exclusion of detectable or significant hybridization to other nucleic acid molecules).

In yet another embodiment of the invention, a ratio of the expression levels of two of the disclosed genes may be used to predict response to treatment with TAM or another SERM. Preferably, the ratio is that of two genes with opposing patterns of expression, such as an underexpressed gene to an overexpressed gene, in correlation to the same phenotype. Non-limiting examples include the ratio of HOXB13 over IL17BR or the ratio of HOXB13 over CACNA1D. This aspect of the invention is based in part on the observation that such a ratio has a stronger correlation with TAM treatment outcome than the expression level of either gene alone. For example, the ratio of HOXB13 over IL17BR has an observed classification accuracy of 77%.

As a non-limiting example, the Ct values from Q-PCR based detection of gene expression levels may be used to derive a ratio to predict the response to treatment with one or more "antiestrogen" agent against breast cancer.

Additional Embodiments of the Invention

In embodiments where only one or a few genes are to be analyzed, the nucleic acid derived from the sample breast cancer cell(s) may be preferentially amplified by use of appropriate primers such that only the genes to be analyzed are amplified to reduce contaminating background signals from other genes expressed in the breast cell. Alternatively, and where multiple genes are to be analyzed or where very few cells (or one cell) is used, the nucleic acid from the sample may be globally amplified before hybridization to the immobilized polynucleotides. Of course RNA, or the cDNA counterpart thereof may be directly labeled and used, without amplification, by methods known in the art.

Sequence expression based on detection of a presence, increase, or decrease in protein levels or activity may also be used. Detection may be performed by any immunohistochemistry (IHC) based, bodily fluid based (where a IL17BR, CACNA1D, and/or HOXB13 polypeptide is found in a bodily fluid, such as but not limited to blood), antibody (including autoantibodies against the protein where present) based, ex foliate cell (from the cancer) based, mass spectroscopy based, and image (including used of labeled ligand where available) based method known in the art and recognized as appropriate for the detection of the protein. Antibody and image based methods are additionally useful for the localization of tumors after determination of cancer by use of cells obtained by a non-invasive procedure (such as ductal lavage or fine needle aspiration), where the source of the cancerous cells is not known. A labeled antibody or ligand may be used to localize the carcinoma(s) within a patient.

Antibodies for use in such methods of detection include polyclonal antibodies, optionally isolated from naturally occurring sources where available, and monoclonal antibodies, including those prepared by use of IL17BR, CACNA1D, and/or HOXB13 polypeptides as antigens. Such antibodies, as well as fragments thereof (including but not limited to F.sub.ab fragments) function to detect or diagnose non-normal or cancerous breast cells by virtue of their ability to specifically bind IL17BR, CACNA1D, or HOXB13 polypeptides to the exclusion of other polypeptides to produce a detectable signal. Recombinant, synthetic, and hybrid antibodies with the same ability may also be used in the practice of the invention. Antibodies may be readily generated by immunization with a IL17BR, CACNA1D, or HOXB13 polypeptide, and polyclonal sera may also be used in the practice of the invention.

Antibody based detection methods are well known in the art and include sandwich and ELISA assays as well as Western blot and flow cytometry based assays as non-limiting examples. Samples for analysis in such methods include any that contain IL17BR, CACNA1D, or HOXB13 polypeptides. Non-limiting examples include those containing breast cells and cell contents as well as bodily fluids (including blood, serum, saliva, lymphatic fluid, as well as mucosal and other cellular secretions as non-limiting examples) that contain the polypeptides.

The above assay embodiments may be used in a number of different ways to identify or detect the response to treatment with TAM or another "antiestrogen" agent against breast cancer based on gene expression in a breast cancer cell sample from a patient. In some cases, this would reflect a secondary screen for the patient, who may have already undergone mammography or physical exam as a primary screen. If positive from the primary screen, the subsequent needle biopsy, ductal lavage, fine needle aspiration, or other analogous minimally invasive method may provide the sample for use in the assay embodiments before, simultaneous with, or after assaying for ER status. The present invention is particularly useful in combination with non-invasive protocols, such as ductal lavage or fine needle aspiration, to prepare a breast cell sample.

The present invention provides a more objective set of criteria, in the form of gene expression profiles of a discrete set of genes, to discriminate (or delineate) between breast cancer outcomes. In particularly preferred embodiments of the invention, the assays are used to discriminate between good and poor outcomes after treatment with tamoxifen or another "antiestrogen" agent against breast cancer. Comparisons that discriminate between outcomes after about 10, about 20, about 30, about 40, about 50, about 60, about 70, about 80, about 90, about 100, or about 150 months may be performed.

While good and poor survival outcomes may be defined relatively in comparison to each other, a "good" outcome may be viewed as a better than 50% survival rate after about 60 months post surgical intervention to remove breast cancer tumor(s). A "good" outcome may also be a better than about 60%, about 70%, about 80% or about 90% survival rate after about 60 months post surgical intervention. A "poor" outcome may be viewed as a 50% or less survival rate after about 60 months post surgical intervention to remove breast cancer tumor(s). A "poor" outcome may also be about a 70% or less survival rate after about 40 months, or about a 80% or less survival rate after about 20 months, post surgical intervention.

In another embodiment of the invention based on the expression of a few genes, the isolation and analysis of a breast cancer cell sample may be performed as follows: (1) Ductal lavage or other non-invasive procedure is performed on a patient to obtain a sample. (2) Sample is prepared and coated onto a microscope slide. Note that ductal lavage results in clusters of cells that are cytologically examined as stated above. (3) Pathologist or image analysis software scans the sample for the presence of atypical cells. (4) If atypical cells are observed, those cells are harvested (e.g. by microdissection such as LCM). (5) RNA is extracted from the harvested cells. (6) RNA is assayed, directly or after conversion to cDNA or amplification therefrom, for the expression of IL17BR, CACNA1D, and/or HOXB13 sequences.

With use of the present invention, skilled physicians may prescribe or withhold treatment with TAM or another "antiestrogen" agent against breast cancer based on prognosis determined via practice of the instant invention.

The above discussion is also applicable where a palpable lesion is detected followed by fine needle aspiration or needle biopsy of cells from the breast. The cells are plated and reviewed by a pathologist or automated imaging system which selects cells for analysis as described above.

The present invention may also be used, however, with solid tissue biopsies, including those stored as an FFPE specimen. For example, a solid biopsy may be collected and prepared for visualization followed by determination of expression of one or more genes identified herein to determine the breast cancer outcome. As another non-limiting example, a solid biopsy may be collected and prepared for visualization followed by determination of HOXB13, IL17BR and/or CACNA1D expression. One preferred means is by use of in situ hybridization with polynucleotide or protein identifying probe(s) for assaying expression of said gene(s).

In an alternative method, the solid tissue biopsy may be used to extract molecules followed by analysis for expression of one or more gene(s). This provides the possibility of leaving out the need for visualization and collection of only cancer cells or cells suspected of being cancerous. This method may of course be modified such that only cells that have been positively selected are collected and used to extract molecules for analysis. This would require visualization and selection as a prerequisite to gene expression analysis. In the case of an FFPE sample, cells may be obtained followed by RNA extraction, amplification and detection as described herein.

In a further alternative to all of the above, the sequence(s) identified herein may be used as part of a simple PCR or array based assay simply to determine the response to treatment with TAM or another "antiestrogen" agent against breast cancer by use of a sample from a non-invasive or minimally invasive sampling procedure. The detection of sequence expression from samples may be by use of a single microarray able to assay expression of the disclosed sequences as well as other sequences, including sequences known not to vary in expression levels between normal and non-normal breast cells, for convenience and improved accuracy.

Other uses of the present invention include providing the ability to identify breast cancer cell samples as having different responses to treatment with TAM or another "antiestrogen" agent against breast cancer for further research or study. This provides an advance based on objective genetic/molecular criteria.

The genes identified herein also may be used to generate a model capable of predicting the breast cancer survival and recurrence outcomes of an ER+ breast cell sample based on the expression of the identified genes in the sample. Such a model may be generated by any of the algorithms described herein or otherwise known in the art as well as those recognized as equivalent in the art using gene(s) (and subsets thereof) disclosed herein for the identification of breast cancer outcomes. The model provides a means for comparing expression profiles of gene(s) of the subset from the sample against the profiles of reference data used to build the model. The model can compare the sample profile against each of the reference profiles or against a model defining delineations made based upon the reference profiles. Additionally, relative values from the sample profile may be used in comparison with the model or reference profiles.

In a preferred embodiment of the invention, breast cell samples identified as normal and cancerous from the same subject may be analyzed, optionally by use of a single microarray, for their expression profiles of the genes used to generate the model. This provides an advantageous means of identifying survival and recurrence outcomes based on relative differences from the expression profile of the normal sample. These differences can then be used in comparison to differences between normal and individual cancerous reference data which was also used to generate the model.

Articles of Manufacture

The materials and methods of the present invention are ideally suited for preparation of kits produced in accordance with well known procedures. The invention thus provides kits comprising agents (like the polynucleotides and/or antibodies described herein as non-limiting examples) for the detection of expression of the disclosed sequences. Such kits, optionally comprising the agent with an identifying description or label or instructions relating to their use in the methods of the present invention, are provided. Such a kit may comprise containers, each with one or more of the various reagents (typically in concentrated form) utilized in the methods, including, for example, pre-fabricated microarrays, buffers, the appropriate nucleotide triphosphates (e.g., dATP, dCTP, dGTP and dTTP; or rATP, rCTP, rGTP and UTP), reverse transcriptase, DNA polymerase, RNA polymerase, and one or more primer complexes of the present invention (e.g., appropriate length poly(T) or random primers linked to a promoter reactive with the RNA polymerase). A set of instructions will also typically be included.

The methods provided by the present invention may also be automated in whole or in part. All aspects of the present invention may also be practiced such that they consist essentially of a subset of the disclosed genes to the exclusion of material irrelevant to the identification of breast cancer survival outcomes via a cell containing sample.

Having now generally described the invention, the same will be more readily understood through reference to the following examples which are provided by way of illustration, and are not intended to be limiting of the present invention, unless specified.

EXAMPLES

Example 1

General Methods

Patient and Tumor Selection Criteria and Study Design

Patient inclusion criteria for this study were: Women diagnosed at the Massachusetts General Hospital (MGH) between 1987 and 2000 with ER positive breast cancer, treatment with standard breast surgery (modified radical mastectomy or lumpectomy) and radiation followed by five years of systemic adjuvant tamoxifen; no patient received chemotherapy prior to recurrence. Clinical and follow-up data were derived from the MGH tumor registry. There were no missing registry data and all available medical records were reviewed as a second tier of data confirmation.

All tumor specimens collected at the time of initial diagnosis were obtained from frozen and formalin fixed paraffin-embedded (FFPE) tissue repositories at the Massachusetts General Hospital. Tumor samples with greater than 20% tumor cells were selected with a median of greater than 75% for all samples. Each sample was evaluated for the following features: tumor type (ductal vs. lobular), tumor size, and Nottingham combined histological grade. Estrogen and progesterone receptor expression were determined by biochemical hormone binding analysis and/or by immunohistochemical staining as described (Long, A. A. et al. "High-specificity in-situ hybridization. Methods and application." Diagn Mol Pathol 1, 45-57 (1992)); receptor positivity was defined as greater than 3 fmol/mg tumor tissue (Long et al.) and greater than 1% nuclear staining for the biochemical and immunohistochemical assays, respectively.

Study design is as follows: A training set of 60 frozen breast cancer specimens was selected to identify gene expression signatures predictive of outcome or response, in the setting of adjuvant tamoxifen therapy. Tumors from responders were matched to the non-responders with respect to TNM staging and tumor grade. Differential gene expression identified in the training set was validated in an independent group of 20 invasive breast tumors with formalin fixed paraffin-embedded (FFPE) tissue samples.

LCM, RNA Isolation and Amplification

With each frozen tumor sample within the 60-case cohort, RNA was isolated from both a whole tissue section of 8 .mu.m in thickness and a highly enriched population of 4,000-5,000 malignant epithelial cells acquired by laser capture microdissection using a PixCell IIe LCM system (Arcturus, Mountain View, Calif.). From each tumor sample within the 20-case test set, RNA was isolated from four 8 .mu.m-thick FFPE tissue sections. Isolated RNA was subjected to one round of T7 polymerase in vitro transcription using the RiboAmp.TM. kit (frozen samples) or another system for FFPE samples according to manufacturer's instructions (Arcturus Bioscience, Inc., Mountain View, Calif. for RiboAmP.TM.). Labeled cRNA was generated by a second round of T7-based RNA in vitro transcription in the presence of 5-[3-Aminoallyl]uridine 5'-triphosphate (Sigma-Aldrich, St. Louis, Mo.). Universal Human Reference RNA (Stratagene, San Diego, Calif.) was amplified in the same manner. The purified aRNA was later conjugated to Cy5 (experimental samples) or Cy3 (reference sample) dye (Amersham Biosciences).

Microarray Analysis

A custom designed 22,000-gene oligonucleotide (60mer) microarray was fabricated using ink-jet in-situ synthesis technology (Agilent Technologies, Palo Alto, Calif.). Cy5-labeled sample RNA and Cy3-labeled reference RNA were co-hybridized at 65.degree. C., 1.times. hybridization buffer (Agilent Technologies). Slides were washed at 37.degree. C. with 0.1.times.SSC/0.005% Triton X-102. Image analysis was performed using Agilent's image analysis software. Raw Cy5/Cy3 ratios were normalized using intensity-dependent non-linear regression.

A data matrix consisting of normalized Cy5/Cy3 ratios from all samples were median centered for each gene. The variance of each gene over all samples was calculated and the top 25% high variance genes (5,475) selected for further analysis. Identification and permutation testing for significance of differential gene expression were performed using BRB ArrayTools, developed by Dr. Richard Simon and Amy Peng (see http site at linus.nci.nih.gov/BRB-ArrayTools.html). Hierarchical cluster analysis was performed with GeneMaths software (Applied-Maths, Belgium) using cosine correlation and complete linkage. All other statistical procedures (two-sample t-test, receiver operating characteristic analysis, multivariate logistic regression and survival analysis) were performed in the open source R statistical environment (see http site at www.r-project.org). Statistical test of significance of ROC curves was by the method of DeLong ("Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach." Biometrics 44, 837-45 (1988)). Disease free survival was calculated from the date of diagnosis. Events were scored as the first distant metastasis, and patients remaining disease-free at the last follow-up were censored. Survival curves were calculated by the Kaplan-Meier estimates and compared by log-rank tests.

Real-Time Quantitative PCR Analysis

Real-time PCR was performed on 59 of the 60-case training samples (one case was excluded due to insufficient materials) and the 20-case validation samples. Briefly, 2 .mu.g of amplified RNA was converted into double stranded cDNA. For each case 12 ng of cDNA in triplicates was used for real-time PCR with an ABI 7900HT (Applied Biosystems) as described (Gelmini, S. et al. "Quantitative polymerase chain reaction-based homogeneous assay with fluorogenic probes to measure c-erbB-2 oncogene amplification." Clin Chem 43, 752-8 (1997)). The sequences of the PCR primer pairs and fluorogenic MGB probe (5' to 3'), respectively, that were used for each gene are as follows:

TABLE-US-00014 HoxB13 (SEQ ID NO: 38) TTCATCCTGACAGTGGCAATAATC, (SEQ ID NO: 39) CTAGATAGAAAATATGAGGCTAACGATCAT, (SEQ ID NO: 40) VIC-CGATAACCAGTACTAGCTG; IL17BR (SEQ ID NO: 41) GCATTAACTAACGATTGGAAACTACATT, (SEQ ID NO: 42) GGAAGATGCTTTATTGTTGCATTATC, (SEQ ID NO: 43) VIC-ACAACTTCAAAGCTGTTTTA.

Relative expression levels of HOXB13 in normal, DCIS and IDC samples were calculated as follows. First, all CT values are adjusted by subtracting the highest CT (40) among all samples, then relative expression=1/2^CT.

In Situ Hybridization

Dig-labeled RNA probes were prepared using DIG RNA labeling kit (SP6/T7) from Roche Applied Science, following the protocol provided with the kit. In situ hybridization was performed on frozen tissue sections as described (Long et al.).

TABLE-US-00015 TABLE 1 Patients and tumor characteristics of training set. Sample ID Tumor type Size Grade Nodes ER PR Age DFS Status 1389 D 1.7 2 0/1 Pos Pos 80 94 0 648 D 1.1 2 0/15 Pos ND 62 160 0 289 D 3 2 0/15 Pos ND 75 63 1 749 D 1.8 2 2/9 Pos Pos 61 137 0 420 D/L 2 3 ND Pos Pos 72 58 1 633 D 2.7 3 0/11 Pos ND 61 20 1 662 D 1 3 6/11 Pos Pos 79 27 1 849 D 2 1 0/26 Pos Neg 75 23 1 356 D 1 2 2/20 Pos ND 58 24 1 1304 D 2 3 0/14 Pos Pos 57 20 1 1419 D 2.5 2 1/8 Pos Pos 59 86 0 1093 D 1 3 1/14 Pos Pos 66 85 0 1047 D/L 2.6 2 0/18 Pos Neg 70 128 0 1037 D/L 1.5 2 0/4 Pos Pos 85 83 0 319 D 4 2 1/13 Pos ND 67 44 1 25 D 3.5 2 0/9 Neg Pos 62 75 1 180 D 1.6 2 2/19 Pos Pos 69 169 0 687 D 3.5 3 3/16 Pos ND 73 142 0 856 D 1.6 2 0/16 Pos Pos 73 88 0 1045 D 2.5 3 1/12 Pos Neg 73 121 0 1205 D 2.7 2 1/19 Pos Pos 71 88 0 1437 D 1.7 2 2/22 Pos Pos 67 89 0 1507 D 3.7 3 0/40 Pos Pos 70 70 0 469 D 1 1 0/19 Pos ND 66 161 0 829 D 1.2 2 0/9 Pos ND 69 136 0 868 D 3 3 0/13 Pos Pos 65 130 0 1206 D 4.1 3 0/15 Pos Neg 84 56 1 843 D 3.4 2 11/20 Pos Neg 76 122 1 342 D 3 2 9/21 Pos ND 62 102 1 1218 D 4.5 1 3/16 Pos Pos 62 10 1 547 D/L 1.5 2 ND Pos ND 74 129 1 1125 D 2.6 2 0/18 Pos Pos 54 123 0 1368 D 2.6 2 ND Pos Pos 82 63 0 605 D 2.2 2 6/18 Pos ND 70 110 0 59 L 3 2 33/38 Pos ND 70 21 1 68 D 3 2 0/17 Pos ND 53 38 1 317 D 1.2 3 1/10 Pos Pos 71 5 1 374 D 1 3 0/15 Pos Neg 57 47 1 823 D 2 2 0/6 Pos Pos 51 69 1 280 D 2.2 3 0/12 Pos ND 66 44 1 651 D 4.7 3 10/13 Pos ND 48 137 1 763 D 1.8 2 0/14 Pos Pos 63 118 0 1085 D 4.7 2 0/8 Pos Pos 48 101 1 1363 D 2.1 2 0/15 Pos Pos 56 114 0 295 D 3.5 2 3/21 Pos Pos 52 118 1 871 D 4 3 0/16 Pos Neg 61 6 1 1343 D 2.5 3 ND Pos Pos 79 21 1 140 L >2.0 2 18/28 Pos ND 63 43 1 260 D/L 0.9 2 1/13 Pos ND 73 42 1 297 D 0.8 2 1/16 Pos Pos 66 169 0 1260 D 3.5 2 0/14 Pos Pos 58 79 0 1405 D 1 3 ND Pos Pos 81 95 0 518 L 5.5 2 3/20 Pos ND 68 156 0 607 D 1.2 2 5/14 Pos Pos 76 114 0 638 D 2 2 1/24 Pos Pos 67 148 0 655 D 2 3 ND Pos Pos 73 143 0 772 D 2.5 2 0/18 Pos Pos 68 69 1 878 D/L 1.6 2 0/9 Pos Neg 76 138 0 1279 D 2 2 0/12 Pos Pos 68 102 0 1370 D 2 2 ND Pos Pos 73 61 0 Abbreviations: D, ductal; L, lobular; D/L. ductal and lobular features; pos, positive; neg, negative; ND, not determined; ER, estrogen receptor; PR, progesterone receptor; DFS, disease-free survival (number of months); status = 1, recurred; status = 0, disease-free.

Example 2

Identification of Differentially Expressed Genes

Gene expression profiling was performed using a 22,000-gene oligonucleotide microarray as described above. In the initial analysis, isolated RNA from frozen tumor-tissue sections taken from the archived primary biopsies were used. The resulting expression dataset was first filtered based on overall variance of each gene with the top 5,475 high-variance genes (75th percentile) selected for further analysis. Using this reduced dataset, t-test was performed on each gene comparing the tamoxifen responders and non-responders, leading to identification of 19 differentially expressed genes at the P value cutoff of 0.001 (Table 2). The probability of selecting this many or more differentially expressed genes by chance was estimated to be 0.04 by randomly permuting the patient class with respect to treatment outcome and repeating the t-test procedure 2,000 times. This analysis thus demonstrated the existence of statistically significant differences in gene expression between the primary breast cancers of tamoxifen responders and non-responders.

TABLE-US-00016 TABLE 2 19-gene signature identified by t-test in the Sections dataset Mean in Fold Parametric p- Mean in non- difference value responders responders of means GB acc Description 1 1.96E-05 0.759 1.317 0.576 AW006861 SCYA4|small inducible cytokine A4 2 2.43E-05 1.31 0.704 1.861 AI240933 ESTs 3 8.08E-05 0.768 1.424 0.539 X59770 IL1R2|interleukin 1 receptor, type II 4 9.57E-05 0.883 1.425 0.62 AB000520 APS|adaptor protein with pleckstrin homology and src homology 2 domains 5 9.91E-05 1.704 0.659 2.586 AF208111 IL17BR|interleukin 17B receptor 6 0.0001833 0.831 1.33 0.625 AI820604 ESTs 7 0.0001935 0.853 1.459 0.585 AI087057 DOK2|docking protein 2, 56 kD 8 0.0001959 1.29 0.641 2.012 AJ272267 CHDH|choline dehydrogenase 9 0.0002218 1.801 0.943 1.91 N30081 ESTs, Weakly similar to I38022 hypothetical protein [H. sapiens] 10 0.0004234 1.055 2.443 0.432 AI700363 ESTs 11 0.0004357 0.451 1.57 0.287 AL117406 ABCC11|ATP-binding cassette, sub- family C (CFTR/MRP), member 11 12 0.0004372 1.12 3.702 0.303 BC007092 HOXB13|homeo box B13 13 0.0005436 0.754 1.613 0.467 M92432 GUCY2D|guanylate cyclase 2D, membrane (retina-specific) 14 0.0005859 1.315 0.578 2.275 AL050227 Homo sapiens mRNA; cDNA DKFZp586M0723 (from clone DKFZp586M0723) 15 0.000635 1.382 0.576 2.399 AW613732 Homo sapiens cDNA FLJ31137 fis, clone IMR322001049 16 0.0008714 0.794 1.252 0.634 BC007783 SCYA3|small inducible cytokine A3 17 0.0008912 2.572 1.033 2.49 X81896 C11orf25|chromosome 11 open reading frame 25 18 0.0009108 0.939 1.913 0.491 BC004960 MGC10955|hypothetical protein MGC10955 19 0.0009924 1.145 0.719 1.592 AK027250 Homo sapiens cDNA: FLJ23597 fis, clone LNG15281

To refine our analysis to the tumor cells and circumvent potential variability attributable to stromal cell contamination, the same cohort was reanalyzed following laser-capture microdissection (LCM) of tumor cells within each tissue section. Using variance based gene filtering and t-test screening identical to that utilized for the whole tissue section dataset, 9 differentially expressed gene sequences were identified with P<0.001 (Table 3).

TABLE-US-00017 TABLE 3 9-gene signature identified by t-test in the LCM dataset Mean in Fold Parametric Mean in non- difference p-value responders responders of means GB acc Description 1 2.67E-05 1.101 4.891 0.225 BC007092 HOXB13|homeo box B13 2 0.0003393 1.045 2.607 0.401 AI700363 ESTs 3 0.0003736 0.64 1.414 0.453 NM_014298 QPRT|quinolinate phosphoribosyltransferase (nicotinate- nucleotide pyrophosphorylase (carboxylating)) 4 0.0003777 1.642 0.694 2.366 AF208111 IL17BR|interleukin 17B receptor 5 0.0003895 0.631 1.651 0.382 AF033199 ZNF204|zinc finger protein 204 6 0.0004524 1.97 0.576 3.42 AI688494 FLJ13189|hypothetical protein FLJ13189 7 0.0005329 1.178 0.694 1.697 AI240933 ESTs 8 0.0007403 0.99 1.671 0.592 AL157459 Homo sapiens mRNA; cDNA DKFZp434B0425 (from clone DKFZp434B0425) 9 0.0007739 0.723 1.228 0.589 BC002480 FLJ13352|hypothetical protein FLJ13352

Only 3 genes were identified as differentially expressed in both the LCM and whole tissue section analyses: the homeobox gene HOXB13 (identified twice as AI700363 and BC007092), the interleukin 17B receptor IL17BR (AF208111), and the voltage-gated calcium channel CACNA1D (AI240933). HOXB13 was differentially overexpressed in tamoxifen nonresponsive cases, whereas IL17BR and CACNA1D were overexpressed in tamoxifen responsive cases. Based on their identification as tumor-derived markers significantly associated with clinical outcome in two independent analyses, the utility of each of these genes was evaluated by itself and in combination with the others.

To define the sensitivity and specificity of HOXB13, IL17BR and CACNA1D expression as markers of clinical outcome, Receiver Operating Characteristic (ROC) analysis (Pepe, M. S. "An interpretation for the ROC curve and inference using GLM procedures." Biometrics 56, 352-9 (2000)) was used. For data derived from whole tissue sections, the Area Under the Curve (AUC) values for IL17BR, HOXB13 and CACNA1D were 0.79, 0.67 and 0.81 for IL17BR, HOXB13 and CACNA1D, respectively (see Table 4 and FIG. 1, upper portion). ROC analysis of the data generated from the microdissected tumor cells yielded AUC values of 0.76, 0.8, and 0.76 for these genes (see Table 4 and FIG. 1, lower portion).

TABLE-US-00018 TABLE 4 ROC analysis of using IL17BR, CACNA1D and HOXB13 expression to predict tamoxifen response Tissue Sections LCM AUC P value AUC P value IL17BR 0.79 1.58E-06 0.76 2.73E-05 CACNA1D 0.81 3.02E-08 0.76 1.59E-05 HOXB13 0.67 0.012 0.79 9.94E-07 ESR1 0.55 0.277 0.63 0.038 PGR 0.63 0.036 0.63 0.033 ERBB2 0.69 0.004 0.64 0.027 EGFR 0.56 0.200 0.61 0.068 AUC, area under the curve; P values are AUC >0.5.

A statistical test of significance indicated that these AUC values are all significantly greater than 0.5, the expected value from the null model that predicts clinical outcome randomly. Therefore, these three genes have potential utility for predicting clinical outcome of adjuvant tamoxifen therapy. As comparison, markers that are currently useful in evaluating the likelihood of response to tamoxifen were analyzed in comparison. The levels of ER (gene symbol ESR1) and progesterone receptor (PR, gene symbol PGR) are known to be positively correlated with tamoxifen response (see Fernandez, M. D., et al. "Quantitative oestrogen and progesterone receptor values in primary breast cancer and predictability of response to endocrine therapy." Clin Oncol 9, 245-50 (1983); Ferno, M. et al. "Results of two or five years of adjuvant tamoxifen correlated to steroid receptor and S-phase levels." South Sweden Breast Cancer Group, and South-East Sweden Breast Cancer Group. Breast Cancer Res Treat 59, 69-76 (2000); Nardelli, G. B., et al. "Estrogen and progesterone receptors status in the prediction of response of breast cancer to endocrine therapy (preliminary report)." Eur J Gynaecol Oncol 7, 151-8 (1986); and Osborne, C. K., et al. "The value of estrogen and progesterone receptors in the treatment of breast cancer." U 46, 2884-8 (1980)).

In addition, growth factor signaling pathways (EGFR, ERBB2) are thought to negatively regulate estrogen-dependent signaling, and hence contribute to loss of responsiveness to tamoxifen (see Dowsett, M. "Overexpression of HER-2 as a resistance mechanism to hormonal therapy for breast cancer." Endocr Relat Cancer 8, 191-5 (2001)). ROC analysis of these genes confirmed their correlation with clinical outcome, but with AUC values ranging only from 0.55 to 0.69, reaching statistical significance for PGR and ERBB2 (see Table 4).

The LCM dataset is particularly relevant, since EGFR, ERBB2, ESR1 and PGR are currently measured at the tumor cell level using either immunohistochemistry or fluorescence in situ hybridization. As individual markers of clinical outcome, HOXB13, IL17BR and CAC1D all outperformed ESR1, PGR, EGFR and ERBB2 (see Table 4).

Example 3

Identification of the HOXB13:IL17BR Expression Ratio

HOXB13:IL17BR expression ratio was identified as a robust composite predictor of outcome as follows. Since HOXB13 and IL17BR have opposing patterns of expression, the expression ratio of HOXB13 over IL17BR was examined to determine whether it provides a better composite predictor of tamoxifen response. Indeed, both t-test and ROC analyses demonstrated that the two-gene ratio had a stronger correlation with treatment outcome than either gene alone, both in the whole tissue sections and LCM datasets (see Table 5). AUC values for HOXB13:IL17BR reached 0.81 for the tissue sections dataset and 0.84 for the LCM dataset. Pairing HOXB13 with CACNA1D or analysis of all three markers together did not provide additional predictive power.

TABLE-US-00019 TABLE 5 HOXB13:IL17BR ratio is a stronger predictor of treatment outcome t-test ROC t-statistic P value AUC P value Tissue IL17BR 4.15 1.15E-04 0.79 1.58E-06 Section HOXB13 -3.57 1.03E-03 0.67 0.01 HOXB13:IL17BR -4.91 1.48E-05 0.81 1.08E-07 LCM IL17BR 3.70 5.44E-04 0.76 2.73E-05 HOXB13 -4.39 8.00E-05 0.79 9.94E-07 HOXB13:IL17BR -5.42 2.47E-06 0.84 4.40E-11 AUC, area under the curve; P values are AUC > 0.5.

The HOXB13/IL7BR ratio was compared to well-established prognostic factors for breast cancer, such as patient age, tumor size, grade and lymph node status (see Fitzgibbons, P. L. et al. "Prognostic factors in breast cancer. College of American Pathologists Consensus Statement 1999." Arch Pathol Lab Med 124, 966-78 (2000)). Univariate logistic regression analysis indicated that only tumor size was marginally significant in this cohort (P=0.04); this was not surprising given that the responder group was closely matched to the non-responder group with respect to tumor size, tumor grade and lymph node status during patient selection. Among the known positive (ESR1 and PGR) and negative (ERBB2 and EGFR) predictors of tamoxifen response, ROC analysis of the tissue sections data indicated that only PGR and ERBB2 were significant (see Table 4). Therefore, a comparison of logistic regression models containing the HOXB13:IL17BR ratio either by itself or in combination with tumor size, and expression levels of PGR and ERBB2, were made (see Table 6). The HOXB13:IL17BR ratio alone was a highly significant predictor (P=0.0003) and had an odds ratio of 10.2 (95% CI 2.9-35.6). In the multivariate model, HOXB13:IL17BR ratio is the only significant variable (P=0.002) with an odds ratio of 7.3 (95% CI 2.1-26). Thus, the expression ratio of HOXB13:IL17BR is a strong independent predictor of treatment outcome in the setting of adjuvant tamoxifen therapy.

TABLE-US-00020 TABLE 6 Logistic Regression Analysis Univariate Model Predictor Odds Ratio 95% CI P Value HOXB13:IL17BR 10.17 2.9-35.6 0.0003 Multivariate Model Predictors Odds Ratio 95% CI P Value Tumor size 1.5 0.7-3.5 0.3289 PGR 0.8 0.3-1.8 0.5600 ERBB2 1.7 0.8-3.8 0.1620 HOXB13:IL17BR 7.3 2.1-26.3 0.0022 All predictors are continuous variables. Gene expression values were from microarray measurements. Odds ratio is the inter-quartile odds ratio, based on the difference of a predictor from its lower quartile (0.25) to its upper quartile (0.75); CI, confidence interval.

Example 4

Independent Validation of HOXB13: IL17BR Expression Ratio

The reduction of a complex microarray signature to a two-gene expression ratio allows the use of simpler detection strategies, such as real-time quantitative PCR (RT-QPCR) analysis. The HOXB13:IL7BR expression ratio by RT-QPCR using frozen tissue sections that were available from 59 of the 60 training cases were analyzed (FIG. 2, part a). RT-QPCR data were highly concordant with the microarray data of frozen tumor specimens (correlation coefficient r=0.83 for HOXB13, 0.93 for IL17BR). In addition, the PCR-derived HOXB13:IL17BR ratios, represented as .DELTA.CTs, where CT is the PCR amplification cycles to reach a predetermined threshold amount (e.g., FIG. 2, parts a and b) and .DELTA.CT is the CT difference between HOXB13 and IL17BR, were highly correlated with the microarray-derived data (r=0.83) and with treatment outcome (t test P=0.0001, FIG. 2, part c). Thus, conventional RT-QPCR analysis for the expression ratio of HOXB13 to IL17BR appears to be equivalent to microarray-based analysis of frozen tumor specimens.

To validate the predictive utility of HOXB13:IL17BR expression ratio in an independent patient cohort, 20 additional ER-positive early-stage primary breast tumors from women treated with adjuvant tamoxifen only at MGH between 1991 and 2000, and for which medical records and paraffin-embedded tissues were available, were identified. Of the 20 archival cases, 10 had recurred with a median time to recurrence of 5 years, and 10 had remained disease-free with a median follow up of 9 years (see Table 7 for details).

TABLE-US-00021 TABLE 7 Patient and tumor characteristics of the validation set. Sample Tumor Type Size Grade Nodes ER PR Age DFS Status Test 1 D 1.9 3 0/10 Pos Pos 69 15 1 Test 2 D 1.7 3 0/19 Pos Pos 61 117 1 Test 3 D 1.7 2 0/26 Pos ND 65 18 1 Test 4 D 1.2 2 0/19 Pos Pos 63 69 1 Test 5 D 1.7 2 2/2 Pos Pos 60 52 1 Test 6 D 1.1 1 0/10 Pos Pos 54 59 1 Test 7 D >1.6 2 0/17 Pos Neg 66 32 1 Test 8 L 2.6 1 2 0/14 Pos Pos 58 67 1 Test 9 D 1.2 2 ND Pos Pos 93 58 1 Test 10 D 4 3 0/20 Pos Pos 66 27 1 Test 11 D 1.1 2 0/19 Pos Pos 64 97 0 Test 12 D 2.7 2 0/10 Pos Pos 66 120 0 Test 13 D 0.9 1 0/22 Pos Pos 66 123 0 Test 14 D 2.1 2 0/16 Pos Pos 57 83 0 Test 15 D 0.8 1 2 0/8 Pos Pos 74 80 0 Test 16 D 1 2 0/13 Pos Pos 74 93 0 Test 17 D 1.6 2 0/29 Pos Pos 66 121 0 Test 18 L 1.5 1 2 0/8 Pos Pos 65 25 0 Test 19 D 1.5 3 0/16 Pos Pos 60 108 0 Test 20* L 4 1 2 0/19 Pos Pos 60 108 0 Abbreviations: Same as supplemental Table 1. *Patient received tamoxifen for 2 years.

RNA was extracted from formalin-fixed paraffin-embedded (FFPE) whole tissue sections, linearly amplified, and used as template for RT-QPCR analysis. Consistent with the results of the training cohort, the HOXB13:IL17BR expression ratio in this independent patient cohort was highly correlated with clinical outcome (t test P=0.035) with higher HOXB13 expression (lower .DELTA.CTs) correlating with poor outcome (FIG. 2, part d). To test the predictive accuracy of the HOXB13:IL17BR ratio, the RT-QPCR data from the frozen tissue sections (n=59) was used to build a logistic regression model. In this training set, the model predicted treatment outcome with an overall accuracy of 76% (P=0.000065, 95% confidence interval 63%-86%). The positive and negative predictive values were 78% and 75%, respectively. Applying this model to the 20 independent patients in the validation cohort, treatment outcome for 15 of the 20 patients was correctly predicted (overall accuracy 75%, P=0.04, 95% confidence interval 51%-91%), with positive and negative predictive values of 78% and 73%, respectively.

Kaplan-Meier analysis of the patient groups as predicted by the model resulted in significantly different disease-free survival curves in both the training set and the independent test set (FIG. 2, parts e and f).

ADDITIONAL REFERENCES

Ma, X. J. et al. Gene expression profiles of human breast cancer progression. Proc Natl Acad Sci USA 100, 5974-9 (2003). Nicholson, R. I. et al. Epidermal growth factor receptor expression in breast cancer: association with response to endocrine therapy. Breast Cancer Res Treat 29, 117-25 (1994).

All references cited herein, including patents, patent applications, and publications, are hereby incorporated by reference in their entireties, whether previously specifically incorporated or not.

Having now fully described this invention, it will be appreciated by those skilled in the art that the same can be performed within a wide range of equivalent parameters, concentrations, and conditions without departing from the spirit and scope of the invention and without undue experimentation.

While this invention has been described in connection with specific embodiments thereof, it will be understood that it is capable of further modifications. This application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains and as may be applied to the essential features hereinbefore set forth.

SEQUENCE LISTINGS

1

41212077DNAHomo sapiens 1agcgcagcgt gcgggtggcc tggatcccgc gcagtggccc ggcgatgtcg ctcgtgctgc 60taagcctggc cgcgctgtgc aggagcgccg taccccgaga gccgaccgtt caatgtggct 120ctgaaactgg gccatctcca gagtggatgc tacaacatga tctaatcccc ggagacttga 180gggacctccg agtagaacct gttacaacta gtgttgcaac aggggactat tcaattttga 240tgaatgtaag ctgggtactc cgggcagatg ccagcatccg cttgttgaag gccaccaaga 300tttgtgtgac gggcaaaagc aacttccagt cctacagctg tgtgaggtgc aattacacag 360aggccttcca gactcagacc agaccctctg gtggtaaatg gacattttcc tacatcggct 420tccctgtaga gctgaacaca gtctatttca ttggggccca taatattcct aatgcaaata 480tgaatgaaga tggcccttcc atgtctgtga atttcacctc accaggctgc ctagaccaca 540taatgaaata taaaaaaaag tgtgtcaagg ccggaagcct gtgggatccg aacatcactg 600cttgtaagaa gaatgaggag acagtagaag tgaacttcac aaccactccc ctgggaaaca 660gatacatggc tcttatccaa cacagcacta tcatcgggtt ttctcaggtg tttgagccac 720accagaagaa acaaacgcga gcttcagtgg tgattccagt gactggggat agtgaaggtg 780ctacggtgca gctgactcca tattttccta cttgtggcag cgactgcatc cgacataaag 840gaacagttgt gctctgccca caaacaggcg tccctttccc tctggataac aacaaaagca 900agccgggagg ctggctgcct ctcctcctgc tgtctctgct ggtggccaca tgggtgctgg 960tggcagggat ctatctaatg tggaggcacg aaaggatcaa gaagacttcc ttttctacca 1020ccacactact gccccccatt aaggttcttg tggtttaccc atctgaaata tgtttccatc 1080acacaatttg ttacttcact gaatttcttc aaaaccattg cagaagtgag gtcatccttg 1140aaaagtggca gaaaaagaaa atagcagaga tgggtccagt gcagtggctt gccactcaaa 1200agaaggcagc agacaaagtc gtcttccttc tttccaatga cgtcaacagt gtgtgcgatg 1260gtacctgtgg caagagcgag ggcagtccca gtgagaactc tcaagacctc ttcccccttg 1320cctttaacct tttctgcagt gatctaagaa gccagattca tctgcacaaa tacgtggtgg 1380tctactttag agagattgat acaaaagacg attacaatgc tctcagtgtc tgccccaagt 1440accacctcat gaaggatgcc actgctttct gtgcagaact tctccatgtc aagcagcagg 1500tgtcagcagg aaaaagatca caagcctgcc acgatggctg ctgctccttg tagcccaccc 1560atgagaagca agagacctta aaggcttcct atcccaccaa ttacagggaa aaaacgtgtg 1620atgatcctga agcttactat gcagcctaca aacagcctta gtaattaaaa cattttatac 1680caataaaatt ttcaaatatt gctaactaat gtagcattaa ctaacgattg gaaactacat 1740ttacaacttc aaagctgttt tatacataga aatcaattac agttttaatt gaaaactata 1800accattttga taatgcaaca ataaagcatc ttcagccaaa catctagtct tccatagacc 1860atgcattgca gtgtacccag aactgtttag ctaatattct atgtttaatt aatgaatact 1920aactctaaga acccctcact gattcactca atagcatctt aagtgaaaaa ccttctatta 1980catgcaaaaa atcattgttt ttaagataac aaaagtaggg aataaacaag ctgaacccac 2040ttttaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 207723105DNAHomo sapiens 2agcgcagcgt gcgggtggcc tggatcccgc gcagtggccc ggcgatgtcg ctcgtgctgc 60taagcctggc cgcgctgtgc aggagcgccg taccccgaga gccgaccgtt caatgtggct 120ctgaaactgg gccatctcca gagtggatgc tacaacatga tctaatcccc ggagacttga 180gggacctccg agtagaacct gttacaacta gtgttgcaac aggggactat tcaattttga 240tgaatgtaag ctgggtactc cgggcagatg ccagcatccg cttgttgaag gccaccaaga 300tttgtgtgac gggcaaaagc aacttccagt cctacagctg tgtgaggtgc aattacacag 360aggccttcca gactcagacc agaccctctg gtggtaaatg gacattttcc tacatcggct 420tccctgtaga gctgaacaca gtctatttca ttggggccca taatattcct aatgcaaata 480tgaatgaaga tggcccttcc atgtctgtga atttcacctc accaggctgc ctagaccaca 540taatgaaata taaaaaaaag tgtgtcaagg ccggaagcct gtgggatccg aacatcactg 600cttgtaagaa gaatgaggag acagtagaag tgaacttcac aaccactccc ctgggaaaca 660gatacatggc tcttatccaa cacagcacta tcatcgggtt ttctcaggtg tttgagccac 720accagaagaa acaaacgcga gcttcagtgg tgattccagt gactggggat agtgaaggtg 780ctacggtgca ggtaaagttc agtgagctgc tctggggagg gaagggacat agaagactgt 840tccatcattc attgctttta aggatgagtt ctctcttgtc aaatgcactt ctgccagcag 900acaccagtta agtggcgttc atgggggctc tttcgctgca gcctccaccg tgctgaggtc 960aggaggccga cgtggcagtt gtggtccctt ttgcttgtat taatggctgc tgaccttcca 1020aagcactttt tattttcatt ttctgtcaca gacactcagg gatagcagta ccattttact 1080tccgcaagcc tttaactgca agatgaagct gcaaagggtt tgaaatggga aggtttgagt 1140tccaggcagc gtatgaactc tggagagggg ctgccagtcc tctctgggcc gcagcggacc 1200cagctggaac acaggaagtt ggagcagtag gtgctccttc acctctcagt atgtctcttt 1260caactctagt ttttgaggtg gggacacagg aggtccagtg ggacacagcc actccccaaa 1320gagtaaggag cttccatgct tcattccctg gcataaaaag tgctcaaaca caccagaggg 1380ggcaggcacc agccagggta tgatggctac tacccttttc tggagaacca tagacttccc 1440ttactacagg gacttgcatg tcctaaagca ctggctgaag gaagccaaga ggatcactgc 1500tgctcctttt ttctagagga aatgtttgtc tacgtggtaa gatatgacct agccctttta 1560ggtaagcgaa ctggtatgtt agtaacgtgt acaaagttta ggttcagacc ccgggagtct 1620tgggcacgtg ggtctcgggt cactggtttt gactttaggg ctttgttaca gatgtgtgac 1680caaggggaaa atgtgcatga caacactaga ggtatgggcg aagccagaaa gaagggaagt 1740tttggctgaa gtaggagtct tggtgagatt ttgctctgat gcatggtgtg aactttctga 1800gcctcttgtt tttcctcagc tgactccata ttttcctact tgtggcagcg actgcatccg 1860acataaagga acagttgtgc tctgcccaca aacaggcgtc cctttccctc tggataacaa 1920caaaagcaag ccgggaggct ggctgcctct cctcctgctg tctctgctgg tggccacatg 1980ggtgctggtg gcagggatct atctaatgtg gaggcacgaa aggatcaaga agacttcctt 2040ttctaccacc acactactgc cccccattaa ggttcttgtg gtttacccat ctgaaatatg 2100tttccatcac acaatttgtt acttcactga atttcttcaa aaccattgca gaagtgaggt 2160catccttgaa aagtggcaga aaaagaaaat agcagagatg ggtccagtgc agtggcttgc 2220cactcaaaag aaggcagcag acaaagtcgt cttccttctt tccaatgacg tcaacagtgt 2280gtgcgatggt acctgtggca agagcgaggg cagtcccagt gagaactctc aagacctctt 2340cccccttgcc tttaaccttt tctgcagtga tctaagaagc cagattcatc tgcacaaata 2400cgtggtggtc tactttagag agattgatac aaaagacgat tacaatgctc tcagtgtctg 2460ccccaagtac cacctcatga aggatgccac tgctttctgt gcagaacttc tccatgtcaa 2520gcagcaggtg tcagcaggaa aaagatcaca agcctgccac gatggctgct gctccttgta 2580gcccacccat gagaagcaag agaccttaaa ggcttcctat cccaccaatt acagggaaaa 2640aacgtgtgat gatcctgaag cttactatgc agcctacaaa cagccttagt aattaaaaca 2700ttttatacca ataaaatttt caaatattgc taactaatgt agcattaact aacgattgga 2760aactacattt acaacttcaa agctgtttta tacatagaaa tcaattacag ttttaattga 2820aaactataac cattttgata atgcaacaat aaagcatctt cagccaaaca tctagtcttc 2880catagaccat gcattgcagt gtacccagaa ctgtttagct aatattctat gtttaattaa 2940tgaatactaa ctctaagaac ccctcactga ttcactcaat agcatcttaa gtgaaaaacc 3000ttctattaca tgcaaaaaat cattgttttt aagataacaa aagtagggaa taaacaagct 3060gaacccactt ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 310532856DNAHomo sapiensmisc_feature(1325)..(1325)a or g or c or t/u 3cggcgatgtc gctcgtgctg ataagcctgg ccgcgctgtg caggagcgcc gtaccccgag 60agccgaccgt tcaatgtggc tctgaaactg ggccatctcc agagtggatg ctacaacatg 120atctaatccc cggagacttg agggacctcc gagtagaacc tgttacaact agtgttgcaa 180caggggacta ttcaattttg atgaatgtaa gctgggtact ccgggcagat gccagcatcc 240gcttgttgaa ggccaccaag atttgtgtga cgggcaaaag caacttccag tcctacagct 300gtgtgaggtg caattacaca gaggccttcc agactcagac cagaccctct ggtggtaaat 360ggacattttc ctatatcggc ttccctgtag agctgaacac agtctatttc attggggccc 420ataatattcc taatgcaaat atgaatgaag atggcccttc catgtctgtg aatttcacct 480caccaggctg cctagaccac ataatgaaat ataaaaaaaa gtgtgtcaag gccggaagcc 540tgtgggatcc gaacatcact gcttgtaaga agaatgagga gacagtagaa gtgaacttca 600caaccactcc cctgggaaac agatacatgg ctcttatcca acacagcact atcatcgggt 660tttctcaggt gtttgagcca caccagaaga aacaaacgcg agcttcagtg gtgattccag 720tgactgggga tagtgaaggt gctacggtgc aggtaaagtt cagtgagctg ctctggggag 780ggaagggaca tagaagactg ttccatcatt cattgctttt aaggatgagt tctctcttgt 840caaatgcact tctgccagca gacaccagtt aagtggcgtt catgggggtt ctttcgctgc 900agcctccacc gtgctgaggt caggaggccg acgtggcagt tgtggtccct tttgcttgta 960ttaatggctg ctgaccttcc aaagcacttt ttattttcat tttctgtcac agacactcag 1020ggatagcagt accattttac ttccgcaagc ctttaactgc aagatgaagc tgcaaagggt 1080ttgaaatggg aaggtttgag ttccaggcag cgtatgaact ctggagaggg gctgccagtc 1140ctctctgggc cgcagcggac ccagctggaa cacaggaagt tggagcagta ggtgctcctt 1200cacctctcag tatgtctctt tcaactctag tttttgaagt ggggacacag gaagtccagt 1260ggggacacag ccactcccca aagaataagg aacttccatg cttcattccc tggcataaaa 1320agtgntcaaa cacaccagag ggggcaggca ccagccaggg tatgatgggt actacccttt 1380tctggagaac catagacttc ccttactaca gggacttgca tgtcctaaag cactggctga 1440aggaagccaa gaggatcact gctgctcctt ttttgtagag gaaatgtttg tgtacgtggt 1500aagatatgac ctagcccttt taggtaagcg aactggtatg ttagtaacgt gtacaaagtt 1560taggttcaga ccccgggagt cttgggcatg tgggtctcgg gtcactggtt ttgactttag 1620ggctttgtta cagatgtgtg accaagggga aaatgtgcat gacaacacta gaggtagggg 1680cgaagccaga aagaagggaa gttttggctg aagtaggagt cttggtgaga ttttgctgtg 1740atgcatggtg tgaactttct gagcctcttg tttttcctca gctgactcca tattttccta 1800cttgtggcag cgactgcatc cgacataaag gaacagttgt gctctgccca caaacaggcg 1860tccctttccc tctggataac aacaaaagca agccgggagg ctggctgcct ctcctcctgc 1920tgtctctgct ggtggccaca tgggtgctgg tggcagggat ctatctaatg tggaggcacg 1980aaaggatcaa gaagacttcc ttttctacca ccacactact gccccccatt aaggttcttg 2040tggtttaccc atctgaaata tgtttccatc acacaatttg ttacttcact gaatttcttc 2100aaaaccattg cagaagtgag gtcatccttg aaaagtggca gaaaaagaaa atagcagaga 2160tgggtccagt gcagtggctt gccactcaaa agaaggcagc agacaaagtc gtcttccttc 2220tttccaatga cgtcaacagt gtgtgcgatg gtacctgtgg caagagcgag ggcagtccca 2280gtgagaactc tcaagacctc ttcccccttg cctttaacct tttctgcagt gatctaagaa 2340gccagattca tctgcacaaa tacgtggtgg tctactttag agagattgat acaaaagacg 2400attacaatgc tctcagtgtc tgccccaagt accacttcat gaaggatgcc actgctttct 2460gtgcagaact tctccatgtc aagcagcagg tgtcagcagg aaaaagatca caagcctgcc 2520acgatggctg ctgctccttg tagcccaccc atgagaagca agagacctta aaggcttcct 2580atcccaccaa ttacagggaa aaaacgtgtg atgatcctga agcttactat gcagcctaca 2640aacagcctta gtaattaaaa cattttatac caataaaatt ttcaaatatt actaactaat 2700gtagcattaa ctaacgattg gaaactacat ttacaacttc aaagctgttt tatacataga 2760aatcaattac agctttaatt gaaaactgta accattttga taatgcaaca ataaagcatc 2820ttccaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 285647193DNAHomo sapiens 4agaataaggg cagggaccgc ggctcctatc tcttggtgat ccccttcccc attccgcccc 60cgcctcaacg cccagcacag tgccctgcac acagtagtcg ctcaataaat gttcgtggat 120gatgatgatg atgatgatga aaaaaatgca gcatcaacgg cagcagcaag cggaccacgc 180gaacgaggca aactatgcaa gaggcaccag acttcctctt tctggtgaag gaccaacttc 240tcagccgaat agctccaagc aaactgtcct gtcttggcaa gctgcaatcg atgctgctag 300acaggccaag gctgcccaaa ctatgagcac ctctgcaccc ccacctgtag gatctctctc 360ccaaagaaaa cgtcagcaat acgccaagag caaaaaacag ggtaactcgt ccaacagccg 420acctgcccgc gcccttttct gtttatcact caataacccc atccgaagag cctgcattag 480tatagtggaa tggaaaccat ttgacatatt tatattattg gctatttttg ccaattgtgt 540ggccttagct atttacatcc cattccctga agatgattct aattcaacaa atcataactt 600ggaaaaagta gaatatgcct tcctgattat ttttacagtc gagacatttt tgaagattat 660agcgtatgga ttattgctac atcctaatgc ttatgttagg aatggatgga atttactgga 720ttttgttata gtaatagtag gattgtttag tgtaattttg gaacaattaa ccaaagaaac 780agaaggcggg aaccactcaa gcggcaaatc tggaggcttt gatgtcaaag ccctccgtgc 840ctttcgagtg ttgcgaccac ttcgactagt gtcaggggtg cccagtttac aagttgtcct 900gaactccatt ataaaagcca tggttcccct ccttcacata gcccttttgg tattatttgt 960aatcataatc tatgctatta taggattgga actttttatt ggaaaaatgc acaaaacatg 1020tttttttgct gactcagata tcgtagctga agaggaccca gctccatgtg cgttctcagg 1080gaatggacgc cagtgtactg ccaatggcac ggaatgtagg agtggctggg ttggcccgaa 1140cggaggcatc accaactttg ataactttgc ctttgccatg cttactgtgt ttcagtgcat 1200caccatggag ggctggacag acgtgctcta ctgggtaaat gatgcgatag gatgggaatg 1260gccatgggtg tattttgtta gtctgatcat ccttggctca tttttcgtcc ttaacctggt 1320tcttggtgtc cttagtggag aattctcaaa ggaaagagag aaggcaaaag cacggggaga 1380tttccagaag ctccgggaga agcagcagct ggaggaggat ctaaagggct acttggattg 1440gatcacccaa gctgaggaca tcgatccgga gaatgaggaa gaaggaggag aggaaggcaa 1500acgaaatact agcatgccca ccagcgagac tgagtctgtg aacacagaga acgtcagcgg 1560tgaaggcgag aaccgaggct gctgtggaag tctctggtgc tggtggagac ggagaggcgc 1620ggccaaggcg gggccctctg ggtgtcggcg gtggggtcaa gccatctcaa aatccaaact 1680cagccgacgc tggcgtcgct ggaaccgatt caatcgcaga agatgtaggg ccgccgtgaa 1740gtctgtcacg ttttactggc tggttatcgt cctggtgttt ctgaacacct taaccatttc 1800ctctgagcac tacaatcagc cagattggtt gacacagatt caagatattg ccaacaaagt 1860cctcttggct ctgttcacct gcgagatgct ggtaaaaatg tacagcttgg gcctccaagc 1920atatttcgtc tctcttttca accggtttga ttgcttcgtg gtgtgtggtg gaatcactga 1980gacgatcctg gtggaactgg aaatcatgtc tcccctgggg atctctgtgt ttcggtgtgt 2040gcgcctctta agaatcttca aagtgaccag gcactggact tccctgagca acttagtggc 2100atccttatta aactccatga agtccatcgc ttcgctgttg cttctgcttt ttctcttcat 2160tatcatcttt tccttgcttg ggatgcagct gtttggcggc aagtttaatt ttgatgaaac 2220gcaaaccaag cggagcacct ttgacaattt ccctcaagca cttctcacag tgttccagat 2280cctgacaggc gaagactgga atgctgtgat gtacgatggc atcatggctt acgggggccc 2340atcctcttca ggaatgatcg tctgcatcta cttcatcatc ctcttcattt gtggtaacta 2400tattctactg aatgtcttct tggccatcgc tgtagacaat ttggctgatg ctgaaagtct 2460gaacactgct cagaaagaag aagcggaaga aaaggagagg aaaaagattg ccagaaaaga 2520gagcctagaa aataaaaaga acaacaaacc agaagtcaac cagatagcca acagtgacaa 2580caaggttaca attgatgact atagagaaga ggatgaagac aaggacccct atccgccttg 2640cgatgtgcca gtaggggaag aggaagagga agaggaggag gatgaacctg aggttcctgc 2700cggaccccgt cctcgaagga tctcggagtt gaacatgaag gaaaaaattg cccccatccc 2760tgaagggagc gctttcttca ttcttagcaa gaccaacccg atccgcgtag gctgccacaa 2820gctcatcaac caccacatct tcaccaacct catccttgtc ttcatcatgc tgagcagcgc 2880tgccctggcc gcagaggacc ccatccgcag ccactccttc cggaacacga tactgggtta 2940ctttgactat gccttcacag ccatctttac tgttgagatc ctgttgaaga tgacaacttt 3000tggagctttc ctccacaaag gggccttctg caggaactac ttcaatttgc tggatatgct 3060ggtggttggg gtgtctctgg tgtcatttgg gattcaatcc agtgccatct ccgttgtgaa 3120gattctgagg gtcttaaggg tcctgcgtcc cctcagggcc atcaacagag caaaaggact 3180taagcacgtg gtccagtgcg tcttcgtggc catccggacc atcggcaaca tcatgatcgt 3240cactaccctc ctgcagttca tgtttgcctg tatcggggtc cagttgttca aggggaagtt 3300ctatcgctgt acggatgaag ccaaaagtaa ccctgaagaa tgcaggggac ttttcatcct 3360ctacaaggat ggggatgttg acagtcctgt ggtccgtgaa cggatctggc aaaacagtga 3420tttcaacttc gacaacgtcc tctctgctat gatggcgctc ttcacagtct ccacgtttga 3480gggctggcct gcgttgctgt ataaagccat cgactcgaat ggagagaaca tcggcccaat 3540ctacaaccac cgcgtggaga tctccatctt cttcatcatc tacatcatca ttgtagcttt 3600cttcatgatg aacatctttg tgggctttgt catcgttaca tttcaggaac aaggagaaaa 3660agagtataag aactgtgagc tggacaaaaa tcagcgtcag tgtgttgaat acgccttgaa 3720agcacgtccc ttgcggagat acatccccaa aaacccctac cagtacaagt tctggtacgt 3780ggtgaactct tcgcctttcg aatacatgat gtttgtcctc atcatgctca acacactctg 3840cttggccatg cagcactacg agcagtccaa gatgttcaat gatgccatgg acattctgaa 3900catggtcttc accggggtgt tcaccgtcga gatggttttg aaagtcatcg catttaagcc 3960taaggggtat tttagtgacg cctggaacac gtttgactcc ctcatcgtaa tcggcagcat 4020tatagacgtg gccctcagcg aagcggaccc aactgaaagt gaaaatgtcc ctgtcccaac 4080tgctacacct gggaactctg aagagagcaa tagaatctcc atcacctttt tccgtctttt 4140ccgagtgatg cgattggtga agcttctcag caggggggaa ggcatccgga cattgctgtg 4200gacttttatt aagtcctttc aggcgctccc gtatgtggcc ctcctcatag ccatgctgtt 4260cttcatctat gcggtcattg gcatgcagat gtttgggaaa gttgccatga gagataacaa 4320ccagatcaat aggaacaata acttccagac gtttccccag gcggtgctgc tgctcttcag 4380gtgtgcaaca ggtgaggcct ggcaggagat catgctggcc tgtctcccag ggaagctctg 4440tgaccctgag tcagattaca accccgggga ggagtataca tgtgggagca actttgccat 4500tgtctatttc atcagttttt acatgctctg tgcatttctg atcatcaatc tgtttgtggc 4560tgtcatcatg gataatttcg actatctgac ccgggactgg tctattttgg ggcctcacca 4620tttagatgaa ttcaaaagaa tatggtcaga atatgaccct gaggcaaagg gaaggataaa 4680acaccttgat gtggtcactc tgcttcgacg catccagcct cccctggggt ttgggaagtt 4740atgtccacac agggtagcgt gcaagagatt agttgccatg aacatgcctc tcaacagtga 4800cgggacagtc atgtttaatg caaccctgtt tgctttggtt cgaacggctc ttaagatcaa 4860gaccgaaggg aacctggagc aagctaatga agaacttcgg gctgtgataa agaaaatttg 4920gaagaaaacc agcatgaaat tacttgacca agttgtccct ccagctggtg atgatgaggt 4980aaccgtgggg aagttctatg ccactttcct gatacaggac tactttagga aattcaagaa 5040acggaaagaa caaggactgg tgggaaagta ccctgcgaag aacaccacaa ttgccctaca 5100ggcgggatta aggacactgc atgacattgg gccagaaatc cggcgtgcta tatcgtgtga 5160tttgcaagat gacgagcctg aggaaacaaa acgagaagaa gaagatgatg tgttcaaaag 5220aaatggtgcc ctgcttggaa accatgtcaa tcatgttaat agtgatagga gagattccct 5280tcagcagacc aataccaccc accgtcccct gcatgtccaa aggccttcaa ttccacctgc 5340aagtgatact gagaaaccgc tgtttcctcc agcaggaaat tcggtgtgtc ataaccatca 5400taaccataat tccataggaa agcaagttcc cacctcaaca aatgccaatc tcaataatgc 5460caatatgtcc aaagctgccc atggaaagcg gcccagcatt gggaaccttg agcatgtgtc 5520tgaaaatggg catcattctt cccacaagca tgaccgggag cctcagagaa ggtccagtgt 5580gaaaagaacc cgctattatg aaacttacat taggtccgac tcaggagatg aacagctccc 5640aactatttgc cgggaagacc cagagataca tggctatttc agggaccccc actgcttggg 5700ggagcaggag tatttcagta gtgaggaatg ctacgaggat gacagctcgc ccacctggag 5760caggcaaaac tatggctact acagcagata cccaggcaga aacatcgact ctgagaggcc 5820ccgaggctac catcatcccc aaggattctt ggaggacgat gactcgcccg tttgctatga 5880ttcacggaga tctccaagga gacgcctact acctcccacc ccagcatccc accggagatc 5940ctccttcaac tttgagtgcc tgcgccggca gagcagccag gaagaggtcc cgtcgtctcc 6000catcttcccc catcgcacgg ccctgcctct gcatctaatg cagcaacaga tcatggcagt 6060tgccggccta gattcaagta aagcccagaa gtactcaccg agtcactcga cccggtcgtg 6120ggccacccct ccagcaaccc ctccctaccg ggactggaca ccgtgctaca cccccctgat 6180ccaagtggag cagtcagagg ccctggacca ggtgaacggc agcctgccgt ccctgcaccg 6240cagctcctgg tacacagacg agcccgacat ctcctaccgg actttcacac cagccagcct 6300gactgtcccc agcagcttcc ggaacaaaaa cagcgacaag cagaggagtg cggacagctt 6360ggtggaggca gtcctgatat ccgaaggctt gggacgctat gcaagggacc caaaatttgt 6420gtcagcaaca aaacacgaaa tcgctgatgc ctgtgacctc accatcgacg agatggagag 6480tgcagccagc accctgctta atgggaacgt gcgtccccga gccaacgggg atgtgggccc 6540cctctcacac cggcaggact atgagctaca ggactttggt cctggctaca gcgacgaaga 6600gccagaccct gggagggatg aggaggacct ggcggatgaa atgatatgca tcaccacctt 6660gtagccccca gcgaggggca gactggctct ggcctcaggt ggggcgcagg agagccaggg 6720gaaaagtgcc tcatagttag gaaagtttag gcactagttg ggagtaatat tcaattaatt 6780agacttttgt ataagagatg tcatgcctca agaaagccat aaacctggta

ggaacaggtc 6840ccaagcggtt gagcctggca gagtaccatg cgctcggccc cagctgcagg aaacagcagg 6900ccccgccctc tcacagagga tgggtgagga ggccagacct gccctgcccc attgtccaga 6960tgggcactgc tgtggagtct gcttctccca tgtaccaggg caccaggccc acccaactga 7020aggcatggcg gcggggtgca ggggaaagtt aaaggtgatg acgatcatca cacctcgtgt 7080cgttacctca gccatcggtc tagcatatca gtcactgggc ccaacatatc catttttaaa 7140ccctttcccc caaatacact gcgtcctggt tcctgtttag ctgttctgaa ata 71935675DNAHomo sapiens 5tttttttttt tttttttttt tcttacaaag aaaaatttaa tattcgatga gaggttgaac 60caggcttaaa gcagacatac taggaaatgg tgcagcctgt aagaatgcca gtttgtaagt 120actgactttg gaaaagatca tcgcctctat cagacactta gggtcctggt ctggcaattt 180tggcctgatg tgatgccaca agacccaaca gagagagaca cagagtccag gataatgttg 240acagtggtgt agccctttag gagaaatggc gctccctgcg gctggtatta ggttaccatt 300ggcaccgaag gaaccaggag gataagaata tccataattt cagagctgcc ctggcacagt 360acctgccccg tcggaggctc tcactggcaa atgacagctc tgtgcaagga gcactcccaa 420gtataaaaat tattacacag ttttattctg aagaacattt tgcattttaa taaaaaagga 480tttatgtcag gaaagagtca tttacaaacc ttgaagtgtt tttgcctgga tcagagtaag 540aatgtcttaa gaagaggttt gtaaggtctt cataacaaag tggtgtttgt tatttacaaa 600aaaaaaaaaa aaaaaaatta acaggttgtc tgtatactat taaaaatttt ggaccaaaaa 660aaaaaaaaaa aaaaa 67561270DNAHomo sapiens 6cgaatgcagg cgacttgcga gctgggagcg atttaaaacg ctttggattc ccccggcctg 60ggtggggaga gcgagctggg tgccccctag attccccgcc cccgcacctc atgagccgac 120cctcggctcc atggagcccg gcaattatgc caccttggat ggagccaagg atatcgaagg 180cttgctggga gcgggagggg ggcggaatct ggtcgcccac tcccctctga ccagccaccc 240agcggcgcct acgctgatgc ctgctgtcaa ctatgccccc ttggatctgc caggctcggc 300ggagccgcca aagcaatgcc acccatgccc tggggtgccc caggggacgt ccccagctcc 360cgtgccttat ggttactttg gaggcgggta ctactcctgc cgagtgtccc ggagctcgct 420gaaaccctgt gcccaggcag ccaccctggc cgcgtacccc gcggagactc ccacggccgg 480ggaagagtac cccagtcgcc ccactgagtt tgccttctat ccgggatatc cgggaaccta 540ccacgctatg gccagttacc tggacgtgtc tgtggtgcag actctgggtg ctcctggaga 600accgcgacat gactccctgt tgcctgtgga cagttaccag tcttgggctc tcgctggtgg 660ctggaacagc cagatgtgtt gccagggaga acagaaccca ccaggtccct tttggaaggc 720agcatttgca gactccagcg ggcagcaccc tcctgacgcc tgcgcctttc gtcgcggccg 780caagaaacgc attccgtaca gcaaggggca gttgcgggag ctggagcggg agtatgcggc 840taacaagttc atcaccaagg acaagaggcg caagatctcg gcagccacca gcctctcgga 900gcgccagatt accatctggt ttcagaaccg ccgggtcaaa gagaagaagg ttctcgccaa 960ggtgaagaac agcgctaccc cttaagagat ctccttgcct gggtgggagg agcgaaagtg 1020ggggtgtcct ggggagacca gaaacctgcc aagcccaggc tggggccaag gactctgctg 1080agaggcccct agagacaaca cccttcccag gccactggct gctggactgt tcctcaggag 1140cggcctgggt acccagtatg tgcagggaga cggaacccca tgtgacaggc ccactccacc 1200agggttccca aagaacctgg cccagtcata atcattcatc ctcacagtgg caataatcac 1260gataaccagt 127071356DNAHomo sapiens 7ggattccccc ggcctgggtg gggagagcga gctgggtgcc ccctagattc cccgcccccg 60cacctcatga gccgaccctc ggctccatgg agcccggcaa ttatgccacc ttggatggag 120ccaaggatat cgaaggcttg ctgggagcgg gaggggggcg gaatctggtc gcccactccc 180ctctgaccag ccacccagcg gcgcctacgc tgatgcctgc tgtcaactat gcccccttgg 240atctgccagg ctcggcggag ccgccaaagc aatgccaccc atgccctggg gtgccccagg 300ggacgtcccc agctcccgtg ccttatggtt actttggagg cgggtactac tcctgccgag 360tgtcccggag ctcgctgaaa ccctgtgccc aggcagccac cctggccgcg taccccgcgg 420agactcccac ggccggggaa gagtacccca gccgccccac tgagtttgcc ttctatccgg 480gatatccggg aacctaccag cctatggcca gttacctgga cgtgtctgtg gtgcagactc 540tgggtgctcc tggagaaccg cgacatgact ccctgttgcc tgtggacagt taccagtctt 600gggctctcgc tggtggctgg aacagccaga tgtgttgcca gggagaacag aacccaccag 660gtcccttttg gaaggcagca tttgcagact ccagcgggca gcaccctcct gacgcctgcg 720cctttcgtcg cggccgcaag aaacgcattc cgtacagcaa ggggcagttg cgggagctgg 780agcgggagta tgcggctaac aagttcatca ccaaggacaa gaggcgcaag atctcggcag 840ccaccagcct ctcggagcgc cagattacca tctggtttca gaaccgccgg gtcaaagaga 900agaaggttct cgccaaggtg aagaacagcg ctacccctta agagatctcc ttgcctgggt 960gggaggagcg aaagtggggg tgtcctgggg agaccaggaa cctgccaagc ccaggctggg 1020gccaaggact ctgctgagag gcccctagag acaacaccct tcccaggcca ctggctgctg 1080gactgttcct caggagcggc ctgggtaccc agtatgtgca gggagacgga accccatgtg 1140acagcccact ccaccagggt tcccaaagaa cctggcccag tcataatcat tcatcctgac 1200agtggcaata atcacgataa ccagtactag ctgccatgat cgttagcctc atattttcta 1260tctagagctc tgtagagcac tttagaaacc gctttcatga attgagctaa ttatgaataa 1320atttggaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1356860DNAHomo sapiens 8caattacagg gaaaaaacgt gtgatgatcc tgaagcttac tatgcagcct acaaacagcc 60960DNAHomo sapiens 9gctctcactg gcaaatgaca gctctgtgca aggagcactc ccaagtataa aaattattac 601060DNAHomo sapiens 10gatcgttagc ctcatatttt ctatctagag ctctgtagag cactttagaa accgctttca 601160DNAHomo sapiens 11tgcctaattt cactctcaga gtgaggcagg taactggggc tccactgggt cactctgaga 601260DNAHomo sapiens 12ttggaagcag agtccctcta aaggtaactc ttgtggtcac tcaatattgt attggcattt 601360DNAHomo sapiens 13acgttagact tttgctggca ttcaagtcat ggctagtctg tgtatttaat aaatgtgtgt 601460DNAHomo sapiens 14ctggtcagcc actctgactt ttctaccaca ttaaattctc cattacatct cactattggt 601560DNAHomo sapiens 15tacaacttct gaatgctgca cattcttcca aaatgatcct tagcacaatc tattgtatga 601660DNAHomo sapiens 16gggatggcct ttaggccaca gtagtgtctg tgttaagttc actaaatgtg tatttaatga 601760DNAHomo sapiens 17ctcaaagtgc taaagctatg gttgactgct ctggtgtttt tatattcatt cgtgctttag 601860DNAHomo sapiens 18ctatggggat ggtccactgt cactgtttct ctgctgttgc aaatacatgg ataacacatt 601960DNAHomo sapiens 19actggaaaag cagatggtct gactgtgcta tggcctcatc atcaagactt tcaatcctat 602060DNAHomo sapiens 20acgccaagct cttcagtgaa gacacgatgt tattaaaagc ctgttttagg gactgcaaaa 602160DNAHomo sapiens 21tttttgtaaa atctttaacc ttccctttgt tcttcatgta cacgctgaac tgcaattctt 602260DNAHomo sapiens 22aacctggggc atttagggca gaggacaaaa ggatgtcagc aattgcttgg gctgcttggc 602360DNAHomo sapiens 23ctggaacctc tggactcccc atgctctaac tcccacactc tgctatcaga aacttaaact 602460DNAHomo sapiens 24aaccccagaa ccatctaaga catgggattc agtgatcatg tggttctcct tttaacttac 602560DNAHomo sapiens 25ggccatgtgc catggtattt gggtcctggg agggtgggtg aaataaaggc atactgtctt 602660DNAHomo sapiens 26gtgtaggcag tcatggcacc aaagccacca gactgacaaa tgtgtatcag atgcttttgt 602760DNAHomo sapiens 27gaaaacctct tcaaaagaca aaaagctggc actgcattct ctctctgtag caggacagaa 602860DNAHomo sapiens 28cacatcttta gggtcagtga acaatggggc acatttggca ctagcttgag cccaactctg 602960DNAHomo sapiens 29gccttaattt cctcatctga aaactggaag gcctgacttg acttgttgag cttaagatcc 603060DNAHomo sapiens 30cttcagggga ggatcaagct ttgaaccaaa gccaatcact ggcttgattt gtgtttttta 603160DNAHomo sapiens 31acaagttttc actgaatgag catggcagtg ccactcaaga aaatgaatct ccaaagtatc 603225DNAHomo sapiens 32ctgaagctta ctatgcagcc tacaa 253329DNAHomo sapiens 33tccaatcgtt agttaatgct acattagtt 293419DNAHomo sapiens 34cagccttagt aattaaaac 193523DNAHomo sapiens 35gccatgatcg ttagcctcat att 233625DNAHomo sapiens 36caattcatga aagcggtttc taaag 253722DNAHomo sapiens 37tctatctaga gctctgtaga gc 223824DNAArtificial SequencePCR primer 38ttcatcctga cagtggcaat aatc 243930DNAArtificial SequencePCR primer 39ctagatagaa aatatgaggc taacgatcat 304019DNAArtificial SequenceFlurogenic MGB probe 40cgataaccag tactagctg 194128DNAArtificial SequencePCR primer 41gcattaacta acgattggaa actacatt 284226DNAArtificial SequencePCR Primer 42ggaagatgct ttattgttgc attatc 264320DNAArtificial SequenceFlurogenic MGB probe 43acaacttcaa agctgtttta 2044561DNAHomo sapiensmisc_feature(475)..(475)a or g or c or t/u 44ccggcgatgt cgctcgtgct gctaagcctg gccgcgctgt gcaggagcgc cgtaccccga 60gagccgaccg ttcaatgtgg ctctgaaact gggccatctc cagagtggat gctacaacat 120gatctaatcc cgggagactt gagggacctc cgagtagaac ctgttacaac tagtgttgca 180acaggggact attcaatttt gatgaatgta agctgggtac tccgggcaga tgccagcatc 240cgcttgttga aggccaccaa gatttgtgtg acgggcaaaa gcaacttcca gtcctacagc 300tgtgtgaggt gcaattacac agaggccttc cagactcaga ccagaccctc tggtggtaaa 360tggacatttt cctacatcgg cttccctgta gagctgaaca cagtctattt cattggggcc 420cataatattc ctaatgcaaa tatgaatgaa gatggccctt ccatgtctgt gaatntcacc 480tcaccaggct gcctagacca cataatgaaa tataaaaaaa agtgtgtcaa ggccggaagc 540ctgtgggatc cgaacatcac t 56145467DNAHomo sapiens 45tttttttttt ttttttttta aaagtgggtt cagcttgttt attccctact tttgttatct 60taaaaacaat gattttttgc atgtaataga aggtttttca cttaagatgc tattgagtga 120atcagtgagg ggttcttaga gttagtattc attaattaaa catagaatat tagctaaaca 180gttctgggta cactgcaatg catggtctat ggaagactag atgtttggct gaagatgctt 240tattgttgca ttatcaaaat ggttatagtt ttcaattaaa actgtaattg atttctatgt 300ataaaacagc tttgaagttg taaatgtagt ttccaatcgt tagttaatgc tacattagtt 360agcaatattt gaaaatttta ttggtataaa atgttttaat tactaaggct gtttgtaggc 420tgcatagtaa gcttcaggat catcacacgt tttttccctg taattgg 467462042DNAHomo sapiens 46ggcccggcga tgtcgctcgt gctgctaagc ctggccgcgc tgtgcaggag cgccgtaccc 60cgagagccga ccgttcaatg tggctctgaa actgggccat ctccagagtg gatgctacaa 120catgatctaa tcccgggaga cttgagggac ctccgagtag aacctgttac aactagtgtt 180gcaacagggg actattcaat tttgatgaat gtaagctggg tactccgggc agatgccagc 240atccgcttgt tgaaggccac caagatttgt gtgacgggca aaagcaactt ccagtcctac 300agctgtgtga ggtgcaatta cacagaggcc ttccagactc agaccagacc ctctggtggt 360aaatggacat tttcctacat cggcttccct gtagagctga acacagtcta tttcattggg 420gcccataata ttcctaatgc aaatatgaat gaagatggcc cttccatgtc tgtgaatttc 480acctcaccag gctgcctaga ccacataatg aaatataaaa aaaagtgtgt caaggccgga 540agcctgtggg atccgaacat cactgcttgt aagaagaatg aggagacagt agaagtgaac 600ttcacaacca ctcccctggg aaacagatac atggctctta tccaacacag cactatcatc 660gggttttctc aggtgtttga gccacaccag aagaaacaaa cgcgagcttc agtggtgatt 720ccagtgactg gggatagtga aggtgctacg gtgcagctga ctccatattt tcctacttgt 780ggcagcgact gcatccgaca taaaggaaca gttgtgctct gcccacaaac aggcgtccct 840ttccctctgg ataacaacaa aagcaagccg ggaggctggc tgcctctcct cctgctgtct 900ctgctggtgg ccacatgggt gctggtggca gggatctatc taatgtggag gcacgaaagg 960atcaagaaga cttccttttc taccaccaca ctactgcccc ccattaaggt tcttgtggtt 1020tacccatctg aaatatgttt ccatcacaca atttgttact tcactgaatt tcttcaaaac 1080cattgcagaa gtgaggtcat ccttgaaaag tggcagaaaa agaaaatagc agagatgggt 1140ccagtgcagt ggcttgccac tcaaaagaag gcagcagaca aagtcgtctt ccttctttcc 1200aatgacgtca acagtgtgtg cgatggtacc tgtggcaaga gcgagggcag tcccagtgag 1260aactctcaag acctcttccc ccttgccttt aaccttttct gcagtgatct aagaagccag 1320attcatctgc acaaatacgt ggtggtctac tttagagaga ttgatacaaa agacgattac 1380aatgctctca gtgtctgccc caagtaccac ctcatgaagg atgccactgc tttctgtgca 1440gaacttctcc atgtcaagca gcaggtgtca gcaggaaaaa gatcacaagc ctgccacgat 1500ggctgctgct ccttgtagcc cacccatgag aagcaagaga ccttaaaggc ttcctatccc 1560accaattaca gggaaaaaac gtgtgatgat cctgaagctt actatgcagc ctacaaacag 1620ccttagtaat taaaacattt tataccaata aaattttcaa atattgctaa ctaatgtagc 1680attaactaac gattggaaac tacatttaca acttcaaagc tgttttatac atagaaatca 1740attacagttt taattgaaaa ctataaccat tttgataatg caacaataaa gcatcttcag 1800ccaaacatct agtcttccat agaccatgca ttgcagtgta cccagaactg tttagctaat 1860attctatgtt taattaatga atactaactc taagaacccc tcactgattc actcaatagc 1920atcttaagtg aaaaaccttc tattacatgc aaaaaatcat tgtttttaag ataacaaaag 1980tagggaataa acaagctgaa cccactttta aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2040aa 204247842DNAHomo sapiens 47agcggagctg cgggtggcct ggatcccgcg cagtggcccg gcgatgtcgc tcgtgctgct 60aagcctggcc acgctgtgca ggagcgccgt accccgagag ccgaccgttc aatgtggctc 120tgaaactgtg gacattttcc tatatcggct tccctgtaga gctgaaaaca gtctatttca 180ttggggccca taatattcct aatgcaaata tgaatgaaga tggcccttcc atgtctgtga 240atttcacctc accaggctgc ctagaccaca taatgaaata taaaaaaagt gtgtcaaggc 300cggaagcctg tgggatccga acatcactgc ttgtaagaag aatgaggaga cagtagaagt 360gaacttcaca accactcccc tgggaaacag atacatggct catccaacac agcactatca 420tcgggttttc tcaggtgttt gagccacacc agaagaaaca aacgcgagct tcagtggtga 480ttccagtgac tggggatagt gaaggtgcta cggtgcagct gactccatat tttcctactt 540gtggcagcga ctgcatccga cataaaggaa cagttgtgct ctgcccacaa acaggcgtcc 600ctttcccctc tggataacaa caaaagcaag ccgggaggct ggctgcctct cctcctgctg 660tctctgctgg ttggccacat tgggtgctgg tggcagggat ctatctaatg tggaggcacg 720aaaggatcca gaagacttcc ttttctacca caaactactg cccccattaa ggtcctgtgg 780ttacccatct tgaaatatgt tcctcacaca atttgttact tcactgaatt cttcaaaacc 840tg 84248788DNAHomo sapiensmisc_feature(675)..(675)a or g or c or t/u 48agcggagcgt gcgggtggcc tggatcccgc gcagtggccc ggcgatgtcg ctcgtgctgc 60taagcctggc cacgctgtgc aggagcgccg taccccgaga gccgaccgtt caatgtggct 120ctgaaactgt ggacattttc ctatatcggc ttccctgtag agctgaaaac agtctatttc 180attggggccc ataatattcc taatgcaaat atgaatgaag atggcccttc catgtctgtg 240aatttcacct caccaggctg cctagaccac ataatgaaat ataaaaaaaa gtgtgtcaag 300gccggaagcc tgtgggatcc gaacatcact gcttgtaaga agaatgagga gacagtagaa 360gtgaacttca caaccactcc cctgggaaac agatacatgg ctcatccaac acagcactat 420catcgggttt tctcaggtgt ttgagccaca ccagaagaaa caaacgcgag cttcagtggt 480gattccagtg actggggata gtgaaggtgc tacggtgcag ctgactccat attttcctac 540ttgtggcagc gactgcatcc gacataaagg aacagttgtg ctctgcccac aaacaggcgt 600ccctttccct ctggataaca acaaaagcaa gccgggaggc tggctgcctc tcctcctgct 660gtctctgctg gtggncacat tgggtgctgg tggcagggat ctatctaatg tggaggcacg 720aaaggatcag aagacttcct tttctaccac cacatactgc cccccattaa ggttcttgtg 780gtttaccc 78849946DNAHomo sapiens 49ggcgatgtcg ctcgtgctgc taagcctggc cgcgctgtgc aggagcgccg taccccgaga 60gccgaccgtt caatgtggct ctgaaactgg gccatctcca gagtggatgc tacaacatga 120tctaatcccg ggagacttga gggacctccg agtagaacct gttacaacta gtgttgcaac 180aggggactat tcaattttga tgaatgtaag ctgggtactc cgggcagatg ccagcatccg 240cttgttgaag gccaccaaga tttgtgtgac gggcaaaagc aacttccagt cctacagctg 300tgtgaggtgc aattacacag aggccttcca gactcagacc agaccctctg gtggtaaatg 360gacattttcc tatatcggct tccctgtaga gctgaacaca gtctatttca ttggggccca 420taatattcct aatgcaaata tgaatgaaga tggcccttcc atgtctgtga atttcacctc 480accaggaagc ctgtgggatc cgaacatcac tgcttgtaag aaagaatgag gagacagtag 540aagtgaactt cacaaccact cccctgggaa acagatacat ggctcttatc caacacagca 600ctatcatcgg gtttctcagg tgtttgagcc acaccagaag aaacaaacgc gagcttcagt 660ggtgattcca gtgactgggg atagtgaagg tgctacggtg cagctgactc catattttcc 720tacttgtggc agcgactgca atccgacata aaggaacagt tgtgctctgc ccacaaacag 780gcgtcccttt ccctcttgga tagcaacaga agcaagccgg gaggctggtg cctcttcttc 840tggtgtctct gctggtggca cattgagtgc tggtggcagg atccatctaa tgtggaggcc 900ccaaaggacc aggaaagact tcctttatta gcaccaagta ttgccc 94650488DNAHomo sapiens 50tggctgaaga tgctttattg ttgcattatc aaaatggtta tagttttcaa ttaaaactgt 60aattgatttc tatgtataaa acagctttga agttgtaaat gtagtttcca atcgttagtt 120aatgctacat tagttagcaa tatttgaaaa ttttattggt ataaaatgtt ttaattacta 180aggctgtttg taggctgcat agtaagcttc aggatcatca cacgtttttt ccctgtaatt 240ggtgggatag gaagccttta aggtctcttg cttctcatgg gtgggctaca aggagcagca 300gccatcgtgg caggcttgtg atctttttcc tgctgacacc tgctacttga catggagaag 360ttctgcacag aaagcagtgg catccttcat gaggtggtac ttggggcaga cactgagagc 420attgtaatcg tcttttgtat caatctctct aaagtagacc accacgtatt tgtgcagatg 480aatctggc 48851509DNAHomo sapiens 51tttgtttggc tgaagatgct ttattgttgc attatcaaaa tggttatagt tttcaattaa 60aactgtaatt gatttctatg tataaaacag ctttgaagtt gtaaatgtag tttccaatcg 120ttagttaatg ctacattagt tagcaatatt tgaaaatttt attggtataa aatgttttaa 180ttactaaggc tgtttgtagg ctgcatagta agcttcagga tcatcacacg ttttttccct 240gtaattggtg ggataggaag cctttaaggt ctcttgcttt tcatgggtgg gctacaagga 300gcagcagcca tcgtggcagg cttgtgatct ttttcctgct gacacctgct gcttgacatg 360gagaagttct gcacagaaag cagtggcatc cttcatgagg tggtacttgg ggcagacact 420gagagcattg taatcgtctt ttgtatcaat ctctctaaag tagaccacca cgtatttgtg 480cagatgaatc tggcttctta gatcactgc 50952502DNAHomo sapiens 52tggcatgaga tgctatattg ttgcattatc aaaatgggtt tagtcttcaa ttaacactgt 60aattgatttc tatgtataaa acagctttga agttgtaaat gtggtttcca atcgtcagtt 120aatgctacat tagttagcaa tatttgaaaa ttttattggt ataaaatgtt ttaattacta 180aggctgtttg taggctgcat agtaagcttc aggatcatca cacgtttttt ccctgtaatt 240ggtgggatag gaagccttta aggtctcttg cttctcatgg gtgggctaca aggagcagca 300gccatcgtgg caggcttgtg atctttttcc tgctgacacc tgctgcttga catggagaag 360ttctgcacag aaagcagtgg catccttcat gaggtggtac ttggggcaga cactgagagc 420attgtaatcg tcttttgtat caatctctct aaagtagacc accacgtatt tgtgcagatg 480aatctggctt cttagatcac tg 50253460DNAHomo sapiens 53gtttggctga agatgcttta ttgttgcatt atcaaaatgg ttatagtttt caattaaaac 60tgtaattgat ttctatgtat aaaacacgct ttgaagttgt aaatgtagtt tccaatcgtt 120agttaatgct acattagtta gcaatatttg aaaattttat tggtataaaa tgttttaatt 180actaaggctg tttgtaggct gcatagtaag cttcaggatc atcacacgtt ttttccctgt 240aattggtggg ataggaagcc tttaaggtct cttgcttctc atgggtgggc tacaaggagc 300agcagccatc gtggcaggct tgtgatcttt ttcctgctga cacctgctgc ttgacatgga 360gaagttctgc acagaaagca gtggcatcct tcatgaggtg gtacttgggg cagacactga 420gagcattgta atcgtctttt gtatcaatct ctctaaagta

46054510DNAHomo sapiens 54tggctgaaga tgctttattg ttgcattatc aaaatggtta tagttttcaa ttaaaactgt 60aattgatttc tatgtataaa acagcgttga agttgtaaat gtagtttcca atcgttagtt 120aatgctacat tagttagcaa tatttgaaaa ttttattggt ataaaatgtt ttaattacta 180aggctgtttg taggctgcat agtaagcttc aggatcatca cacgtttttt ccctgtaatt 240ggtgggatag gaagccttta aggtctcttg cttctcatgg gtgggctaca aggagcagca 300gccatcgtgg caggcttgtg atctttttcc tgctgacacc tgctgcttga catggagaag 360ttctgcacag aaagcagtgg catccttcat gaggtggtac ttggggcaga cactgagagc 420attgtaatcg tcttttgtat caatctctct aaagtagacc accacgtatt tgtgcagatg 480aatctggctt cttagatcac tgcagaaaag 51055465DNAHomo sapiens 55tttttttttt acaacttcaa agctgtttta tacatagaaa tcaattacag ttttaattga 60aaactataac cattttgata atgcaacaat aaagcatctt cagccaaaca tctagtcttc 120catagaccat gcattgcagt gtacccagaa ctgtttagct aatattctat gtttaattaa 180tgaatactaa ctctaagaac ccctcactga ttcactcaat agcatcttaa gtgaaaaacc 240ttctattaca tgcaaaaaat cattgttttt aagataacaa aagtagggaa taaacaagct 300gaacccactt ttactggacc aaatgatcta ttatatgtgt accacttgta tgatttggta 360tttgcataag accttccctc tacaaactag attcatatct tgattcttgt acaggtgcct 420tttaacatga acaacaaaat acccacaaac ttgtctactt ttgcc 46556506DNAHomo sapiens 56tagtaattaa aacattttat accaataaaa ttttcaaata ttgctaacta atgtagcatt 60aactaacgat tggaaactac atttacaact tcaaagctgt tttatacata gaaatcaatt 120acagttttaa ttgaaaacta taaccatttt gataatgcaa caataaagca tcttcagcca 180aacatctagt cttccataga ccatgcattg cagtgtaccc agaactgttt agctaatatt 240ctatgtttaa ttaatgaata ctaactctaa gaacccctca ctgattcact caatagcatc 300ttaagtgaaa aaccttctat tacatgcaaa aaatcattgt ttttaagata acaaaagtag 360ggaataaaca agctgaaccc acttttactg gaccaaatga tctattatat gtgtaaccac 420ttgtatgatt tggtatttgc ataagacctt ccctctacaa actagattca tatcttgatt 480cttgtacagg tgccttttaa catgaa 50657427DNAHomo sapiens 57tttttttttt ttttttagca atatttgaaa attttattgg tataaaatgt tttaattact 60aaggctgttt gtaggctgca tagtaagctt caggatcatc acacgttttt tccctgtaat 120tggtgggata ggaagccttt aaggtctctt gcttctcatg ggtgggctac aaggagcagc 180agccatcgtg gcaggcttgt gatctttttc ctgctgacac ctgctacttg acatggagaa 240gttctgcaca gaaagcagtg gcatccttca tgaggtggta cttggggcag acactgagag 300cattgtaatc gtcttttgta tcaatctctc taaagtagac caccacgtat ttgtgcagat 360gaatctggct tcttagatca ctgcagaaaa ggttaaaggc aagggggaag aggtcttgag 420agttctc 42758467DNAHomo sapiensmisc_feature(434)..(434)a or g or c or t/u 58ttaaagtggg ttcagcttgt ttattcccta cttttgttat cttaaaaaca atgatttttt 60gcatgtaata gaaggttttt cacttaagat gctattgagt gaatcagtga ggggttctta 120gagttagtat tcattaatta aacatagaat attagctaaa cagttctggg tacactgcaa 180tgcatggtct atggaagact agatgtttgg ctgaagatgc tttattgttg cattatcaaa 240atggttacag ttttcaatta aagctgtaat tgatttctat gtataaaaca gctttgaagt 300tgtaaatgta gtttccaatc gttagttaat gctacattag ttagcaatat ttgaaaattt 360tattggtata aaatgtttta attactaagg ctgtttgtag gctgcatagt aagcttcagg 420atcatcacac gttntttccc tgtaattggt gggataggaa gccttta 46759420DNAHomo sapiens 59agttagcaat atttgaaaat tttattggta taaaatgttt taattactaa ggctgtttgt 60aggctgcata gtaagcttca ggatcatcac acgttttttc cctgtaattg gtgggatagg 120aagcctttaa ggtctcttgc ttctcatggg tgggctacaa ggagcagcag ccatcgtggc 180aggcttgtga tctttttcct gctgacacct gctacttgac atggagaagt tctgcacaga 240aagcagtggc atccttcatg aggtggtact tggggcagac actgagagca ttgtaatcgt 300cttttgtatc aatctctcta aagtagacca ccacgtattt gtgcagatga atctggcttc 360ttagatcact gcagaaaagg ttaaaggcaa gggggaagag gtcttgagag ttctcactgg 42060434DNAHomo sapiens 60ttggctgaag atgctttatt gttgcattat caaaatggtt atagttttca attaaaactg 60taattgattt ctatgtataa aacagctttg aagttgtaaa tgtagtttcc aatcgttagt 120taatgctaca ttagttagca atatttgaaa attttattgg tataaaatgt tttaattact 180aaggctgttt gtaggcttgc atagaagctt caggatcatc acacgttttt tccctgtaat 240tggtgggata ggaagccttt aaggtctctt gcttctcatg ggtgggctac aaggagcagc 300agccatcgtg gcaggcttgt gatctttttc ctgctgacac ctgctgcttg acatggagaa 360gttctgcaca gaaagcagtg gcatccttca tgaggtggta cttggggcag acactgagag 420cattgtaatc gtct 43461416DNAHomo sapiens 61tttttttttt agcaatattt gaaaatttta ttggtataaa atgttttaat tactaaggct 60gtttgtaggc tgcatagtaa gcttcaggat catcacacgt tttttccctg taattggtgg 120gataggaagc ctttaaggtc tcttgcttct catgggtggg ctacaaggag cagcagccat 180cgtggcaggc ttgtgatctt tttcctgctg acacctgcta cttgacatgg agaagttctg 240cacagaaagc agtggcatcc ttcatgaggt ggtacttggg gcagacactg agagcattgt 300aatcgtcttt tgtatcaatc tctctaaagt agaccaccac gtatttgtgc agatgaatct 360ggcttcttag atcactgcag aaaaggttaa aggcaagggg gaagaggtct tgagag 41662414DNAHomo sapiens 62tttggctgaa gatgctttat tgttgcatta tcaaaatggt tacagttttc aattaaagct 60gtaattgatt tctatgtata aaacagcttt gaagttgtaa atgtagtttc caatcgttag 120ttaatgctac attagttagc aatatttgaa aattttattg gtataaaatg ttttaattac 180taaggctgtt tgtaggctgc atagtaagct tcaggatcat cacacgtttt ttccctgtaa 240ttggtgggat aggaagcctt taaggtctct tgcttctcat gggtgggcta caaggagcag 300cagccatcgt ggcaggcttg tgatcttttt cctgctgaca cctgctgctt gacatggaga 360agttctgcac agaaagcagt ggcatccttc atgaggtggt acttggggca gaca 41463409DNAHomo sapiens 63ttctctggct gaagatgctt tattgttgca ttatcaaaat ggttacagtt ttcaattaaa 60gctgtaattg atttctatgt ataaaacagc tttgaagttg taaatgtagt ttccaatcgt 120tagttaatgc tacattagtt agcaatattt gaaaatttta ttggtataaa atgttttaat 180tactaaggct gtttgtaggc tgcatagtaa gcttcaggat catcacacgt tttttccctg 240taattggtgg gataggaagc ctttaaggtc tcttgcttct catgggtggg ctacaaggag 300cagcagccat cgtggcaggc ttgtgatctt tttcctgctg acacctgctg cttgacatgg 360agaagttctg cacagaaagc agtggcatcc ttcatgaggt ggtacttgg 40964414DNAHomo sapiens 64tttttttttt ttttttacaa ccttgaaagc tgttttatac atagaaatca attacagttt 60taattgaaaa ctataaccat tttgataatg caacaataaa gcatcttcag ccaaacatct 120agtcttccat agaccatgca ttgcagtgta cccagaactg tttagctaat attctatgtt 180taattaatga atactaactc taagaacccc tcactgattc actcaatagc atcttaagtg 240aaaaaccttc tattacatgc aaaaaatcat tgtttttaag ataacaaaag tagggaataa 300acaagctgaa cccactttta ctggaccaaa tgatctatta tatgtgtaac cacttgtatg 360atttggattt gcataagacc ttccctctac aaactagatt catatcttga ttct 41465414DNAHomo sapiens 65tttttttttt ttttttacaa ctgcaaagct gttttataca tagaaatcaa ttacagtttt 60aattgaaaac tataaccatt ttgataatgc aacaataaag catcttcagc caaacatcta 120gtcttccata gaccatgcat tgcagtgtac ccagaactgt ttagctaata ttctatgttt 180aattaatgaa tactaactct aagaacccct cactgattca ctcaatagca tcttaagtga 240aaaaccttct attacatgca aaaaatcatt gtttttaaga taacaaaagt agggaataaa 300caagctgaac ccacttttac tggaccaaat gatctattat atgtgtaacc acttgtatga 360tttggtattt gcataagacc ttccctctac aaactagatt catatcttga ttct 41466484DNAHomo sapiens 66ttttttagtt agcaatattt gaaaatttta ttggtataaa atgttttaat tactaaggct 60gtttgtaggc tgcatagtaa gcttcaggat catcacacgt tttttccctg taattggtgg 120gataggaagc ctttaaggtc tcttgcttct catgggtggg ctacaaggag cagcagccat 180cgtggcaggc ttgtgatctt tttcctgctg acacctgcta cttgacatgg agaagttctg 240cacagaaagc agtggcatcc ttcatgaggt ggtacttggg gcagacactg agagcattgt 300aatcgtcttt tgtatcaatc tctctaaagt agaccaccac gtatttgtgc agatgaatct 360ggcttcttag atcactgcag aaaaggttaa aggcaagggg gaagaggtct tgagagttct 420cactgggact gccctcgctc ttgccacagg taccatcgca cacactgttg acgtcattgg 480aaag 48467398DNAHomo sapiens 67ggctgaagat gctttattgt tgcattatca aaatggttat agttttcaat taaaactgta 60attgatttct atgtataaaa cagctttgaa gttgtaaatg tagtttccaa tcgttagtta 120atgctacatt agttagcaat atttgaaaat tttattggta taaaatgttt taattactaa 180ggctgtttgt aggctgcata gtaagcttca ggatcatcac acgttttttc cctgtaattg 240gtgggatagg aagcctttaa ggtctcttgc ttctcatggg tgggctacaa ggagcagcag 300ccatcgtggc aggcttgtga tctttttcct gctgacacct gctgcttgac atggagaagt 360tctgcacaga aagcagtggc atccttcatg aggtggta 39868401DNAHomo sapiens 68ttggctgaag atgctttatt gttgcattat caaaatggtt acagttttca attaaagctg 60taattgattt ctatgtataa aacagctttg aagttgtaaa tgtagtttcc aatcgttagt 120taatgctaca ttagttagca atatttgaaa attttattgg tataaaatgt tttaattact 180aaggctgttt gtaggctgca tagtaagctt caggatcatc acacgttttt tccctgtaat 240tggtgggata ggaagccttt aaggtctctt gcttctcatg ggtgggctac aaggagcagc 300agccatcgtg gcaggcttgt gatctttttc ctgctgacac ctgctgcttg acatggagaa 360gttctgcaca gaaagcagtg gcatccttca tgaggtggta c 40169392DNAHomo sapiensmisc_feature(228)..(228)a or g or c or t/u 69ttggctgaag atgctttatt gttgcattat caaaatggtt atagttttca attaaaactg 60taattgattt ctatgtataa aacagctttg aagttgtaaa tgtagtttcc aatcgttagt 120taatgctaca ttagttagca atatttgaaa attttattgg tataaaatgt tttaattact 180aaggctgttt gtaggctgca tagtaagctt caggatcatc acacgttntt tccctgtaat 240tggtgggata ggaagccttt aaggtctctt gcttctcatg ggtgggctac aaggagcagc 300agccatcgtg gcaggcttgt gatctttttc ctgctgacac ctgctgcttg acatggagaa 360gttctgcaca gaaagcagtg gcatccttca tg 39270386DNAHomo sapiens 70gtttggctga agatgcttta ttgttgcatt atcaaaatgg ttatagtttt caattaaaac 60tgtaattgat ttctatgtat aaaacagctt tgaagttgta aatgtagttt ccaatcgtta 120gttaatgcta cattagttag caatatttga aaattttatt ggtataaaat gttttaatta 180ctaaggctgt ttgtaggctg catagtaagc ttcaggatca tcacacgttt tttccctgta 240attggtggga taggaagcct ttaaggtctc ttgcttctca tgggtgggct acaaggagca 300gcagccatcg tggcagcttg gtgatctttt tcctgctgac acctgctgct tgacatgaag 360aagttctgca cagaaagcag tggcat 38671386DNAHomo sapiens 71gtttggctga agatgcttta ttgttgcatt atcaaaatgg ttatagtttt caattaaaac 60tgtaattgat ttctatgtat aaaacagctt tgaagttgta aatgtagttt ccaatcgtta 120gttaatgcta cattagttag caatatttga aaattttatt ggtataaaat gttttaatta 180ctaaggctgt ttgtaggctg catagtaagc ttcaggatca tcacacgttt tttccctgta 240attggtggga taggaagcct ttaaggtctc ttgcttctca tgggtgggct acaaggagca 300gcagccatcg tggcaggctt ggatcttttt cctgctgaca cctgctgctt gacattggaa 360agttctgcac agaaagcagt ggcatc 38672386DNAHomo sapiens 72ttttggctga tgatgcttta ttgttgcatt atcaaaatgg ttacagtttt caattaaagc 60tgtaattgat ttctatgtat aaaacagctt tgaagttgta aatgtagttt ccaatcgtta 120gttaatgcta cattagttag caatatttga aaattttatt ggtataaaat gttttaatta 180ctaaggctgt ttgtaggctg catagtaagc ttcaggatca tcacacgttt tttccctgta 240attggtggga taggaagcct ttaaggtctc ttgcttctca tgggtgggct acaaggagca 300gcagccatcg tggcaggctt gtgatctttt tcctgctgac acctgctgct tgacatggag 360aagttctgca cagaaagcag tggcat 38673373DNAHomo sapiens 73ggctgaagat gctttattgt tgcattatca aaatggttac agttttcaat taaagctgta 60attgatttct atgtataaaa cagctttgaa gttgtaaatg tagtttccaa tcgttagtta 120atgctacatt agttagcaat atttgaaaat tttattggta taaaatgttt taattactaa 180ggctgtttgt aggctgcata gtaagcttca ggatcatcac acgttttttc cctgtaattg 240gtgggatagg aagcctttaa ggtctcttgc ttctcatggg tgggctacaa ggagcagcag 300ccatcgtggc aggcttgtga tctttttcct gctgacacct gctgcttgac atggagaagt 360tctgcacaga aag 37374403DNAHomo sapiens 74gattggctgt tttatgcttt attgttgcat tatcaaaatg gttatagttt tcaattaaaa 60ctgtaattga tttctatgta taaaacagct ttgaagttgt aaatgtagtt tccaatcgtt 120agttaatgct acattagtta gcaatatttg aaaattttat tggtataaaa tgttttaatt 180actaaggctg tttgtaggct gcatagtaag cttcaggatc atcacacgtt ttttccctgt 240tattggtggg ataggaagcc tttaaggtct cttgcttctc atgggtgggc tacaaggagc 300agcagccatc gtggcaggct tgtgatcttt ttcctgctga cacctgctgc ttgacatgga 360gaagttctgc acaaaaagca gtggcatcct tcatgaggtg gta 40375457DNAHomo sapiens 75gcaatatttt aaaattttat tggtataaaa tgttttaatt actaaggctg tttgtaggct 60gcatagtaag cttcaggatc atcacacgtt ttttccctgt aattggtggc ataggaagcc 120tttaaggtct cttgcttctc atggtgtggg ctacaaggag cagcagccat cgtggcaggc 180ttgtgatctt tttcctgctg acacctgctg cttgacatgg agaagttctg cacagaaagc 240agtggcatcc ttcatgaggt ggtacttggg gcagacactg agagcattgt aatcgtcttt 300tgtatcaatc tctctaaagt agaccaccac gtatttgtgc agatgaatct ggcttcttag 360atcactgcag aaaaggttaa aggcaagggg gaagaggtct tgagagttct cactgggact 420gccctcgctc ttgccacagg taccatcgca cacactg 45776365DNAHomo sapiens 76tttttttttt acaacttcaa agctgtttta tacatagaaa tcaattacag ttttaattga 60aaactataac cattttgata atgcaacaat aaagcatctt cagccaaaca tctagtcttc 120catagaccat gcattgcagt gtacccagaa ctgtttagct aatattctat gtttaattaa 180tgaatactaa ctctaagaac ccctcactga ttcactcaat agcatcttaa gtgaaaaacc 240ttctattaca tgcaaaaaat cattgttttt aagataacaa aagtagggaa taaacaagct 300gaacccactt ttactggacc aaatgatcta ttatatgtgt aaccacttgt atgatttggt 360atttg 36577356DNAHomo sapiensmisc_feature(277)..(277)a or g or c or t/umisc_feature(322)..(322)a or g or c or t/u 77gtttcgctga agatgcttta ttgttgcatt atcaaaatgg ttatagtttt caattaaaac 60tgtaattgat ttctatgtat aaaacagctt tgaagttgta aatgtagttt ccaatcgtta 120gttaatgcta cattagttag caatatttga aaattttatt ggtataaaat gttttaatta 180ctaaggctgt ttgtaggctg catagtaagc ttaaggccca tcacacgttt tttccctgta 240attggtggga taggaagcct ttaaggtctc ttgcttntca tgggtgggct acaaggagca 300gcagccatcg tggcaggctt gngatctttt tcctgctggc ccctgctgct tgacat 35678413DNAHomo sapiensmisc_feature(1)..(1)a or g or c or t/umisc_feature(264)..(264)a or g or c or t/u 78naaagcactg gctgaaggaa gccaagagga tcactgctgc tccttttttc tagaggaaat 60gtttgtctac gtggtaagat atgacctagc ccttttaggt aagcgaactg gtatgttagt 120aacgtgtaca aagtttaggt tcagaccccg ggagtcttgg gcacgtgggt ctcgggtcac 180tggttttgac tttagggctt tgttacagat gtgtgaccaa ggggaaaatg tgcatgacaa 240cactagaggt atgggcgaca cganaacgaa cgggaagttt tggctgaagt aggagtcttg 300gtgagatttt gctctgatgc atggtgtgaa ctttctgagc ctcttgtttt tcctcaagct 360gactccatat tttcctactt gtggcagcga ctgcatccga cataaaggaa cag 41379394DNAHomo sapiens 79tagcaatatt tgaaaatttt attggtataa aatgttttaa ttactaaggc tgtttgtagg 60ctgcatagta agcttcagga tcatcacacg ttttttccct gtaattggtg ggataggaag 120cctttaaggt ctcttgcttc tcatgggtgg gctacaagga gcagcagcca tcgtggcagg 180cttgtgatct ttttcctgct gacacctgct gcttgacatg gagaagttct gcacagaaag 240cagtggcatc cttcatgagg tggtacttgg ggcagacact gagagcattg taatcgtctt 300ttgtatcaat ctctctaaag tagaccacca cgtatttgtg cagatgaatc tggcttctta 360gatcactgca gaaaaggtta aaggcaaggg ggga 39480437DNAHomo sapiens 80agcaatattt gaaaatttta ttggtataaa atgttttaat tactaaggct gtttgtaggc 60tgcatagtaa gcttcaggat catcacacgt tttttccctg taattggtgg cataggaagc 120ctttaaggtc tcttgcttct catgggtggg ctacaaggag cagcagccat cgtggcaggc 180ttgtgatctt tttcctgctg acacctgctg cttgacatgg agaagttctg cacagaaagc 240agtggcatcc ttcatgaggt ggtacttggg gcagacactg agagcattgt aatcgtcttt 300tgtatcaatc tctctaaagt agaccaccac gtatttgtgc agatgaatct ggcttcttag 360atcactgcag aaaaggttaa aggcaagggg gaagaggtct tgagagttct cactgggact 420gccctcgctc ttgccac 43781321DNAHomo sapiens 81tttttttttt tagcaatatt tgaaaatttt attggtataa aatgttttaa ttactaaggc 60tgtttgtagg ctgcatagta agcttcagga tcatcacacg ttttttccct gtaattggtg 120ggataggaag cctttaaggt ctcttgcttc tcatgggtgg gctacaagga gcagcagcca 180tcgtggcagg cttgtgatct ttttcctgct gacacctgct gcttgacatg gagaagttct 240gcacaaaaag cagtggcatc cttcatgagg tggtacttgg ggcagacact gagagcattg 300taatcgtctt ttgtatcaat c 32182321DNAHomo sapiens 82tttttttttt tagcaatatt tgaaaatttt attggtataa aatgttttaa ttactaaggc 60tgtttgtagg ctgcatagta agcttcagga tcatcacacg ttttttccct gtaattggtg 120ggataggaag cctttaaggt ctcttgcttc tcatgggtgg gctacaagga gcagcagcca 180tcgtggcagg cttgtgatct ttttcctgct gacacctgct gcttgacatg gagaagttct 240gcacaaaaag cagtggcatc cttcatgagg tggtacttgg ggcagacact gagagcattg 300taatcgtctt ttgtatcaat c 32183314DNAHomo sapiens 83ttttatacat agaaatcaat tacagcttta attgaaaact ataaccattt tgataatgca 60acaataaagc atcttcagcc aaacatctag tcttccatag accatgcatt gcagtgtacc 120cagaactgtt tagctaatat tctatgttta attaatgaat actaactcta agaacccctc 180actgattcac tcaatagcat cttaagtgaa aaaccttcta ttacatgcaa aaaatcattg 240tttttaagat aacaaaagta gggaataaac aagctgaacc cacttttact ggaccaaatg 300atctattata tgtg 31484286DNAHomo sapiens 84ggctgaagat gctttattgt tgcattatca aaatggttat agttttcaat taaaactgta 60attgatttct atgtataaaa cagctttgaa gttgtaaatg tagtttccaa tcgttagtta 120atgctacatt agttagcaat atttgaaaat tttattggta taaaatgttt taattactaa 180ggctgtttgt aggctgcata gtaagcttca ggatcatcac acgttttttc ccctgtatgg 240gtgggatagg aagcctttaa ggtctcttgc ttctcatggg tgggct 28685333DNAHomo sapiensmisc_feature(2)..(2)a or g or c or t/umisc_feature(8)..(8)a or g or c or t/umisc_feature(24)..(25)a or g or c or t/umisc_feature(27)..(28)a or g or c or t/umisc_feature(30)..(31)a or g or c or t/umisc_feature(39)..(39)a or g or c or t/umisc_feature(48)..(48)a or g or c or t/umisc_feature(51)..(51)a or g or c or t/umisc_feature(67)..(67)a or g or c or t/umisc_feature(75)..(75)a or g or c or t/umisc_feature(80)..(80)a or g or c or t/umisc_feature(85)..(87)a or g or c or t/umisc_feature(95)..(95)a or g or c or t/umisc_feature(98)..(98)a or g or c or t/umisc_feature(106)..(106)a or g or c or t/umisc_feature(109)..(109)a or g or c or t/umisc_feature(123)..(123)a or g or c or t/umisc_feature(128)..(128)a or g or c or t/umisc_feature(144)..(146)a or g or c or t/umisc_feature(180)..(180)a or g or c or t/umisc_feature(191)..(191)a or g or c or

t/umisc_feature(217)..(217)a or g or c or t/umisc_feature(234)..(234)a or g or c or t/umisc_feature(307)..(307)a or g or c or t/umisc_feature(327)..(327)a or g or c or t/u 85tnaggaanga gaagaagcga gatnnanntn nagaaatang tggtggcnta ntttagagag 60attgatncaa aagcngattn caatnnnctc agtgnctncc caagtnccnc ctcatgaagg 120atncactnct ttctgtgcag actnnncatg tcaagcagca ggtgtcagca ggaaaaagan 180cacaagctcc ncgatggctg ctgctccttg tagcccncca tgagaagcaa gagncttaaa 240ggcttcctat cccaccaatt acagggaaaa acgtgtgatg acctgagctt actatgcagc 300ctacaancag ccttagtaat taaaccnttt att 33386522DNAHomo sapiensmisc_feature(1)..(1)a or g or c or t/umisc_feature(3)..(4)a or g or c or t/umisc_feature(161)..(161)a or g or c or t/umisc_feature(231)..(231)a or g or c or t/umisc_feature(299)..(299)a or g or c or t/umisc_feature(339)..(339)a or g or c or t/umisc_feature(445)..(445)a or g or c or t/umisc_feature(467)..(467)a or g or c or t/umisc_feature(490)..(490)a or g or c or t/umisc_feature(516)..(516)a or g or c or t/u 86nannatgaag atgctttatt gttgcattat caaaatggtt acagttttca attaaagctg 60taattgattt ctatgtataa aacagctttg aagttgtaaa tgtagtttcc aatcgttagt 120taatgctaca ttagttagca atatttgaaa attttattgg nataaaatgt tttaattact 180aaggctgttt gtaggctgca tagtaagctt caggatcatc acacgttttt nccctgtaat 240tgggtgggga tagggaagcc ctttaagggt ctcttgcttc tcatggggtg gggcctacna 300agggagcagc cagcccatcg tggccagggc cttgtgganc ctttttccct gcctggacac 360cctgcctgcc ttggaccatg gggaggaagg ttctggcacc aggaaagcca ggtggcccat 420cccttccatg agggtggggt acttnggggg gccaggacca ctgaggngcc attggtaatc 480cgtccttttn gtatccaatc ccctcctaag gtaggncccc cc 52287277DNAHomo sapiensmisc_feature(240)..(240)a or g or c or t/u 87ttttgtgggt tcagcttgtt tattccctac ttttgttatc ttaaaaacaa tgattttttg 60catgtaatag aaggtttttc acttaagatg ctattgagtg aatcagtgag gggttcttag 120agttagtatt cattaattaa acatagaata ttagctaaac agttctgggt acactgcaat 180gcatggtcta tggaagacta gatgtttggc tgaagatgct tttattgttg cattatcaan 240atggtttata gttttcaatt aaaactgtaa ttgattt 27788265DNAHomo sapiensmisc_feature(211)..(211)a or g or c or t/umisc_feature(220)..(220)a or g or c or t/umisc_feature(250)..(250)a or g or c or t/u 88ggctgaagat gctttattgt tgcattatca aaatggttat agttttcaat taaaactgta 60attgatttct atgtataaaa cagctttgaa gttgtaaatg tagtttccaa tcgttagtta 120atgctacatt agttagcaat atttgaaaat tttattggta taaaatgttt taattactaa 180ggctgtttgt aggctgcata gtaagcttaa ngatcatacn cacgtttttc cctgaatttg 240gtgggataan gaagccttta aaggt 26589350DNAHomo sapiensmisc_feature(17)..(17)a or g or c or t/umisc_feature(72)..(72)a or g or c or t/umisc_feature(326)..(326)a or g or c or t/umisc_feature(341)..(341)a or g or c or t/u 89ttgaaaattt tattggnata aaatgtttta attactaagg ctgtttgtag gctgcatagt 60aagcttcagg ancatcacac gttttttccc tgtaattggt ggcataggaa gcctttaagg 120tctcttgctt ctcatgggtg ggctacaagg agcagcagcc atcgtggcag gcttgtgatc 180tttttcctgc tgacacctgc tgcttgacat ggagaagttc tgcacagaaa gcagtggcat 240ccttcatgag gtggtacttg gggcagacac tgagagcatt gtaatcgtct tttgtatcaa 300tctctctaaa gtagaccacc accgtntttg tgcagatgga ntctggcttc 35090452DNAHomo sapiensmisc_feature(227)..(227)a or g or c or t/umisc_feature(230)..(230)a or g or c or t/umisc_feature(234)..(234)a or g or c or t/umisc_feature(429)..(429)a or g or c or t/u 90aggcactatc atcgggtttt ctcaggtgtt tgagccacac cagaagaaac aaacgcgagc 60ttcagtggtg attccagtga ctggggatag tgaaggtgct acggtgcagc tgactccata 120ttttcctact tgtggcagcg actgcatccg acataaagga acagttgtgc tctgcccaca 180aacaggcgtc cctttccctc tggataacaa caaaagcaag ccggganggn ctgncctctc 240ctcctgctgt ctctgctggt ggccacatgg gtgctggtgg cagggatcta tctaatgtgg 300aggcacgaaa ggatcaagaa gacttccttt tctaaccacc acattactgc cccccattta 360aggttcttgt ggttttaccc atctggaaat atgttttccc ttcacacatt tgtttatttc 420attgatttnt ttcaaaacct tggcaggagt tt 45291465DNAHomo sapiensmisc_feature(22)..(22)a or g or c or t/umisc_feature(403)..(403)a or g or c or t/umisc_feature(415)..(415)a or g or c or t/umisc_feature(437)..(437)a or g or c or t/umisc_feature(449)..(449)a or g or c or t/u 91gggtccagtg cagtggcttg cntgcagaaa gaaggcagca gacaaagtcg tcttccttct 60ttccaatgac gtcaacagtg tgtgcgatgg tacctgtggc aagagcgagg gcagtcccag 120tgagaactct caagacctct tcccccttgc ctttaacctt ttctgcagtg atctaagaag 180ccagattcat ctgcacaaat acgtggtggt ctactttaga gagattgata caaaagacga 240ttacaatgct ctcagtgtct gccccaagta ccacctcatg aaggatgcca ctgctttctg 300tgcagaactt ctccatgtca agcagcaggt gtcagcagga aaaagattca caagcctgcc 360acgatggctg cttgcttcct ttgtagccca cccatgagga agncaagaga ccttnaaagg 420gttccttttc ccatcanttt acaggggana aaacgtgtga tgatc 46592440DNAHomo sapiensmisc_feature(13)..(13)a or g or c or t/umisc_feature(16)..(16)a or g or c or t/umisc_feature(18)..(19)a or g or c or t/umisc_feature(77)..(77)a or g or c or t/umisc_feature(175)..(175)a or g or c or t/umisc_feature(277)..(277)a or g or c or t/umisc_feature(330)..(330)a or g or c or t/umisc_feature(336)..(336)a or g or c or t/umisc_feature(420)..(420)a or g or c or t/u 92ttttgtttgg ctnatntnnt tcttattgtt gcattatcaa aatggttata gttttcaatt 60aaaactgtaa ttgattncta tgtataaaac agctttgaag ttgtaaatgt agtttccaat 120cgttagttaa tgctacatta gttagcaata tttgaaaatt ttattggtat aaaangtttt 180aattactaag gctgtttgta ggctgcatag taagcttcag gatcatcaca cgtttttccc 240ctgtaattgg tgggatagga agcctttaag gtctctngct tctcatgggt gggctacaag 300gagcagcagc catcgtggca ggcttgtgan cttttncctg ctgacacctg ctgcttgaca 360tgggagaagt tctgcacaga aaggcagtgg gcatccttca tgaggtgggt acttgggggn 420cagacactga ggagcattgt 44093641DNAHomo sapiens 93actcaaaaga aggcagcaga caaagtcgtc ttccttcttt ccaatgacgt caacagtgtg 60tgcgatggta cctgtggcaa gagcgagggc agtcccagtg agaactctca agacctcttc 120ccccttgcct ttaacctttt ctgcagtgat ctaagaagcc agattcatct gcacaaatac 180gtggtggtct actttagaga gattgataca aaagacgatt acagtgctct cagtgtctgc 240cccaagtacc acctcatgaa ggatgccact gctttctgtg cagaacttct ccatgtcaag 300cagcaggtgt cagcaggaaa aagatcacaa gcctgccacg atggccgctg ctccttgtag 360cccacccatg agaagcaaga gaccttaaag gcttcctatc ccaccaatta cagggaaaaa 420acgtgtgatg atcctgaagc ttactatgca gcctacaaac agccttagta attaaaacat 480tttataccaa taaaattttc aaatatgcta actaatgtag cattaactaa cgattggaaa 540ctacatttac aacttcaaag ctgttttata catagaaatc aattacagct ttaattgaaa 600actgtaacca ttttgataat gcaacaataa agcatcttca g 64194468DNAHomo sapiens 94gtccagtgca gtggcttgcc actcaaaaga aggcagcaga caaagtcgtc ttccttcttt 60ccaatgacgt caacagtgtg tgcgatggta cctgtggcaa gagcgagggc agtcccagtg 120agaactctca agacctcttc ccccttgcct ttaacctttt ctgcagtgat ctaagaagcc 180agattcatct gcacaaatac gtggtggtct actttagaga gattgataca aaagacgatt 240acagtgctct cagtgtctgc cccaagtacc acctcatgaa ggatgccact gctttctgtg 300cagaacttct ccatgtcaag cagcaggtgt cagcaggaaa aagatcacaa gcctgccacg 360atggccgctg ctccttgtag cccacccatg agaagcaaga gaccttaaag gcttcctatc 420ccaccaatta caggggaaaa aacgtgtgat gatcctgaag cttactat 46895507DNAHomo sapiensmisc_feature(215)..(215)a or g or c or t/umisc_feature(427)..(427)a or g or c or t/umisc_feature(438)..(438)a or g or c or t/umisc_feature(445)..(445)a or g or c or t/umisc_feature(454)..(454)a or g or c or t/umisc_feature(459)..(459)a or g or c or t/umisc_feature(471)..(471)a or g or c or t/umisc_feature(477)..(477)a or g or c or t/umisc_feature(486)..(486)a or g or c or t/umisc_feature(499)..(499)a or g or c or t/umisc_feature(507)..(507)a or g or c or t/u 95tattgttgca ttatcaaaat ggttatagtt ttcaattaaa actgtaattg atttctatgt 60ataaaacagc tttgaagttg taaatgtagt ttccaatcgt tagttaatgc tacattagtt 120agcaatattt gaaaatttta ttggtataaa atgttttaat tactaaggct gtttgtaggc 180tgcatagtaa gcttcaggat catcacacgt ttttnccctg taattgggtg gggataggga 240agcctttaag gtctcttgct tctcatgggg tggggctaca agggaggcag gcagccatcg 300tgggcagggc ttgtgatctt tttccctgct gacacctgct gcttgacatg gggggaaggt 360tctggcacag aaagcagtgg gcatccttca tgagggtggt acttgggggg cagacactga 420ggaggcnttg taaatcgnct ttttngtatc caanctctnc taaagtaggg nccaccncgt 480tttttnttgc aggtggatnc ggggctn 50796440DNAHomo sapiensmisc_feature(22)..(22)a or g or c or t/umisc_feature(24)..(24)a or g or c or t/umisc_feature(364)..(364)a or g or c or t/umisc_feature(382)..(382)a or g or c or t/umisc_feature(414)..(414)a or g or c or t/umisc_feature(424)..(424)a or g or c or t/umisc_feature(430)..(430)a or g or c or t/u 96gggtccagtg cagtggcttg cntncaaaag aaggcagcag acaaagtcgt cttccttctt 60tccaatgacg tcaacagtgt gtgcgatggt acctgtggca agagcgaggg cagtcccagt 120gagaactctc aagacctctt cccccttgcc tttaaccttt tctgcagtga tctaagaagc 180cagattcatc tgcacaaata cgtggtggtc tactttagag agattgatac aaaagacgat 240tacaatgctc tcagtgtctg ccccaagtac cacctcatga aggatgccac tgctttctgt 300gcagaacttc tccatgtcaa gcagcaggtg tcagcaggaa aaagatcaca agcctgccac 360gatngctgct gctccttgta gnccacccat gagaagcaag tgacctttaa aggntttcct 420attnccaccn atttacaggg 44097630DNAHomo sapiens 97gactagatgt ttggctgaag atgctttatt gttgcattat caaaatggtt atagttttca 60attaaaactg taattgattt ctatgtataa aacagctttg aagttgtaaa tgtagtttcc 120aatcgttagt taatgctaca ttagttagca atatttgaaa attttattgg tataaaatgt 180tttaattact aaggctgttt gtaggctgca tagtaagctt caggatcatc acacgttttt 240tccctgtaat tggtgggata ggaagccttt aaggtctctt gcttctcatg ggtgggctac 300aaggagcagc agccatcgtg gcaggcttgt gatctttttc ctgctgacac ctgctgcttg 360acatggagaa gttctgcaca gaaagcagtg gcatccttca tgaggtggta cttggggcag 420acactgagag cattgtaatc gtcttttgta tcaatctctc taaagtagac caccacgtat 480ttgtgcagat gaatctggct tcttagatca ctgcagaaaa ggttaaaggc aagggggaag 540aggtcttgag agttctcact gggactgccc tcgctcttgc cacaggtacc atcgcacaca 600ctgttgacgt cattggaaaa gaaggaagac 63098788DNAHomo sapiens 98gagttctcac tgggactgcc ctcgctcttg ccacaggtac catcgcacac actgttgacg 60tcattggaaa gaaggaagac gaccttgtct gctaccttct tttgagtggc aagccactgc 120actggaccca tctctgctat tttctttttc tgccactttt caaggatgac ctcacttctg 180caatggtttt gaagaaattc agtgaagtaa caaattgtgt gatggaaaca tatttcagat 240gggtaaacca caagaacctt aatggggggc agtagtgtgg tggtagaaaa ggaagtcttc 300ttgatccttt ctgtgagagg agaaaagcat ttgttatctg tgaatagcaa acagcaggct 360ttcactctgt aaaccatccc tgacaaatga tcccttgcta gagaatgtca gctgagcacc 420aagggccttg ttagtgacag caaggaaaaa catcctgatg ttccttttga acacatcacc 480tgaaacacac tgatgcttaa accttaactt tttttttttg ggggacatag tctcactctg 540tcgcccaggc tggagtgcgt gggagaggac ctcggaaaga ctggcaagca tccgcataca 600agggagtaac agcacaatac tccgtgaact tcggagccct ccaaaggaat actcaagggc 660gggtaaagga tggcaagggt cgacggagag cccacgagga gagcggaagg tagagaggag 720acaagcataa gacgcgagag gaactccaag gcggggccaa agagagaaac cacggtcacc 780aacagaag 78899307DNAHomo sapiensmisc_feature(34)..(34)a or g or c or t/umisc_feature(263)..(263)a or g or c or t/umisc_feature(270)..(270)a or g or c or t/umisc_feature(279)..(279)a or g or c or t/umisc_feature(306)..(306)a or g or c or t/u 99agaagccaga ttcatctgca caaatacgtg gtgntctact ttagagagat tgatacaaaa 60gacgattaca atgctctcag tgtctgcccc aagtaccacc tcatgaagga tgccactgct 120ttctgtgcag aacttctcca tgtcaagcag caggtgtcag caggaaaaag atcacaagcc 180tgccacgatg gctgctgctc cttgtagccc acccatgaga agcaagagac cttaaaggct 240tcctatccca ccaattacag ggnaaaaacn gtagtgatna tccctgacag cttactatgc 300cagccnt 307100335DNAHomo sapiensmisc_feature(67)..(67)a or g or c or t/umisc_feature(315)..(315)a or g or c or t/umisc_feature(333)..(333)a or g or c or t/u 100ttggctgaag atgctttatt gttgcattat caaaatcggt tacagttttc aattaaagct 60gtaattngat ttctatgtat aaaacagctt tgaagttgta aatgtagttt ccaatcgtta 120gttaatgcta cattagttag caatatttga aaattttatt ggtataaaat gttttaatta 180ctaaggctgt ttgtaggctg catagtaagc ttcaggatca tcacacgttt tttccctgta 240attgggtggg ataggaagcc tttaaggtct cttgcttctc attgggtggg ctacaaggag 300cagcagccat ccgtnggcaa ggctttgtgg atnct 335101639DNAHomo sapiens 101ggaagagaaa gatcgtccag aggttccatc gcacacactg tatgacgtca ttggaaatga 60aggaagacga ctttgtctgc tggcttcttg tgagtggcaa gccactgcag tggacccatc 120tctgctattt tctttattct gccacttttc aaggatgacc tcacttctgc aatggttttg 180aagaaagttc agtgaagtaa caaattgtgt gatggaaaca tatttcagat gggtaaacca 240caagaacctt aatggggggc agtagtgtgg tggtagaaaa ggaagtcttc ttgatccttt 300ctgtgagagg agaaaagcat tagttatctg tgaacagcaa acagcaggca tttcacatct 360gtaaaccatc cctgacaaat gatcccttgc tagagaatgt cagctgagca ccaaggggcc 420ttgttagtga cagcaaggac aaaacatcct gatgttcctt ttgaacacat cagctgaaac 480acactgatgc tctaaaccgt taactattta ttaatggggg aacataggtc tcaactcatg 540tacgaccagg ctggagtgca gtggggttga acatcgacag acatagcaaa ccaccgatca 600ctagggaaac aacgcacaga actccagact taaaacacc 639102477DNAHomo sapiensmisc_feature(365)..(365)a or g or c or t/u 102attcggcacc tggggggcag acactgagag cattgtaatc gtcttttgta tcaatctctc 60taaagtagac caccacgtat ttgtgcagat gaatctggct tcttagatca ctgcagaaaa 120ggttaaaggc aagggggaag aggtcttgag agttctcact gggactgccc tcgctcttgc 180cacaggtacc atcgcacaca ctgttgacgt cattggaaag aaggaagacg actttgtctg 240ctgccttctt ttgagtggca agccactgca ctggacccat ctctgctatt ttctttttct 300gccacttttc aaggatgacc tcacttctgc aatggttttg aagaaattca gtgaagtaac 360aaatntgtgt gatggaaaca tatttcagat gggtaaacca caagaacctt aatggggggc 420agtagtgtgg tggtagaaaa ggaagtcttc ttgatccttt ctgtgagagg agaaagc 477103432DNAHomo sapiens 103ttttgatggt ccacttccat ttaatgaatt agtaaatatc ttttctcatg attttaatta 60catttttttc tctagcttac tttattataa tacagcacat aatacaccta acatgcaaaa 120tatgtgttaa ttggctgttt atgttattgg taagacttcc agtcaacagt aggctattag 180aagttaagtt gtgggaaaat caaaggttat aggagatttt caactgcatg cagggccggt 240gccctcccca ctgtgttgtt caagggtcag ctgtactctc taagggcttt gctaacttca 300aaacatggag tatttgaata cagaaaccag agcatttaca tactcagctc aaggcagagc 360tattaaaaaa actcctcttc tccatatgta ggaaaggaaa tacaaatgca tcctttgagt 420catttgtgat gt 432104316DNAHomo sapiensmisc_feature(68)..(68)a or g or c or t/umisc_feature(70)..(70)a or g or c or t/umisc_feature(74)..(74)a or g or c or t/umisc_feature(120)..(120)a or g or c or t/umisc_feature(140)..(140)a or g or c or t/umisc_feature(211)..(211)a or g or c or t/umisc_feature(289)..(289)a or g or c or t/u 104aacagttgtg ctctgcccac aaacaggcgt ccctttccct ctggataaca acaaaagcaa 60gccgggangn ctgncgctct cctcctgctg tctctgctgg tggccacatg ggtgctggtn 120gcagggatct atctaatgtn gaggcacgaa agggatcaag aggacttcct tttctaccac 180cacactactg ccccccatta aggttcttgt nggtttaccc atctggaaat atgtttccat 240cacacaattt gttacttcac tggaatttct tcaaaaccat tggcaggang tgagggtcat 300ccttggaaaa gtgggc 316105401DNAHomo sapiensmisc_feature(274)..(274)a or g or c or t/u 105cctcacttct gcaatggttt tgaagaaatt cagtgaagta acaaattgtg tgatggaaac 60atatttcaga tgggtaaacc acaagaacct taatgggggg cagtagtgtg gtggtagaaa 120aggaagtctt cttgatcctt tcgtgcctcc acattagata gatccctgcc accagcaccc 180atgtggccac cagcagagac agcaggagga gaggcagcca gcctcccggc tttgcttttg 240ttgttatcca gaggggaaag gggacgcctg tttntggggc agagcacaac tgtttccctc 300gtgcccgaat tctttgggcc ttcgaggggc caaatttccc tattaggtga ggtcgtattt 360taaatttcgg taattcatgg tcataggctt gtttttcccc g 401106516DNAHomo sapiensmisc_feature(400)..(400)a or g or c or t/umisc_feature(462)..(462)a or g or c or t/umisc_feature(483)..(483)a or g or c or t/umisc_feature(509)..(509)a or g or c or t/u 106gtttcaacac aattttggat cagctgcctg tttgcaaaaa cataatatat ttctgttaaa 60cagttcttca cctaacagca tattgctctt ataactggta gagctgtttc aaaggaagtt 120ggtttctggt ccaagttttg acctaaacca tgtccatctt ctattaccag cacttacaag 180cactgtgaaa actgatcatg acaaataagt aaaatttgct acattaaaca tattgcctca 240gccattacta agcgtccact tgtaaagctg gacacagttt ttactttatg cttcattttg 300attttttatc cgtaagacat aaattagaag gcatgaggtg gccctttaag gataatctgc 360aaatatacac attttaaata gtcatccatc tggaaatcgn tccaccattc caggggaagg 420attccaggta ttggtgctgt ggtggaaata aagcattccc cngggaaaaa aaccatttta 480tgnctaaata attaccacca ttaacctcnt ggggtt 516107187DNAHomo sapiens 107gaatactaac tctaagaacc cctcactgat tcactcaata gcatcttaag tgaaaaacct 60tctattacat gcaaaaaatc attgttttta agataacaaa agtagggaat aaacaagctg 120aacccacttt tactggacca aatgatctat tatatgtgta accacttgta tgatttggga 180tttgcat 187108156DNAHomo sapiens 108ttttttacaa cttcaaagct gttttataca tagaaatcaa ttacagtttt aattgaaaac 60tataaccatt ttgataatgc aacaataaag catcttcagc caaacatcta gtcttccata 120gaccatgcat tgcagtgtac ccagaactgt ttagct 156109491DNAHomo sapiensmisc_feature(478)..(478)a or g or c or t/umisc_feature(491)..(491)a or g or c or t/u 109ctgagtgtga tggtgtaagc ctgtggtccc agctactagg gaggctgaga tgggattaca 60ggtgtgagcc acggcgcctg gcctaaaagc atctttttct ttaacgcaga ggttatgttg 120tattattagc ataaatgttt ttttctggga atgcttattt cacacagcac aatactgaat 180cttctctgga atgtggatcg atttcagatg gatgactatt aaaatgtgta tatttgcaga 240ttatccttaa agggccacct catgccttct aatttatgtc ttacggataa aaaatcaaaa 300tgaagcataa agtaaaaact

gtgtccagct ttacaagtgg acgcttagta atggctgagg 360caatatgttt aatgtagcca aattttactt atttgtccat gatccagttt ttcacagtgc 420ttgttaagtg ctggtaatta ggaaggtggg acatgggtta ggtcaaaact tgggaccnga 480aaccaacttg n 491110270DNAHomo sapiens 110tttttttttt acaacttcaa agctgtttta tacatagaaa tcaattacag ttttaattga 60aaactataac cattttgata atgcaacaat aaagcatctt cagccaaaca tctagtcttc 120catagaccat gcattgcatt gtacccagaa ctgtttagct aatattctat gtttaattaa 180tgaatactaa ctctaagaac ccctcactga ttcactcaat agcatcttaa gtgaaaaacc 240ttctattaca tgcaaaaaat cattggtttt 270111478DNAHomo sapiens 111ttttctgagt aagaacaggc tttatttgta aaaccactcg tgactcttta caaagcagga 60tacacagaag ggaaaaaaat acacagtgca aaatggatgt tctgagtgcc acaaggatct 120gctgaaaaaa gccaaagatg taagatggct gggtatatat gagaatgaat atttcactat 180attctgattc aattaccagt ctcagtggcc caggatgagc ttttggtgtg gtcacatggc 240caacatttgg ataacaaatg aggaataatg gtaccgcctc actagtgcct gagaacagca 300tgttctggaa aatgtctctg gagttagaga tgtgttagct ttttcattac agatggagaa 360atacaatgtt tacacaacag tccaggggtg gggtcaaaag ttggaaggtg tcattagacg 420cagccaaata aagtgaagac aacccaggtg actggcagcc ctgacttgtg cgtgggcg 478112263DNAHomo sapiens 112tttctgagta agaacaggct ttatttgtaa aaccactcgt gactctttac aaagcaggat 60acacagaagg gaaaaaaata cacagtgcaa aatggatgtt ctgagtgcca caaggatctg 120ctgaaaaaag ccaaagatgt aagatggctg ggtatatatg agaatgaata tttcactata 180ttctgattca attaccagtc tcagtggccc aggatgagct ttggtggtgg tcacatggcc 240aacatttgga taacaaatga gga 263113388DNAHomo sapiens 113gagatggagg tctcgctttg tgacgtagcc tggtcttgag cgatcctttt gccttggcct 60tgccaaagtg ctgggattgg aggcatgagc cactgcaccc acccctgttt tttttttaag 120taaaccatta taataactca tttataaaaa ggttacttca agagggcttt caacttaaga 180attattttca ttttgaacat gaaaagttaa atagtaacta agaaactgag aactctgaca 240gtgacctcta ataggtaact ttaggcaaaa gtagacaagt ttgtgggtat tttgttgttc 300atgttaaaag gcacctgtac aagaatcaag atatgaatct agtttgtaga gggaaggtct 360tatgcaaata ccaaatcata caagtggt 388114450DNAHomo sapiens 114agagatgttg gtctcgcttt gtgacgtagc ctgggcttga gcgatccttt tgccttggcc 60ttgccaaagt gctgggattg gaggcatgag ccactgcacc cacccctgtt ttttttttaa 120gtaaaccatt ataataactc atttataaaa aggttacttc aagagggctt tcaacttaag 180aattattttc attttgaaca tgaaaagtta aatagtaact aagaaactga gaactctgac 240agtgacctct aataggtaac tttaggcaaa agtagacaag tttgtgggta ttttgttgtt 300catgttaaaa ggcacctgta caagaatcaa gatatgaatc tagtttgtag agggaaggtc 360ttatgcaaat accaaatcat acaagtggtt acacatataa tagatcattt ggtccagtaa 420aagtgggttc agcttgttta ttccctactt 450115162DNAHomo sapiens 115gagatggagg tctcgctttg tgacgtagcc tggtcttgag cgatcctttt gccttggctt 60gcaaagtgct gggattggag gcatgagcac tgcacccacc cctgtttttt tttttaagta 120aaccattata ataactcatt tataaaaagg ttacttcaag ag 162116392DNAHomo sapiensmisc_feature(117)..(117)a or g or c or t/umisc_feature(345)..(345)a or g or c or t/umisc_feature(378)..(378)a or g or c or t/umisc_feature(388)..(388)a or g or c or t/u 116ttcactcaat agcatcttaa gtgaaaaacc ttctattaca tgcaaaaaat cattgttttt 60aagataacaa aagtagggaa taaacaagct gaacccactt ttactggacc aaatgancta 120ttatatgtat aaccacttgt atgatttggt atttgcataa gaccttccct ctacaaacta 180gattcatatc ttgattcttg tacaggtgcc tttttaatat tctgtgatga aatcgttcac 240agtcagagta catgtctgct gcatatggga aatagggact gttgttctga gggacaaggc 300actcaattca gccgtaaagg ctgacccggg ctactttttt tccangggaa tacaattttt 360ttaccttgga ataaaatngg gcccgacngg ac 392117428DNAHomo sapiens 117tttttttttt tgagtaagaa caggctttat ttgtaaaacc actcgtgact ctttacaaag 60caggatacac agaagggaaa aaaatacaca gtgcaaaatg gatgttctga gtgccacaag 120gatctgctga aaaaaagcca aagatgtaag atggctgggt atatatgaga atgaatattt 180cactatattc tgattcaatt accagtctca gtggcccagg atgagctttt ggtgtggtca 240catggccaac atttggataa caaatgagga ataatggtac cgcctcacta gtgcctgaga 300acagcatgtt ctggaaaatg tctctggagt tagagatgtg ttagcttttt cattacagat 360ggagaaatac aatgtttaca caacagtcca ggggtggggt caaaagttgg aaggtgtcat 420tagacgca 428118430DNAHomo sapiens 118aaatttttaa cttttaatag ttaaaatagt taactattgg tatggtagga aatgataaag 60tagactagta tctgtataca ttttctgcat ttatgacata cctttttctt catttttttc 120aatattttaa ttgaaaagtt catccgagtt tcatctaagt tttttcaaag tgatacaaat 180ctccaaaaaa ttttccaata tatgtattga aaaaatccag gtgtaagtgg ctctgcgcag 240tccaaacctg tgttgttcaa gggtcaactg tgtatgaatc caagcgaaag cttttcttaa 300cacctcataa gaactatttt ttaaaaaaca ggaactagca tagagtaacc atcacaggta 360aagtgtaatt tgttatcagc catcttttgc ccatttcagt actggtagaa ggctcaatgg 420taaaaataaa 430119368DNAHomo sapiens 119tttttttttt tttttttttt ttttctgact gtcccgtttt tatttttacc attgagcctt 60ctaccagtac tgaaatgggc aaaagatggc tgataacaaa ttacacttta cctgtgatgg 120ttactctatg ctagttcctg ttttttaaaa aatagttctt atgaggtgtt aagaaaagct 180ttcgcttgga ttcatacaca gttgaccctt gaacaacaca ggtttggact gcgcagacca 240cttacacctg gattttttca atacatatat tggaaaattt tttggggatt tgtatcactt 300tgaaaaaact tagatgaaac tcggatggac ttttccatta aaatattgga aaaaatgaag 360aaaaaggt 368120435DNAHomo sapiens 120tttttttttt tttttttttt ttttctgact ggcccgtttt tatttttacc attgagcctt 60ctaccagtac tgaaatgggc aaaagatggc tgataacaaa ttacacttta cctgggatgg 120ttactctatg ctagttcctg ttttttaaaa aatagttctt atgaggggtt aaaaaaagct 180ttcgcttgga ttcatacaca gttgaccctt gaacaacaca ggtttggact gcgcagagcc 240acttacacct ggattttttc aatacatata ttggaaaatt ttttggagat ttgtatcact 300ttgaaaaaac ttagatgaaa ctcggatgaa cttttcaatt aaaatattga aaaaaatgaa 360gaaaaaggta tgtcataaat gcagaaaatg tatacagata ctagtctact ttatcatttc 420ctaccatacc aatag 435121880DNAHomo sapiensmisc_feature(788)..(788)a or g or c or t/u 121taaaggaaca gttgtgctct gcccacaaac aggcgtccct ttccctctgg ataacagtaa 60gtgcccagta acttcaacca gatgatcaaa gtggctcaca cacagtcact gccccccact 120cagtatgtgg aagggttgtg tgtatgtggg cagtgcaagg ggtcgctgcc tgtgtacact 180gaactggggt gcagagaaag ccaacagtgc tgtcccagag aacctagaat ctgagtaaga 240acaggcttta tttgtaaaac cactcgtgac tctttacaaa gcaggataca cagaagggaa 300aaaaatacac agtgcaaaat ggatgttctg agtgccacaa ggatctgctg aaaaaagcca 360aagatgtaag atggctgggt atatatgaga atgaatattt cactatattc tgattcaatt 420accagtctca gtggcccagg atgagctttt ggtgtggtca catggccaac atttggataa 480caaatgagga ataatggtac cgcctcacta gtgcctgaga acagcatgtt ctggaaaatg 540tctctggagt tagagatgtg ttagcttttt cattacagat ggagaaatac aatgtttaca 600caacagtcca ggggtggggg tcaaaagttg gaaggtgtca ttagacgcag ccaaataaag 660tgaagaccac ccaggtgact ggcagccctg acttgtgcgt gggcgaaacc ttacagattc 720ctggggcact ctgtgcctga acttacctgg atggtctttg tgaggcgggt gggcacttat 780cctccatnaa tggtcagtct aacaagaccg gcctgtaaaa atggcatcta ataggggcta 840tggaatggaa aacagttggt acccagaaat aactttaatt 880122437DNAHomo sapiensmisc_feature(34)..(34)a or g or c or t/umisc_feature(192)..(192)a or g or c or t/u 122gacagtctgg gagcccagag ctctgggagg agtngggaaa atgctgcttc ctgctgcttg 60cttctaggca cctgcttccg ccatctcact taccatggct agagatgggg gtgagactgg 120ggaaggacaa aagcagggaa cagataaggg atggaaatca gaagggaata tagaaagaac 180tctggatatg cnagaaatgc cggtacctga gcattttgta tcaatgggag taccctctgt 240aactgctcag taggttacaa atgaagagtc caccagtatt agaaacaatt taaacttgcc 300agtaccaact gggatgtgtg ccttcaattt gaaaattgta tgttttattt tttaaatttg 360gttaacagca ttaatttata gagtatttga tgtcatttat ggttcccgag gtgtttccaa 420cacaattttt gggatca 4371231107DNAHomo sapiensmisc_feature(893)..(893)a or g or c or t/u 123cttttaatag ttaaaatagt taactattgg tatggtagga aatgataaag tagactagta 60tctgtataca ttttctgcat ttatgacata cctttttctt catttttttc aatattttaa 120ttgaaaagtt catccgagtt tcatctaagt tttttcaaag tgatacaaat ctccaaaaaa 180ttttccaata tatgtattga aaaaatccag gtgtaagtgg ctctgcgcag tccaaacctg 240tgttgttcaa gggtcaactg tgtatgaatc caagcgaaag cttttcttaa cacctcataa 300gaactatttt ttaaaaaaca ggaactagca tagagtaacc atcacaggta aagtgtaatt 360tgttatcagc catcttttgc ccatttcagt actggtagaa ggctcaatgg taaaaataaa 420aacgggacag tcagaagatc tggaagtcct gaccctgctt tcacctggca tgtgtaatcc 480agtcatgctc gtatcagtct ctgtaggagc acttgaaggt attacataaa tgctatctaa 540ctctgggaaa cgccaacatg tgattgcctc cagaggaatc ttctttaaaa aaaaattcaa 600aatgttattt ccttactagg atgtctttaa agaattataa cccttaccgt gcctccacat 660tagatagatc cctgccacca gcacccatgt ggccaccagc agagacagca ggaggagagg 720cagccagcct cccggcttgc ttttgtctgg aaaaaaacaa agcttattca cctttggaaa 780aaaatccaca cttatctctt aatttaaaaa ctaagacttg gtatacttta tagagggtta 840tttatttttt attatttttt agttttgaga cagagtctcg ctttgttgcc tangctggag 900tgcagtggcg caatctcggt tcactgcagc ctccgttctc cggggttcaa ggcatgctgg 960ctcagcctcc tgtatagctg gggattaaag gcatgtgttc acgcggccca gccccttttg 1020taaaagattt agatcccttt taaaaccatc agtcaggagg ctcctttaaa aagtctggcc 1080atctaatctt ttttccccca aaagggg 1107124290DNAHomo sapiens 124tttttttttt tcttttttct gagtaagaac aggctttatt tgtaaaacca ctcgtgactc 60tttacaaagc aggatacaca gaagggaaaa aaatacacag tgcaaaatgg atgttctgag 120tgccacaagg atctgctgaa aaaagccaaa gatgtaagat ggctgggtat atatgagaat 180gaatatttca ctatattctg attcaattac cagtctcagt ggcccaggat gagcttttgg 240tgtggtcaca tggccaacat ttggataaca aatgaggaat aatctcgtgc 290125812DNAHomo sapiens 125aatttataga gtattgatgt catttatgtt tctgaggtgt ttcaacacaa ttttggatca 60gctgcctgtt tgcaaaaaca taatatattt ctgttaaaca gttcttcacc taacagcata 120ttgctcttat aactggtaga gctgtttcaa aggaagttgg tttctggtcc aagttttgac 180ctaaaccatg tccatcttct attaccagca cttacaagca ctgtgaaaac tgatcatgac 240aaataagtaa aatttgctac attaaacata ttgcctcagc cattactaag cgtccacttg 300taaagctgga cacagttttt actttatgct tcattttgat tttttatccg taagacataa 360attagaaggc atgaggtggc cctttaagga taatctgcaa atatacacat tttaatagtc 420atccatctga aatcgatcca cattccagag aagattcagt attgtgctgt gtgaaataag 480cattcccaga aaaaaaacat ttatgctaat aatacaacat aacctctgca ttaaagaaaa 540agatgctttt aggccaggcg ccgtggctca cgcctgtaat ccctgcactt tgagaggctg 600aggtgggtgg atcatgaggt caggagatca agaccatcct ggctaacagg gtgaaacccc 660gtctctactg gggatataac aaagttagct gggtgtggtg gtgggtgctt gtggtcccag 720ctactcagga ggctgaggca ggagaatggc gtgaacccgg aaggcagagg ttgtagtgac 780gcgaggttca cgccactgca ttccagtctg gg 812126679DNAHomo sapiensmisc_feature(8)..(8)a or g or c or t/u 126caggaagnta agaacagtcc taaaatctct ttggcttctt tgtcctgata tgcaccggca 60ttttcacagt aggaactagg gtttctgtcc agtttttttg gttctttaag gaattaatgt 120tattctgggt acaactgctt acatacatag cacatataga tgacattttt acaggccgtc 180ttgttagact gacatacatg gaggatagtg ccacccgcct cacaagaaca tcaggtaagc 240tcaggcacag agtgcccagg aatctgtaag gcttcgccca cgcacaagtc agggctgcca 300gtcacctggg ttgtcttcac tttatttggc tgcgtctaat gacaccttcc aacttttgac 360cccacccctg gactgttgtg taaacattgt atttctccat ctgtaatgaa aaagctaaca 420catctctaac tccagagaca ttttccagaa catgctgttc tcaggcacta gtgaggcggt 480accattattc ctcatttgtt atccaaatgt tggccatgtg accacaccaa aagctcatcc 540tgggccactg agactggtaa ttgaatcaga atatagtgaa atattcattc tcatatatac 600ccagccatct tacatctttg gcttttttca gcagatcctt gtggcactca gaacatccat 660tttgcactgt gtatttttt 679127449DNAHomo sapiens 127aaatttttaa cttttaatag ttaaaatagt taactattgg tatggtagga aatgataaag 60tagactagta tctgtataca ttttctgcat ttatgacata cctttttctt catttttttc 120aatattttaa ttgaaaagtt catccgagtt tcatctaagt tttttcaaag tgatacaaat 180ctccaaaaaa ttttccaata tatgtattga aaaaatccag gtgtaagtgg ctctgcgcag 240tccaaacctg tgttgttcaa gggtcaactg tgtatgaatc caagcgaaag cttttcttaa 300cacctcataa gaactatttt ttaaaaaaca ggaactagca tagagtaacc atcacaggta 360aagtgtaatt tgttatcagc catcttttgc ccatttcagt actggtagaa ggctcaatgg 420taaaaataaa aacgggacag tcagaaaaa 449128396DNAHomo sapiens 128tctgagtaag aacaggcttt atttgtaaaa ccactcgtga ctctttacaa agcaggatac 60acagaaggga aaaaaataca cagtgcaaaa tggatgttct gagtgccaca aggatctgct 120gaaaaaagcc aaagatgtaa gatggctggg tatatatgag aatgaatatt tcactatatt 180ctgattcaat taccagtctc agtggcccag gatgagcttt tggtgtggtc acatggccaa 240catttggata acaaatgagg aataatggta ccgcctcact agtgcctgag aacagcatgt 300tctggaaaat gtctctggag ttagagatgt gttagctttt tcattacaga tggagaaata 360caatgtttac acaacagtcc aggggtgggg tcaaag 396129232DNAHomo sapiens 129ctgactgtcc cgtttttatt tttaccattg agccttctac cagtactgaa atgggcaaaa 60gatggctgat aacaaattac actttacctg tgatggttac tctatgctag ttcctgtttt 120ttaaaaaata gttcttatga ggtgttaaga aaagctttcg cttggattca tacacagttg 180acccttgaac aacacaggtt tggactgcgc agagccaccc tcgtgccgaa tt 232130185DNAHomo sapiens 130ctgactgtcc cgtttttatt tttaccattg agccttctac cagtactgaa atgggcaaaa 60gatggctgat aacaaattac actttacctg tgatggttac tctatgctag ttcctgtttt 120ttaaaaaata gttcttatga ggtgttaaga aaagctttcg cttggattca tacacagttg 180accct 185131726DNAHomo sapiens 131ggaaatgata aagtagacta gtatctgtat acattttctg catttatgac ataccttttt 60cttcattttt ttcaatattt taattgaaaa gttcatccga gtttcatcta agttttttca 120aagtgataca aatctccaaa aaattttcca atatatgtat tgaaaaaatc caggtgtaag 180tggctctgcg cagtccaaac ctgtgttgtt caagggtcaa ctgtgtatga atccaagcga 240aagcttttct taacacctca taagaactat tttttaaaaa acaggaacta gcatagagta 300accatcacag gtaaagtgta atttgttatc agccatcttt gcccatttca gtactggtag 360aaggctcaat ggtaaaaata aaaacgggac agtcagaaga tctggaagtc ctgaccctgc 420tttcacctgg catgtgtaat ccagtcatgc tcgtatcagt ctctgtagga gcacttgaag 480gtattacata aatgctatct aactctggga aacgccaaca tgtgattgcc tccagaggaa 540tcttctttaa aaaaaaattc aaaatgttat ttccttacta ggatgtcttt aaagaattat 600aacccttacc gtgcctccac attagataga tccctgcaac agacccatgt ggcaccagca 660gagacagcag gaggagaggc agcagctccc ggttgtttgt ctggaaaaac aaaggttatc 720actttg 726132185DNAHomo sapiens 132ctgactgtcc cgtttttatt tttaccattg agccttctac cagtactgaa atgggcaaaa 60gatggctgat aacaaattac actttacctg tgatggttac tctatgctag ttcctgtttt 120ttaaaaaata gttcttatga ggtgttaaga aaagctttcg cttggattca tacacagttg 180accct 185133291DNAHomo sapiens 133gcacgagatt attcctcatt tgttatccaa atgttggcca tgtgaccaca ccaaaagctc 60atcctgggcc actgagactg gtaattgaat cagaatatag tgaaatattc attctcatat 120atacccagcc atcttacatc tttggctttt ttcagcagat ccttgtggca ctcagaacat 180ccattttgca ctgtgtattt ttttcccttc tgtgtatcct gctttgtaaa gagtcacgag 240tggttttaca aataaagcct gttcttactc agaaaaaaaa aaaaaaaaaa a 291134795DNAHomo sapiensmisc_feature(1)..(2)a or g or c or t/umisc_feature(770)..(770)a or g or c or t/u 134nnttgaacag gcgtgacggt ccggattccc gggatgttgt gctctgccca caaacaggcg 60tccctttccc tctggataac aacaaaagca agccgggagg ctggctgcct ctcctcctgc 120tgtctctgct ggtggccaca tgggtgctgg tggcagggat ctatctaatg tggaggcacg 180aaaggatcaa gaagacttcc ttttctacca ccacactact gccccccatt aaggttcttg 240tggtttaccc atctgaaata tgtttccatc acacaatttg ttacttcact gaatttcttc 300aaaaccattg cagaagtgag gtcatccttg aaaagtggca gaaaaagaaa atagcagaga 360tgggtccagt gcagtggctt gccactcaaa agaaggcagc agacaaagtc gtcttccttc 420tttccaatga cgtcaacagt gtgtgcgatg gtacctgtgg caagagcgag ggcagtccca 480gtgagaactc tcaagacctc ttcccccttg cctttaacct tttctgcagt gatctaagaa 540gccagattca tctgcacaaa tacgtggtgg tctactttag agagattgat acaaaagacg 600attacaatgc tctcagtgtc tgccccaagt accacctcat gaaggatgcc actgctttct 660gtgcagaact tctccatgtc aagcagcagg tgtcagcagg aaaaagatca caagcctgcc 720acgatggctg ctgctccttg tagcccaccc atgagaagca agagaccttn aaggcttcct 780atcccaccat tacag 795135387DNAHomo sapiens 135tttttttttt tttctgagta agaacaggct ttatttgtaa aaccactcgt gactctttac 60aaagcaggat acacagaagg gaaaaaaata cacagggcaa aatggatgtt ctgagtgcca 120caaggatctg ctgaaaaaag ccaaagatgt aagatggctg ggtatatatg agaatgaata 180tttcactata ttctgattca attaccagtc tcagtggccc aggatgagct tttggtgtgg 240tcacatggcc aacatttgga taacaaatga ggaataatgg taccgcctca ctagtgcctg 300agaacagcat gttctggaaa atgtctctgg agttagagat gtgttagctt tttcattaca 360gatggagaaa tacaatgttt acacaac 387136561DNAHomo sapiens 136catgatgttc agtatgatca gttaacctta acctctgagc atcctgaagc aaaatctaaa 60taatgcagct attaccactg gtggtccagg ctctggtgaa gccctctgag cccaggagga 120agagaaagca ttgtccagag gtaggaacac agtctgggag cccagagctc tgggaggagt 180gggaaaatgc tgcttcctgc tgcttgcttc taggcacctg cttccgccat ctcacttacc 240atggctagag atgggggtga gactggggaa ggacaaaagc agggaacaga taagggatgg 300aaatcagaag ggaatataga aagaactctg gatgtggaga aatgccggta cctgagcatt 360ttgtatcaat gggagtaccc tctgtaactg ctcagtaggt tacaaatgaa gagtccacca 420gtattagaaa caatttaaac ttgccagtac caactgggat gtgtgccttc aatttgaaaa 480ttgtatgttt tattttttaa atttgttaac agcattaatt tatagagtat tgatgtcatt 540tatgtttctg aggtgtttca a 561137476DNAHomo sapiens 137tctgagtaag aacaggcttt atttgtaaaa ccactcgtga ctctttacaa agcaggatac 60acagaaggga aaaaaataca cagtgcaaaa tggatgttct gagtgccaca aggatctgct 120gaaaaaagcc aaagatgtaa gatggctggg tatatatgag aatgaatatt tcactatatt 180ctgattcaat taccagtctc agtggcccag gatgagcttt tggtgtggtc acatggccaa 240catttggata acaaatgagg aataatggta ccgcctcact agtgcctgag aacagcatgt 300tctggaaaat gtctctggag ttagagatgt gttagctttt tcattacaga tggagaaata 360caatgtttac acaacagtcc aggggtgggg tcaaaagttg gaaggtgtca ttagacgcag

420ccaaataaag tgaagacaac ccaggtgact ggcagccctg acttgtgcgt gggcga 476138186DNAHomo sapiens 138ctgactgtcc cgtttttatt tttaccattg agccttctac cagtactgaa atgggcaaaa 60gatggctgat aacaaattac actttacctg tgatggttac tctatgctag tatcctgttt 120tttaaaaaat agttcttatg aggtgttaag aaaagctttc gcttggattc atacacagtt 180gaccct 186139456DNAHomo sapiensmisc_feature(255)..(255)a or g or c or t/umisc_feature(260)..(260)a or g or c or t/umisc_feature(307)..(307)a or g or c or t/umisc_feature(350)..(350)a or g or c or t/umisc_feature(406)..(406)a or g or c or t/umisc_feature(431)..(431)a or g or c or t/umisc_feature(446)..(446)a or g or c or t/u 139aggaagttaa gaacagtcct aaaatctctt tggcttcttt gtcctgatat gcaccggcat 60tttcacagta ggaactaggg tttctgtcca gtttttttgg ttctttaagg aattaatgtt 120attctgggta caactgctta catacatagc acatatagat gacattttta caggccgtct 180tgttagactg acatacatgg aggatagtgc cacccgcctc acaagaacat caggtaagct 240caggcacaga gtccnagggn atctgtaagg gcttcgccca cgcacaagtc agggctgcca 300gtcaccnggg ttgtcttcac tttatttggg ctgcgtctaa tgacaccttn ccaacttttt 360gaccccaccc tggggcttgt tgtgtaaacc attgttattt ctcccntctg taatggaaaa 420aggttaacac ntttttaact tccggngaca tttttc 4561401816DNAHomo sapiens 140gcacgagcga tgtcgctcgt gctgctaagc ctggccgcgc tgtgcaggag cgccgtaccc 60cgagagccga ccgttcaatg tggctctgaa actgggccat ctccagagtg gatgctacaa 120catgatctaa tccccggaga cttgagggac ctccgagtag aacctgttac aactagtgtt 180gcaacagggg actattcaat tttgatgaat gtaagctggg tactccgggc agatgccagc 240atccgcttgt tgaaggccac caagatttgt gtgacgggca aaagcaactt ccagtcctac 300agctgtgtga ggtgcaatta cacagaggcc ttccagactc agaccagacc ctctggtggt 360aaatggacat tttcctacat cggcttccct gtagagctga acacagtcta tttcattggg 420gcccataata ttcctaatgc aaatatgaat gaagatggcc cttccatgtc tgtgaatttc 480acctcaccag gctgcctaga ccacataatg aaatataaaa aaaagtgtgt caaggccgga 540agcctgtggg atccgaacat cactgcttgt aagaagaatg aggagacagt agaagtgaac 600ttcacaacca ctcccctggg aaacagatac atggctctta tccaacacag cactatcatc 660gggttttctc aggtgtttga gccacaccag aagaaacaaa cgcgagcttc agtggtgatt 720ccagtgactg gggatagtga aggtgctacg gtgcagctga ctccatattt tcctacttgt 780ggcagcgact gcatccgaca taaaggaaca gttgtgctct gcccacaaac aggcgtccct 840ttccctctgg ataacaacaa aagcaagccg ggaggctggc tgcctctcct cctgctgtct 900ctgctggtgg ccacatgggt gctggtggca gggatctatc taatgtggag gcacgaaagg 960atcaagaaga cttccttttc taccaccaca ctactgcccc ccattaaggt tcttgtggtt 1020tacccatctg aaatatgttt ccatcacaca atttgttact tcactgaatt tcttcaaaac 1080cattgcagaa gtgaggtcat ccttgaaaag tggcagaaaa agaaaatagc agagatgggt 1140ccagtgcagt ggcttgccac tcaaaagaag gcagcagaca aagtcgtctt ccttctttcc 1200aatgacgtca acagtgtgtg cgatggtacc tgtggcaaga gcgagggcag tcccagtgag 1260aactctcaag actcttcccc ttgcctttaa ccttttctgc agtgatctaa gaagccagat 1320tcatctgcac aaatacgtgg tggtctactt tagagagatt gatacaaaag acgattacaa 1380tgctctcagt gtctgcccca agtaccacct catgaaggat gccactgctt tctgtgcaga 1440acttctccat gtcaagtagc aggtgtcagc aggaaaaaga tcacaagcct gccacgatgg 1500ctgctgctcc ttgtagccca cccatgagaa gcaagagacc ttaaaggctt cctatcccac 1560caattacagg gaaaaaacgt gtgatgatcc tgaagcttac tatgcagcct acaaacagcc 1620ttagtaatta aaacatttta taccaataaa attttcaaat attgctaact aatgtagcat 1680taactaacga ttggaaacta catttacaac ttcaaagctg ttttatacat agaaatcaat 1740tacagtttta attgaaaact ataaccattt tgataatgca acaataaagc atcttcagcc 1800aaaaaaaaaa aaaaaa 18161411828DNAHomo sapiens 141cggcgatgtc gctcgtgctg ataagcctgg ccgcgctgtg caggagcgcc gtaccccgag 60agccgaccgt tcaatgtggc tctgaaactg ggccatctcc agagtggatg ctacaacatg 120atctaatccc cggagacttg agggacctcc gagtagaacc tgttacaact agtgttgcaa 180caggggacta ttcaattttg atgaatgtaa gctgggtact ccgggcagat gccagcatcc 240gcttgttgaa ggccaccaag atttgtgtga cgggcaaaag caacttccag tcctacagct 300gtgtgaggtg caattacaca gaggccttcc agactcagac cagaccctct ggtggtaaat 360ggacattttc ctatatcggc ttccctgtag agctgaacac agtctatttc attggggccc 420ataatattcc taatgcaaat atgaatgaag atggcccttc catgtctgtg aatttcacct 480caccaggctg cctagaccac ataatgaaat ataaaaaaaa gtgtgtcaag gccggaagcc 540tgtgggatcc gaacatcact gcttgtaaga agaatgagga gacagtagaa gtgaacttca 600caaccactcc cctgggaaac agatacatgg ctcttatcca acacagcact atcatcgggt 660tttctcaggt gtttgagcca caccagaaga aacaaacgcg agcttcagtg gtgattccag 720tgactgggga tagtgaaggt gctacggtgc agctgactcc atattttcct acttgtggca 780gcgactgcat ccgacataaa ggaacagttg tgctctgccc acaaacaggc gtccctttcc 840ctctggataa caacaaaagc aagccgggag gctggctgcc tctcctcctg ctgtctctgc 900tggtggccac atgggtgctg gtggcaggga tctatctaat gtggaggcac gaaaggatca 960agaagacttc cttttctacc accacactac tgccccccat taaggttctt gtggtttacc 1020catctgaaat atgtttccat cacacaattt gttacttcac tgaatttctt caaaaccatt 1080gcagaagtga ggtcatcctt gaaaagtggc agaaaaagaa aatagcagag atgggtccag 1140tgcagtggct tgccactcaa aagaaggcag cagacaaagt cgtcttcctt ctttccaatg 1200acgtcaacag tgtgtgcgat ggtacctgtg gcaagagcga gggcagtccc agtgagaact 1260ctcaagacct cttccccctt gcctttaacc ttttctgcag tgatctaaga agccagattc 1320atctgcacaa atacgtggtg gtctacttta gagagattga tacaaaagac gattacaatg 1380ctctcagtgt ctgccccaag taccacttca tgaaggatgc cactgctttc tgtgcagaac 1440ttctccatgt caagcagcag gtgtcagcag gaaaaagatc acaagcctgc cacgatggct 1500gctgctcctt gtagcccacc catgagaagc aagagacctt aaaggcttcc tatcccacca 1560attacaggga aaaaacgtgt gatgatcctg aagcttacta tgcagcctac aaacagcctt 1620agtaattaaa acattttata ccaataaaat tttcaaatat tactaactaa tgtagcatta 1680actaacgatt ggaaactaca tttacaactt caaagctgtt ttatacatag aaatcaatta 1740cagctttaat tgaaaactgt aaccattttg ataatgcaac aataaagcat cttccaaaaa 1800aaaaaaaaaa aaaaaaaaaa aaaaaaaa 18281422856DNAHomo sapiensmisc_feature(1325)..(1325)a or g or c or t/u 142cggcgatgtc gctcgtgctg ataagcctgg ccgcgctgtg caggagcgcc gtaccccgag 60agccgaccgt tcaatgtggc tctgaaactg ggccatctcc agagtggatg ctacaacatg 120atctaatccc cggagacttg agggacctcc gagtagaacc tgttacaact agtgttgcaa 180caggggacta ttcaattttg atgaatgtaa gctgggtact ccgggcagat gccagcatcc 240gcttgttgaa ggccaccaag atttgtgtga cgggcaaaag caacttccag tcctacagct 300gtgtgaggtg caattacaca gaggccttcc agactcagac cagaccctct ggtggtaaat 360ggacattttc ctatatcggc ttccctgtag agctgaacac agtctatttc attggggccc 420ataatattcc taatgcaaat atgaatgaag atggcccttc catgtctgtg aatttcacct 480caccaggctg cctagaccac ataatgaaat ataaaaaaaa gtgtgtcaag gccggaagcc 540tgtgggatcc gaacatcact gcttgtaaga agaatgagga gacagtagaa gtgaacttca 600caaccactcc cctgggaaac agatacatgg ctcttatcca acacagcact atcatcgggt 660tttctcaggt gtttgagcca caccagaaga aacaaacgcg agcttcagtg gtgattccag 720tgactgggga tagtgaaggt gctacggtgc aggtaaagtt cagtgagctg ctctggggag 780ggaagggaca tagaagactg ttccatcatt cattgctttt aaggatgagt tctctcttgt 840caaatgcact tctgccagca gacaccagtt aagtggcgtt catgggggtt ctttcgctgc 900agcctccacc gtgctgaggt caggaggccg acgtggcagt tgtggtccct tttgcttgta 960ttaatggctg ctgaccttcc aaagcacttt ttattttcat tttctgtcac agacactcag 1020ggatagcagt accattttac ttccgcaagc ctttaactgc aagatgaagc tgcaaagggt 1080ttgaaatggg aaggtttgag ttccaggcag cgtatgaact ctggagaggg gctgccagtc 1140ctctctgggc cgcagcggac ccagctggaa cacaggaagt tggagcagta ggtgctcctt 1200cacctctcag tatgtctctt tcaactctag tttttgaagt ggggacacag gaagtccagt 1260ggggacacag ccactcccca aagaataagg aacttccatg cttcattccc tggcataaaa 1320agtgntcaaa cacaccagag ggggcaggca ccagccaggg tatgatgggt actacccttt 1380tctggagaac catagacttc ccttactaca gggacttgca tgtcctaaag cactggctga 1440aggaagccaa gaggatcact gctgctcctt ttttgtagag gaaatgtttg tgtacgtggt 1500aagatatgac ctagcccttt taggtaagcg aactggtatg ttagtaacgt gtacaaagtt 1560taggttcaga ccccgggagt cttgggcatg tgggtctcgg gtcactggtt ttgactttag 1620ggctttgtta cagatgtgtg accaagggga aaatgtgcat gacaacacta gaggtagggg 1680cgaagccaga aagaagggaa gttttggctg aagtaggagt cttggtgaga ttttgctgtg 1740atgcatggtg tgaactttct gagcctcttg tttttcctca gctgactcca tattttccta 1800cttgtggcag cgactgcatc cgacataaag gaacagttgt gctctgccca caaacaggcg 1860tccctttccc tctggataac aacaaaagca agccgggagg ctggctgcct ctcctcctgc 1920tgtctctgct ggtggccaca tgggtgctgg tggcagggat ctatctaatg tggaggcacg 1980aaaggatcaa gaagacttcc ttttctacca ccacactact gccccccatt aaggttcttg 2040tggtttaccc atctgaaata tgtttccatc acacaatttg ttacttcact gaatttcttc 2100aaaaccattg cagaagtgag gtcatccttg aaaagtggca gaaaaagaaa atagcagaga 2160tgggtccagt gcagtggctt gccactcaaa agaaggcagc agacaaagtc gtcttccttc 2220tttccaatga cgtcaacagt gtgtgcgatg gtacctgtgg caagagcgag ggcagtccca 2280gtgagaactc tcaagacctc ttcccccttg cctttaacct tttctgcagt gatctaagaa 2340gccagattca tctgcacaaa tacgtggtgg tctactttag agagattgat acaaaagacg 2400attacaatgc tctcagtgtc tgccccaagt accacttcat gaaggatgcc actgctttct 2460gtgcagaact tctccatgtc aagcagcagg tgtcagcagg aaaaagatca caagcctgcc 2520acgatggctg ctgctccttg tagcccaccc atgagaagca agagacctta aaggcttcct 2580atcccaccaa ttacagggaa aaaacgtgtg atgatcctga agcttactat gcagcctaca 2640aacagcctta gtaattaaaa cattttatac caataaaatt ttcaaatatt actaactaat 2700gtagcattaa ctaacgattg gaaactacat ttacaacttc aaagctgttt tatacataga 2760aatcaattac agctttaatt gaaaactgta accattttga taatgcaaca ataaagcatc 2820ttccaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 28561431583DNAHomo sapiens 143atgtcgctcg tgctgctaag cctggccgcg ctgtgcagga gcgccgtacc ccgagagccg 60accgttcaat gtggctctga aactgggcca tctccagagt ggatgctaca acatgatcta 120atcccgggag acttgaggga cctccgagta gaacctgtta caactagtgt tgcaacaggg 180gactattcaa ttttgatgaa tgtaagctgg gtactccggg cagatgccag catccgcttg 240ttgaaggcca ccaagatttg tgtgacgggc aaaagcaact tccagtccta cagctgtgtg 300aggtgcaatt acacagaggc cttccagact cagaccagac cctctggtgg taaatggaca 360ttttcctata tcggcttccc tgtagagctg aacacagtct atttcattgg ggcccataat 420attcctaatg caaatatgaa tgaagatggc ccttccatgt ctgtgaattt cacctcacca 480ggctgcctag accacataat gaaatataaa aaaaagtgtg tcaaggccgg aagcctgtgg 540gatccgaaca tcactgcttg taagaagaat gaggagacag tagaagtgaa cttcacaacc 600actcccctgg gaaacagata catggctctt atccaacaca gcactatcat cgggttttct 660caggtgtttg agccacacca gaagaaacaa acgcgagctt cagtggtgat tccagtgact 720ggggatagtg aaggtgctac ggtgcagctg actccatatt ttcctacttg tggcagcgac 780tgcatccgac ataaaggaac agttgtgctc tgcccacaaa caggcgtccc tttccctctg 840gataacaaca aaagcaagcc gggaggctgg ctgcctctcc tcctgctgtc tctgctggtg 900gccacatggg tgctggtggc agggatctat ctaatgtgga ggcacgaaag gatcaagaag 960acttcctttt ctaccaccac actactgccc cccattaagg ttcttgtggt ttacccatct 1020gaaatatgtt tccatcacac aatttgttac ttcactgaat ttcttcaaaa ccattgcaga 1080agtgaggtca tccttgaaaa gtggcagaaa aagaaaatag cagagatggg tccagtgcag 1140tggcttgcca ctcaaaagaa ggcagcagac aaagtcgtct tccttctttc caatgacgtc 1200aacagtgtgt gcgatggtac ctgtggcaag agcgagggca gtcccagtga gaactctcaa 1260gacctcttcc cccttgcctt taaccttttc tgcagtgatc taagaagcca gattcatctg 1320cacaaatacg tggtggtcta ctttagagag attgatacaa aagacgatta caatgctctc 1380agtgtctgcc ccaagtacca cctcatgaag gatgccactg ctttctgtgc agaacttctc 1440catgtcaagc agcaggtgtc agcaggaaaa agatcacaag cctgccacga tggctgctgc 1500tccttgtagc ccacccatga gaagcaagag accttaaagg gttccttttc ccatcattta 1560caggggaaaa acgtgtgatg atc 15831442584DNAHomo sapiens 144catattagag tctacagata tgcctttctt acagcaatcc tgcacccaca taaaagctac 60attttcaata caagattaaa aggtattctg caaaatgtgc aaggttttca tgtctgctgg 120tgtagctgta gtgatggctt catgaatttt tttctttttt gactatggtc cttacgctgg 180attcatttat cttgaaatgg tgaacaatca cagctgcaga ccctcaattt atggtacata 240tcaagcaatt tggctttttt tcttgtaatg aaaaaaaaaa gttttttttg ctttttttca 300tgacactgct tcttgggagc actgccagca ttactagtgg cacttcgtat gggtcctaag 360gtgttattga aggtttacga tattgcacta aacacgaaaa ataccagaga accactggag 420atacttttta ctgtgatatg taatttactg gagacaggaa ctgctcgttt ggagatggtt 480agcatcacag ggtgttttaa gtcgatactt gcaacccttg agctcaccac agtagcaaca 540ggaggtggct aggaaattat tcacagcagg acagtacgca ctgcaattaa ttgtatgcag 600ttatgattta ataccacatc tttatgctca cgtttctctc aactgtgaat ggtgccatgt 660acagttggta tgtgtgtgtt taagttttga taaattttta acttttaata gttaaaatag 720ttaactattg gtatggtagg aaatgataaa gtagactagt atctgtatac attttctgca 780tttatgacat acctttttct tcattttttt caatatttta attgaaaagt tcatccgagt 840ttcatctaag ttttttcaaa gtgatacaaa tctccaaaaa attttccaat atatgtattg 900aaaaaatcca ggtgtaagtg gctctgcgca gtccaaacct gtgttgttca agggtcaact 960gtgtatgaat ccaagcgaaa gcttttctta acacctcata agaactattt tttaaaaaac 1020aggaactagc atagagtaac catcacaggt aaagtgtaat ttgttatcag ccatcttttg 1080cccatttcag tactggtaga aggctcaatg gtaaaaataa aaacgggaca gtcagaagat 1140ctggaagtcc tgaccctgct ttcacctggc atgtgtaatc cagtcatgct cgtatcagtc 1200tctgtaggag cacttgaagg tattacataa atgctatcta actctgggaa acgccaacat 1260gtgattgcct ccagaggaat cttctttaaa aaaaaattca aaatgttatt tccttactag 1320gatgtcttta aagaattata acccttaccg tgcctccaca ttagatagat ccctgccacc 1380agcacccatg tggccaccag cagagacagc aggaggagag gcagccagcc tcccggcttg 1440cttttgtctg gaaaaaacaa agcttattca cctttggaaa acaaatccac acttatctct 1500taatttaaaa actaagactt ggtatacttt atagaggttt atttattttt tattattttt 1560tagttttgag acagagtctc gctttgttgc ctaggctgga gtgcagtggc gcaatctcgg 1620ttcactgcag cctccgtctc ccgggttcaa gcaatgctgc ctcagcctcc tgagtagctg 1680ggattacagg catgtgtcac cgcgcccagc cactttgtag agatttagat ccctttaaaa 1740ccatcagtca gaagctcttt agatagtctg ccaatcatat ctttttccct agagtgtgca 1800ggtcttgcat tagattctca aaagggatat gggacccagg aagttaagaa cagtcctaaa 1860atctctttgg cttctttgtc ctgatatgca ccggcatttt cacagtagga actagggttt 1920ctgtccagtt tttttggttc tttaaggaat taatgttatt ctgggtacaa ctgcttacat 1980acatagcaca tatagatgac atttttacag gccgtcttgt tagactgaca tacatggagg 2040atagtgccac ccgcctcaca agaacatcag gtaagctcag gcacagagtg cccaggaatc 2100tgtaaggctt cgcccacgca caagtcaggg ctgccagtca cctgggttgt cttcacttta 2160tttggctgcg tctaatgaca ccttccaact tttgacccca cccctggact gttgtgtaaa 2220cattgtattt ctccatctgt aatgaaaaag ctaacacatc tctaactcca gagacatttt 2280ccagaacatg ctgttctcag gcactagtga ggcggtacca ttattcctca tttgttatcc 2340aaatgttggc catgtgacca caccaaaagc tcatcctggg ccactgagac tagtaattga 2400atcagaatat agtgaaatat tcattctcat atatacccag ccatcttaca tctttggctt 2460ttttcagcag atccttgtgg cactcagaac atccattttg cactgtgtat ttttttccct 2520tctgtgtatc ctgctttgta aagagtcacg agtggtttta caaataaagc ctgttcttac 2580tcag 2584145665DNAHomo sapiens 145tttttttttt ttttttctga gtaagaacag gctttatttg taaaaccact cgtgactctt 60tacaaagcag gatacacaga agggaaaaaa atacacagtg caaaatggat gttctgagtg 120ccacaaggat ctgctgaaaa aagccaaaga tgtaagatgg ctgggtatat atgagaatga 180atatttcact atattctgat tcaattacca gtctcagtgg cccaggatga gcttttggtg 240tggtcacatg gccaacattt ggataacaaa tgaggaataa tggtaccgcc tcactagtgc 300ctgagaacag catgttctgg aaaatgtctc tggagttaga gatgtgttag ctttttcatt 360acagatggag aaatacaatg tttacacaac agtccagggg tggggtcaaa agttggaagg 420tgtcattaga cgcagccaaa taaagtgaag acaacccagg tgactggcag ccctgacttg 480tgcgtgggcg aagccttaca gattcctggg cactctgtgc ctgagcttac ctgatgttct 540tgtgaggcgg gtggcactat cctccatgta tgtcagtcta acaagacggc ctgtaaaaat 600gtcatctata tgtgctatgt atgtaagcag ttgtacccag aataacatta atcctcgtgc 660cgaat 665146664DNAHomo sapiensmisc_feature(613)..(613)a or g or c or t/u 146tttttttttt tttttgttgg gctgaagatg ctttattatt gcattatcaa aatggttata 60gttttcaatt aaaactgtaa ttgatttcta tgtataaaac agctttgaag ttgtaaatgt 120agtttccaat cgttagttaa tgctacatta gttagcaata tttgaaaatt ttattggtat 180aaaatgtttt aattactaag gctgtttgta ggctgcatag taagcttcag gatcatcaca 240cgttttttcc ctgtaattgg tgggatagga agcctttaag gtctcttgct tctcatgggt 300gggctacaag gagcagcagc catcgtggca ggcttgtgat ctttttcctg ctgacacctg 360ctgcttgaca tggagaagtt ctgcacagaa agcagtggca tccttcatga ggtggtactt 420ggggcagaca ctgagagcat tgtaatcgtc ttttgtatca atctctctaa agtagaccac 480cacgtatttg tgcagatgaa tctggcttct tagatcactg cagaaaaggt taaaggcaag 540ggggaagagg tcttgagagt tctcactggg actgccctcg ctcttgccac aggtaccatc 600gcacacactg ttnacgtcat tggaaagaag gaagacgact ttgtctgctg ccttcttttg 660agtg 664147739DNAHomo sapiens 147tggtttttgt ttttttttca ttttctgttg gattacagaa aaagaatggg acccattcag 60gtctcgattt ccaaaggtaa agatggaagg ctgggcagac tggcttttgt tacctgacat 120gccgtagggt gagcttagag gaagaaagaa aacaattttt atttggccaa aacagaacaa 180atgctgaaaa ggaaatcttg tttttttcct aaagccaaat agaaatgatt tgggtataat 240ttaagagtcc ttgtgttgta cagatatggt gactgatgta gttattaata ctaccaactt 300agtcatcaag cctcaatttt cctttacctg aaggattaag tgaaagcttt tggagttcat 360gatgttcagt atgatcagtt aaccttaacc tctgagcatc ctgaagcaaa atctaaataa 420tgcagctatt accactggtg gtccaggctc tggtgaagcc ctctgagccc aggaggaaga 480gaaagcattg tccagaggta ggaacacagt ctgggagccc agagctctgg gaggagtggg 540aaaatgctgc ttcctgctgc ttgcttctag gcacctgctt ccgccatctc acttaccatg 600gctagagatg ggggtgagac tggggaagga cacaagcagg gaacagataa gggatggaaa 660tcagaaggga atatagaaag aactctggat gtggagacat gccggtacct gagcattttg 720tatcaatggg agtacctct 739148657DNAHomo sapiens 148tttttttttt ttttttttgg ctgaagatgc tttattgttg cattatcaaa atggttacag 60ttttcaatta aagctgtaat tgatttctat gtataaaaca gctttgaagt tgtaaatgta 120gtttccaatc gttagttaat gctacattag ttagcaatat ttgaaaattt tattggtata 180aaatgtttta attactaagg ctgtttgtag gctgcatagt aagcttcagg atcatcacac 240gtttttttcc ctgtaattgg tgggatagga agcctttaag gtctcttgct tctcatgggt 300gggctacaag gagcagcagc catcgtggca ggcttgtgat ctttttcctg ctgacacctg 360ctgcttgaca tggagaagtt ctgcacagaa agcagtggca tccttcatga ggtggtactt 420ggggcagaca ctgagagcat tgtaatcgtc ttttgtatca atctctctaa agtagaccac 480cacgtatttg tgcagatgaa tctggcttct tagatcactg cagaaaaggt taaaggcaag 540ggggaagagg tcttgagagt tctcactggg acttgcctcg ctcttgccac aggtaccatc 600gcacacactg ttgacgtcat

tggaaagaaa gaagacgact ttgtctgctg ccttctt 657149102DNAHomo sapiens 149gctgaagatg ctttattgtt gcattatcaa aatggttaca gttttcaatt aaagctgtaa 60ttgatttcta tgtataaaac agctttgaag ttgtaaatgt ag 102150187DNAHomo sapiens 150cacgcgtccg attttatacc aataaaattt tcaaatattg ctaactaatg tagcattaac 60taacgattgg aaactacatt tacaacttca aagctgtttt atacatagaa atcaattaca 120gctttaattg aaaactgtaa ccattttgat aatgcaacaa taaagcatct tcagccaaaa 180aaaaaaa 187151361DNAHomo sapiens 151agaaaaagaa aatagcagag atgggtccag tgcagtggct tgcataaaaa agaaggcagc 60agacaaagtc gtcttccttc tttccaatga cgtcaacagt gtgtgcgatg gtacctgtgg 120caagagcgag ggcagtccca gtgagaactc tcaagacctc ttcccccctt gcctttaacc 180ttttctgcag tgatctaaga agccagattc atctgcacaa atacgtggtg gtctacttta 240gagagattga tacaaaagac gattacaatg ctctcagtgt ctgccccaag taccacctca 300tgaaggatgc cactgctttc tgtgcagaac ttctccatgt caagcagcag gtttcagcag 360g 361152783DNAHomo sapiensmisc_feature(707)..(707)a or g or c or t/u 152tttttttttt ttttgtttgg ctgaagatgc tttattgttg cattatcaaa atggttacag 60ttttcaatta aagctgtaat tgatttctat gtataaaaca gctttgaagt tgtaaatgta 120gtttccaatc gttagttaat gctacattag ttagcaatat ttgaaaattt tattggtata 180aaatgtttta attactaagg ctgtttgtag gctgcatagt aagcttcagg atcatcacac 240gttttttccc tgtaattggt gggataggaa gcctttaagg tctcttgctt ctcatgggtg 300ggctacaagg agcagcagcc atcgtggcag gcttgtgatc tttttcctgc tgacacctgc 360tgcttgacat ggagaagttc tgcacagaaa gcagtggcat ccttcatgag gtggtacttg 420gggcagacac tgagagcatt gtaatcgtct tttgtatcaa tctctctaaa gtagaccacc 480acgtatttgt gcagatgaat ctggcttctt agatcactgc agaaaaggtt aaaggcaagg 540gggaagaggt cttgagagtt ctcactggga ctgccctcgc tcttgccaca ggtaccatcg 600cacacactgt tgacgtcatt ggaaagaagg aagacgactt tgtctgctgc cttcttttga 660gtggcaagcc actgcactgg acccatctct gctattttct ttttctngca cttttcaagg 720atgactcact tctgcaatgg tttttgagaa ttcagtgaag tacaaatgtg tgatggaaca 780tat 783153399DNAHomo sapiens 153cgctcgtgct gctaagcctg gccgcgctgt gcaggagcgc cgtaccccga gagccgaccg 60ttcaatgtgg ctctgaaact gggccatctc cagagtggat gctacaacat gatctaatcc 120ccggagactt gagggacctc cgagtagaac ctgttacaac tagtgttgca acaggggact 180attcaatttt gatgaatgta agctgggtac tccgggcaga tgccacacca gaagaaacaa 240acgcgagctt cagtggtgat tccagtgact ggggatagtg aaggtgctac ggtgcagctg 300actccatatt ttcctacttg tggcagcgac tgcatccgac ataaaggaac agttgtgctc 360tgcccacaaa caggcgtccc tttccctctg gataacaac 399154518DNAHomo sapiensmisc_feature(503)..(503)a or g or c or t/u 154gctgagtgtg atggtgtaag cctgtggtcc cagctactag ggaggctgag atgggattac 60aggtgtgagc cacggcgcct ggcctaaaag catctttttc tttaacgcag aggttatgtt 120gtattattag cataaatgtt tttttctggg aatgcttatt tcacacagca caatactgaa 180tcttctctgg aatgtggatc gatttcagat ggatgactat taaaatgtgt atatttgcag 240attatcctta aagggccacc tcatgccttc taatttatgt cttacggata aaaaatcaaa 300atgaagcata aagtaaaaac tgtgtccagc tttacaagtg gacgcttagt aatggctgag 360gcaatatgtt taatgtagca aattttactt atttgtcatg atcagttttc acagtgcttg 420taagtgctgg taatagaaga tggacatggt ttaggtcaaa acttggacca gaaaccaact 480tcctttgaaa cagctctacc agntataaga gcaatatg 518155490DNAHomo sapiens 155ctgttgacgt cattggaaag aaggaagacg actttgtctg ctgccttctt ttgagtggca 60agccactgca ctggacccat ctctgctatt ttctttttct gccacttttc aaggatgacc 120tcacttctgc aatggttttg aagaaattca gtgaagtaac aaattgtgtg atggaaacat 180atttcagatg ggtaaaccac aagaacctta atggggggca gtagtgtggt ggtagaaaag 240gaagtcttct tgatcctttc tgtgagagga gaaaagcatt tgttatctgt gaacagcaaa 300cagcaggctt tcactctgta aaccatccct gacaaatgat cccttgctag agaatgtcag 360ctgagcacca agggccttgt tagtgacagc aaggaaaaac atcctgatgt tccttttgaa 420cacatcacct gaaacacact gatgcttaaa ccttaacttt tttttttttg gagacacagt 480ctcactctgt 490156421DNAHomo sapiens 156tttttttttt ttttttttct gagtaagaac aggctttatt tgtaaaacca ctcgtgactc 60tttacaaagc aggatacaca gaagggaaaa aaatacacag tgcaaaatgg atgttctgag 120tgccacaagg atctgctgaa aaaagccaaa gatgtaagat ggctgggtat atatgagaat 180gaatatttca ctatattctg attcaattac cagtctcagt ggcccaggat gagcttttgg 240tgtggtcaca tggccaacat ttggataaca aatgaggaat aatggtaccg cctcactagt 300gcctgagaac agcatgttct ggaaaatgtc tctggagtta gagatgtgtt agctttttca 360ttacagatgg agaaatacaa tgtttacaca acagtccagg ggtggggtca aaagttggaa 420g 421157547DNAHomo sapiens 157tttttttttt ttttttttgg ctgaagatgc tttattgttg cattatcaaa atggttatag 60ttttcaatta aaactgtaat tgatttctat gtataaaaca gctttgaagt tgtaaatgta 120gtttccaatc gttagttaat gctacattag ttagcaatat ttgaaaattt tattggtata 180aaatgtttta attactaagg ctgtttgtag gctgcatagt aagcttcagg atcatcacac 240gttttttccc tgtaattggt gggataggaa gcctttaagg tctcttgctt ctcatgggtg 300ggctacaagg agcagcagcc atcgtggcag gcttgtgatc tttttcctgc tgacacctgc 360tgcttgacat ggagaagttc tgcacagaaa gcagtggcat ccttcatgag gtggtacgtg 420gggcagacac tgagagcatt gtaatcgtct tttgtatcaa tctctctaaa gtagaccacc 480acgtatttgt gcagatgaat ctggcttctt agatcactgc agaaaaggtt aaaggcaagg 540gggaaga 547158644DNAHomo sapiens 158tttttttttt ttttttttga aagggtcagg acttccagat cttctgactg tcccgttttt 60atttttacca ttgagccttc taccagtact gaaatgggca aaagatggct gataacaaat 120tacactttac ctgtgatggt tactctatgc tagttcctgt tttttaaaaa atagttctta 180tgaggtgtta agaaaagctt tcgcttggat tcatacacag ttgacccttg aacaacacag 240gtttggactg cgcagagcca cttacacctg gattttttca atacatatat tggaaaattt 300tttggagatt tgtatcactt tgaaaaaact tagatgaaac tcggatgaac ttttcaatta 360aaatattgaa aaaaatgaag aaaaaggtat gtcataaatg cagaaaatgt atacagatac 420tagtctactt tatcatttcc taccatacca atagttaact attttaacta ttaaaagtta 480aaaatttatc aaaacttaaa cacacacata ccaactgtac atggcaccat tcacagttga 540gagaaacgtg agcataaaga tgtggtatta aatcataact gcatacaatt aattgcagtg 600cgtactgtcc tgctgtgaat atttcctagc cctcgtgccg aatc 644159470DNAHomo sapiens 159gtgggtgacc gtggcttgcc actcaaaaga aggcagcaga caaagtcgtc ttccttcttt 60ccaatgacgt caacagtgtg tgcgatggta cctgtggcaa gagcgagggc agtcccagtg 120agaactctca agacctcttc ccccttgcct ttaacctttt ctgcagtgat ctaagaagcc 180agattcatct gcacaaatac gtggtggtct actttagaga gattgataca aaagacgatt 240acaatgctct cagtgtctgc cccaagtacc acctcatgaa ggatgccact gctttctgtg 300cataacttct ccatgtcaag cagcaggtgt cagcaggaaa aagatcacaa gcctgccacg 360atggctgctg ctccttgtag cccacccatg agaagcaaga gaccttaaag gcttcctatc 420ccaccaatta cagggaaaaa aacgtgtgat gatcctgaag ccacggtcaa 470160499DNAHomo sapiens 160tagaggatcc cggtcgacgg tggttcagtg atcatcacac tttttccctg taataggtgg 60gataggaagc ctttaaggtc tcttgcttct catgggtggg ctacaaggag cagcagccat 120cgtggcaggc ttgtgatctt tttcctgctg acacctgctg cttgacatgg agaagttatg 180cacagaaagc agtggcatcc ttcatgaggt ggtacttggg gcagacactg agagcattgt 240aatcgtcttt tgtatcaatc tctctaaagt agaccaccac gtatttgtgc agatgaatct 300ggcttcttag atcactgcag aaaaggttaa aggcaagggg gaagaggtct tgagagttct 360cactgggact gccctcgctc ttgccacagg taccatcgca cacactgttg acgtcattgg 420aaagaaggaa gacgactttg tctgctgcct tcttttgagt ggcaagccac ggtcaaccca 480caagccacgg tcaacccac 499161615DNAHomo sapiens 161tctacgtggt aagatatgac ctagcccttt taggtaagcg aactggtatg ttagtaacgt 60gtacaaagtt taggttcaga ccccgggagt cttgggcatg tgggtctcgg gtcactggtt 120ttgactttag ggctttgtta cagatgtgtg accaagggga aaatgtgcat gacaacacta 180gaggtagggg cgaagccaga aagaagggaa gttttggctg aagtaggagt cttgcgactg 240catccgacat aaaggaacag ttgtgctctg cccacaaaca ggcgtccctt tccctctgga 300taacaacaaa agcaagccgg gaggctggct gcctctcctc ctgctgtctc tgctggtggc 360cacatgggtg ctggtggcag ggatctatct aatgtggagg cacgaaagga tcaagaagac 420ttccttttct accaccacac tactgccccc cattaaggtt cttgtggttt acccatctga 480aatatgtttc catcacacaa tttgttactt cactgaattt cttcaaaacc attgcagaag 540tgaggtcatc cttgaaagtg gcagagtagc agagatgggt ccagtgcagt ggcttgccac 600tcgtgcgatg gtctt 615162636DNAHomo sapiensmisc_feature(50)..(50)a or g or c or t/umisc_feature(203)..(203)a or g or c or t/umisc_feature(323)..(323)a or g or c or t/umisc_feature(463)..(463)a or g or c or t/umisc_feature(467)..(467)a or g or c or t/umisc_feature(502)..(502)a or g or c or t/umisc_feature(507)..(507)a or g or c or t/umisc_feature(595)..(595)a or g or c or t/umisc_feature(600)..(600)a or g or c or t/u 162ggcacgagca ctggctgaag gaagccaaga ggatcactgc tgctcctttn ttctagagga 60aatgtttgtc tacgtggtaa gatatgacct agccctttta ggtaagcgaa ctggtatgtt 120agtaacgtgt acaaagttta ggttcagacc ccgggagtct tgggcatgtg ggtctcgggt 180cactggtttt gactttaggg ctntgttaca gatgtgtgac caaggggaaa atgtgcatga 240caacactaga gctgactcca tattttccta cttgtggcag cgactgcatc cgacataaag 300gaacagttgt gctctgccca canacaggcg tccctttccc tctggataac aacataagca 360agccgggagg ctggctgcct ctcctcctgc tgtctctgct ggtggcacat gggtgctggt 420ggagggatct atctaatgtg gaggcacgga tcaagaagac ttncttntct accaccacac 480tactggcccc aataagggtc tngtggntac cccatctgaa tatgttcata cacaatttgt 540actcactgaa ttctcaaaac attgagagtg aggcatcctg aaagtgcgaa aaganatgcn 600aatggtcagt gcatgctgca ctagcagcat ggactt 636163676DNAHomo sapiens 163gatcccgcgc agtggcccgg cgatgtcgct cgtgctgcta agcctggccg cgctgtgcag 60gagcgccgta ccccgagagc cgaccgttca atgtggctct gaaactgggc catctccaga 120gtggatgcta caacatgatc taatccccgg agacttgagg gacctccgag tagaacctgt 180tacaactagt gttgcaacag gggactattc aattttgatg aatgtaagct gggtactccg 240ggcagatgcc agcatccgct tgttgaaggc caccaagatt tgtgtgacgg gcaaaagcaa 300cttccagtcc tacagctgtg tgaggtgcaa ttacacagag gccttccaga ctcagaccag 360accctctggt ggtaaatgga cattttccta catcggcttc cctgtagagc tgaacacagt 420ctatttcatt ggggcccata atattcctaa tgcaaatatg aatgaagatg gcccttccat 480gtctgtgaat ttcacctcac caggctgcct agaccacata atgaaatata aaaaaaagtg 540tgtcaaggcc ggaagcctgt gggatccgaa catcactgct tgtaagaaga atgaggagac 600agtagaagtg aacttcacaa ccactcccct gggaaacaga tacatggctc ttatccaaca 660cagcactatc attcgg 676164722DNAHomo sapiens 164gtcttgcatt agattctcaa aagggatatg ggacccagga agttaagaac agtcctaaaa 60tctctttggc ttctttgtcc tgatatgcac cggcattttc acagtaggaa ctagggtttc 120tgtccagttt ttttggttct ttaaggaatt aatgttattc tgggtacaac tgcttacata 180catagcacat atagatgaca tttttacagg ccgtcttgtt agactgacat acatggagga 240tagtgccacc cgcctcacaa gaacatcagg taagctcagg cacagagtgc ccaggaatct 300gtaaggcttc gcccacgcac aagtcagggc tgccagtcac ctgggttgtc ttcactttat 360ttggctgcgt ctaatgacac cttccaactt ttgaccccac ccctggactg ttgtgtaaac 420attgtatttc tccatctgta atgaaaaagc taacacatct ctaactccag agacattttc 480cagaacatgc tgttctcagg cactagtgag gcggtaccat tattcctcat ttgttatcca 540aatgttggcc atgtgaccac accaaaagct catcctgggc cactgagact ggtaattgaa 600tcagaatata gtgaaatatt cattctcata tatacccagc catcttacat ctttggcttt 660tttcagcaga tccttgtggc actcagaaca tccattttgc actgtgtatt tttttccctt 720ct 722165335DNAHomo sapiens 165tgtgtaactc tcaagacctc ttcccccttg cctttaacct tttctgcagt gatctaagaa 60gccagattca tctgcacaaa tacgtggtgg tctactttag agagattgat acaaaagacg 120attacaatgc tctcagtgtc tgccccaagt accacctcat ggaggatgcc actgctttct 180gtgcagaact tctccatgtc aagtagcagg tgtcagcagg aaaaagatca caagcctgcc 240acgatggctg ctgctccttg tagcccaccc atgagaagca agagacctta aaggcttcct 300atcccaccaa ttacagggaa aaaacgtgtg atgat 335166680DNAHomo sapiensmisc_feature(591)..(591)a or g or c or t/u 166ctgaaatatg tttccatcac acaatttgtt acttcactga atttcttcaa aaccattgca 60gaagtgaggt catccttgaa aagtggcaga aaaagaaaat agcagagatg ggtccagtgc 120agtggcttgc cactcaaaag aaggcagcag acaaagtcgt cttccttctt tccaatgacg 180tcaacagtgt gtgcgatggt acctgtggca agagcgaggg cagtcccagt gagaactctc 240aagacctctt cccccttgcc tttaaccttt tctgcagtga tctaagaagc cagattcatc 300tgcacaaata cgtggtggtc tactttagag agattgatac aaaagacgat tacaatgctc 360tcagtgtctg ccccaagtac cacctcatga aggatgccac tgctttctgt gcagaacttc 420tccatgtcaa gtagcaggtg tcagcaggaa aaagatcaca agcctgccac gatggctgct 480gctccttgta gcccacccat gagaagcaag agaccttaaa ggcttcctat cccaccaatt 540acagggaaaa aaacgtgtga tgatccctga agcttactat gcagcctaca nacagcctta 600gtaataaaac attttatcca ataaaatttc aaattttgct taactatgtg cataaactac 660gattgaaaac tctttacact 680167491DNAHomo sapiens 167cattgtggtt gcagctgcat agtaagcttc aggatcatca cacgtttttt ccctgtaatt 60ggtgggatag gaagccttta aggtctcttg cttctcatgg gtgggctaca aggagcagca 120gccatcgtgg caggcttgtg atctttttcc tgctgacacc tgctgcttga catggagaag 180ttctgcacag aaagcagtgg catccttcat gaggtggtac ttggggcaga cactgagagc 240attgtaatcg tcttttgtat caatctccct aaagtagacc accacgtatt tgtgcagatg 300aatctggctt cttagatcac tgcagaaaag gttaaaggca agggggaaga ggtcttgaga 360gttctcactg ggactgccct cgctcttgcc acaggtacca tcgcacacac tgttgacgtc 420attggaaaga aggaagacga ctttgtctgc tgccttcttt tgagtggcaa gccactgcac 480tggacccatc t 491168533DNAHomo sapiens 168gtgaataagc tttgtttttt ccagacaaaa gcaagccagg aggctggctg cctctcctcc 60tgctgtctct gctggtggcc acatggttgc tggtggcagg gatctatcta atgtggaggc 120acggtaaggg ttataattct ttaaagtcat cctagtaagg aaataacatt tggaattttt 180ttttaaagaa gattcctctg gaggcaatca cctgttggcg tttcccagag ttagatagca 240tttatgtaat accttcaagt gctcctacag agactgatac gagcatgact ggattacaca 300tgccaggtga aagcagggcc aggacttcca gatcttctga ctgtcccgtt tttattttta 360ccattgagcc ttctaccaga actgaaatgg gcaaaagatg gctgataaca aattacactt 420tacctgtgat ggttactcta tgctagttcc tgtttttaaa aaaatagttc ttatgaggtg 480tcaagaaaag ctttcgcttg gattcataca cagttgaccc ttgaacaaca cag 533169218DNAHomo sapiensmisc_feature(119)..(120)a or g or c or t/umisc_feature(199)..(199)a or g or c or t/u 169gatcctgaag cttactatgc agcctacaaa cagccttagt aattaaaaca ttttatacca 60ataaaatttt caaatattgc taactaatgt agcattaact aacgattgga aactacatnn 120acaacttcaa agctgtttta tacatagaaa tcaattacag ctttaattga aaactataac 180cattttgata atgcaacant aaagcatctt cagccaaa 218170703DNAHomo sapiensmisc_feature(554)..(554)a or g or c or t/umisc_feature(703)..(703)a or g or c or t/u 170gcaacttcca gtcctacagc tgtgtgaggt gcaattacac agaggccttc cagactcaga 60ccagaccctc tggtggtaaa tggacatttt cctatatcgg cttccctgta gagctgaaca 120cagtctattt cattggggcc cataatattc ctaatgcaaa tatgaatgaa gatggccctt 180ccatgtctgt gaatttcacc tcaccaggct gcctagacca cataatgaaa tataaaaaaa 240agtgtgtcaa ggccggaagc ctgtgggatc cgaacatcac tgcttgtaag aagaatgagg 300agacagtaga agtgaacttc acaaccactc ccctgggaaa cagatacatg gctcttatcc 360aacacagcac tatcatcggg ttttctcagg tgtttgagcc acaccagaag aaacaaacgc 420gagcttcagt ggtgattcca gtgactgggg atagtgaagg tgctacggtg cagctgactc 480catattttcc tacttgtggc agcgactgca tccgacataa aggaacagtt gtgctctgcc 540cacaaacagg cgtncctttt cctctggata acaacaaaag caagccggga ggcttggctg 600ctctccttct gctggccttt gctgtggcca cattggtgct ggtggcaggg atctatctaa 660tgtggatgca cgtctcgtgg tttacccatc tgaaatatgt tcn 703171893DNAHomo sapiensmisc_feature(798)..(798)a or g or c or t/umisc_feature(805)..(805)a or g or c or t/u 171atttttcctc ttgtggcagc gactggcatc cgacataaag gaacagttgt gctctgccca 60caaacaggcg tccctttccc tctggataac aacaaaagca agccgggagg ctggctgcct 120ctcctcctgc tgtctctgct ggtggccaca tgggtgctgg tggcagggat ctatctaatg 180tggaggcacg aaaggatcaa gaagacttcc ttttctacca ccacactact gccccccatt 240aaggttcttg tggtttaccc atctgaaata tgtttccatc acacaatttg ttacttcact 300gaatttcttc aaaaccattg cagaagtgag gtcatccttg aaaagtggca gaaaaagaaa 360atagcagaga tgggtccagt gcagtggctt gccactcaaa agaaggcagc agacaaagtc 420gtcttccttc tttccaatga cgtcaacagt gtgtgcgatg gtacctgtgg caagagcgag 480ggcagtccca gtgagaactc tcaagacctc ttcccccttg cctttaacct tttctgcagt 540gatctaagaa gccagattca tctgcacaaa tacgtggtgg tctactttag agagattgat 600acaaaagacg attacaatgc tctcagtgtc tgccccaagt accacctcat gaaggatgcc 660actgctttct gtgcagaact tctccatgtc aagcagcagg tgtcagcagg aaaaagatca 720caagcctgcc acgatggctg ctgctccttg tagcccaccc atgagaagca agagacctta 780aggcttctat cccaccanta caggnaaaaa cgtgtgatga tcctgaagct tactatgcag 840cctacaacag gcttagtatt aaaacattta tacccataaa ttttcaaatt gct 893172959DNAHomo sapiens 172taggtgacac tatagaacaa gtttgtacaa aaaagcaggc tggtaccggt ccggaattcc 60cgggatagtg gmccggcgak gtcgctcgtg ctgctaagcc tggccgcgct gtgcaggagc 120gccgtacccc gagagccgac cgttcaatgt ggctctgaaa ctgggccatc tccaragtgg 180atgskacaac atgatctaat cccgggagac ttgagggacc tccgagtaga acctgttaca 240actagtgttg caacagggga ctattcaatt ttgatgaatg taagctgggt actccgggsa 300gatgccagca tccgcttgtt gaaggccacc aagatttgtg tgamgggcaa aagcaacwtc 360cagtcctaca gcwgtgtgag gtagcaatta cacagagagc acatatccag actctagacc 420agaccctctg gwggtaaatg gacattttcc tatatcggct tccctgtaga gctgaacaca 480gtctatattc attggggccc awaatawwcc taatgcaaat atgaatgaag atggcccttc 540catgtctgtg aatttcacct caccaggctg cctagaccac ataatgaaat awaaaaaaaa 600gtgtgtcaag gccggaagcc tgtgggatcc gaacatcact gcttgtaaga agaatgarga 660gacagtagaa gtgaacttca caaccactcc cctgggaaac agatamatkg ctcttatcca 720acacarmact atcatcgggt tttctcaggt gtttgagcca caccagaaga aacaaacgcg 780agcttcagtg gtgattccag tgactgggga tagtgaaggt gctacggtgc agctgactcc 840atattttcct acttgtggca gcgwctgcat ccgacataaa ggaacagttg tgctctgccc 900acaaacaggc gtccctttyc ctctggataa caacaaaagc aacygggags tggytgyct 9591731200DNAHomo sapiensmisc_feature(15)..(15)a or g or c or

t/umisc_feature(35)..(35)a or g or c or t/umisc_feature(43)..(43)a or g or c or t/umisc_feature(45)..(45)a or g or c or t/umisc_feature(48)..(48)a or g or c or t/u 173waatwakadd ratanhtgaa aactataacc atttntgata atngnaanaa taaagcatct 60tcagccaaac atctagtctt ccatagacca tgcattgcag tgtacccaga wctgtttagc 120taatattcta tgtttaatta atgaatacta actctaagaa cccctcactg attcactcaa 180tagcatctta agtgaaaaac cttctattac atgcaaaaaa tcattgtttt taagataaca 240aaagtaggga ataaacaagc tgaacccact tttactggac caaatgatct attatatgtg 300taaccacttg tatgatttgg tatttgcata agaccttccc tctacaaact agattcatat 360cttgattctt gtacaggtgc cttttaacat gaacaacaaa atacccacaa acttgtctac 420ttttgcctaa agttacctat tagaggtcac tgtsagagtk ctcagtttct tagttactat 480ttaastttts atgttcaaaa tgaaaataat tctkaagtkg aaagsgctct tgaagtaacc 540tttttataaa tgagttatta taatggttta cttaaataaa avagaggggk ttttgcggtg 600gctcatgcct ccaatcccag cactttggca aggccaaggc aaaavgatcg ctcaagacca 660ggctacgtca caaagcgaga cctccatctc tacaaaagat ttaaaaaatt agctgagtgt 720gatggtgtga gcctgtggtc ccagctacta gggaggctga gatgggagga tcacttgagc 780cctggaggtc aagggtgcag taaacggtga ttgtgccact gcactccatc ctgggtgaga 840gcagaccctg tctaaaacaa acaaacgaaa aaacccccac agaatgacag aacataaaag 900atgcacattt tgtcttccaa ctttttactc ttctaaaagc atctttttta aattttttaa 960attttttttt ttttgagaca gagtttcact ctgtcacaca ggctggagtg mgtggcgtga 1020ctcggctcac tamaactctg cytccggggt yacscatctc ctgcwcagct cctgagaagc 1080kggayamagg mccacacaaa ccagtaaytt tatwttttga aaaagggtty acctgtasma 1140graggctgaa tccgacmaar tmaccmccac yycaaadgag gawaagkgkr smggscbggc 1200174899DNAHomo sapiensmisc_feature(483)..(483)a or g or c or t/umisc_feature(485)..(485)a or g or c or t/u 174ttatgggggg cagtagtgtg gtggtagaaa aggaagtctt cttgatcctt tcgtgcctcc 60cattagatag atccctgcca ccagcaccca tgtggccacc agcagagaca gcaggaggag 120aggcagccag cctcccggct tgcttttgtt gttatccaga gggaaaggga cgcctgtttg 180tgggcagagc acaactgttc ctttatgtcg gatgcagtcg ctgccacaag taggaaaata 240tggagtcagc tgcaccgtag caccttcact atccccagtc actggaatca ccactgaagc 300tcgcgtttgt ttcttctggt gtggctcaaa cacctgagaa aacccgatga tagtgctgtg 360ttggataaga gccatgtatc tgtttcccag gggagtggtt gtgaagttca cttctactgt 420ctcctcattc ttcttacaag cagtgatgtt cggatcccac aggcttccgg ccttgacaca 480ctntntttta tatttcatta tgtggtctag gcagcctggt gaggtgaaat tcacagacat 540ggaagggcca tcttcattca tatttgcatt aggaatatta tgggccccaa tgaaatagac 600tgtgttcagc tctacagggg aagccgatat aggaaaatgt ccatttacca ccagagggtc 660tggtctgagt cttgaaggcc ttttgtgtta ttgcacctta cacagctgtt agactgggaa 720gttgcttttg ccccgcacac aaatcttgtg ggccttcaac agcggatgct gccatttgcc 780ccgaagtccc cagctcaatt cattaaaaat tgaataggcc ccttgtggca accctagttg 840gtacagggtt ttacttgggg ggcccctcta agtttccccg ggatataaac aaagtgtgg 899175877DNAHomo sapiens 175ttatgggggg cagtagtgtg gtggtagaaa aggaagtctt cttgatcctt tcgtgcctcc 60acattagata gatccctgcc accagcaccc atgtggccac cagcagagac agcaggagga 120gaggcagcca gcctcccggc ttgcttttgt tgttatccag agggaaaggg acgcctgttt 180gtgggcagag cacaactgtt cctttatgtc ggatgcagtc gctgccacaa gtaggaaaat 240atggagtcag ctgcaccgta gcaccttcac tatccccagt cactggaatc accactgaag 300ctcgcgtttg tttcttctgg tgtggctcaa acacctgaga aaacccgatg atagtgctgt 360gttggataag agccatgtat ctgtttccca ggggagtggt tgtgaagttc acttctactg 420tctcctcatt cttcttacaa gcagtgatgt tcggatccca caggcttccg gccttgacac 480actttttttt atatttcatt atgtggtcta ggcagcctgg tgaggtgaaa ttcacagaca 540tggaagggcc atcttcattc atatttgcat taggaatatt atgggcccca atgaaataga 600ctgtgttcag ctctacaggg aagccgatat aggaaaatgt ccatttacca ccagagggtc 660tggtctgagt ctggaaggcc tctgtgtaat tgcacctcac acagctgtag gactgggagt 720tgcttttgcc cgtacacaaa tcttgttggc cttcaacaag cggatgctgg catctggcgg 780gggtacccag cttacattca tcaaaattga atagtcccct tgttgcaaca ctagtttgta 840aacaggttct actccggggg tcccctcagt ctcccgg 877176419DNAHomo sapiens 176caaatatgaa tgaagatggc ccttccatgt ctgtgaattt cacctcacca ggctgcctag 60accacataat gaaatataaa aaaaagtgtg tcaaggccgg aagcctgtgg gatccgaaca 120tcactgcttg taagaagaat gaggagacag tagaagtgaa cttcacaacc actcccctgg 180gaaacagata catggctctt atccaacaca gcactatcat cgggttttct caggtgtttg 240agccacacca gaagaaacaa acgcgagctt cagtggtgat tccagtgact ggggatagtg 300aaggtgctac ggtgcaactg actccatatt ttcctacttg tggcagcgac tgcatccgac 360ataaaggaac agttgtgctc tgcccacaaa caggcgtccc tttccctctg gataacaac 419177420DNAHomo sapiens 177gcaaatatga atgaagatgg cccttccatg tctgtgaatt tcacctcacc aggctgccta 60gaccacataa tgaaatataa aaaaaagtgt gtcaaggccg gaagcctgtg ggatccgaac 120atcactgctt gtaagaagaa tgaggagaca gtagaagtga acttcacaac cactcccctg 180ggaaacagat acatggctct tatccaacac agcactatca tcgggttttc tcaggtgttt 240gagccacacc agaagaaaca aacgcgagct tcagtggtga ttccagtgac tggggatagt 300gaaggtgcta cggtgcagct gactccatat tttcctactt gtggcagcga ctgcatccga 360cataaaggaa cagttgtgct ctgcccacaa acaggcgtcc ctttccctct ggataacaac 420178676DNAHomo sapiens 178gcaaatatga atgaagatgg cccttccatg tctgtgaatt tcacctcacc aggctgccta 60gaccacataa tgaaatataa aaaaaagtgt gtcaaggccg gaagcctgtg ggatccgaac 120atcactgctt gtaagaagaa tgaggagaca gtagaagtga acttcacaac cactcccctg 180ggaaacagat acatggctct tatccaacac agcactatca tcgggttttc tcaggtgttt 240gagccacacc agaagaaaca aacgcgagct tcagtggtga ttccagtgac tggggatagt 300gaaggtgcta cggtgcagct gactccatat tttcctactt gtggcagcga ctgcatccga 360cataaaggaa cagttgtgct ctgcccacaa acaggcgtcc ctttccctct ggataacaac 420aaaagcaagc cgggaggctg gctgcctctc ctcctgctgt ctctgctggt ggccacatgg 480gtgctggtgg cagggatcta tctaatgtgg aggcacgaaa ggatcaagaa gacttccttt 540tttaccacca cactactgtc tcccattaaa gatcttgtgg tttatccatc tgaaatattg 600ttccattaca catattggta cctaactgaa attctttaaa accattgcaa attgaggtca 660ctcttgaaag ggcgtg 676179517DNAHomo sapiens 179cggctcctac cttttgcccg atccccttcc ccattccgcc cccgccccaa cgcagtgcac 60agtgccctgc acacagtagt cgctcaataa atgttcgtgg atgatgatga tgatgatgat 120gaaaaaaatg cagcatcaac ggcagcagca agcggaccac gcgaacgagg caaactatgc 180aagaggcacc agacttcctc tttctggtga aggaccaact tctcagctga atagctccaa 240gcaaactgtc ctgtcttggc aagctgcaat cgatgctgct agacaggcca aggctgccca 300aactatgagc acctctgcac ccccacctgt aggatctctc tcccaaagaa aacgtcagca 360atacgccaag agcaaaaaac agggtaactc gtccaacagc cgacctgccc gcgccctttt 420ctgtttatca ctcaataacc ccatccgaag agcctgcatt agtatagtgg aatggaaaca 480tttgacatat ttatattatt ggctattttt tgccaat 517180860DNAHomo sapiens 180gaatatgacc ctgaggcaaa gggaaggata aacaccttga tgtggtcact ctgcttcgac 60gcatccagcc tcccctgggg tttgggaagt tatgtccaca cagggtagcg tgcaagagat 120tagttgccat gaacatgcct ctcaacagtg acgggacagt catgtttaat gcaaccctgt 180ttgctttggt tcgaacggct cttaagatca agaccgaagg gaacctggag caagctaatg 240aagaacttcg ggctgtgata aagaaaattt ggaagaaaac cagcatgaaa ttacttgacc 300aagttgtccc tccagctggt gatgatgagg taaccgtggg gaagttctat gccactttcc 360tgatacagga ctactttagg aaattcaaga aacggaaaga acaaggactg gtgggaaagt 420accctgcgaa gaacaccaca attgccctac aggcgggatt aaggacactg catgacattg 480ggccagaaat ccggcgtgct atatcgtgtg atttgcaaga tgacgagcct gaggaaacaa 540aacgagaaga agaagatgat gtgttcaaaa gaaatggtgc cctgcttgga aaccatgtca 600atcatgttaa tagtgatagg agagattccc ttcagcagac caatagcacc accgtcccct 660gcattgtcca aaggccttca attccacctg caagtgatac tgagaaaccg ctgtttcctc 720cagcaggaaa ttcggggtgt cataaccatc ataaccatta attccatagg aaagcaaggt 780tcccacttca acaatgccag tctcgaatag tgccaatatg tccaaagctt gccatggtaa 840gcgggccagc attgggaacc 860181495DNAHomo sapiens 181gcacgagatt aattagactt ttgtataaga gatgtcatgc ctcaagaaag ccataaacct 60ggtaggaaca ggtcccaagc ggttgagcct ggcagagtac catgcgctcg gccccagctg 120caggaaacag caggccccgc cctctcacag aggatgggtg aggaggccag acctgccctg 180ccccattgtc cagatgggca ctgctgtgga gtctgcttct cccatgtacc agggcaccag 240gcccacccaa ctgaaggcat ggcggcgggg tgcaggggaa agttaaaggt gatgacgatc 300atcacacctg tgtcgttacc tcagccatcg gtctagcata tcagtcactg ggcccaacat 360atccattttt aaaccctttc ccacaaatac actgcgtcct ggttcctgtt tagctgttct 420gaaatacggt gtgtaagtaa gtcagaaccc agctaccagt gattattgcg agggcaatgg 480gacctcataa ataag 495182557DNAHomo sapiens 182tttttttttt ttttttttag tggggaacta caattattag gacccatgga tattgctgca 60gttcaaatac aatacagtaa ttacaaaata tagaccatct ctttacaaat acaaattata 120gtatattaca agtcatgtac agtaaatcta taattttaaa caaactagtg tatctaagtt 180tacctggttg cgagtgcatt attattccag tttacagttg cccttagcgt gacagtcaga 240aaccgaccat cggagtgata ttctcttatg taaactggcg tcacatcaca gaaaacctta 300tttatgaggt cccattgccc tcgcaataat cactggtagc tgggttctga cttacttaca 360caccgtattt cagaacagct aaacaggaac caggacgcag tgtatttgtg ggaaagggtt 420taaaaatgga tatgttgggc ccagtgactg atatgctaga ccgatggctg aggtaacgac 480acaggtgtga tgatcgtcat cacctttaac tttcccctgc accccgccgc catgccttcc 540agttgggtgg gcctggt 557183416DNAHomo sapiens 183ctctgagcac tacaatcagc cagattggtt gacacagatt caagatattg ccaacaaagt 60cctcttggct ctgttcacct gcgagatgct ggtaaaaatg tacagcttgg gcctccaagc 120atactcttgt tctctttaca accggtttga ttgcttcgtg gtgtgtggtg gaatcactga 180gacgatcttg gtggaactgg aaatcatgtc tcccctgggg atctctgtgt ttcggtgtgt 240gcgcctctta agaatcttca aagtgaccag gcactggact tccctgagca acttagtggc 300atccttatta aactccatga agtccatcgc ttcgctgttg cttctgcttt ttctcttcat 360tatcatcttt tccttgcttg ggatgcagct gtttggcggc aagtttaatt ttgatg 416184401DNAHomo sapiensmisc_feature(365)..(365)a or g or c or t/u 184accagcagac ctgactgtcc ccagcagctt ccggaacaaa aacagcgaca agagaggagt 60gcggacagtt ggtggaggca gtcctgatat ccgaagcttg ggacgctatg caagggaccc 120aaaatttgtg tcagcaacaa aacacgaaat cgctgatgcc tgtgacctca ccatcgacga 180gatggagagt gcagccagca ccctgcttaa tgggaacgtg cgtccccgag ccaacgggga 240tgtgggcccc ctctcacacc ggcagactat gagctacagg actttggtcc tgggcttaca 300gcgacgaaga gccagaccct ggggagggat tgagggagga cctgggcgga tgaattgatt 360ttgcntcacc acctttgtta ggcccccagg cgaggggcaa g 401185186DNAHomo sapiensmisc_feature(11)..(11)a or g or c or t/umisc_feature(172)..(172)a or g or c or t/u 185tttttttttt nttttttttt ttgtggaaag atgataggtt tatagtgact caaaatattt 60tagaaaaatt tctgtagtgt caagttcttt caaacttaaa attttaaccc cagaggattt 120tcgctgaata aatgagaatt ggctctattt cttctacttc tggatagccc gngtaaaaat 180actaat 186186433DNAHomo sapiensmisc_feature(45)..(45)a or g or c or t/umisc_feature(296)..(296)a or g or c or t/umisc_feature(303)..(303)a or g or c or t/umisc_feature(345)..(345)a or g or c or t/umisc_feature(366)..(366)a or g or c or t/umisc_feature(386)..(386)a or g or c or t/umisc_feature(391)..(391)a or g or c or t/umisc_feature(420)..(420)a or g or c or t/umisc_feature(428)..(428)a or g or c or t/u 186tttttttttt tttttttttt tgtggaaaga tgataggttt atagngactc aaaatatttt 60agaaaaattt ctgtagtgtc aagttctttc aaacttaaaa ttttaacccc agaggatttt 120cgctgaataa atgagaattg gctctatttc ttctacttct ggatagcccg agtaaaaata 180ctaataattt ctagatttta gtggggaact acaattatta ggacccatgg atattgctgc 240agttcaaata caatacagta attacaaaat atagaccatc tctttacaaa tacaanttat 300agnatattac aagtcatgta cagtaaatct ataattttgg acaanctagt gtatctaagt 360ttaccngggg tgcgagtgcc ttattnttcc ngtttacagt tgcccttagc gtgacagtcn 420ggaaccgncc ttc 433187331DNAHomo sapiensmisc_feature(61)..(61)a or g or c or t/u 187gcctgactgt ccccagcagc ttccggaaca aaaacagcga caagcagagg agtgcggaca 60ntttggtgga ggcagtcctg atatccgaag cttgggacgc tatgcaaggg acccaaaatt 120tgtgtcagca acaaaacacg aaatcgctga tgcctgtgac ctcaccatcg acgagatgga 180gagtgcagcc agcaccctgc ttaatgggaa cgtgcgtccc cgagccaacg gggatgtggg 240ccccctctca caccggcaga ctatgagcta caggactttg gtcctgggct acagcgacga 300agagccagac cctgggaggg atgaggagga c 331188643DNAHomo sapiens 188agcggtcgta ataatgtagt tccccactaa aatctagaaa ttattagtat ttttactcgg 60gctatccaga agtagaagaa atagagcaaa ttctcattta ttcagcgaaa atcctctggg 120gttaaaattt taagttgaaa gaacttgaca ctacagaaat ttttctaaaa tatttgagtc 180actataaacc tatcatcttt ccacaagata taccagatga ctattgcagt cttctcttgg 240gcaagagttc catgatttga tactgtacct tggatccacc atgggtgcaa ctgtcttggt 300ttgttgttga cttgaaccac cctctggtaa gtaagtgaat tacagagcag gtctagctgg 360ctgctctgcc ccttgggtat ccatagttac ggttttctct gtggcccacc caggtgtttt 420tgcatcgctg gtgcagaaat gcacaggtgg atgagatata gctgctcttg tcctctgggg 480actggtggtg ctgcttaaga aataaggggt gctggggaca gaggagcaac gtggtgatct 540ataggattgg agtgtcgggg tctgtacaaa tcgtattgtt gccttttaca aaactgtgta 600ctgtatgttc tctttgaggg cttttgtatg caattgaatg agg 643189357DNAHomo sapiens 189ttttcttgtg gaaagatgat aggtttatag tgactcaaaa tattttagaa aaatttctgt 60agtgtcaagt tctttcaaac ttaaaatttt aaccccagag gattttcgct gaataaatga 120gaattggctc tatttcttct acttctggat agcccgagta aaaatactaa taatttctag 180attttagtgg ggaactacaa ttattaggac ccatggatat tgctgcagtt caaatacaat 240acagtaatta caaaatatag accatctctt tacaaataca aattatagta tattacaagt 300catgtacagt aaatctataa ttttaaacaa acctgtgtat ctaagtttac ctggttg 357190420DNAHomo sapiens 190gacaaataaa gcaattataa atgtatctca ctttagaaca gacaaaaaaa gggcatgcta 60tggaaattgt ttaaatctca agcaacaatg ctgattaatt tctggtcaat aatcgttcta 120tagttctcct tcatgaagcc tggtgaggtt ccaggaaaca gcttgatttg ggaagcctca 180gcagaaaaga aagcatctca gaggacacat aaaatgtctg gcaacccctc ttggcggccc 240tcatccagca aagcttgtgt ggtcttggca actgtcctca ggactctgct ttcaagatga 300aagaggtgta gcttacccgc tcaatacacc aagtacaaga tttagtacga aaaatgaccc 360aaagatgacg agactgacaa gatacaccca gggcaattcc aatcccatag catcattcat 420191465DNAHomo sapiens 191tttatattat tcaccacttt gttatgaaga ccttacaaac ctcttcttaa gacattctta 60ctctgatcca ggcaaaaaca cttcaaggtt tgtaaatgac tctttcctga cataaatcct 120tttttattaa aatgcaaaat gttcttcaga ataaaactgt gtaataattt ttatacttgg 180gagtgctcct tgcacagagc tgtcatttgc cagtgagagc ctccgacagg gcaggtactg 240tgccagggca gctctgaaat tatggatatt cttatcctcc tggttccttc ggtgccaatg 300gtaacctaat accagccgca gggagcgcca tttctcctaa agggctacac cactgtcaac 360attatcctgg actctgtgtc tctctctgtt gggtcttgtg gcatcacatc aggccaaaat 420tgccagacca ggaccctaag tgtctgatag aggcgatgat ctttt 465192330DNAHomo sapiens 192tttttttttt tttttttttt tcttacaaag aaaaatttaa tattcgatga gaggttgaac 60caggcttaaa gcaaacatac taggaaatgg ggcagcctgt aagaatgcca gtttgtaagt 120actgactttg gaaaagatca tcgcctctat cagacactta gggtcctggt ctggcaattt 180tggcctgatg tgatgccaca agacccaaca gagagagaca cagagtccag gataatgttg 240acagggggta gccctttagg agaaatggcg ctccctgcgg ctggtattag gttaccattg 300gcaccgaagg aaccaggagg ataagaatat 330193502DNAHomo sapiens 193tgtaaataac aaacaccact ttgttatgaa gaccttacaa acctcttctt aagacattct 60tactctgatc caggcaaaaa cacttcaagg tttgtaaatg actctttcct gacataaatc 120cttttttatt aaaatgcaaa atgttcttca gaataaaact gtgtaataat ttttatactt 180gggagtgctc cttgcacaga gctgtcattt gccagtgaga gcctccgacg gggcaggtac 240tgtgccaggg cagctctgaa attatggata ttcttatcct cctggttcct tcggtgccaa 300tggtaaccta ataccagccg cagggagcgc catttctcct aaagggctac accactgtca 360acattatcct ggactctgtg tctctctctg ttgggtcttg tggcatcaca tcaggccaaa 420attgccagac caggacccta agtgtctgat agaggcgatg atcttttcca aagtcagtac 480ttacaaactg gcattcttac ag 502194410DNAHomo sapiens 194tttttttttt tgtaaataac aaacaccact ttgttatgaa gaccttacaa acctcttctt 60aagacattct tactctgatc caggcaaaaa cacttcaagg tttgtaaatg actctttcct 120gacataaatc cttttttatt aaaatgcaaa atgttcttca gaataaaact gtgtaataat 180ttttatactt gggagtgctc cttgcacaga gctgtcattt gccagtgaga gcctccgacg 240gggcaggtac tgtgccaggg cagctctgaa attatggata ttcttatcct cctggttcct 300tcggtgccaa tggtaaccta ataccagccg cagggagcgc catttctcct aaagggctac 360accactgtca acattatcct ggactctgtg tctctctctg ttgggtcttg 410195333DNAHomo sapiens 195gtaaataaca aacaccactt tgttatgaag accttacaaa cctcttctta agacattctt 60actctgatcc aggcaaaaac acttcaaggt ttgtaaatga ctctttcctg acataaatcc 120ttttttatta aaatgcaaaa tgttcttcag aataaaactg tgtaataatt tttatacttg 180ggagtgctcc ttgcacagag ctgtcatttg ccagtgagag cctccgacgg gcaggtactg 240tgccagggca gctctgaaat atggatattc ttacctcctg gttctttcgg tgcaaatggt 300aacctaatac cagccgcagg gagcgccatt tct 333196282DNAHomo sapiensmisc_feature(231)..(231)a or g or c or t/u 196gtaaataaca aacaccactt tgttatgaag accttacaaa cctcttctta agacattctt 60actctgatcc aggcaaaaac acttcaaggt ttgtaaatga ctctttcctg acataaatcc 120ttttttatta aaatgcaaaa tgttcttcag aataaaactg tgtaataatt tttatacttg 180ggagtgctcc ttgcacagag ctgtcatttg ccagtgagag cctccgacgg ngcaggtact 240gtgccagggc agctctgaat tatggatatt cttatcctcc tg 282197400DNAHomo sapiens 197tttttcttac aaagaaaaat ttaatattcg atgagaggtt gaaccaggct taaagcagac 60atactaggaa atggtgcagc ctgtaagaat gccagtttgt aagtactgac tttggaaaag 120atcatcgcct ctatcagaca cttagggtcc tggtctggca attttggcct gatgtgatgc 180cacaagaccc aacagagaga gacacagagt ccaggataat gttgacagtg gtgtagccct 240ttaggagaaa tggcgctccc tgcggctggt attaggttac cattggcacc gaaggaacca 300ggaggataag aatatccata atttcagagc tgccctggca cagtacctgc cccgtcggag 360gctctcactg gcaaatgaca gctctgtgca aggagcactc 400198482DNAHomo sapiens 198ttatcttgtg gaaagatgat aggtttatag tgactcaaaa tattttagaa aaatttctgt 60agtgtcaagt tctttcaaac ttaaaatttt aaccccagag gattttcgct gaataaatga 120gaattggctc tatttcttct acttctggat agcccgagta aaaatactaa taatttctag 180attttagtgg ggaactacaa ttattaggac ccatggatat tgctgcagtt caaatacaat 240acagtaatta caaaatatag accatctctt tacaaataca

aattatagga tattacaagg 300catgtacagt aaatctataa ttttaaacaa actagtgtat ctaagtttac ctggttgcga 360gtgcattatt attccagttt acagttgccc ttagcgtgac agtcagaaac cgaccatcgg 420agtgatattc tcttatgtaa actggcgtca catcacagaa aaccttattt atgaggtccc 480at 482199459DNAHomo sapiens 199gccctcacag cccaccacgc ctggccttcg cccaattctg aaacttcgta ggatagagct 60ggaaagtgcc acatggtgaa gcgagatcca gctgtctggg tggatgtcgg agtccatagg 120ctgagcagag atggttctta gtgaggttct cgctgccagt tgacggtgaa atcatagctg 180ccatttacat tttgtgagat tatgaaaaac ataagactaa agaaactaaa tgtgttattc 240ctgtggacac aaaaatgtgt gtttttcaga tggggagggg accaaaaagg aaaaacattt 300catcttaaaa ctttcctaag acaaaggaaa acaaaaaacc atgctcctac aacttcaaat 360ttttcttacc aaagaaaaat ttaatattcg atgagaggtt gaaccaggct taaagcagac 420atactaggga atgggtgcag cctgtaagaa tgccagttt 459200487DNAHomo sapiens 200gtaaataaca aacaccactt tgttatgaag accttacaaa cctcttctta agacattctt 60actctgatcc aggcaaaaac acttcaaggt ttgtaaatga ctctttcctg acataaatcc 120ttttttatta aaatgcaaaa tgttcttcag aataaaactg tgtaataatt tttatacttg 180ggagtgctcc ttgcacagag ctgtcatttg ccagtgagag cctccgacgg ggcaggtact 240gtgccagggc agctctgaaa ttatggatat tcttatcctc ctggttcctt cggtgccaat 300ggtaacctaa taccagccgc aggagcgcca tttctcctaa agggctacac cactgtcaac 360attatcctgg gactctgtgt ctctctctgt tgggtcttgt ggcatcacat caggccaaaa 420ttggccagac caggacccca agtggtctga tagaaggcga tgatcttttc caaagtcagt 480acttaca 487201445DNAHomo sapiens 201gtttaaaatt atagatttac tgtacatgac ttgtaatata ctataatttg tatttgtaaa 60gagatggtct atattttgta attactgtat tgtatttgaa ctgcagcaat atccatgggt 120cctaataatt gtagttcccc actaaaatct agaaattatt agtattttta ctcgggctat 180ccagaagtag aagaaataga gccaattctc atttattcag cgaaaatcct ctggggttaa 240aattttaagt ttgaaagaac ttgacactac agaaattttt ctaaaatatt ttgagtcact 300ataaacctat catctttcca caagatatac cagatgacta tttgcagtct tttctttggg 360caagagttcc atgattttga tactgtacct ttggatccac catgggttgc aactgtcttt 420ggttttgttt gtttgacttg aacca 445202313DNAHomo sapiens 202ttcgctgaat aaatgagaat tggctctatt tcttctactt ctggatagcc cgagtaaaaa 60tactaataat ttctagattt tagtggggaa ctacaattat taggacccat ggatattgct 120gcagttcaaa tacaatacag taattacaaa atatagacca tctctttaca aatacaaatt 180atagtatatt acaagtcatg tacagtaaat ctataatttt aaacaaacta gtgtatctaa 240gtttacctgg ttgcgagtgc attattattc cagtttacag ttgcccttag cgtgacagtc 300agaaaccgac cat 313203413DNAHomo sapiens 203ttttatcttg tggaaagatg ataggtttat agtgactcaa aatattttag aaaaatttct 60gtagtgtcaa gttctttcaa acttaaaatt ttaaccccag aggattttcg ctgaataaat 120gagaattggc tctatttctt ctacttctgg atagcccgag taaaaatact aataatttct 180agattttagt ggggaactac aattattagg acccatggat attgctgcag ttcaaataca 240atacagtaat tacaaaatat agaccatctc tttacaaata caaattatag tatattacaa 300gtcatgtaca gtaaatctat aattttaaac aaactagtgt atctaagttt acctggttgc 360gagtgcatta ttattccagt ttacagttgc ccttagcgtg acagtcagaa acc 413204476DNAHomo sapiens 204ttttatcttg tggaaagatg ataggtttat agtgactcaa aatattttag aaaaatttct 60gtagtgtcaa gttctttcaa acttaaaatt ttaaccccag aggattttcg ctgaataaat 120gagaattggc tctatttctt ctacttctgg atagcccgag taaaaatact aataatttct 180agattttagt ggggaactac aattattagg acccatggat attgctgcag ttcaaataca 240atacagtaat tacaaaatat agaccatctc tttacaaata caaattatag tatattacaa 300gtcatgtaca gtaaatctat aattttaaac aaactagtgt atctaagttt acctggttgc 360gagtgcatta ttattccagt ttacagttgc ccttagcgtg acagtcagaa accgaccatc 420ggagtgatat tctcttatgt aaactggcgt cacatcacag aaaaccttat ttattt 476205406DNAHomo sapiens 205tttttttttt agagccaatt ctcatttatt cagcgaaaat cctctggggt taaaatttta 60agtttgaaag aacttgacac tacagaaatt tttctaaaat attttgagtc actataaacc 120tatcatcttt ccacaagata taccagatga ctatttgcag tcttttcttt gggcaagagt 180tccatgattt tgatactgta cctttggatc caccatgggt tgcaactgtc tttggttttg 240tttgtttgac ttgaaccacc ctctggtaag taagtgaatt acagagcagg tccagctggc 300tgctctgccc cttgggtatc catagttacg gttttctctg tggcccaccc agggtgtttt 360ttgcatcgct ggtgcagaaa tgcacaggtg gatgagatat agctgc 406206473DNAHomo sapiens 206tttttttttg taaataacaa acaccacttt gttatgaaga ccttacaaac ctcttcttaa 60gacattctta ctctgatcca ggcaaaaaca cttcaaggtt tgtaaatgac tctttcctga 120cataaatcct tttttattaa aatgcaaaat gttcttcaga ataaaactgt gtaataattt 180ttatacttgg gagtgctcct tgcacagagc tgtcatttgc cagtgagagc ctccgacagg 240gcaggtactg tgccagggca gctctgaaat tatggatatt cttatcctcc tggttccttc 300ggtgccaatg gtaacctaat accagccgca gggagcgcca tttctcctaa agggctacac 360cactgtcaac attatcctgg actctgtgtc tctctctgtt gagtcttgtg gcatcacatc 420aggccaaaat tgccagacca ggaccctaag tgtctgatag aggcgatgat ctt 473207463DNAHomo sapiens 207tttagagcca attctcattt attcagcgaa aatcctctgg ggttaaaatt ttaagtttga 60aagaacttga cactacagaa atttttctaa aatattttga gtcactataa acctatcatc 120tttccacaag atataccaga tgactatttg cagtcttttc tttgggcaag agttccatga 180ttttgatact gtacctttgg atccaccatg ggttgcaact gtctttggtt ttgtttgttt 240gacttgaacc accctctggt aagtaagtga attacagagc aggtccagct ggctgctctg 300ccccttgggt atccatagtt acggttttct ctgtggccca cccagggtgt tttttgcatc 360gctggtgcag aaatgcacag gtggatgaga tatagctgct cttgtcctct ggggactggt 420ggtgctgctt aagaaataag gggtgctggg gacagaggag caa 463208140DNAHomo sapiens 208tttttttttt ttttgtaaat aacaaacacc actttgttat gaagacctta caaacctctt 60cttaagacat tcttactctg atccaggcaa aaacacttca aggtttgtaa atgactcttt 120cctgacataa atcctttttg 140209237DNAHomo sapiensmisc_feature(208)..(208)a or g or c or t/umisc_feature(221)..(221)a or g or c or t/umisc_feature(229)..(229)a or g or c or t/u 209acaaagaaaa atttaatatt cgatgagagg ttgaaccagg cttaaagcag acatactagg 60aaatggtgca gcctgtaaga atgccagttt gtaagtactg actttggaaa agatcatcgc 120ctctatcaga cacttagggt cctggtctgg caattttggc ctgatgtgat gccacaagac 180ccaacagaga gagacacaga gtccaggnta atattgacag naggtggang cccccct 237210292DNAHomo sapiens 210tttttttttt tttttttttt ggtccaaaat ttttaatagt atacagacaa cctgttaatt 60tttttttttt ttttttttgg aaataacaaa caccactttg ttatgaagac cttacaaacc 120tcttcttaag acattcttac tctgatccag gcaaaaacac ttcaaggttt ggaaatgact 180ctttcctgac ataaatcctt ttttattaaa atgcaaaagg ttcttcagaa taaaactgtg 240taataatttt tatacttggg agtgctcctt gcacagagct gtcatttgcc ag 292211434DNAHomo sapiens 211tttttcttac aaagaaaaat ttaatattcg atgagaggtt gaaccaggct taaagcagac 60atactaggaa atggtgcagc ctgtaagaat gccagtttgt aagtactgac tttggaaaag 120atcatcgcct ctatcagaca cttagggtcc tggtctggca attttggcct gatgtgatgc 180cacaagaccc aacagagaga gacacagagt ccaggataat gttgacagtg gtgtagccct 240ttaggagaaa tggcgctccc tgcggctggt attaggttac cattggcacc gaagagacca 300ggaggataag aatatccata atttcagagc tgccctggca cagtacctgc cccgtcggag 360gctctcactg gcaaatgaca gctctgtgca aggagcactc ccaagtataa aaattattac 420acagttttat tctg 434212451DNAHomo sapiens 212taaataacaa acaccacttt gttatgaaga ccttacaaac ctcttcttaa gacattctta 60ctctgatcca ggcaaaaaca cttcaaggtt tgtaaatgac tctttcctga cataaatcct 120tttttattaa aatgcaaaat gttcttcaga ataaaactgt gtaataattt ttatacttgg 180gagtgctcct tgcacagagc tgtcatttgc cagtgagagc ctccgacggg gcaggtactg 240tgccagggca gctctgaaat tatggatatt cttatcctcc tggttccttc ggtgccaatg 300gtaacctaat accagccgca gggagcgcca tttctcctaa agggctacac cactgtcaac 360attatcctgg actctgtgtc tctctctgtt gggtcttgtg gcatcacatc aggccaaaat 420tgccagacca ggaccctaag tgtctgatag a 451213231DNAHomo sapiens 213tttgtaaata acaaacacca ctttgttatg aagaccttac aaacctcttc ttaagacatt 60cttactctga tccaggcaaa aacacttcaa ggtttgtaaa tgactctttc ctgacataaa 120tcctttttta ttaaaatgca aaatgttctt cagaataaaa ctgtgtaata atttttatac 180ttgggagtgc tccttgcaca gagctgtcat ttgccagtga gagcctccga c 231214483DNAHomo sapiens 214ttgtaaataa caaacaccac tttgttatga agaccttaca aacctcttct taagacattc 60ttactctgat ccaggcaaaa acacttcaag gtttgtaaat gactctttcc tgacataaat 120ccttttttat taaaatgcaa aatgttcttc agaataaaac tgtgtaataa tttttatact 180tgggagtgct ccttgcacag agctgtcatt tgccagtgag agcctccgaa ggggcaggta 240ctgtgccagg gcagctctga aattatggat attcttatcc tcctggttcc ttcggtgcca 300atggtaacct aataccagcc gcaggagcgc catttctcct aaagggctac accactgtca 360acattatcct ggactctgtg tctctctctg ttgggtcttg tggcatcaca tcaggccaaa 420attgccagac caggacccta agtgtctgat agaggcgatg atcttttcca aagtcagtac 480tta 483215507DNAHomo sapiens 215gctcgacttt ttttttgggg gaacgttttc attaggttaa cagtgtttgg caagcattgg 60aaacacggaa tctcacagac agatacaggc agaaagaatc acagttcaat ccaaaagcaa 120cacactgaga ggacatcaga gtccaaacac atgcagagaa gctgtcaggg agcagctagg 180agacacgcag agttgcctca cacgtggcag caggagaagg tgcaacacgg atccgactgc 240ttacccacta aggacaccaa gaaccaggtt aaggacgaaa aatgagccaa ggatgatcag 300actaacaaaa tacacccatg gccattccca tcctatcgca tcatttaccc agtagagcac 360gtctgtccag ccctccatgg tgatgcactg aaacacagta agcatggcaa aggcaaagtt 420atcaaagttg gtgatgcctc cgttcgggcc aacccagcca ctcctacatt ccgtgccatt 480ggcagtacac tggcgtccat tccctgt 507216443DNAHomo sapiens 216tttttttttt tttttttggt ccaaaatttt taatagtata cagacaacct gttaattttt 60tttttttttt tttttgtaaa taacaaacac cactttgtta tgaagacctt acaaacctct 120tcttaagaca ttcttactct gatccaggca aaaacacttc aaggtttgta aatgactctt 180tcctgacata aatccttttt tattaaaatg caaaatgttc ttcagaataa aactgtgtaa 240taatttttat acttgggagt gctccttgca cagagctgtc atttgccagt gagagcctcc 300gacggggcag gtactgtgcc agggcagctc tgaaattatg gatattctta tcctcctggt 360tccttcggtg ccaatggtaa cctaatacca gccgcaggga gcgccatttc tcctaaaggg 420ctacaccact gtcaacatta tcc 443217305DNAHomo sapiens 217tttttttttt ttttttttct tacaaagaaa aatttaatat tcgatgagag gttgaaccag 60gcttaaagca gacatactag gaaatggtgc agcctgtaag aatgccagtt tgtaagtact 120gactttggaa aagatcatcg cctctatcag acacttaggg tcctggtctg gcaattttgg 180cctgatgtga tgccacaaga cccaacagag agagacacag agtccaggat aatgttgaca 240gtggtgtagc cctttaggag aaatggcgct ccctgcggct ggtattaggt taccattggc 300accga 305218376DNAHomo sapiens 218tgtaaataac aaacaccact tggttatgaa gaccttacaa acctcttctt aagacattct 60tactctgatc caggcaaaaa cacttcaagg tttgtaaatg actctttcct gacataaatc 120cttttttatt aaaatgcaaa atgttcttca gaataaaact gtgtaataat ttttatactt 180gggagtgctc cttgcacaga gctgtcattt gccagtgaga gcctccgacg gggcaggtac 240tgtgccaggg cagctctgaa attatggata ttcttatcct cctggttcct tcggtgccaa 300tggtaaccta ataccagccg cagggagcgc catttctcct aaagggctac accactgtca 360acattatcct ggactc 376219544DNAHomo sapiens 219attcctgtta attttgacaa gctcaacggc tgaaatctag gaatggttac taccaaaagc 60ccacccaatc cagctcattt tgctatcgtt ttataacaat taatctgcat tatatttgga 120tccagacaaa taaagcaatt ataaatgtat ctcactttac aacagacaaa aaaagggcat 180gctatggaaa ttgtttaaat ctcaagcaac aatgctgatt aatttctggt caataatcgt 240tctatagttc tccttcatga agcctggtga ggttccagga aacagcttga tttgggaagc 300ctcagcagaa aagaaagcat ctcagaggac acataaaatg tctggcaacc cctcttggcg 360gccctcatcc agcaaagctt gtgtggtctt ggcaactgtc ctcaggactc tgctttcaag 420atgaaagagg tgtagcttac ccgctcaata caccaagtac aagatttagt acgaaaaatg 480acccaaagat gacgagactg acacaataca cccagggcaa ttcaaatccc atagcatcat 540tcat 544220308DNAHomo sapiens 220ggtcgacgta tttgtaaaga gatggtctat atcttgtaat tactgtattg tatttgaact 60gcagcaatat ccatgggtcc taataattgt agttccccac taaaatctag aaattattag 120tatttttact cgggctatcc agaagtagaa gaaatagagc caattctcat ttattcagcg 180aaaatcctct ggggttaaaa ttttaagttt gaaagaactt gacactacag aaatttttct 240aaaatatttt gagtcactat aaacctatca tctttccaca agaaaaaaaa acaaaaaaaa 300agtcgacg 308221939DNAHomo sapiensmisc_feature(674)..(674)a or g or c or t/umisc_feature(684)..(684)a or g or c or t/umisc_feature(687)..(687)a or g or c or t/umisc_feature(781)..(781)a or g or c or t/umisc_feature(795)..(795)a or g or c or t/u 221caaagtactt ccccacattt agctggattt gtctttggtt tgaagaggct aatacgtgaa 60agatttgttc acagttggat gtcccctttt ctgaaccatg aagtaatatt gtgaatggag 120ttgaatgctg aggttagggt gccggaaaga ttcagggtcc ttcggtaccc tcacatggct 180tggctttggt agaacaagaa actaagctct gatttggctt taaatgagag tgctaaattt 240cctttttcta ataaagaacc tagctaaaca tttatatata cttttgaaca ctgaactttc 300ttgttgcaga gttaacagct gttgggggta gctgacagct ggatcctggt gctgttggta 360ccatggtacc tgaagtgcac aggctggtag ccacacctga cattaacaag tgagtggtaa 420cctctctgcc gctggctcac agctactgtt tccatagaaa tggctgtcgg gatcagtgga 480aacgaggtaa gtgaaagttt tcgctgatcc ttgtttccat caagctgacg tctgtttccc 540tggcaacagc agtggacagc agccaggcgc tagcaacaga ttcagtagag ctctcacttg 600tcagctgtgg ctatcatctg ttcctgacca agttcttttt ttttttttta ataatgtaca 660gaaagacctc tganggacca ggangcnact ctggccacat gtgccctcct ggatgctcgt 720tttgcaaatg gagagctgtg tgctgagttg acttctctgt ccgcagttcc ccctccactg 780nggctctggg gttgntgatg tgcaggtaaa aaaaaggagg gttgttgaag gttattagtt 840gttccaaggg gaagcctgtt gaaacctggt tgatccccaa tccctatggg gaagaaaaat 900ctctttaagg ggcttttcat gcccagagac ccaaatttt 939222966DNAHomo sapiens 222ggtggcgatt cggacgaggg caaagacttc ccccatttag ctggatttgt ctttggtttg 60aagaggctaa tacgtgaaag atttgttcac agttggatgt ccccttttct gaaccatgaa 120gtaatattgt gaatggagtt gaatgctgag gttagggtgc cggaaagatt cagggtcctt 180cggtaccctc acatggcttg gctttggtag aacaagaaac taagctctga tttggcttta 240aatgagagtg ctaaatttcc tttttctaat aaagaaccta gctaaacatt tatatatact 300tttgaacact gaactttctt gttgcagagt taacagctgt tgggggtagc tgacagctgg 360atcctggtgc tgttggtacc atggtacctg aagtgcacag gctggtagcc acacctgaca 420ttaacaagtg agtggtaacc tctctgccgc tggctcacag ctactgtttc catagaaatg 480gctgtcggga tcagtggaaa cgaggtaagt gaaagttttc gctgatcctt gtttccatca 540agctgacgtc tgtttccctg gcaacagcag tggacagcag ccaggcgcta gcaacagatt 600caggagagct ctcacttgtc agctgtggct atcatctgtt cctgaccaag ttcttttttt 660tttttttaat aatggacaga aagacctctg aggacccagg aggcacctct gggcacatgt 720gccctcctgg atgctccttt tgcagatgga gacctggggg ctgagttgac ttctctggcc 780gcagttcccc ctccacctgg ggctcctggg tggtgagggg ccaggtaaaa aaagggaagg 840tgtttgaggg tattaatggg tccccgggcg ggctgatcga atcctgggga ctccacgtcc 900ctggggggac aagaatctct tcaacggggt tttccggccg ggagccggag ttttttattc 960agcggg 966223692DNAHomo sapiens 223tttttttttt ttttttttct tgtggaaaga tgataggttt atagtgactc aaaatatttt 60agaaaaattt ctgtagtgtc aagttctttc aaacttaaaa ttttaacccc agaggatttt 120cgctgaataa atgagaattg gctctatttc ttctacttct ggatagcccg agtaaaaata 180ctaataattt ctagatttta gtggggaact acaattatta ggacccatgg atattgctgc 240agttcaaata caatacagta attacaaaat atagaccatc tctttacaaa tacaaattat 300agtatattac aagtcatgta cagtaaatct ataattttaa acaaactagt gtatctaagt 360ttacctggtt gcgagtgcat tattattcca gtttacagtt gcccttagcg tgacagtcag 420aaaccgacca tcggagtgat attctcttat gtaaactggc gtcacatcac agaaaacctt 480atttatgagg tcccattgcc ctcgcaataa tcactggtag ctgggttctg acttacttac 540acaccgtatt tcagaacagc taaacaggaa ccaggacgca gtgtatttgg gggaaagggt 600ttacaaatgg atatgttggg cccagtgact gatatgctag accgatggct gaggtaacga 660cacaggtgtg atgatcgtca tcacctttaa ct 692224595DNAHomo sapiens 224tgcaaataag gacaagctca gcggctgaaa tctacaaatg gggactacca aaagcccacc 60caatccagct cattttgcta tcgttttata acaattaatc tgcattatat ttggatccag 120acaaataaag caattataaa tgtatctcac tttagaacag acaaaaaaag ggcatgctat 180ggaaattgtt taaatctcaa gcaacaatgc tgattaattt ctggtcaata atcgttctat 240agttctcctt catgaagcct ggtgaggttc caggaaacag cttgatttgg gaagcctcag 300cagaaaagaa agcatctcag aggacacata aaatgtctgg caacccctct tggcggccct 360catccagcaa agcttgtgtg gtcttggcaa ctgtcctcag gactctgctt tcaagatgaa 420agaggtgtag cttacccgct caatacacca agtacaagat ttagtacgaa aaatgaccca 480aagatgacga gactgacaaa atacacccag ggcaattcaa atcccatagc atcattcatc 540tgcaagaaat aagatggtct cataggagtg ggttaataag aggatttaat aagga 595225999DNAHomo sapiens 225ggcaaagtac ttccccacat ttagctggat tggtctttgg tttgaagagg ctaatacgtg 60aaagatttgt tcacagttgg atgtcccctt ttctgaacca tgaagtaata ttgtgaatgg 120agttgaatgc tgacggttag ggtgccggaa agattcaggg tccttcggta ccctcacatg 180gcttggcttt ggtagaacaa gaaactaagc tctgatttgg ctttaaatga gagtgctaaa 240tttccttttt ctaataaaga acctagctaa acatttatat atacttttga acactgaact 300ttcttgtcag cagagttaac agctgtaggg ggtagctgac acggctggat cctggtgctg 360ttggtaccat ggtacctgaa gtgcacaggc tggtagccac acctgacatt aacaacgtga 420gtggtaacct ctctgccgct ggctcacagc tactgtttcc atcagaaatg gctgtcgggc 480tcacgtggaa acgaggtaag tgaaagtacg ctagatcctt gttccatcac agctgacgct 540ctgtttccca tggcaacacc cagcacggac aagccgccac gccgcataga caaccacaac 600cacgtacagc tctccacaag tcagctcgtg gctatccatc atgtccctga acaagcccac 660accacccccc cccaagcgac acagcaacga gcaccacccg gacgaaccaa aggacggacc 720cccctgcccc aacctctcgc ccatccgcga cagacccgcc aagcaaacac gacaacctaa 780caaagcagag ggacagaccc atagcgcccg ctaccggaag cgtacaccac ttcccaacag 840taaggccaaa agagcgacgc ggagcacgtg aacggataag aaaacgagag aaggcacggc 900cgcatggcaa acacaccagc aagcagcaga cagcacgtgg gcacgacaca ggacagaaag 960cagcccacct cagaggggac caacgaagag tcgcacgac 999226695DNAHomo sapiensmisc_feature(695)..(695)a or g or c or t/u 226ctgggcccaa catatccatt tttaaaccct ttcccccaaa tacactgcgt cctggttcct 60gtttagctgt tctgaaatac ggtgtgtaag taagtcagaa cccagctacc agtgattatt 120gcgagggcaa tgggacctca taaataaggt tttctgtgat

gtgacgccag tttacataag 180agaatatcac tccggtggtc ggtttctgac tgtcacgcta agggcaactg taaactggaa 240taataatgca ctcgcaacca ggtaaactta gatacactag tttgtttaaa attatagatt 300tactgtacat gacttgtaat atactataat ttgtatttgt aaagagatgg tctatatttt 360gtaattactg tattgtattt gaactgcagc aatatccatg ggtcctaata attgtagttc 420cccactaaaa tctagaaatt attagtattt ttactcgggc tatccagaag tagaagaaat 480agagccaatt ctcatttatt cagcgaaaat cctctggggt taaaatttta agtttgaaag 540aacttgacac tacagaaatt tttctaaaat attttgagtc actataaacc tatcatcttt 600ccacaagata taccagatga ctatttgcag tcttttcttt gggcaagagt tccatgattt 660tgatactgta cctttggatc caccatgggt tgcan 695227870DNAHomo sapiens 227ggaaaagaaa tactgtttta gagaaataac attttcaaca aaacatccct ggagtcagat 60tttgagttgg ggtgggctaa tcagggagtc ggggctctct gcgtgatgtc agttctatgg 120ctaactggtt tttctaaacc agccagctgc ctatcaaaac agtacaactt ttctaggaaa 180tgcaattggc aaagacactt acgatgctga gaagtacaca aggtgaaact gctccagttt 240ttctcatagc agggtcagca ggaaagcaag tggtgcccct ggtcccatct cacacaggtg 300agactgcacc gagaggtaac gtggccctca cagcccacca cgcctggcct tcgcccaatt 360ctgaaacttc gtaggataga gctggaaagt gccacatggt gaagcgagat ccagctgtct 420gggtggatgt cggagtccat aggctgagca gagatggttc ttagtgaggt tctcgctgcc 480agttgacggt gaaatcatag ctgccattta cattttgtga gattatgaaa aacataagac 540taaagaaact aaatgtgtta ttcctgtgga cacaaaaatg tgtgtttttc agatggggag 600gggaccaaaa aggaaaaaca tttcatctta aaactttcct aagacaaagg aaaacaaaaa 660accatgctct acaacttcaa atttttctta caaagaaaaa tttaatattc gatgagcagg 720ttgaaccagg cttaaagcag acatactagg aaatggtgca gcctgtaaga atgccagttt 780gtaagtactg actttggaaa agatcatcgc tctatcagac acttagggtc ctggtctggc 840cattttggcc tgatgtgatg ccaaaagacc 870228368DNAHomo sapiens 228ttttatcgtg tggaaagatg ataggtttat agtgactcaa aatattttag aaaaatttct 60gtagtgtcaa gttctttcaa acttaaaatt ttaaccccag aggattttcg ctgaataaat 120gagaattggc tctatttctt ctacttctgg atagcccgag taaaaatact aataatttct 180agattttagt ggggaactac aattattagg acccatggat attgctgcag ttcaaataca 240atacagtaat tacaaaatat agaccatctc tttacaaata caaattatag tatattacaa 300gtcatgtaca gtaaatctat tttaaacaaa ctagtgtatc taagtttacc tggttgcgag 360tgcattat 368229412DNAHomo sapiens 229cttacaaaga aaaatttaat attcgatgag aggttgaacc aggcttaaag cagacatact 60aggaaatggt gcagcctgta agaatgccag tttgtaagta ctgactttgg aaaagatcat 120cgcctctatc agacacttag ggtcctggtc tggcaatttt ggcctgatgt gatgccacaa 180gacccaacag agagagacac agagtccagg ataatgttga cagtggtgta gccctttagg 240agaaatggcg ctccctgcgg ctggtattag gttaccattg gcaccgaaga gaccaggagg 300ataagaatat ccataatttc agagctgccc tggcacagta cctgccccgt cggaggctct 360cactggcaaa tgacagctct gtgcaaggag cactcccaag tataaaaatt at 412230610DNAHomo sapiens 230ccgcgtccgg tcagatggta caagtttgtc tctataatta agacttttcc accatcacaa 60actttaaaca caaagtctaa aatcttgggc agcatagaaa ataggttcta gctaagcagg 120agttttgtcc tctaccaaga cctttcctga aaatcactta tcaagacagt ttcctgtaag 180aaaaagccat atcccagctg attttccttc ctggggccaa aatctgctat tattcggcct 240gaaagccttg atgactctgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 300gtatggatgc ttgtgtgtgt gtatggggaa tatgtgatta atgtgtgttg gctgctgttg 360tctctgattt ggctactgtt gtttctgatt taaatctaag taaatgttta attaaatgta 420tagaatgctg tctctaatgt gaccctctct ccttattaaa tcctcttatt aacccactcc 480tatgagacca tcttatttct tgcagatgaa tgatgctatg ggatttgaat tgccctgggt 540gtattttgtc agtctcgtca tctttgggtc atttttcgta ctaaatcttg tacttggtgt 600attgagcggg 610231236DNAHomo sapiensmisc_feature(195)..(195)a or g or c or t/u 231aatgcaaaat gttcttcaga ataaaactgt gtaataattt ttatacttgg gatgtgctcc 60ttgcacagag ctgtcatttg ccagtgagag cctcgacagg caggtactgt gccagggcag 120ctctgaaatt atggatattc ttatcctcct ggttccttct gtgctcaatg gtaacctaat 180accagccgca ggacncgcca tttctcctaa agggctacac cactgtcaac attatc 236232427DNAHomo sapiens 232tcagcgaaaa tcctctgggg ttaaaatttt aagtttgaaa gaacttgaca ctacagaaat 60ttttctaaaa tattttgagt cactataaac ctatcatctt tccacaagat ataccagatg 120actatttgca gtcttttctt tgggcaagag ttccatgatt ttgatactgt acctttggat 180ccaccatggg ttgcaactgt ctttggtttt gtttgtttga cttgaaccac cctctggtaa 240gtaagtaagt gaattacaga gcaggtccag ctggctgctc tgccccttgg gtatccatag 300ttacggtttt ctctgtggcc cacccagggt gttttttgca tcgctggtgc agaaatgcat 360aggtggatga gatatagctg ctcttgtcct ctggggactg gtggtgctgc ttaagaaata 420aggggtg 427233838DNAHomo sapiens 233ttttgtcagt ctcgtcatct ttgggtcatt tttcgtacta aatcttgtac ttggtgtatt 60gagcgggcac agtggctcac gcctataatc ccagcacttt cggaggccga ggcagctgga 120ccacccgaga tcaggagttt gagaccagcc tgactaaggc agtgaaaccc tgtctctact 180aaaaatacaa aaattagcca ggcatggtgg cgcatgcctg taatcccagc tacttgggag 240gctgaggcag gagaatcact tgaaccaggg aggtggagat tgcagtgagc caagactgca 300ccattgcatt ccagcctggg tgacaagagc aaaactccat ctcaaaaaaa aaaaaaaaaa 360aaaaaaaaaa agacttttct ctcattcaac actttaccag catctactga cagaaaatgg 420acaattgaat ttcctccaat atatatacct ctgatatgtc tgctttgtaa aagagtagtg 480taattgctta caacattgaa aaggttgtta ttggggtcct ggggtagcca ggatatcggc 540atgatttgtc accatattca gaataaaact gtactgcaat agtgagttaa ttccatatct 600tggccaacag agaatttttg gccagtggct actaaggcac acggaagtcc agtctaaaag 660ggacagggga ggactctttg tagatagttc ttatgattaa aaaataactt cctatgtgtt 720gtagtgatga tttaagctga cagaatgcta aagacacccc ttatgattac ctggtagcaa 780agtaccttcc ccacatttaa cctggatttg cccttttggg tttgaaagag gctaaata 838234904DNAHomo sapiens 234ggtgggattc ggcacgaggg caagacttcc ccacatttag ctggatttgt ctttggtttg 60aagaggctaa tacgtgaaag atttgttcac agttggatgt ccccttttct gaaccatgaa 120gtaatatttg tgatatggag ttcgaatggc tgaggtctag gtgtgccgag aaagattcag 180ggtccttcgg taccctcaca tggcttggct ttggtagaac aagaaactaa gctctgattt 240ggctttaaat gagagtgcta aatttccttt ttctaataaa gaacctagct aaacatttat 300atatactttt gaacactgaa ctttcttgtt gcagagttaa cagctgttgg gggtagctga 360cagctggatc ctggtgctgt tggtaccatg gtacctgaag tgcacaggct ggtagccaca 420cctgacatta acaagtgagt ggtaacctct ctgccgctgg ctcacagcta ctgtttccat 480agaaatggct gtcgggatca gtggaaacga ggtaagtgaa agttttcgct gatccttgtt 540tccatcaagc tgacgtctgt ttccctggca acagcagtgg acagcagcca ggcgctagca 600acagattcag tagagctctc acttgtcagc tgtggctatc atctgttcct gaccaagttc 660tttttttttt ttttaataat gtacagaaag acctctgagg acccaggagg cacctctggc 720cacatgtgcc ctcctggatg ctcgttttgc agatggagag ctgtgtgctg agttgacttc 780tctgtccgca gttccccctc cacctgtgct ctgggttgtt gatgtgccag ttaaaacagg 840gaggctgctt cagggtatta gtgttgccaa ggggaggctg ttgaaatctg gttgatccca 900aatc 904235935DNAHomo sapiens 235caaagtactt ccccacattt agctggattt gtctttggtt tgaagaggct aatacgtgaa 60agatttgttc acagttggat gtcccctttt ctgaaccatg aagtaatatt gtgaatggag 120ttgaatgctg aggttagggt gccggaaaga ttcagggtcc ttcggtaccc tcacatggct 180tggctttggt agaacaagaa actaagctct gatttggctt taaatgagag tgctaaattt 240cctttttcta ataaagaacc tagctaaaca tttatatata cttttgaaca ctgaactttc 300ttgttgcaga gttaacagct gttgggggta gctgacagct ggatcctggt gctgttggta 360ccatggtacc tgaagtgcac aggctggtag ccacacctga cattaacaag tgagtggtaa 420cctctctgcc gctggctcac agctactgtt tccatagaaa tggctgtcgg gatcagtgga 480aacgaggtaa gtgaaagttt tcgctgatcc ttgtttccat caagctgacg tctgtttccc 540tggcaacagc agtggacagc agccaggcgc tagcaacaga ttcagtagag ctctcacttg 600tcagctgtgg ctatcatctg ttcctgacca agttcttttt ttttttttta ataatgtaca 660gaaagacctc tgaggaccca gggagcacct ctggccacat gtgccctcct gaatgctcgt 720tttgcaaatg gagagctgtg tgctgagttg acttctctgt ccgcaggtcc ccctccaact 780gtgctcctgg gttgtgatgt gcagggttaa accagggaag ctgttgaagg gtattagtgt 840tgccagggaa aggctgttga attctggttg atcccaaatc cctaggggga agagaaatcc 900cttacgagtg gtttttcatg gccaggaacc ctata 935236382DNAHomo sapiens 236tcagcgaaaa tcctctgggg ttaaaatttt aagtttgaaa gaacttgaca ctacagaaat 60ttttctaaaa tattttgagt cactataaac ctatcatctt tccacaagat ataccagatg 120actatttgca gtcttttctt tgggcaagag ttccatgatt ttgatactgt acctttggat 180ccaccatggg ttgcaactgt ctttggtttt gtttgtttga cttgaaccac cctctggtaa 240gtaagtaagt gaattacaga gcaggtccag ctggctgctc tgccccttgg gtatccatag 300ttacggtttt ctctgtggcc cacccagggt gttttttgca tcgctggtgc agaaatgcat 360aggtggatga gatatagctg ct 382237461DNAHomo sapiens 237gtatatcttg tggaaagatg ataggtttat agtgactcaa aatattttag aaaaatttct 60gtagtgtcaa gttctttcaa acttaaaatt ttaaccccag aggattttcg ctgaataaat 120gagaattggc tctatttctt ctacttctgg atagcccgag taaaaatact aataatttct 180agattttagt ggggaactac aattattagg acccatggat atagctgcag ttcaaataca 240atacagtaat tacaaaatat agaccatctc tttacaaata caaattatag tatattacaa 300gtcatgtaca gtaaatctat aattttaaac aaactagtgt atctaagttt accaggttgc 360gagtgcatta ttattccagt ttacagttgc ccttagcgtg acagtcagaa accgaccatc 420ggagtgatat tctcttatgt aaacaggcgt cacatcacag a 461238557DNAHomo sapiens 238tttttttttt tgtggaaaga tgataggttt atagtgactc aaaatatttt agaaaaattt 60ctgtagtgtc aagttctttc aaacttaaaa ttttaacccc agaggatttt cgctgaataa 120atgagaattg gctctatttc ttctacttct ggatagcccg agtaaaaata ctaataattt 180ctagatttta gtggggaact acaattatta ggacccatgg atattgctgc agttcaaata 240caatacagta attacaaaat atagaccatc tctttacaaa tacaaattat agtatattac 300aagtcatgta cagtaaatct ataattttaa acaaactagt gtatctaagt ttacctggtt 360gcgagtgcat tattattcca gtttacagtt gcccttagcg tgacagtcag aaaccgacca 420tcggagtgat attctcttat gtaaactggc gtcacatcac agaaaacctt atttatgagg 480tcccattgcc ctcgcaataa tcactggtag ctgggttctg acttacttac acaccgtatt 540tcagaacagc taaacag 557239481DNAHomo sapiens 239tttggtatat cttgtggaaa gatgataggt ttatagtgac tcaaaatatt ttagaaaaat 60ttctgtagtg tcaagttctt tcaaacttaa aattttaacc ccagaggatt ttcgctgaat 120aaatgagaat tggctctatt tcttctactt ctggatagcc cgagtaaaaa tactaataat 180ttctagattt tagtggggaa ctacaattat taggacccat ggatattgct gcagttcaaa 240tacaatacag taattacaaa atatagacca tctctttaca aatacaaatt atagtatatt 300acaagtcatg tacagtaaat ctataatttt aaacaaacta gtgtatctaa gtttacctgg 360ttgcgagtgc attattattc cagtttacag ttgcccttag cgtgacagtc agaaaccgac 420catcggagtg atattctctt atgtaaactg gcgtcacatc acagaaaacc ttatttatga 480g 481240466DNAHomo sapiens 240tttttttgtg gaaagatgat aggtttatag tgactcaaaa tattttagaa aaatttctgt 60agtgtcaagt tctttcaaac ttaaaatttt aaccccagag gattttcgct gaataaatga 120gaattggctc tatttcttct acttctggat agcccgagta aaaatactaa taatttctag 180attttagtgg ggaactacaa ttattaggac ccatggatat tgctgcagtt caaatacaat 240acagtaatta caaaatatag accatctctt tacaaataca aattatagta tattacaagt 300catgtacagt aaatctataa ttttaaacaa actagtgtat ctaagtttac ctggttgcga 360gtgcattatt attccagttt acagttgccc ttagcgtgac agtcagaaac cgaccatcgg 420agtgatattc tcttatgtaa actggcgtca catcacagaa aacctt 466241353DNAHomo sapiens 241cggccgccaa cttttttgaa tgagtgaagt gccaggtacc atgagaaaac cctagctggt 60aaagatcaaa cctgagttag ttctaaattc acatacggat tttttttgca tgacgaaatc 120tattctcttt ttcctgacaa cttctccacc tagatgtttg ggaaagttgc catgagagat 180aacaaccaga tcaataggaa caataacttc cagacgtttc cccaggcggt gctgctgctc 240ttcaggtgac tgcaactggc ttgggcggtg ctcctgggca ggggggtccg ctaggcgtgg 300gtccagaggg acggaggaca caggttatta aagcagtgtg cctttctcag ttg 353242526DNAHomo sapiens 242taaataacta acaccatttt gttatgaaga ccttacaaac ctcttcttaa gacattctta 60ctctgatcca ggcaaaaaca cttcaaggtt tgtaaatgac tctttcctga cataaatcct 120ttttttatta aaatgcaaaa tgttcttcag aataaaactg tgtaataatt tttatacttg 180ggagtgctcc ttgcacagag ctgtcatttg ccagtgagag cctccgacgg ggcaggtact 240gtgccagggc agctctgaaa ttatggatat tcttatcctc ctggttcctt cggtgccaat 300ggtaacctaa taccagccgc agggagcgcc atttctccta aagggctaca ccactgtcaa 360cattatcctg gactctgtgt ctctctctgt tgggtcttgt ggcatcacat caggccaaaa 420ttgccagacc aggaccctaa gtgtctgata gaggcgatga tcttttccaa agtcagtact 480tacaaactgg cattcttaca ggctgcacca tttcctagta tgtctg 526243750DNAHomo sapiens 243acttttctag gaaatgcaat tggcaaagac acttacgatg ctgagaagta cacaaggtga 60aactgctcca gtttttctca tagcagggtc agcaggaaag caagtggtgc ccctggtccc 120atctcacaca ggtgagactg caccgagagg taacgtggcc ctcacagccc accacgcctg 180gccttcgccc aattctgaaa cttcgtagga tagagctgga aagtgccaca tggtgaagcg 240agatccagct gtctgggtgg atgtcggagt ccataggctg agcagagatg gttcttagtg 300aggttctcgc tgccagttga cggtgaaatc atagctgcca tttacatttt gtgagattat 360gaaaaacata agactaaaga aactaaatgt gttattcctg tggacacaaa aatgtgtgtt 420tttcagatgg ggaggggacc aaaaaggaaa aacatttcat cttaaaactt tcctaagaca 480aaggaaaaca aaaaaccatg ctctacaact tcaaattttt cttacaaaga aaaatttaat 540attcgatgag aggttgaacc aggcttaaag cagacatact aggaaatggt gcagcctgta 600agaatgccag tttgtaagta ctgactttgg aaaagatcat cgcctctatc agacacttag 660ggtcctggtc tggcaatttt ggcctgatgt gatgccacaa gacccaacag agagagacac 720agagtccagg ataatgttga cagtggtgta 750244432DNAHomo sapiens 244tttttttttt tttttttaga agaaatagag ccaattctca tttattcagc gaaaatcctc 60tggggttaaa attttaagtt tgaaagaact tgacactaca gaaatttttc taaaatattt 120tgagtcacta taaacctatc atctttccac aagatatacc agatgactat ttgcagtctt 180ttctttgggc aagagttcca tgattttgat actgtacctt tggatccacc atgggttgca 240actgtctttg gttttgtttg tttgacttga accaccctct ggtaagtaag tgaattacag 300agcaggtcca gctggctgct ctgccccttg ggtatccata gttacggttt tctctgtggc 360ccacccaggg tgttttttgc atcgctggtg cagaaatgca caggtggatg agatatagct 420gctcttgtcc tc 432245502DNAHomo sapiens 245ttatcttgtg gaaagatgat aggtttatag tgactcaaaa tattttagaa aaatttctgt 60agtgtcaagt tctttcaaac ttaaaatttt aaccccagag gattttcgct gaataaatga 120gaattggctc tatttcttct acttctggat agcccgagta aaaatactaa taatttctag 180attttagtgg ggaactacaa ttattaggac ccatggatat tgctgcagtt caaatacaat 240acagtaatta caaaatatag accatctctt tacaaataca aattatagta tattacaagt 300catgtacagt aaatctataa ttttaaacaa actagtgtat ctaagtttac ctggttgcga 360gtgcattatt attccagttt acagttgccc ttagcgtgac agtcagaaac cgaccatcgg 420agtgatattc tcttatgtaa actggcgtca catcacagaa aaccttattt atgaggtccc 480attgccctcg caataatcac tg 502246356DNAHomo sapiens 246tttttcttgt ggaaagatga taggtttata gtgactcaaa atattttaga aaaatttctg 60tagtgtcaag ttctttcaaa cttaaaattt taaccccaga ggattttcgc tgaataaatg 120agaattggct ctatttcttc tacttctgga tagcccgagt aaaaatacta ataatttcta 180gattttagtg gggaactaca attattagga cccatggata ttgctgcagt tcaaatacaa 240tacagtaatt acaaaatata gaccatctct ttacaaatac aaattatagt atattacaag 300tcatgtacag taaatctata attttaaaca aactagtgta tctaagttta cctggt 356247442DNAHomo sapiens 247atcttgtgga aagatgatag gtttatagtg actcaaaata ttttagaaaa atttctgtag 60tgtcaagttc tttcaaactt aaaattttaa ccccagagga ttttcgctga ataaatgaga 120attggctcta tttcttctac ttctggatag cccgagtaaa aatactaata atttctagat 180tttagtgggg aacctacaat tattaggacc catggatatt gctgcagttc aaatacaata 240cagtaattac aaaatataga ccatctcttt acaaatacaa attatagtat attacaagtc 300atgtacagta aatctataat tttaaacaaa ctagtgtatc taagtttacc tggttgcgag 360tgcattatta ttccagttta cagttgccct tagcgtgaca gtcagaaacc gaccatcgga 420gtgatattct cttatgtaaa ct 442248552DNAHomo sapiens 248ttttcttcaa ataattacaa gctcagcggc tgaaatctac aaatggggac taccaaaagc 60ccacccaatc cagctcattt tgctatcgtt ttataacaat taatctgcat tatatttgga 120tccagacaaa taaagcaatt ataaatgtat ctcactttag aacagacaaa aaaagggcat 180gctatggaaa ttgtttaaat ctcaagcaac aatgctgatt aatttctggt caataatcgt 240tctatagttc tccttcatga agcctggtga ggttccaggg aaacagcttg atttgggaag 300cctcagcaga aaagaaagca tctcagagga cacataaaat gtctggcaac ccctcttggc 360ggccctcatc cagcaaagct tgtgtggtct tggcaactgt cctcaggact ctgctttcaa 420gatgaaagag gtgtagctta cccgctcaat acaccaagta caagatttag tacgaaaaat 480gacccaaaga tgacgagact gacaaaatac acccagggca attcaaatcc catagcatca 540ttcatctgca ag 552249491DNAHomo sapiens 249tttgtaaata acaaacacca ctttgttatg aagaccttac aaacctcttc ttaagacatt 60cttactctga tccaggcaaa aacacttcaa ggtttgtaaa tgactctttc ctgacataaa 120tcctttttta ttaaaatgca aaatgttctt cagaataaaa ctgtgtaata atttttatac 180ttgggagtgc tccttgcaca gagctgtcat ttgccagtga gagcctccga cggggcaggt 240actgtgccag ggcagctctg aaattatgga tattcttatc ctcctggttc cttcggtgcc 300aatggtaacc taataccagc cgcagggagc gccatttctc ctaaagggct acaccactgt 360caacattatc ctggactctg tgtctctctc tgttgggtct tgtggcatca catcaggcca 420aaattgccag accaggaccc taagtgtctg atagaggcga tgatcttttc caaagtcagt 480acttacaaac t 491250401DNAHomo sapiens 250tttttttttg gtccaaaatt tttaatagta tacagacaac ctgttaattt tttttttttt 60tttttttgta aataacaaac accactttgt tatgaagacc ttacaaacct cttcttaaga 120cattcttact ctgatccagg caaaaacact tcaaggtttg taaatgactc tttcctgaca 180taaatccttt tttattaaaa tgcaaaatgt tcttcagaat aaaactgtgt aataattttt 240atacttggga gtgctccttg cacagagctg tcatttgcca gtgagagcct ccgacggggc 300aggtactgtg ccagggcagc tctgaaatta tggatattct tatcctcctg gttccttcgg 360tgccaatggt aacctaatac cagccgcagg gagcgccatt t 401251387DNAHomo sapiens 251tcgacagcta ccagtgatta ttgcgagggc aatgggacct cataaataag gttttctgtg 60atgtgacgcc atttacataa gagaatatca ctccgatggt cggtttctga ctgtcacgct 120aagggcaact gtaaactgga ataataatgc actcgcaacc aggtaaactt agatacacta 180gtttgtttaa aattatagat ttactgtaca tgacttgtaa tatactataa tttgtatttg 240taaagagatg gtctatattt tgtaattact gtattgtatt tgaactgcag caatatccat 300gggtcctaat aattgtagtt ccccactaaa atctagaaat tattagtatt tttactcggg 360ctatccagaa gtagaagaaa tagagcc

387252474DNAHomo sapiens 252gaatatgtga ttaatgtgtg ttggctgctg ttgtctctga tttggctact gttgtttctg 60atttaaatct aagtaaatgt ttaattaaat gtatagaatg ctgtctctaa tgtgaccctc 120tctccttatt aaatcctctt attaacccac tcctatgaga ccatcttatt tcttgcagat 180gaatgatgct atgggatttg aattgccctg ggtgtatttt gtcagtctcg tcatctttgg 240gtcatttttc gtactaaatc ttgtacttgg tgtattgagc gggtaagcta cacctctttc 300atcttgaaag cagagtcctg aggacagttg ccaagaccac acaagctttg ctggatgagg 360gccgccaaga ggggttgcca gacattttat gtgtcctctg agatgctttc ttttctgctg 420aggcttccca aatcaagctg tttcctggaa cctcaccagg cttcatgaag gaga 474253594DNAHomo sapiens 253tttgtaaata acaaacacca ctttgttatg aagaccttac aaacctcttc ttaagacatt 60cttactctga tccaggcaaa aacacttcaa ggtttgtaaa tgactctttc ctgacataaa 120tcctttttta ttaaaatgca aaatgttctt cagaataaaa ctgtgtaata atttttatac 180ttgggagtgc tccttgcaca gagctgtcat ttgccagtga gagcctccga cggggcaggt 240actgtgccag ggcagctctg aaattatgga tattcttatc ctcctggttc cttcggtgcc 300aatggtaacc taataccagc cgcagggagc gccatttctc ctaaagggct acaccactgt 360caacattatc ctggactctg tgtctctctc tgttgggtct tgtggcatca catcaggcca 420aaattgccag accaggaccc taagtgtctg atagaggcga tgatcttttc caaagtcagt 480acttacaaac tggcattctt acaggctgca ccatttccta gtatgtctgc tttaagcctg 540gttcaacctc tcatcgaata ttaaattttt ctttgtaaga aaaaaaaaaa aaaa 594254548DNAHomo sapiens 254tttgtaaata acaaacacca ctttgttatg aagaccttac aaacctcttc ttaagacatt 60cttactctga tccaggcaaa aacacttcaa ggtttgtaaa tgactctttc ctgacataaa 120tcctttttta ttaaaatgca aaatgttctt cagaataaaa ctgtgtaata atttttatac 180ttgggagtgc tccttgcaca gagctgtcat ttgccagtga gagcctccga cagggcaggt 240actgtgccag ggcagctctg aaattatgga tattcttatc ctcctggttc cttcggtgcc 300aatggtaacc taataccagc cgcagggagc gccatttctc ctaaagggct acaccactgt 360caacattatc ctggactctg tgtctctctc tgttgggtct tgtggcatca catcaggcca 420aaattgccag accaggaccc taagtgtctg atagaggcga tgatcttttc caaagtcagt 480acttacaaac tggcattctt acaggctgca ccatttccta gtatgtctgc tttaagcctg 540gttcaacc 548255456DNAHomo sapiensmisc_feature(57)..(57)a or g or c or t/umisc_feature(324)..(324)a or g or c or t/umisc_feature(396)..(396)a or g or c or t/u 255ggagaaagga gggaaaccag gagcagccgg catgggcagt ggcagaattg gccctgntag 60agagcagagc tgatgccatc cttttggcaa atagctgaca ttttatggtg tggtgctggg 120tgagccccct gtgagggttg aacagatgtg gacaggactt gggtccaggc actagagtgg 180tgcagcctgt aagaatgcca gtttgtaagt actgactttg gaaaagatca tcgcctctat 240cagacactta gggtcctggt ctggcaattt tggcctgatg tgatgccaca agacccaaca 300gagagagaca cagagtccag gatnaatgtt gacagtggtg tagcctttag gaagaaatgg 360cgctccctgc ggctggtatt aggttaccat tggcanccga aggaacccag gaggattaag 420aatttcccta atttcagaac ttgccctggc acagta 456256434DNAHomo sapiensmisc_feature(194)..(194)a or g or c or t/umisc_feature(351)..(351)a or g or c or t/umisc_feature(401)..(401)a or g or c or t/umisc_feature(426)..(426)a or g or c or t/u 256ggtccaaaat ttttaatagt atacagacaa cctgttaatt tttttttttt ttttttttgt 60aaataacaaa caccactttg ttatgaagac cttacaaacc tcttcttaag acattcttac 120tctgatccag gcaaaaacac ttcaaggttt gtaaatcgac tctttcctga cataaatcct 180tttttattaa aatngcaaaa ttgttcttca gaataaaact gtgtaataat ttttatactt 240gggagtgctc cttgcacaga gctgtcattt gccagtgaga gcctccgacg gggcaggtac 300tgtgccaggg cagctctgaa attatggaaa ttcttatccc cctggttcct ncggtggcca 360atgggtaacc taataccagc ccgcgggaag cgccaatttc ncccaaaagg gggtaaacca 420ctggtnaaac atta 434257199DNAHomo sapiensmisc_feature(187)..(187)a or g or c or t/umisc_feature(195)..(195)a or g or c or t/u 257tttttctttt gtaaataaca aacaccactt tgttatgaag accttacaaa cctcttctta 60agacattctt actctgatcc aggcaaaaac acttcaaggt ttgtaaatga ctctttcctg 120acataaatcc ttttttatta aaatgcaaaa tgttcttcag aataaaactg tgtaataatt 180tttatangtg ggggngctc 199258459DNAHomo sapiensmisc_feature(405)..(405)a or g or c or t/umisc_feature(456)..(456)a or g or c or t/u 258acaaagaaaa atttaatatt cgatgagagg ttgaaccagg cttaaagcag acatactagg 60aaatggtgca gcctgtaaga atgccagttt gtaagtactg actttggaaa agatcatcgc 120ctctatcaga cacttagggt cctggtctgg caattttggc ctgatgtgat gccacaagac 180ccaacagaga gagacacaga gtccaggata atgttgacag tggtgtagcc ctttaggaga 240aatggcgctc cctgcggctg gtattaggtt accattggca ccgaagaacc aggaggataa 300gaatatccat aatttcagag cttgccctgg cacagtacct gccccgtcgg aggctctcac 360tgggcaaatg gacagctctg tgcaaggagc actcccaagt ataanaatta ttacacagtt 420ttattctgaa gaacattttg cattttaata aaaaangga 459259443DNAHomo sapiens 259tttttttttt ttttgggcca aaatttttaa tagtatacag acaacctgtt aatttttttt 60tttttttttt ttgtaaataa caaacaccac tttgttatga agaccttaca aacctctttt 120taagacattc ttactctgat ccaggcaaaa acacttcaag gtttgtaaat gactttttcc 180tgacataaat ccttttttat taaaatgcaa aatgttcttc agaataaaac tgtgtaataa 240tttttatact tgggagtgct ccttgcacag agctgtcatt tgccagtgag agcctccgac 300ggggcaggta ctgtgccagg gcagctctga aattatggat attcttatcc tcctggttcc 360ttcggtgcca atggtaacct aataccagcc gcagggagcg ccatttctcc taaagggcta 420caccactgtc aacattatcc tgg 443260442DNAHomo sapiens 260tttttttttg gtccaaaatt tttaatagta tacagacaac ctgttaattt tttttttttt 60tttttttgta aataacaaac accactttgt tatgaagacc ttacaaacct cttcttaaga 120cattcttact ctgatccagg caaaaacact tcaaggtttg taaatgactc tttcctgaca 180taaatccttt tttattaaaa tgcaaaatgt tcttcagaat aaaactgtgt aataattttt 240atacttggga gtgctccttg cacagagctg tcatttgcca gtgagagcct ccgacggggc 300aggtactgtg ccagggcagc tctgaaatta tggatattct tatcctcctg gttccttcgg 360tgccaatggt aacctaatac cagccgcagg gagcgccatt tctcctaaag ggctacacca 420ctgtcaacat tatcctggac tc 442261471DNAHomo sapiens 261tttgtggaaa gatgataggt ttatagtgac tcaaaatatt ttagaaaaat ttctgtagtg 60tcaagttctt tcaaacttaa aattttaacc ccagaggatt ttcgctgaat aaaatgagaa 120ttggctctat ttcttctact tctggatagc ccgagtaaaa atactaataa tttctagatt 180ttagtgggga actacaatta ttaggaccca tggatattgc tgcagttcaa atacaataca 240gtaattacaa aatatagacc atctctttac aaatacaaat tatagtatat tacaagtcat 300gtacagtaaa tctataattt taaacaaact agtgtatcta agtttacctg gttgcgagtg 360cattattatt ccagtttaca gttgccctta gcgtgacagt cagaaaccga ccatcggagt 420gatattctct tatgtaaact ggcgtcacat cacagaaaac cttatttatg a 4712627635DNAHomo sapiens 262gggcgagcgc ctccgtcccc ggatgtgagc tccggctgcc cgcggtcccg agccagcggc 60gcgcgggcgg cggcggcggg caccgggcac cgcggcgggc gggcagacgg gcgggcatgg 120ggggagcgcc gagcggcccc ggcggccggg ccggcatcac cgcggcgtct ctccgctaga 180ggaggggaca agccagttct cctttgcagc aaaaaattac atgtatatat tattaagata 240atatatacat tggattttat ttttttaaaa agtttatttt gctccatttt tgaaaaagag 300agagcttggg tggcgagcgg ttttttttta aaatcaatta tccttatttt ctgttatttg 360tccccgtccc tccccacccc cctgctgaag cgagaataag ggcagggacc gcggctccta 420cctcttggtg atccccttcc ccattccgcc cccgccccaa cgcccagcac agtgccctgc 480acacagtagt cgctcaataa atgttcgtgg atgatgatga tgatgatgat gaaaaaaatg 540cagcatcaac ggcagcagca agcggaccac gcgaacgagg caaactatgc aagaggcacc 600agacttcctc tttctggtga aggaccaact tctcagccga atagctccaa gcaaactgtc 660ctgtcttggc aagctgcaat cgatgctgct agacaggcca aggctgccca aactatgagc 720acctctgcac ccccacctgt aggatctctc tcccaaagaa aacgtcagca atacgccaag 780agcaaaaaac agggtaactc gtccaacagc cgacctgccc gcgccctttt ctgtttatca 840ctcaataacc ccatccgaag agcctgcatt agtatagtgg aatggaaacc atttgacata 900tttatattat tggctatttt tgccaattgt gtggccttag ctatttacat cccattccct 960gaagatgatt ctaattcaac aaatcataac ttggaaaaag tagaatatgc cttcctgatt 1020atttttacag tcgagacatt tttgaagatt atagcgtatg gattattgct acatcctaat 1080gcttatgtta ggaatggatg gaatttactg gattttgtta tagtaatagt aggattgttt 1140agtgtaattt tggaacaatt aaccaaagaa acagaaggcg ggaaccactc aagcggcaaa 1200tctggaggct ttgatgtcaa agccctccgt gcctttcgag tgttgcgacc acttcgacta 1260gtgtcaggag tgcccagttt acaagttgtc ctgaactcca ttataaaagc catggttccc 1320ctccttcaca tagccctttt ggtattattt gtaatcataa tctatgctat tataggattg 1380gaacttttta ttggaaaaat gcacaaaaca tgtttttttg ctgactcaga tatcgtagct 1440gaagaggacc cagctccatg tgcgttctca gggaatggac gccagtgtac tgccaatggc 1500acggaatgta ggagtggctg ggttggcccg aacggaggca tcaccaactt tgataacttt 1560gcctttgcca tgcttactgt gtttcagtgc atcaccatgg agggctggac agacgtgctc 1620tactggatga atgatgctat gggatttgaa ttgccctggg tgtattttgt cagtctcgtc 1680atctttgggt catttttcgt actaaatctt gtacttggtg tattgagcgg agaattctca 1740aaggaaagag agaaggcaaa agcacgggga gatttccaga agctccggga gaagcagcag 1800ctggaggagg atctaaaggg ctacttggat tggatcaccc aagctgagga catcgatccg 1860gagaatgagg aagaaggagg agaggaaggc aaacgaaata ctagcatgcc caccagcgag 1920actgagtctg tgaacacaga gaacgtcagc ggtgaaggcg agaaccgagg ctgctgtgga 1980agtctctgtc aagccatctc aaaatccaaa ctcagccgac gctggcgtcg ctggaaccga 2040ttcaatcgca gaagatgtag ggccgccgtg aagtctgtca cgttttactg gctggttatc 2100gtcctggtgt ttctgaacac cttaaccatt tcctctgagc actacaatca gccagattgg 2160ttgacacaga ttcaagatat tgccaacaaa gtcctcttgg ctctgttcac ctgcgagatg 2220ctggtaaaaa tgtacagctt gggcctccaa gcatatttcg tctctctttt caaccggttt 2280gattgcttcg tggtgtgtgg tggaatcact gagacgatct tggtggaact ggaaatcatg 2340tctcccctgg ggatctctgt gtttcggtgt gtgcgcctct taagaatctt caaagtgacc 2400aggcactgga cttccctgtg caacttagtg gcatccttat taaactccat gaagtccagt 2460gcttcgctgt tgcttctgct ttttctcttc attatcatct tttccttgct tgggatgcag 2520ctgtttggcg gcaagtttaa ttttgatgaa acgcaaacca agcggagcac ctttgacaat 2580ttccctcaag cacttctcac agtgttccag atcctgacag gcgaagactg gaatgctgtg 2640atgtacgatg gcatcatggc ttacgggggc ccatcctctt caggaatgat cgtctgcatc 2700tacttcatca tcctcttcat ttgtggtaac tatattctac tgaatgtctt cttggccatc 2760gctgtagaca atttggctga tgctgaaagt ctgaacactg ctcagaaaga agaagcggaa 2820gaaaaggaga ggaaaaagat tgccagaaaa gagagcctag aaaataaaaa gaacaacaaa 2880ccagaagtca accagatagc caacagtgac aacaaggtta caattgatga ctatagagaa 2940gaggatgaag acaaggaccc ctatccgcct tgcgatgtgc cagtagggga agaggaagag 3000gaagaggagg aggatgaacc tgaggttcct gccggacccc gtcctcgaag gatctcggag 3060ttgaacatga aggaaaaaat tgcccccatc cctgaaggga gcgctttctt cattcttagc 3120aagaccaacc cgatccgcgt aggctgccac aagctcatca accaccacat cttcaccaac 3180ctcatccttg tcttcatcat gctgagcagt gctgccctgg ccgcagagga ccccatccgc 3240agccactcct tccggaacac gatactgggt tactttgact atgccttcac agccatcttt 3300actgttgaga tcctgttgaa gatgacaact tttggagctt tcctccacaa aggggccttc 3360tgcaggaact acttcaattt gctggatatg ctggtggttg gggtgtctct ggtgtcattt 3420gggattcaat ccagtgccat ctccgttgtg aagattctga gggtcttaag ggtcctgcgt 3480cccctcaggg ccatcaacag agcaaaagga cttaagcacg tggtccagtg cgtcttcgtg 3540gccatccgga ccatcggcaa catcatgatc gtcaccaccc tcctgcagtt catgtttgcc 3600tgtatcgggg tccagttgtt caaggggaag ttctatcgct gtacggatga agccaaaagt 3660aaccctgaag aatgcagggg acttttcatc ctctacaagg atggggatgt tgacagtcct 3720gtggtccgtg aacggatctg gcaaaacagt gatttcaact tcgacaacgt cctctctgct 3780atgatggcgc tcttcacagt ctccacgttt gagggctggc ctgcgttgct gtataaagcc 3840atcgactcga atggagagaa catcggccca atctacaacc accgcgtgga gatctccatc 3900ttcttcatca tctacatcat cattgtagct ttcttcatga tgaacatctt tgtgggcttt 3960gtcatcgtta catttcagga acaaggagaa aaagagtata agaactgtga gctggacaaa 4020aatcagcgtc agtgtgttga atacgccttg aaagcacgtc ccttgcggag atacatcccc 4080aaaaacccct accagtacaa gttctggtac gtggtgaact cttcgccttt cgaatacatg 4140atgtttgtcc tcatcatgct caacacactc tgcttggcca tgcagcacta cgagcagtcc 4200aagatgttca atgatgccat ggacattctg aacatggtct tcaccggggt gttcaccgtc 4260gagatggttt tgaaagtcat cgcatttaag cctaaggggt attttagtga cgcctggaac 4320acgtttgact ccctcatcgt aatcggcagc attatagacg tggccctcag cgaagcagac 4380ccaactgaaa gtgaaaatgt ccctgtccca actgctacac ctgggaactc tgaagagagc 4440aatagaatct ccatcacctt tttccgtctt ttccgagtga tgcgattggt gaagcttctc 4500agcagggggg aaggcatccg gacattgctg tggactttta ttaagttctt tcaggcgctc 4560ccgtatgtgg ccctcctcat agccatgctg ttcttcatct atgcggtcat tggcatgcag 4620atgtttggga aagttgccat gagagataac aaccagatca ataggaacaa taacttccag 4680acgtttcccc aggcggtgct gctgctcttc aggtgtgcaa caggtgaggc ctggcaggag 4740atcatgctgg cctgtctccc agggaagctc tgtgaccctg agtcagatta caaccccggg 4800gaggagcata catgtgggag caactttgcc attgtctatt tcatcagttt ttacatgctc 4860tgtgcatttc tgatcatcaa tctgtttgtg gctgtcatca tggataattt cgactatctg 4920acccgggact ggtctatttt ggggcctcac catttagatg aattcaaaag aatatggtca 4980gaatatgacc ctgaggcaaa gggaaggata aaacaccttg atgtggtcac tctgcttcga 5040cgcatccagc ctcccctggg gtttgggaag ttatgtccac acagggtagc gtgcaagaga 5100ttagttgcca tgaacatgcc tctcaacagt gacgggacag tcatgtttaa tgcaaccctg 5160tttgctttgg ttcgaacggc tcttaagatc aagaccgaag ggaacctgga gcaagctaat 5220gaagaacttc gggctgtgat aaagaaaatt tggaagaaaa ccagcatgaa attacttgac 5280caagttgtcc ctccagctgg tgatgatgag gtaaccgtgg ggaagttcta tgccactttc 5340ctgatacagg actactttag gaaattcaag aaacggaaag aacaaggact ggtgggaaag 5400taccctgcga agaacaccac aattgcccta caggcgggat taaggacact gcatgacatt 5460gggccagaaa tccggcgtgc tatatcgtgt gatttgcaag atgacgagcc tgaggaaaca 5520aaacgagaag aagaagatga tgtgttcaaa agaaatggtg ccctgcttgg aaaccatgtc 5580aatcatgtta atagtgatag gagagattcc cttcagcaga ccaataccac ccaccgtccc 5640ctgcatgtcc aaaggccttc aattccacct gcaagtgata ctgagaaacc gctgtttcct 5700ccagcaggaa attcggtgtg tcataaccat cataaccata attccatagg aaagcaagtt 5760cccacctcaa caaatgccaa tctcaataat gccaatatgt ccaaagctgc ccatggaaag 5820cggcccagca ttgggaacct tgagcatgtg tctgaaaatg ggcatcattc ttcccacaag 5880catgaccggg agcctcagag aaggtccagt gtgaaaagaa cccgctatta tgaaacttac 5940attaggtccg actcaggaga tgaacagctc ccaactattt gccgggaaga cccagagata 6000catggctatt tcagggaccc ccactgcttg ggggagcagg agtatttcag tagtgaggaa 6060tgctacgagg atgacagctc gcccacctgg agcaggcaaa actatggcta ctacagcaga 6120tacccaggca gaaacatcga ctctgagagg ccccgaggct accatcatcc ccaaggattc 6180ttggaggacg atgactcgcc cgtttgctat gattcacgga gatctccaag gagacgccta 6240ctacctccca ccccagcatc ccaccggaga tcctccttca actttgagtg cctgcgccgg 6300cagagcagcc aggaagaggt cccgtcgtct cccatcttcc cccatcgcac ggccctgcct 6360ctgcatctaa tgcagcaaca gatcatggca gttgccggcc tagattcaag taaagcccag 6420aagtactcac cgagtcactc gacccggtcg tgggccaccc ctccagcaac ccctccctac 6480cgggactgga caccgtgcta cacccccctg atccaagtgg agcagtcaga ggccctggac 6540caggtgaacg gcagcctgcc gtccctgcac cgcagctcct ggtacacaga cgagcccgac 6600atctcctacc ggactttcac accagccagc ctgactgtcc ccagcagctt ccggaacaaa 6660aacagcgaca agcagaggag tgcggacagc ttggtggagg cagtcctgat atccgaaggc 6720ttgggacgct atgcaaggga cccaaaattt gtgtcagcaa caaaacacga aatcgctgat 6780gcctgtgacc tcaccatcga cgagatggag agtgcagcca gcaccctgct taatgggaac 6840gtgcgtcccc gagccaacgg ggatgtgggc cccctctcac accggcagga ctatgagcta 6900caggactttg gtcctggcta cagcgacgaa gagccagacc ctgggaggga tgaggaggac 6960ctggcggatg aaatgatatg catcaccacc ttgtagcccc cagcgagggg cagactggct 7020ctggcctcag gtggggcgca ggagagccag gggaaaagtg cctcatagtt aggaaagttt 7080aggcactagt tgggagtaat attcaattaa ttagactttt gtataagaga tgtcatgcct 7140caagaaagcc ataaacctgg taggaacagg tcccaagcgg ttgagcctgg cagagtacca 7200tgcgctcggc cccagctgca ggaaacagca ggccccgccc tctcacagag gatgggtgag 7260gaggccagac ctgccctgcc ccattgtcca gatgggcact gctgtggagt ctgcttctcc 7320catgtaccag ggcaccaggc ccacccaact gaaggcatgg cggcggggtg caggggaaag 7380ttaaaggtga tgacgatcat cacacctgtg tcgttacctc agccatcggt ctagcatatc 7440agtcactggg cccaacatat ccatttttaa accctttccc ccaaatacac tgcgtcctgg 7500ttcctgttta gctgttctga aatacggtgt gtaagtaagt cagaacccag ctaccagtga 7560ttattgcgag ggcaatggga cctcataaat aaggttttct gtgatgtgac gccagtttac 7620ataagagaat atcac 7635263637DNAHomo sapiens 263tttttttttt cttacaaaga aaaatttaat attcgatgag aggttgaacc aggcttaaag 60cagacatact aggaaatggt gcagcctgta agaatgccag tttgtaagta ctgactttgg 120aaaagatcat cgcctctatc agacacttag ggtcctggtc tggcaatttt ggcctgatgt 180gatgccacaa gacccaacag agagagacac agagtccagg ataatgttga cagtggtgta 240gccctttagg agaaatggcg ctccctgcgg ctggtattag gttaccattg gcaccgaagg 300aaccaggagg ataagaatat ccataatttc agagctgccc tggcacagta cctgccccgt 360cggaggctct cactggcaaa tgacagctct gtgcaaggag cactcccaag tataaaaatt 420attacacagt tttattctga agaacatttt gcattttaat aaaaaaggat ttatgtcagg 480aaagagtcat ttacaaacct tgaagtgttt ttgcctggat cagagtaaga atgtcttaag 540aagaggtttg taaggtcttc ataacaaagt ggtgtttgtt atttacaaaa aaaaaaaaaa 600aaaaaaatta acaggttgtc tgtatactat taaaaat 6372647193DNAHomo sapiens 264agaataaggg cagggaccgc ggctcctatc tcttggtgat ccccttcccc attccgcccc 60cgcctcaacg cccagcacag tgccctgcac acagtagtcg ctcaataaat gttcgtggat 120gatgatgatg atgatgatga aaaaaatgca gcatcaacgg cagcagcaag cggaccacgc 180gaacgaggca aactatgcaa gaggcaccag acttcctctt tctggtgaag gaccaacttc 240tcagccgaat agctccaagc aaactgtcct gtcttggcaa gctgcaatcg atgctgctag 300acaggccaag gctgcccaaa ctatgagcac ctctgcaccc ccacctgtag gatctctctc 360ccaaagaaaa cgtcagcaat acgccaagag caaaaaacag ggtaactcgt ccaacagccg 420acctgcccgc gcccttttct gtttatcact caataacccc atccgaagag cctgcattag 480tatagtggaa tggaaaccat ttgacatatt tatattattg gctatttttg ccaattgtgt 540ggccttagct atttacatcc cattccctga agatgattct aattcaacaa atcataactt 600ggaaaaagta gaatatgcct tcctgattat ttttacagtc gagacatttt tgaagattat 660agcgtatgga ttattgctac atcctaatgc ttatgttagg aatggatgga atttactgga 720ttttgttata gtaatagtag gattgtttag tgtaattttg gaacaattaa ccaaagaaac 780agaaggcggg aaccactcaa gcggcaaatc tggaggcttt gatgtcaaag ccctccgtgc 840ctttcgagtg ttgcgaccac ttcgactagt gtcaggggtg cccagtttac aagttgtcct 900gaactccatt ataaaagcca tggttcccct ccttcacata gcccttttgg tattatttgt 960aatcataatc tatgctatta taggattgga actttttatt ggaaaaatgc acaaaacatg 1020tttttttgct gactcagata tcgtagctga agaggaccca gctccatgtg cgttctcagg 1080gaatggacgc cagtgtactg ccaatggcac ggaatgtagg agtggctggg ttggcccgaa 1140cggaggcatc accaactttg

ataactttgc ctttgccatg cttactgtgt ttcagtgcat 1200caccatggag ggctggacag acgtgctcta ctgggtaaat gatgcgatag gatgggaatg 1260gccatgggtg tattttgtta gtctgatcat ccttggctca tttttcgtcc ttaacctggt 1320tcttggtgtc cttagtggag aattctcaaa ggaaagagag aaggcaaaag cacggggaga 1380tttccagaag ctccgggaga agcagcagct ggaggaggat ctaaagggct acttggattg 1440gatcacccaa gctgaggaca tcgatccgga gaatgaggaa gaaggaggag aggaaggcaa 1500acgaaatact agcatgccca ccagcgagac tgagtctgtg aacacagaga acgtcagcgg 1560tgaaggcgag aaccgaggct gctgtggaag tctctggtgc tggtggagac ggagaggcgc 1620ggccaaggcg gggccctctg ggtgtcggcg gtggggtcaa gccatctcaa aatccaaact 1680cagccgacgc tggcgtcgct ggaaccgatt caatcgcaga agatgtaggg ccgccgtgaa 1740gtctgtcacg ttttactggc tggttatcgt cctggtgttt ctgaacacct taaccatttc 1800ctctgagcac tacaatcagc cagattggtt gacacagatt caagatattg ccaacaaagt 1860cctcttggct ctgttcacct gcgagatgct ggtaaaaatg tacagcttgg gcctccaagc 1920atatttcgtc tctcttttca accggtttga ttgcttcgtg gtgtgtggtg gaatcactga 1980gacgatcctg gtggaactgg aaatcatgtc tcccctgggg atctctgtgt ttcggtgtgt 2040gcgcctctta agaatcttca aagtgaccag gcactggact tccctgagca acttagtggc 2100atccttatta aactccatga agtccatcgc ttcgctgttg cttctgcttt ttctcttcat 2160tatcatcttt tccttgcttg ggatgcagct gtttggcggc aagtttaatt ttgatgaaac 2220gcaaaccaag cggagcacct ttgacaattt ccctcaagca cttctcacag tgttccagat 2280cctgacaggc gaagactgga atgctgtgat gtacgatggc atcatggctt acgggggccc 2340atcctcttca ggaatgatcg tctgcatcta cttcatcatc ctcttcattt gtggtaacta 2400tattctactg aatgtcttct tggccatcgc tgtagacaat ttggctgatg ctgaaagtct 2460gaacactgct cagaaagaag aagcggaaga aaaggagagg aaaaagattg ccagaaaaga 2520gagcctagaa aataaaaaga acaacaaacc agaagtcaac cagatagcca acagtgacaa 2580caaggttaca attgatgact atagagaaga ggatgaagac aaggacccct atccgccttg 2640cgatgtgcca gtaggggaag aggaagagga agaggaggag gatgaacctg aggttcctgc 2700cggaccccgt cctcgaagga tctcggagtt gaacatgaag gaaaaaattg cccccatccc 2760tgaagggagc gctttcttca ttcttagcaa gaccaacccg atccgcgtag gctgccacaa 2820gctcatcaac caccacatct tcaccaacct catccttgtc ttcatcatgc tgagcagcgc 2880tgccctggcc gcagaggacc ccatccgcag ccactccttc cggaacacga tactgggtta 2940ctttgactat gccttcacag ccatctttac tgttgagatc ctgttgaaga tgacaacttt 3000tggagctttc ctccacaaag gggccttctg caggaactac ttcaatttgc tggatatgct 3060ggtggttggg gtgtctctgg tgtcatttgg gattcaatcc agtgccatct ccgttgtgaa 3120gattctgagg gtcttaaggg tcctgcgtcc cctcagggcc atcaacagag caaaaggact 3180taagcacgtg gtccagtgcg tcttcgtggc catccggacc atcggcaaca tcatgatcgt 3240cactaccctc ctgcagttca tgtttgcctg tatcggggtc cagttgttca aggggaagtt 3300ctatcgctgt acggatgaag ccaaaagtaa ccctgaagaa tgcaggggac ttttcatcct 3360ctacaaggat ggggatgttg acagtcctgt ggtccgtgaa cggatctggc aaaacagtga 3420tttcaacttc gacaacgtcc tctctgctat gatggcgctc ttcacagtct ccacgtttga 3480gggctggcct gcgttgctgt ataaagccat cgactcgaat ggagagaaca tcggcccaat 3540ctacaaccac cgcgtggaga tctccatctt cttcatcatc tacatcatca ttgtagcttt 3600cttcatgatg aacatctttg tgggctttgt catcgttaca tttcaggaac aaggagaaaa 3660agagtataag aactgtgagc tggacaaaaa tcagcgtcag tgtgttgaat acgccttgaa 3720agcacgtccc ttgcggagat acatccccaa aaacccctac cagtacaagt tctggtacgt 3780ggtgaactct tcgcctttcg aatacatgat gtttgtcctc atcatgctca acacactctg 3840cttggccatg cagcactacg agcagtccaa gatgttcaat gatgccatgg acattctgaa 3900catggtcttc accggggtgt tcaccgtcga gatggttttg aaagtcatcg catttaagcc 3960taaggggtat tttagtgacg cctggaacac gtttgactcc ctcatcgtaa tcggcagcat 4020tatagacgtg gccctcagcg aagcggaccc aactgaaagt gaaaatgtcc ctgtcccaac 4080tgctacacct gggaactctg aagagagcaa tagaatctcc atcacctttt tccgtctttt 4140ccgagtgatg cgattggtga agcttctcag caggggggaa ggcatccgga cattgctgtg 4200gacttttatt aagtcctttc aggcgctccc gtatgtggcc ctcctcatag ccatgctgtt 4260cttcatctat gcggtcattg gcatgcagat gtttgggaaa gttgccatga gagataacaa 4320ccagatcaat aggaacaata acttccagac gtttccccag gcggtgctgc tgctcttcag 4380gtgtgcaaca ggtgaggcct ggcaggagat catgctggcc tgtctcccag ggaagctctg 4440tgaccctgag tcagattaca accccgggga ggagtataca tgtgggagca actttgccat 4500tgtctatttc atcagttttt acatgctctg tgcatttctg atcatcaatc tgtttgtggc 4560tgtcatcatg gataatttcg actatctgac ccgggactgg tctattttgg ggcctcacca 4620tttagatgaa ttcaaaagaa tatggtcaga atatgaccct gaggcaaagg gaaggataaa 4680acaccttgat gtggtcactc tgcttcgacg catccagcct cccctggggt ttgggaagtt 4740atgtccacac agggtagcgt gcaagagatt agttgccatg aacatgcctc tcaacagtga 4800cgggacagtc atgtttaatg caaccctgtt tgctttggtt cgaacggctc ttaagatcaa 4860gaccgaaggg aacctggagc aagctaatga agaacttcgg gctgtgataa agaaaatttg 4920gaagaaaacc agcatgaaat tacttgacca agttgtccct ccagctggtg atgatgaggt 4980aaccgtgggg aagttctatg ccactttcct gatacaggac tactttagga aattcaagaa 5040acggaaagaa caaggactgg tgggaaagta ccctgcgaag aacaccacaa ttgccctaca 5100ggcgggatta aggacactgc atgacattgg gccagaaatc cggcgtgcta tatcgtgtga 5160tttgcaagat gacgagcctg aggaaacaaa acgagaagaa gaagatgatg tgttcaaaag 5220aaatggtgcc ctgcttggaa accatgtcaa tcatgttaat agtgatagga gagattccct 5280tcagcagacc aataccaccc accgtcccct gcatgtccaa aggccttcaa ttccacctgc 5340aagtgatact gagaaaccgc tgtttcctcc agcaggaaat tcggtgtgtc ataaccatca 5400taaccataat tccataggaa agcaagttcc cacctcaaca aatgccaatc tcaataatgc 5460caatatgtcc aaagctgccc atggaaagcg gcccagcatt gggaaccttg agcatgtgtc 5520tgaaaatggg catcattctt cccacaagca tgaccgggag cctcagagaa ggtccagtgt 5580gaaaagaacc cgctattatg aaacttacat taggtccgac tcaggagatg aacagctccc 5640aactatttgc cgggaagacc cagagataca tggctatttc agggaccccc actgcttggg 5700ggagcaggag tatttcagta gtgaggaatg ctacgaggat gacagctcgc ccacctggag 5760caggcaaaac tatggctact acagcagata cccaggcaga aacatcgact ctgagaggcc 5820ccgaggctac catcatcccc aaggattctt ggaggacgat gactcgcccg tttgctatga 5880ttcacggaga tctccaagga gacgcctact acctcccacc ccagcatccc accggagatc 5940ctccttcaac tttgagtgcc tgcgccggca gagcagccag gaagaggtcc cgtcgtctcc 6000catcttcccc catcgcacgg ccctgcctct gcatctaatg cagcaacaga tcatggcagt 6060tgccggccta gattcaagta aagcccagaa gtactcaccg agtcactcga cccggtcgtg 6120ggccacccct ccagcaaccc ctccctaccg ggactggaca ccgtgctaca cccccctgat 6180ccaagtggag cagtcagagg ccctggacca ggtgaacggc agcctgccgt ccctgcaccg 6240cagctcctgg tacacagacg agcccgacat ctcctaccgg actttcacac cagccagcct 6300gactgtcccc agcagcttcc ggaacaaaaa cagcgacaag cagaggagtg cggacagctt 6360ggtggaggca gtcctgatat ccgaaggctt gggacgctat gcaagggacc caaaatttgt 6420gtcagcaaca aaacacgaaa tcgctgatgc ctgtgacctc accatcgacg agatggagag 6480tgcagccagc accctgctta atgggaacgt gcgtccccga gccaacgggg atgtgggccc 6540cctctcacac cggcaggact atgagctaca ggactttggt cctggctaca gcgacgaaga 6600gccagaccct gggagggatg aggaggacct ggcggatgaa atgatatgca tcaccacctt 6660gtagccccca gcgaggggca gactggctct ggcctcaggt ggggcgcagg agagccaggg 6720gaaaagtgcc tcatagttag gaaagtttag gcactagttg ggagtaatat tcaattaatt 6780agacttttgt ataagagatg tcatgcctca agaaagccat aaacctggta ggaacaggtc 6840ccaagcggtt gagcctggca gagtaccatg cgctcggccc cagctgcagg aaacagcagg 6900ccccgccctc tcacagagga tgggtgagga ggccagacct gccctgcccc attgtccaga 6960tgggcactgc tgtggagtct gcttctccca tgtaccaggg caccaggccc acccaactga 7020aggcatggcg gcggggtgca ggggaaagtt aaaggtgatg acgatcatca cacctcgtgt 7080cgttacctca gccatcggtc tagcatatca gtcactgggc ccaacatatc catttttaaa 7140ccctttcccc caaatacact gcgtcctggt tcctgtttag ctgttctgaa ata 7193265243DNAHomo sapiens 265gtactgtgcc ggggcagctc tgaaattatg gatattctta tcctcctggt tccttcggtg 60ccaatggtaa cctaatacca gccgcaggga gcgccatttc tcctaaaggg ctacaccact 120gtcaacatta tcctggactc tgtgtctctc tctgttgggt cttgtggcat cacatcaggc 180caaaattgcc agaccaggac cctaagtgtc tgatagaggc gatgatcttt tcaaagtcag 240tac 243266341DNAHomo sapiensmisc_feature(9)..(9)a or g or c or t/u 266tgcagcaant ggcacggaat gtaggagtgg gtgggtggga ccgaacggag gcatcaccaa 60ctttgataac ttggcctatg ccatgcttac ggtgtttcag tgcatcacca tggagggctg 120gacagatgtg ctctactggg taaatgatgc gataggatgg gaatggccat gggcgtattt 180tgttagtctg atcatccttg gctcattttt cgtccttaac ctggttcttg gtgtccttag 240tggagaattc tcaaaggaaa gagagaaggc aaaagcacgg ggagatttcc agaagctccg 300ggagaagcag cagctggagg aggatctaaa gggctacttg g 341267406DNAHomo sapiensmisc_feature(332)..(332)a or g or c or t/u 267atgactacgg gggaagttca ttctgacctt ccagactagc tagtactata tgaaatccga 60gagacggaat gaacacggac tgatgggaaa gtaccctgcg aagaacacca caattgccct 120acaggcgtga ttaaggacac tgcatgatag ttgctccaga atgccggcgt gctatatcgt 180gtgatttgca agatgacgag cgtgaggaaa caaaacgaga agaagaagat gatgtgttca 240aaagaaatgg tgccctgctt ggaaaccatg tcaatcatgt taatagtgat aggagagatt 300cccttcagca gaccaatacc acccaccgtc cnctgcatgt ccaaaggcct tcaattccac 360ctgcaagtga tactgagaaa ccgctgttcc tccagcagga aattcg 406268236DNAHomo sapiens 268tacatctccg ctatctgtgc cgtgtaacac ggtgtccagt ctcgttaggg aggggctgct 60ggaggggtgg cccacgaccg ggtcgagtga ctcggtgagc acttctgtgc tttacttgaa 120tctaggccgg caactgccat gatctgttgc tgcattagat gcagaggcag tgccgcgcga 180tggtgaagat gggagacgac gggacctctt gctggctgct ctgccggcgc aggcac 236269586DNAHomo sapiens 269tgtcgtgact ggcgatacct ggcgttagtg tgtacatggt gttcataatt gctgctgcat 60aacattttgt gagaattaat gtgacaatgt atgtgcagtg cttagcacat agcaagtgct 120catgaatggt agccaccaag atggctgttg tcattttagt ttgcagcagt tccacttgtc 180atcattgagt tcccagggag tcccctcttc tttgggaaca gacttgctct ctgtagctcc 240attgcggtaa aaacagatga ggttaatccc tgtcccaatc attttggaga tggcgtcgtt 300tgtattccaa ttccacagcc cagttcttgt ctttgtcttc cttttattta agcagcagcc 360acacagaatt agcccttttc aaaaataaat aagattatca tcctgttttg cgtccctggg 420gtaacagact ctaacatttc tttctctttc tcttctttca gattgtctag tgtaattttg 480gaacaattaa ccaaagaaac agaaggcggg aaccactcac gcggcaaatc tggaggcttt 540gatgtcaaag ccctccgtgc ctttcgagtg ttgcgaccac ttcgaa 586270549DNAHomo sapiens 270agttcccacc tcaacaaatg ccaatctcaa taatgccaat atgtccaaag ctgcccatgg 60aaagcggccc agcattggga accttgagca tgtgtctgaa aatgggcatc attcttccca 120caagcatgac cgggagcctc agagaaggtc cagtgtgaaa aggtccgact caggagatga 180acagctccca actatttgcc gggaagaccc agagatacat ggctatttca gggaccccca 240ctgcttgggg gagcaggagt atttcagtag tgaggaatgc tacgaggatg acagctcgcc 300cacctggagc aggcaaaact atggctacta cagcagatac ccaggcagaa acatcgactc 360tgagaggccc cgaggctacc atcatcccca aggattcttg gaggacgatg actcgcccgt 420ttgctatgat tcacggagat ctccaaggag acgcctacta cctcccaccc cagcatgtga 480ggccagattt tttgtttttg ggtggaacct cccggggaac agtgtacctt tcccccaacc 540cccgctctg 549271595DNAHomo sapiens 271attcggcacg agcctccttc aactttgagt gctctgcccc ttgggtatcc atagttacgg 60ttttctctgt ggcccaccca gggtgttttt tgcatcgctg gtgcagaaat gcacaggtgg 120atgagatata gctgctcttg tcctctgggg actggtggtg ctgcttaaga aataaggggt 180gctggggaca gaggagcaac gtggtgatct ataggattgg agtgtcgggg tctgtacaaa 240tcgtattgtt gccttttaca aaactgctgt actgtatgtt ctctttgagg gcttttatat 300gcaattgact gagggctgaa gttttcatta gaatgcactc acactctgac tgtacgtcct 360gatgaaaacc cacttttgga taattagaac cgtcaaggct tcattttctg tcaacagaat 420taggccgact gtcaggttac cttggcaggg attccctgca atcaaaaaga tagatgatag 480gtagcaattt tggtccaaaa tttttaatag tatacagaca acctgttaat tttttttttt 540tttttttttg taaataacaa acaccacttt gttatgaaga ccttacaaac ctctt 595272209DNAHomo sapiens 272ggaaaactca agtccagagc aatactacgt aaaattcaga agtgagaaca tacaaaggca 60acacacaggc tgacgaagaa acagaaagaa gatactgacc tgagtttgga ttttgagatg 120gcttgactga aagaaagaca aaaagtgtta agattctggt tccgagggct tgagcacaca 180ctccccatca tttcagctgg agatttcat 209273687DNAHomo sapiensmisc_feature(632)..(632)a or g or c or t/u 273tttttttttt ttttttttat tctgaagaac attttgcatt ttaataaaaa aggatttatg 60tcaggaaaga gtcatttaca aaccttgaag tgtttttgcc tggatcagag taagaatgtc 120ttaagaagag gtttgtaagg tcttcataac aaagtggtgt ttgttattta caaaaaaaaa 180aaaaaaaaat taacaggttg tctgtatact attaaaaatt ttggaccaaa attgctacct 240atcatctatc tttttgattg cagggaatcc ctgccaaggt aacttgacag tcggcctaat 300tctgttgaca gaaaatgaag ccttgacggt tctaattatc caaaagtggg ttttcatcag 360gacgtacagt cagagtgtga gtgcattcta atgaaaactt cttcagccct cattcaattg 420catacaaaag ccctcaaaga gaacatacag tacagcagtt ttgtaaaagg caacaatacg 480atttgtacag accccgacac tccaatccta tagatcacca cgttgctcct ctgtccccag 540caccccttat ttcttaagca gcaccaccag tccccagagg acaagagcag ctatatctca 600tccacctgtg catttctgca ccagcgatgc anaaaacacc ctggggtggg ccacagagaa 660aaccgtaact atggataccc aaggggc 687274573DNAHomo sapiens 274taaataacaa acaccacttt gttatgaaga ccttacaaac ctcttcttaa gacattctta 60ctctgatcca ggcaaaaaca cttcaaggtt tgtaaatgac tctttcctga cataaatcct 120tttttattaa aatgcaaaat gttcttcaga ataaaactgt gtaataattt ttatacttgg 180gagtgctcct tgcacagagc tgtcatttgc cagtgagagc ctccgacggg gcaggtactg 240tgccagggca gctctgaaat tatggatatt cttatcctcc tggttccttc ggtgccaatg 300gtaacctaat accagccgca gggagcgcca tttctcctaa agggctacac cactgtcaac 360attatcctgg actctgtgtc tctctctgtt gggtcttgtg gcatcacatc aggccaaaat 420tgccagacca ggaccctaag tgtctgatag aggcgatgat cttttccaaa gtcagtactt 480acaaactggc attcttacag gctgcaccat ttcctagtat gtctgcttta agcctggttc 540aacctctcat cgaatattaa atttttcttt gta 573275453DNAHomo sapiens 275tttttttttt tttttcttgg ggaaagatga taggtttata gtgactcaaa atattttaaa 60aaaatttctg tagggtcaag ttctttcaaa cttaaaattt taaccccaga ggattttcgc 120tgaataaatg aaaattggct ctatttcttc aacttcggga tagcccgagt aaaaatacta 180ataatttcta aattttaggg gggaactaca attattagga cccatggata ttgctgcagt 240tcaaatacaa tacagtaatt acaaaatata gaccatctct ttacaaatac aaattatagt 300atattacaag tcatgtacag taaatctata attttaaaca aactagtgta tctaagttta 360cctggttgcg agtgcattat tattccagtt tacagttgcc cttagcgtga cagtcagaaa 420ccgaccatcg gagtgatatt ctcttatgta aac 453276415DNAHomo sapiensmisc_feature(311)..(311)a or g or c or t/u 276tgattacttg tagcaaagta cttccccaca tttagctgga tttgtctttg gtttgaagag 60gctaatacgt gaaagatttg ttcacagttg gatgtcccct tttctgaacc atgaagtaat 120attgtgaatg gagttgaatg ctgaggttag ggtgccggaa agattcaggg tccttcggta 180ccctcacatg gcttggcttt ggtagaacaa gaaactaagc tctgatttgg ctttaaatga 240gagtgctaaa tttccttttt ctaataaaga acctagctaa acatttatat atacttttga 300acactgaact ntcttgttgc agagttaaca gctgttgggg gtagctgaca gctggatcct 360ggtgctgttg gtaccatggt acctgaagtg cacaggctgg tagccacacc tgaca 415277646DNAHomo sapiensmisc_feature(43)..(43)a or g or c or t/umisc_feature(573)..(573)a or g or c or t/u 277tttttttttt tttttcttac aaagaaaaat ttaatattcg atngagaggt tgaaccaggc 60ttaaagcaga catactagga aatggtgcag cctgtaagaa tgccagtttg taagtactga 120ctttggaaaa gatcatcgcc tctatcagac acttagggtc ctggtctggc aattttggcc 180tgatgtgatg ccacaagacc caacagagag agacacagag tccaggataa tgttgacagt 240ggtgtagccc tttaggagaa atggcgctcc ctgcggctgg tattaggtta ccattggcac 300cgaaggaacc aggaggataa gaatatccat aatttcagag ctgccctggc acagtacctg 360ccccgtcgga ggctctcact ggcaaatgac agctctgtgc aaggagcact cccaagtata 420aaaattatta cacagtttta ttctgaagaa cattttgcat tttaataaaa aaggatttat 480gtcaggaaag agtcatttac aaaccttgaa gtgtttttgc ctggatcaga gtaagaatgt 540cttaagaaga ggtttgtaag gtcttcataa canagtggtg tttgttattt acaaaaaaaa 600aaaaaaaaaa aataaaaaaa aaaaaaaaaa cctcgtgccg aattct 646278668DNAHomo sapiens 278tttttttttt ttttttgtaa ataacaaaca ccactttggt tatgaagacc ttacaaacct 60cttcttaaga cattcttact ctgatccagg caaaaacact tcaaggtttg taaatgactc 120tttcctgaca taaatccttt tttattaaaa tgcaaaatgt tcttcagaat aaaactgtgt 180aataattttt atacttggga gtgctccttg cacagagctg tcatttgcca gtgagagcct 240ccgacagggc aggtactgtg ccagggcagc tctgaaatta tggatattct tatcctcctg 300gttccttcgg tgccaatggt aacctaatac cagccgcagg gagcgccatt tctcctaaag 360ggctacacca ctgtcaacat tatcctggac tctgtgtctc tctctgttgg gtcttgtggc 420atcacatcag gccaaaattg ccagaccagg accctaagtg tctgatagag gcgatgatct 480tttccaaagt cagtacttac aaactggcat tcttacaggc tgcaccattt cctagtatgt 540ctgctttaag cctggttcaa cctctcatcg aatattaaat ttttctttgt aagaaaaatt 600tgaagttgta gagcatggtt ttttgttttc ccttgtctta ggaaagtttt aagatgaaat 660gtttttcc 668279496DNAHomo sapiens 279agtacacaag gtgaaactgc tccagttttt ctcatagcag ggtcagcagg aaagcaagtg 60gtgcccctgg tcccatctca cacaggtgag actgcaccga gaggtaacgt ggccctcaca 120gcccaccacg cctggccttc gcccaattct gaaacttcgt aggatagagc tggaaagtgc 180cacatggtga agcgagatcc agctgtctgg gtggatgtcg gagtccatag gctgagcaga 240gatggttctt agtgaggttc tcgctgccag ttgacggtga aatcatagct gccatttaca 300ttttgtgaga ttatgaaaaa cataagacta aagaaactaa atgtgttatt cctgtggaca 360caaaaatgtg tgtttttcag atggggaggg gaccaaaaag gaaaaacatt tcatcttaaa 420acttccctaa gacaaaggaa aacaaaaaac catgctctac aacttcaaat ttttcttaca 480aagaaaaatt taatat 496280701DNAHomo sapiens 280agctgaggaa acaaaacgag agaagaagat gatgtgttca aaagaaatgg tgccctgctt 60ggaaaccatg tcaatcatgt taatagtgat aggagagatt cccttcagca gaccaatacc 120acccaccgtc ccctgcatgt ccaaaggcct tcaattccac ctgcaagtga tactgagaaa 180ccgctgtttc ctccagcagg aaattcggtg tgtcataacc atcataacca taattccata 240ggaaagcaag ttcccacctc aacaaatgcc aatctcaata atgccaatat gtccaaagct 300gcccatggaa agcggcccag catagggaac cttgagcatg tgtctgaaaa tgggcatcat 360tcttcccaca agcatgaccg ggagcctcag agaaggtcca gtgtgaaaag gtccgactca 420ggagatgaac agctcccaac tattggccgg gaagacccag agatacatgg ctatttcagg 480cacccccacg gcttggggga gcaggagtat ttcagtagtg aggaatgcta cgaggatgac 540agctcgccca cctggagcag gcaaaactat ggctactaca gcagataccc aggcagaaac 600atcgactctg agaggcgcga ggctacatca tcccaagatt ctggaggaga tgactcgccg 660tttgtatgat cacgagatct caagagagct atactcccac c 701281515DNAHomo sapiens 281tcttgtggaa agatgatagg tttatagtga ctcaaaatat tttagaaaaa tttctgtagg 60gtcaagttct ttcaaactta aaattttaac cccagaggat tttcgctgaa taaatgaaaa 120ttggctctat ttcttctact tctggatagc ccgagtaaaa atactaataa tttctagatt

180ttagtgggga actacaatta ttaggaccca tggatattgc tgcagttcaa atacaataca 240gtaattacaa aatatagacc atctctttac aaatccaaat tatagtatat tacaagtcat 300gtaccgtaaa tctattttaa acaaactagg gtatctaagt ttacctggtt gcaagtgcat 360tattattcca gtttacagtt gcccttagcg tgacagtcag aaaccgacca tcggagtgat 420attctcttat gtaaactggc gtcacatcac agaaaacctt atttatttgg gggaaagggt 480ttaaaaatgg atatgttggg cccagtgact gatac 515282258DNAHomo sapiens 282ggaaaagatc atcgcctcta tcagacactt agggtcctgg tctggcaatt ttggcctgat 60gtgatgccac aagacccaac agagagagac acagagtcca ggataatgtt gacagtggtg 120tagcccttta ggagaaatgg cgctccctgc ggctggtatt aggttaccat tggcaccgaa 180ggaaccagga ggataagaat atccataatt tcagagctgc cctggcacgg tacctgcccc 240gtcggaggct ctcactgg 258283510DNAHomo sapiens 283gatgcgtgat ggctgatcta gaggtatccc atggactctc atcgcagctc ctggtacaca 60gacgagcccg acatctccta ccggactttc acaccagcca gcctgactgt ccccagcagc 120ttccggaaca aaaacagcga caagcagagg agtgcggaca gcttggtgga ggcagtcctg 180atatccgaag gcttgggacg ctatgcaagg gacccaaaat ttgtgtcagc aacaaaacac 240gagatcgctg atgcctgtga cctcaccatc gacgagatgg agagtgcagc cagcaccctg 300cttaatggga acgtgcgtcc ccgagccaac ggggatgtgg gccccctctc acaccggcag 360gactatgagc tacaggactt tggtcctggc tacagcgacg aagggccaga ccctgggagg 420gatgaggagg acctggcgga tgaaatgata tgcatcacca ccttgtagcc cccagcgagg 480ggcagactgg ctctggcctc aggtggggcg 510284405DNAHomo sapiensmisc_feature(75)..(75)a or g or c or t/umisc_feature(142)..(142)a or g or c or t/u 284cgctcgttcg ctgtgccagg acaaagtcct gtagctcata gtcctgccgt gtgagagggg 60gccacatccc cgttnctcgg gacgcacgac ccattaagca gggtgctggc tgccccctcc 120atctcgtcga tggagaggtc ancaggcatc agcgatttcg tgttttgtgt gcgtgacaca 180aattttgggt cccttgcata cgcgtcccac agccttacgg agtatcagcg actgctctcc 240accaatgctg cccgcgactc ctactgcttg tccgctgttt ttggttccgg aagctgctgg 300ggacagtcag gctggctggt gtgaaagtcc ggtaggagat gtcgggctcg tctgtgtacc 360aggagctgcg gtgcagggac ggcaggctgc cgttcacctg gtccg 405285892DNAHomo sapiensmisc_feature(27)..(27)a or g or c or t/umisc_feature(35)..(35)a or g or c or t/umisc_feature(42)..(42)a or g or c or t/umisc_feature(49)..(49)a or g or c or t/umisc_feature(831)..(831)a or g or c or t/umisc_feature(835)..(835)a or g or c or t/u 285gagtttcgag cttctctttt cctaagngaa aaaanaaaga ancacaagna aaccaaataa 60ccatgttact ctgtataaaa atgctaatca gggaattctg aatcaataat gctccaatga 120aggacagaat ttaattagaa acaacactaa ccacaagagc ctagcacaac ccaaactcag 180agcttcctgg taatctcaat gcgatggatt cattacacag accatcttat taaaattctc 240atctgagagc taatcagcat tgaatgcatc atttatttta tgacaccaaa attaactgca 300gtgattcttt aagcatgggg acacgtgact cccactctca gccccgaggg atgacagcca 360agagcctggc ttctgcccaa gattccatcc gttttggtct gcagtgcatg gtcaaccatg 420atccacaaag cagcaacccg ggggctgtag ctgccgtgat gcgggggtaa gcctggcagg 480ctgcaactgt tgcagggctc ccaacacagc ccctggacaa acgcgtcagg ggaaaatagg 540gttacctggc aatctttttc ctctcctttt cttccgcttc ttctttctga gcagtgttca 600gactttcagc atcagccaaa gtgtctacag cgatggccaa gaagacattc agtagaatat 660ctaattacaa ctttttaagg gcacaacaca ctactaaatg caactacgtg cggccaacaa 720tggcaacgcc acacacctct gcatcccggg aagctgggta gtaggtgacg tccccaagtg 780ttatactcac acagcaaacc tagagtacca gagccctgct tttcaaacaa nacanaacaa 840acaaacaacc caaagtaaaa cctggtaagg gacgtcttca gaagtaaatt ac 892286425DNAHomo sapiens 286ctggctttcc catagcacgc tcggcaggaa agcaagtgat gcccctggct cccatctcac 60acaggtgaca ctgcaccgag aggtaacgtg gccctcacag cccaccacgc ctggccttcg 120cccaattctg aaacttcgta ggatagagct ggaaagtggc acatggtgaa gcgagatcca 180gctgtctggg tggatgtcgg agctccatag gctgagcaga gatggttctt agtgaggttc 240tcgctgccag ttgacggtga aatcatagct gccatttaca ttttgtgaga ttatgaaaaa 300cataagacta aagaaactaa atgtgttatt cctgtggaca caaaaatgtg tgtttttcag 360atggggaggg gaccaaaaag gaaaaacatt tcatcttaaa actttcctaa gacaaaggaa 420aacaa 425287441DNAHomo sapiensmisc_feature(10)..(10)a or g or c or t/u 287ctcagcatgn atgaaacagg atgaggttgg tgaagatgtg gtggttgatg agcttgtggc 60agcctacgcg gatcgggttg gtcttgctaa gaatgaagaa agcgctccct tcagggatgg 120gggcaatttt ttccttcatg ttcaactccg agatccttcg aggacggggt ccggcaggaa 180cctcaggttc atcctcctcc tcttcctctt cctcttcccc tacgggcaca tcgcaaggcg 240gataggggtc cttgtcttca tcctcttctc tatagtcatc aattgtaacc ttgttgtcac 300tgttggctat ctggttgact tctggtttgt tgttcttttt attttctagg ctctcttttc 360tggcaatctt tttcctctcc ttttcttccg cttcttcttt ctgagcagtg ttcagacttt 420cagcatcagc caaatggtct a 441288165DNAHomo sapiens 288tcaaagtcga aggaggatct ccgcgtggga tgctggggtg ggaggtagta ggcgtctcct 60tggagatctc cgtgaatcat agcaaacggg cgagtcatcg tcctacaaga atcctagtgg 120atgatggtag cctcggggcc tctcagagtc gatgtttctg cctgg 165289330DNAHomo sapiens 289ctcgcccgtt tgctatgagt cacggagatc tccaaggaga cgcctactac ctcccacccc 60agcatcccac cggagatcct ccttcaactt tgagtgcctg cgccggcaga gcagccagga 120agaggtcccg tcgtctccca tcttccccca tcgcacggcc ctgcctctgc atctaatgca 180gcaacagatc atggcagttg ccggcctaga ttcaagtaaa gcccagaagt actcaccgag 240tcactcgacc cggccgtggg ccacccctcc agcaacccct ccctaccggg actggacacc 300gtgctacacc ccccagatga cgccgatgta 330290401DNAHomo sapiens 290ccaggcagaa acatcgactc tgagaggccc cgaggctacc atcatcccca aggattcttg 60gaggacgatg actcgcccgt ttgctatgat tcacggagat ctccaaggag acgcctacta 120cctcccaccc cagcatccca ccggagatcc tccttcaact ttgagtgcct gcgccggcag 180agcagccagg aagaggtccc gtcgtctccc atcttccccc atcgcacggc cctgcctctg 240catctaatgc agcaacagat catggcagtt gccggcctag attcaagtaa agcccagaag 300tactcaccga gtcactcgac ccggtcgtgg gccacccctc cagcaacccc tccctaccgg 360gactggacac cgtgctacac cccccagatg acgccgatgt a 401291431DNAHomo sapiensmisc_feature(321)..(321)a or g or c or t/umisc_feature(354)..(355)a or g or c or t/umisc_feature(373)..(373)a or g or c or t/umisc_feature(401)..(401)a or g or c or t/u 291tacatcggcg tcatctgggg ggtgtagcac ggtgtccagt cccggtaggg aggggttgct 60ggaggggtgg cccacgaccg ggtcgagtga ctcggtgagt acttctgggc tttacttgaa 120tctaggccgg caactgccat gatctgttgc tgcattagat gcagaggcag ggccgtgcga 180tgggggaaga tgggagacga cgggacctct tcctggctgc tctgccggcg caggcactca 240aagttgaagg aggatctccg gtgggatgct ggggtgggag gtagtaggcg tctccttgga 300gatctccgtg aatcatagca nacgggcgag tcatcgtcct ccaagaatcc ttgnngatga 360tggtagcctc ggngcctctc agagtcgatg tttctgcctg ngtatctgct cgggcgagcc 420ggtaccgagc t 431292330DNAHomo sapiens 292tacatcggcg tcatctgggg ggtgtagcac ggtgtccagt cccggtaggg aggggttgct 60ggaggggtgg cccacgaccg ggtcgagtga ctcggtgagt acttctgggc tttacttgaa 120tctaggccgg caactgccat gatctgttgc tgcattagat gcagaggcag ggccgtgcga 180tgggggaaga tgggagacga cgggacctct tcctggctgc tctgccggcg caggcactca 240aagttgaagg aggatctccg gtgggatgct ggggtgggag gtagtaggcg tctccttgga 300gatctccgtg aatcatagca aacgggcgag 330293183DNAHomo sapiensmisc_feature(42)..(42)a or g or c or t/umisc_feature(70)..(70)a or g or c or t/umisc_feature(133)..(133)a or g or c or t/u 293gcggacagct tggtggaggc agtcctgata tccgaagcct tnggacgcta tgcaagggac 60ccaaaatttn tttcagcaac aaaacacgaa atcgctgatg cctgtaacct caccatcgac 120gagatggaga gtncagccag caccctgctt aatgggaacg tgcgtccccg agccaacggg 180gat 183294132DNAHomo sapiens 294aagaaatagg aggataagaa tatcatattt cagagctgcc ctggcacagt acctgccccg 60tcggaggctc tcactggcaa atgacagctc tgtgcaagga gcactcccaa gtataaaaat 120tattacacag tt 132295358DNAHomo sapiens 295ccattggtac gagagaaatt aggaggataa gattatctat tattctgagc tgccctggca 60cagtacctgc cccgtcggag gctctcactg gcaaatgaca gctctgtgca aggagcactc 120ccaagtataa aaattattac atagttttat tctgaagaac attttgcatt ttaataaaaa 180aggatttatg tcaggaaaga gtcatttaca taccttgaat tgtttttgcc tggatcagag 240taagaatgtc ttaagaagag gtttgtaagg tcttcataac aaagtggtgt ttgttattta 300caaaaaaaaa aaaaaaaaaa atttttatac cgggtttgtc tgtatacaaa tttctctg 358296289DNAHomo sapiens 296tccagagtag aagaaatcag ccaagtatca tttattcagc gaaaatcctc tggggattaa 60aattttaagt ttgaaagaac ttgacactac agaaattttt ctaaaatatt ttgagtcact 120ataaacctat catctttcca caagatatac cagatgacta tttgcagtct tttctttggg 180caagagttcc atgattttga tactgtacct ttggatccac catgggttgc aactgtcttt 240ggttttgttt gtttgacttg aaccaccctc tggaaagcta ctctggaaa 289297889DNAHomo sapiens 297gggattcccc cggctgggtg gggagagcga gctgggtgcc cccatagatt cccctgcccg 60aacctcatga gccgaccctc ggctccatgg agcccggaaa ttatgccacc ttggatggag 120ccaaggatat cgaaggcttg ttgggagcgg gaggggggcg gaatctggtc gcccactccc 180tctctgacca gccacccagc gcgctacgct tgatgcctgt gtcaatatgc ccccttgatc 240tgccaggctc ggggagcggc caaaagcaat gcccacccta tgctctgggg gtgcccaggg 300gactgtcccc ggctccgtgc cttatggtta ctgtggggcg gggtacatac tcctgcagag 360ttgtcccgga gctcgttgaa accttgtgcc gaggagagcc accctggcgg tacccgggaa 420gactccccag ggcgggaaga gtaccccagc ggcccaatga gttgtgcttc tatcgggata 480tccgggacct accaggccta tgtgcaggta ctggacgtgt cctgtgctgc agactctggg 540tgtccgtgga gcaccggaca ttggctcgct gtggcctgtg gccggtacca gtcttgggct 600ctcggtgtgt ggctggacac gccggttgtg ttcgcgggag accgcaccca ccaggttcct 660ttgggagggc cgctttgcag actccggggg aggcccctct gaggcggggc cttttcgggg 720gggcgaagaa agctttccga cgcaggcgct tgcggagctg gcgggacatc gggacacttc 780acccagcgaa gcgcggcttg gggcccctct gggcgcggtc tcggttgaca ccggcgaaga 840gtttcgggag aggcccatat cttctgggga gggcgttgcg tcgcccccg 8892981356DNAHomo sapiens 298ggattccccc ggcctgggtg gggagagcga gctgggtgcc ccctagattc cccgcccccg 60cacctcatga gccgaccctc ggctccatgg agcccggcaa ttatgccacc ttggatggag 120ccaaggatat cgaaggcttg ctgggagcgg gaggggggcg gaatctggtc gcccactccc 180ctctgaccag ccacccagcg gcgcctacgc tgatgcctgc tgtcaactat gcccccttgg 240atctgccagg ctcggcggag ccgccaaagc aatgccaccc atgccctggg gtgccccagg 300ggacgtcccc agctcccgtg ccttatggtt actttggagg cgggtactac tcctgccgag 360tgtcccggag ctcgctgaaa ccctgtgccc aggcagccac cctggccgcg taccccgcgg 420agactcccac ggccggggaa gagtacccca gccgccccac tgagtttgcc ttctatccgg 480gatatccggg aacctaccag cctatggcca gttacctgga cgtgtctgtg gtgcagactc 540tgggtgctcc tggagaaccg cgacatgact ccctgttgcc tgtggacagt taccagtctt 600gggctctcgc tggtggctgg aacagccaga tgtgttgcca gggagaacag aacccaccag 660gtcccttttg gaaggcagca tttgcagact ccagcgggca gcaccctcct gacgcctgcg 720cctttcgtcg cggccgcaag aaacgcattc cgtacagcaa ggggcagttg cgggagctgg 780agcgggagta tgcggctaac aagttcatca ccaaggacaa gaggcgcaag atctcggcag 840ccaccagcct ctcggagcgc cagattacca tctggtttca gaaccgccgg gtcaaagaga 900agaaggttct cgccaaggtg aagaacagcg ctacccctta agagatctcc ttgcctgggt 960gggaggagcg aaagtggggg tgtcctgggg agaccaggaa cctgccaagc ccaggctggg 1020gccaaggact ctgctgagag gcccctagag acaacaccct tcccaggcca ctggctgctg 1080gactgttcct caggagcggc ctgggtaccc agtatgtgca gggagacgga accccatgtg 1140acagcccact ccaccagggt tcccaaagaa cctggcccag tcataatcat tcatcctgac 1200agtggcaata atcacgataa ccagtactag ctgccatgat cgttagcctc atattttcta 1260tctagagctc tgtagagcac tttagaaacc gctttcatga attgagctaa ttatgaataa 1320atttggaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1356299727DNAHomo sapiens 299attcccccgg cctgggtggg gagagcgagc tgggtgcccc ctagattccc cgcccccgca 60cctcatgagc cgaccctcgg ctccatggag cccggcaatt atgccacctt ggatggagcc 120aaggatatcg aaggcttgct gggagcggga ggggggcgga atctggtcgc ccactcccct 180ctgaccagcc acccagcggc gcctacgctg atgcctgctg tcaactatgc ccccttggat 240ctgccaggct cggcggagcc gccaaagcaa tgccacccat gccctggggt gccccagggg 300acgtccccag ctcccgtgcc ttatggttac tttggaggcg ggtactactc ctgccgagtg 360tcccggagct cgctgaaacc ctgtgcccag gcagccaccc tggccgcgta ccccgcggag 420actcccacgg ccggggaaga gtaccccagc cgccccactg agtttgcctt ctatccggga 480tatccgggaa cctaccagcc tatggccagt tacctggacg tgtctgtggt gcagactctg 540ggtgctcctg gagaaccgcg acatgactcc ctgttgcctg tggacagtta ccagtcctgg 600gctctcgctg gtggctggaa cagccagatg tgttgccagg gagaacagaa cccaccaggt 660ccccttttgg aaggcagcat ttgcagactc cagcgggcag caccctcctg acgcctgcgc 720ctttcgt 727300793DNAHomo sapiens 300gcaggcgact tgcgagctgg gagcgattta aaacgctttg gattcccccg gcctgggtgg 60ggagagcgag ctgggtgccc cctagattcc ccgcccccgc acctcatgag ccgaccctcg 120gctccatgga gcccggcaat tatgccacct tggatggagc caaggatatc gaaggcttgc 180tgggagcggg aggggggcgg aatctggtcg cccactcccc tctgaccagc cacccagcgg 240cgcctacgct gatgcctgct gtcaactatc cccccttgga tctgccaggc tcggcggagc 300cgccaaagca atgccaccca tgccctgggg tgccccaggg gacgtcccca gctcccgtgc 360cttatggtta ctttggaggc gggtactact cctgccgagt gtcccggagc tcgctgaaac 420cctgtgccca ggcagccacc ctggccgcgt accccgcgga gactcccacg gccggggaag 480agtaccccag ccgccccact gagtttgcct tctatccggg atatccggga acctaccagc 540ctatggccag ttacctggac gtgtctgtgg tgcagactct gggtgctcct ggagaaccgc 600gacatgactc cctgttgcct gtggacagtt accagtcttg ggctctcgct ggtggctgga 660acagccagat gtgttgccag ggagaacaga agccaccagg tcccttttgg aaggcagcat 720ctgcagactc cagcgggcag gacctcctga cgcctgcggc ctttcgtcgc gagcgcaaga 780aacgcattcc gta 793301759DNAHomo sapiens 301ggatttaaaa cgctttggat tcccccggcc tgggtgggga gagcgagctg ggtgccccct 60agattccccg cccccgcacc tcatgagccg accctcggct ccatggagcc cggcaattat 120gccaccttgg atggagccaa ggatatcgaa ggcttgctgg gagcgggagg ggggcggaat 180ctggtcgccc actcccctct gaccagccac ccagcggcgc ctacgctgat gcctgctgtc 240aactatgccc ccttggatct gccaggctcg gcggagccgc caaagcaatg ccacccatgc 300cctggggtgc cccagggacg tccccagctc ccgtgcctta tggttacttt ggaggcgggt 360actactcctg ccgagtgtcc cggagctcgc tgaaaccctg tgcccaggca gccaccctgg 420ccgcgtaccc cgcggagact cccacggccg gggaagagta ccccagccgc cccactgagt 480ttgccttcta tccgggatat ccgggaacct accagcctat ggccagttac ctggacgtgt 540ctgtggtgca gactctgggt gctcctggag aaccgcgaca tgactccctg ttgcctgtgg 600acagttacca gtcttgggct ctcgctggtg ggctggaaca gccagatgtg ttgccagcgc 660agaacagaac ccaccaggtc ccttttggaa ggcagcattt gcagactcca gcgggcagaa 720ccctcctgac gcctgcgcct ttcgttcgcg ggcgaaaaa 759302614DNAHomo sapiens 302aagaaacgca ttccgtacag caaggggcag ttgcgggagc tggagcggga gtatgcggct 60aacaagttca tcaccaagga caagaggcgc aagatctcgg cagccaccag cctctcggag 120cgccagatta ccatctggtt tcagaaccgc cgggtcaaag agaagaaggt tctcgccaag 180gtgaagaaca gcgctacccc ttaagagatc tccttgcctg ggtgggagga gcgaaagtgg 240gggtgtcctg gggagaccag gaacctgcca agcccaggct ggggccaagg actctgctga 300gaggccccta gagacaacac ccttcccagg ccactggctg ctggactgtt cctcaggagc 360ggcctgggta cccagtatgt gcagggagac ggaaccccat gtgacagccc actccaccag 420ggttcccaaa gaacctggcc cagtcataat cattcatcct gacagtggca ataatcacga 480taaccagtac tagctgccat gatcgttagc ctcatatttt ctatctagag ctctgtagag 540cactttagaa accgctttca tgaattgagc taattatgaa taaatttgga aggcgaaaaa 600aaaaacctcg tgcc 614303318DNAHomo sapiens 303attcggcacg aggttttttt tttcgccttc caaatttatt cataattagc tcaattcatg 60aaagcggttt ctaaagtgct ctacagagct ctagatagaa aatatgaggc taacgatcat 120ggcagctagt actggttatc gtgattattg ccactgtcag gatgaatgat tatgactggg 180ccaggttctt tgggaaccct ggtggagtgg gctgtcacat ggggttccgt ctccctgcac 240atactgggta cccaggccgc tcctgaggaa cagtccagca accagtggcc tgggaagggt 300gttgtctcta ggggcctc 3183041483DNAHomo sapiens 304gggtggggag agcgagctgg gtgcccccta gattccccgc ccccgcacct catgagccga 60ccctcggctc catggagccc ggcaattatg ccaccttgga tggagccaag gatatcgaag 120gcttgctggg agcgggaggg gggcggaatc tggtcgccca ctcccctctg accagccacc 180cagcggcgcc tacgctgatg cctgctgtca actatgcccc cttggatctg ccaggctcgg 240cggagccgcc aaagcaatgc cacccatgcc ctggggtgcc ccaggggacg tccccagctc 300ccgtgcctta tggttacttt ggaggcgggt actactcctg ccgagtgtcc cggagctcgc 360tgaaaccctg tgccaggcag ccaccctggc cgcgtaaccc gacggagact ctcacgtgcg 420gggaagagta cccctagcgc cccacatgag tttgccttct atccgggata tccgggaccg 480taccagccta tggcagttac ctggacgtgt ctgtggtgcc gactctgggt gctcctggag 540aaccgcggac atgactcctt gtttgctgtg cgacgctcac cagtctgggc tcctcgtcgg 600tggtcgcact cccacttttt gccgggcgac atcccccggg gccccttccg gaacagcgac 660cttgcgagcc cccggggaca cacccccgta agcggcctat catcgctgat aaacctcatc 720agagggcacc gaaagccgcg actctaaccc ccccactacg actcacgacc gcacaggtac 780tcgaaccgcc caatatctgg ttctaaccca tggcgcatct cagccgctag agagccaacc 840aaacgcgcca cgcgcaacca cactacacca cggcacccct ttcatctcac tcccacgccg 900atcactcttc accctccaga atcattcccc tcgcacatcc tacctatctc atgcctccca 960gttcacccca ttccctcccc taatctcacc cacacattca cgcacgttct cactacgctt 1020cgctccgacc cacatcctca cccccacatt cataccactt caccatcacg acccccccct 1080ctcatcgact cctgtctcat tctcaaccac agtactacca gctccaacac accactcacc 1140ccaagctatc catcacctac acgctttcac ccctcaccgc tcccaagtaa ttcagatcac 1200tcaaacacaa tctgctacat actcatccct cccccactcc cagtacagtc caaccaccga 1260ccaactacct ccgcgccacc cgcgccgccc cacctcaccg gccccaaccg cccgcacagg 1320gcacgcaccc cccggcaacc gcgcgatccg gccgtacaca ctcttgggcg gcacgcagct 1380gaggacattc cgcgggagcg ccccaccgtg ggctacgtgg gtcgcgaccc ggcggggcgc 1440gtgcggcgtc gcccgcccgc ccgccgactg cgacccagtc gag 1483305758DNAHomo sapiensmisc_feature(561)..(561)a or g or c or t/umisc_feature(656)..(656)a or g or c or t/u 305ggggctttgg attcccccgg cctgggtggg gagagcgagc tgggtgcccc ctagattccc 60cgcccccgca cctcatgagc cgaccctcgg ctccatggag cccggcaatt atgccacctt 120ggatggagcc aaggatatcg aaggcttgct gggagcggga ggggggcgga atctggtcgc 180ccactcccct ctgaccagcc acccagcggc gcctacgctg atgcctgctg tcaactatgc 240ccccttggat ctgccaggct

cggcggagcc gccaaagcaa tgccacccat gccctggggt 300gccccagggg acgtccccag ctcccgtgcc ttatggttac tttggaggcg ggtactactc 360ctgccgagtg tcccggagct cgctgaaacc ctgtgcccag gcagccaccc tggccgcgta 420ccccgcggag actcccacgg ccggggaaga gtaccccagc cgccccactg agtttgcctt 480ctatccggga tatccgggaa cctaccagcc tatggccagt tacctggacg tgtctgtggt 540gcagactctg ggtgctcctg nagaaccgcg acatgactcc ctgttgcctg tggacagtta 600ccagtcttgg gctctcgctg gtggcctgga acagcccaga tgtgtttgcc cagggnagaa 660cacgaacccc acccggttcc cccttttggg aaagggcagc cattttggcc agccttccaa 720gcggggccaa ccaccccctc ccctggacag gccctggt 758306476DNAHomo sapiens 306gcggccgcaa gaaacgcatt ccgtacagca aggggcagtt gcgggactgg agcgggagta 60tgcggctaac aagttcatca ccaaggacaa gaggcgcaag atctcggcag ccaccagcct 120ctcggagcgc cagattacca tctggtttca gaaccgccgg gtcaaagaga agaaggttct 180cgccaaggtg aagaacagcg ctacccctta agagatctcc ttgcctgggt gggaggagcg 240aaagtggggg tgtcctgggg agaccaggaa cctgccaagc ccaggctggg gccaaggact 300ctgctgagag gcccctagag acaacaccct tcccaggcca ctggctgctg gactgttcct 360caggagcggc ctgggtaccc agtatgtgca gggagacgga accccatgtg acagcccatt 420ccaccagggt tcccaaagaa cctggcccag tcataatcat tcatcctgac agtggc 476307552DNAHomo sapiens 307agcggccgca agaaacgcat tccgtacagc aaggggcagt tgcgggagct ggagcgggag 60tatgcggcta acaagttcat caccaaggac aagaggcgca agatctcggc agccaccagc 120ctctcggagc gccagattac catctggttt cagaaccgcc gggtcaaaga gaagaaggtt 180ctcgccaagg tgaagaacag cgctacccct taagagatct ccttgcctgg gtgggaggag 240cgaaagtggg ggtgtcctgg ggagaccagg aacctgccaa gcccaggctg gggccaagga 300ctctgctgag aggcccctag agacaacacc cttcccaggc cactggctgc tggactgttc 360ctcaggagcg gcctgggtac ccagtatgtg cagggagacg gaaccccatg tgacagccca 420ctccaccagg gttcccaaag aacctggccc agtcataatc attcatcctg acagtggcaa 480taatcacgat aaccagtact agctgccatg atcgttagcc tcatattttc tatctagagc 540tctgtagagc ac 552308447DNAHomo sapiens 308gcggccgcaa gaaacgcatt ccgtacagca aggggcagtt gcgggactgg agcgtgagta 60tgcggctaac aagttcatca ccaaggacaa gaggcgcaag atctcggcag ccaccagcct 120ctcggagcgc cagattacca tctggtttca gaaccgccgg gtcaaagaga agaaggttct 180cgccaaggtg aagaacagcg ctacccctta agagatctcc ttgcctgggt gggaggagcg 240aaagtggggg tgtcctgggg agaccaggaa cctgccaagc ccaggctggg gccaaggact 300ctgctgagag gcccctagag acaacaccct tcccaggcca ctggctgctg gactgttcct 360caggagcggc ctgggtaccc agtatgtgca gggagacgga accccatgtg acagcccact 420ccaccagggt tcccaaagaa cctggcc 447309418DNAHomo sapiens 309tttttttttt tttttttttc gccttccaaa tttattcata attagctcaa ttcatgaaag 60cggtttctaa agtgctctac aaagctctaa ataaaaaata tgaggctaac gatcatggca 120gctagtactg gttatcggga ttattgccac tgtcaggatg aatgattatg actgggccag 180gttctttggg aaccctggtg gagtgggctg tcacatgggg ttccgtctcc ctgcacatac 240tgggtaccca ggccgttcct gaggaacagt ccaccaccca gtggcctggg aagggtgttg 300tctctagggg cctctcaaca aagtccttgg ccccagcctg ggcttggcag gttcctggtc 360tccccaggac acccccactt tcgctcctcc cacccaggca aggagatctc ttaagggg 418310405DNAHomo sapiensmisc_feature(6)..(6)a or g or c or t/umisc_feature(380)..(380)a or g or c or t/u 310gacgcnaggt atgcggctaa caagttcatc accaaggaca agaggcgcaa gatctcggca 60gccaccagcc tctcggagcg ccagattacc atctggtttc agaaccgccg ggtcaaagag 120aagaaggttc tcgccaaggt gaagaacagc gctacccctt aagagatctc cttgcctggg 180tgggaggagc gaaagtgggg gtgtcctggg gagaccagga acctgccaag cccaggctgg 240ggccaaggac tctgctgaga ggcccctaga gacaacaccc ttcccaggcc actggctgct 300ggactgttcc tcaggagcgg cctgggtacc catgtatgtg cagggagacg gaaccccatg 360tgacagccca ctccaccagn gttcctaaag aaccctggcc agtca 405311369DNAHomo sapiensmisc_feature(301)..(301)a or g or c or t/umisc_feature(318)..(318)a or g or c or t/u 311gcaggcgact tgcgagctgg gagcggttta aaacgctttg gattcccccg gcctgggtgg 60ggagagcgag ctgggtgccc cctagattcc ccgcccccgc acctcatgag ccgaccctcg 120gtccatggac acggcaatta tgccaccttg gatggagcca aggatatcga aggcttgctg 180ggagcgggag gggggcggaa tctggtcgcc cactcccctc tgaccagcca cccagcggcg 240cctacgctga tgcctgctgt caactatgcc cccttggatc tgccaggctc ggcggactct 300naaagcatat gccacccnat gccctggggt gccccagggg aacgtcccca gctcccgtgc 360cttatggtt 369312374DNAHomo sapiens 312gcggccgcaa gaaacgcatt ccgtacagca aggggcagtt gcgggagctg gagcgggagt 60atgcggctaa caagttcatc accaaggaca agaggcgcaa gatctcggca gccaccagcc 120tctcggagcg ccagattacc atctggtttc agaaccgccg ggtcaaagag aagaaggttc 180tcgccaaggt gaagaacagc gctacccctt aagagatctc cttgcctggg tgggaggagc 240gaaagtgggg gtgtcctggg gagaccagga acctgccaag cccaggctgg ggccaaggac 300tctgctgaga ggcccctaga gacaacaccc ttcccaggcc actggctgct ggactgttcc 360tcaggagcgg cctg 374313337DNAHomo sapiens 313gtcgacgaac agcgctaccc cttaagagat ctccttgcct gggtgggagg agcgaaagtg 60ggggtgtcct ggggagaccg ggaactgcca agcccaggct ggggcaagga ctctgctgag 120aggcccctag agacaacacc cttcccaggc cactgctgct ggactgttcc tcaggagcgg 180cctgggtacc cagtatgtgc agggagacgg aaccccatgt gacagcccac tccaccaggg 240ttcccaaaga acctggccca gtcataatca ttcatcctga cagtggcaat aatcacgata 300accagtactc agctgccatg atcgttagcc tcatatt 337314452DNAHomo sapiens 314gcgtcgaccc cttgaagaga tctccttgcc tgggtgggag gagcgaaagt gggggtgtcc 60tggggagacc aggaacctgc caagcccagg ctggggccaa ggactctgct gagaggcccc 120tagagacaac acccttccca ggccactggc tgctggactg ttcctcagga gcggcctggg 180tacccagtat gtgcagggag acggaacccc atgtgacagc ccactccacc agggttccca 240aagaacctgg cccagtcata atcattcatc ctgacagtgg caataatcac gataaccagt 300actagctgcc atgatcgtta gcctcatatt ttctatctag agctctgtag agcacttgta 360gaaaccgctt tcatgaattg agctaattat gaatagattt ggaaggggaa aaaagtggaa 420aaagttttgc ccaaagtggg tcgtttacgt cg 452315358DNAHomo sapiens 315ctccctggca acacatctgg ctgttccagc accagcgaga cccaagactg gtaactgtcc 60acaggcaaca gggagtcatg tcgcggttct ccaggagcac ccagagtctg caccacagac 120acgtccaggt aactggccat agctgagtag gttcccggat atcccggata gaaggcaaac 180tcagtggggc ggctggggta ctcttccccg gccgtggaga gtctccgcgg ggtacggccc 240agggtggctg cctgggcatc agggtttcag cgagctccgg gacactcggc aggagtagta 300cccgcctcca aagtaaccat aaggcacggg agctggggac gtccctgggg caccccag 358316474DNAHomo sapiens 316tttaaaacgc tttggattcc cccggcctgg gtggggagag cgagctgggt gccccctaga 60ttccccgccc ccgcacctca tgagccgacc ctcggtccat ggagccggcg aattatgcca 120ccttggatgg agccaaggat atcgaaggct tgctgggagc gggagggggg cggaatctgg 180tcgcccactc ccctctgacc agccacccag cggcgctacg tgatgcctgc tgtcaactat 240gcccttggat ctgccagctc gcggagccaa agcaatgcca cccatgccct ggggtgcccc 300aggtgacgtc cccagctccc gtgccttatg gttactttgg aggcgggtac tactcctgcc 360gagtgtcccg gagctcgctg aaaccctgtg cccaggcagc caccctggcc gcgtaccccg 420cgatgactcc cacggccggg gaagagtacc ccagccgccc cactgagttt gcct 474317739DNAHomo sapiensmisc_feature(616)..(616)a or g or c or t/umisc_feature(678)..(678)a or g or c or t/umisc_feature(730)..(730)a or g or c or t/u 317caggcgactt gcgagtctgg gagcgattta aaacgctttg gattcccccg gcctgggtgg 60ggagagcgag ctgggtgccc cctagattcc ccgcccccgc acctcatgag ccgaccctcg 120gctccatgga gcccggcaat tatgccacct tggatggagc caaggatatc gaaggcttgc 180tgggagcggg aggggggcgg aatctggtcg cccactcccc tctgaccagc cacccagcgg 240cgcctacgct gatgcctgct gtcaactatg cccccttgga tctgccaggc tcggcggagc 300cgccaaagca atgccaccca tgccctgggg tgccccaggg gacgtcccca gctcccgtgc 360cttatggtta ctttggaggc gggtactact cctgccgagt gtcccggagc tcgctgaaac 420cctgtgccca ggcagccacc ctggccgcgt accccgcgga gactcccacg gccggggaag 480agtaccccag ccgccccact gagtttgcct tctatccggg atatccggga acctaccagc 540ctatggccag ttaccttgga cgtgtctgtg gtgcagactc tgggtgctcc tggagaaccg 600cgacatgact ccctgntgcc tgtggacagt taccagtctt gggctctcgc tggtggctgg 660aacagccaga tgtgttgnca gggagaacag aacccaccag gtcccttttg gaaggcagat 720ttgcagactn cagcgggca 739318924DNAHomo sapiens 318aggcagccac cctggccgcg taccccgcgg agactcccac ggccggggaa gagtacccca 60gccgccccac tgagtttgcc ttctatccgg gatatccggg aacctaccag cctatggcca 120gttacctgga cgtgtctgtg gtgcagactc tgggtgctcc tggagaaccg cgacatgact 180ccctgttgcc tgtggacagt taccagtctt gggctctcgc tggtggctgg aacagccaga 240tgtgttgcca gggagaacag aacccaccag gtcccttttg gaaggcagca tttgcagact 300ccagcgggca gcaccctcct gacgcctgcg cctttcgtcg cggccgcaag aaacgcattc 360cgtacagcaa ggggcagttg cgggagctgg agcgggagta tgcggctaac aagttcatca 420ccaaggacaa gaggcgcaag atctcggcag ccaccagcct ctcggagcgc cagattacca 480tctggtttca gaaccgccgg gtcaaagaga agaaggttct cgccaaggtg aagaacagcg 540ctacccctta agagatctcc ttgcctgggt gggaggagcg aaagtggggg tgtcctgggg 600agaccaggaa cctgccaagc cccaggctgg ggccaaggac tctgctgaga ggcccctaga 660gacaacaccc ttcccaggcc actggctgct ggactgttcc tcaggagcgg cctgagtacc 720ccgtatgtgc aggggagacg gaaccccctg tgaccagccc ccctccaccc gtggtctccc 780agataacctg gcccccactc ataaatcatt tcttcccggg ccgggggcca atcattcccc 840gaactacccc ggtaccttat acaattagat tggacatgaa tcctctcggg ggcattccct 900atggcgctga ggcccctcac acct 924319566DNAHomo sapiensmisc_feature(421)..(421)a or g or c or t/umisc_feature(541)..(541)a or g or c or t/u 319gggtgctgtc ctctggagtc tgcaaatgct gccttccaaa agggacctgg tgggttctgt 60tctccctggc aacacatctg gctgttccag ccaccagcga gagcccaaga ctggtaactg 120tccacaggca acagggagtc atgtcgcggt tctccaggag cacccagagt ctgcaccaca 180gacacgtcca ggtaactggc cataggctgg taggttcccg gatatcccgg atagaaggca 240aactcaatgg ggcggctggg gtactcttcc ccggccgtgg gagtctccgc ggggtacgcg 300gccagggtgg ctgcctgggc acagggtttc agcgagctcc gggacactcg gcaggagtag 360tacccgcctc caaagtaacc ataaggcacg ggagctgggg acgtcccctg gggcacccca 420nggcatgggt ggcattgctt tggcggctcc gccgagcctg gcagatccaa gggggcatag 480ttgacagcag gcatcagcgt aggcgccgct gggtggctgg tcaaaaggga gtggcgacca 540nattccgccc ccctcccgct tcccag 566320526DNAHomo sapiensmisc_feature(472)..(472)a or g or c or t/umisc_feature(501)..(501)a or g or c or t/u 320gggtgctgcc cgctggagtc tgcaaatgct gccttccaaa agggacctgg tgggttctgt 60tctccctggc aacacatctg gctgttccag ccaccagcga gagcccagga ctggtaactg 120tccacaggca acagggagtc atgtcgcggt tctccaggag cacccagagt ctgcaccaca 180gacacgtcca ggtaactggc cataggctgg taggttcccg gatatcccgg atagaaggca 240aactcagtgg ggcggctggg gtactcttcc ccgccgtggg agtctccgcg gggtacgcgg 300ccagggtggc tgcctgggca cagggtttca gcgagctccg ggacactcgg caggagtagt 360acccgcctcc aaagtaacca taaggcacgg gagctgggga cgtcccctgg ggcaccccag 420ggcatgggtg gcattgcttt ggcggctccg ccgagcctgg cagatccaag gnggcatagt 480tgacagcagg catcagcgta ngcgccgctg ggtggctgtc aagagg 526321471DNAHomo sapiens 321tcgacgttac ctggacgtgt ctgtggtgca gactctgggt gctcctggag aaccgcgaca 60tgactccctg ttgcctgtgg acagttacca gtcttgggct ctcgctggtg gctggaacag 120cagatgtgtt gccagggaga acagaaccca ccaggtccct tttggaaggc agcatttgca 180gactccagcg ggcagcaccc tcctgacgcc tgcgcctttc gtcgcggccg caagaaacgc 240attccgtaca gcaaggggca gttgcgggac tggagcggga gtatgcggct aacaagttca 300tcaccaagga caagaggcgc aagatctcgg cagccaccag cctctcggag cgccagatta 360ccatctggtt tcagaaccgc cgggtcaaag agaagaaggt tctcgccaag gtgaagaaca 420gcgctacccc ttaagagatc tccttgcctg ggtgggagga gcgaaagtgt g 471322545DNAHomo sapiensmisc_feature(427)..(427)a or g or c or t/u 322gtcaggaggg tgctgcccgc tggagtctgc aaatgctgcc ttccaaaagg gacctggtgg 60gttctgttct ccctggcaac acatctggct gttccagcca ccagcgagag cccaggactg 120gtaactgtcc acaggcaaca gggagtcatg tcgcggttct ccaggagcac ccagagtctg 180caccacagac acgtccaggt aactggccat aggctggtag gttcccggat atcccggata 240gaaggcaaac tcagtggggc ggctggggta ctcttccccg gccgtgggag tctccgcggg 300gtacgcggcc agggtggctg cctgggcaca gggtttcagc gagctccggg acactcggca 360tgagtagacc cgccttccaa gtaaccataa ggcacgggag ctggtaacgt cccctggggc 420accccanggc catgggtgca ttgctttggc ggctccgccg agccctgcag atccaaggtg 480ggcatattga cagcaggcat tcacgtatgc gccccctggg tggctgtcat attggggatt 540gcgac 545323438DNAHomo sapiensmisc_feature(366)..(366)a or g or c or t/umisc_feature(375)..(375)a or g or c or t/umisc_feature(415)..(415)a or g or c or t/umisc_feature(419)..(420)a or g or c or t/u 323gcaggcgtca ggagggtgct gcccgctgga gtctgcaaat gctgccttcc aaaagggacc 60tggtgggttc tgttctccct ggcaacacat ctggctgttc cagccaccag cgagagccca 120agactggtaa ctgtccacag gcaacaggga gtcatgtcgc ggttctccag gagcacccag 180agtctgcacc acagacacgt ccaggtaact ggccataggc tggtaggttc ccggatatcc 240cggatagaag gcaaactcag tggggcgact ggggtactct tcccggccgt ggggagtctc 300cgcggggtac gcggccaggg gtggctgcct gggcaccagg ggtttcagcg agctccggga 360cactcngcag gaaantagta cccgcctccc aaagtaacca taagcaccgg actgngggnn 420ggacgtcccc tggggcac 438324370DNAHomo sapiens 324gcgaccggac gaaaggaggc gtcaggaggg tgctgcccgc tggagtctgc aaatgctgcc 60ttccaaaagg gacctggtgg gttctgttct ccctggcaac acatctggct gttccagcac 120cagcgagacc caagactggt aactgtccac aggcaacagg gagtcatgtc gcggttctcc 180aggagcaccc agagtctgca ccacagacac gtccaggtaa ctggccatag ctaggtaggt 240tcccggatat cccggataga aggcaaactc agtggggcga ctggggtact cttccccggc 300cgtgggagtc tccgcggggt acgcccatgg gtggctgcct gggcacaggg tttcagcgag 360ctccgggaca 370325495DNAHomo sapiens 325gcaggcgtca ggagggtgct gcccgctgga gtctgcaaat gctgccttcc aaaagggacc 60tggtgggttc tgttctccct ggcaacacat ctggctgttc cagccaccag cgagagccca 120agactggtaa ctgtccacag gcaacaggga gtcatgtcgc ggttctccag gagcacccag 180agtctgcacc acagacacgt ccaggtaact ggccataggc tggtaggttc ccggatatcc 240cggatagaag gcaaactcag tggggcgact ggggtactct tccccggccg tgggagtctc 300cgcggggtac gcggccaggg tggctgcctg ggcacagggt ttcagcgagc tccgggacac 360tcggcaggag tagtacccgc ctccaaagta accataaggc acgggagctg gatgcgtccc 420ctagggcacc ccatggcatg ggtggcattg ctttggcggc tccgccgagc ctggcagatc 480caaggaggca ctgtt 495326408DNAHomo sapiens 326gggtgctgcc cgctggagtc tgcaaatgct gccttccaaa agggacctgg tgggttctgt 60tctccctggc aacacatctg gctgttccag ccaccagcga gacccaagac tggtaactgt 120ccacaggcaa cagggagtca tgtcgcggtt ctccaggagc acccagagtc tgcaccacag 180acacgtccag gtaactggcc ataggctggt aggttcccgg atatcccgga tagaaggcaa 240actcagtggg gcggctgggg tactcttccc cggccgtggg agtctccgcg gggtacgcgt 300ccagggtggc tgcctgggca cagggtttca gcgagctccg ggacactcgg caggagtagt 360acccgcctcc aaagtaacca taaggcacgg gagctgggga cgtccctg 408327344DNAHomo sapiens 327gggtgctgcc cgctggagtc tgcaaatgct gccttccaaa agggacctgg tgggttctgt 60tctccctggc aacacatctg gctgttccag ccaccagcga gacccaagac tggtaactgt 120ccacaggcaa cagggagtca tgtcgcggtt ctccaggagc acccagagtc tgcaccacag 180acacgtccag gtaactggcc ataggtggta ggttcccgga tatcccggat agaaggcaaa 240ctcagtgggg cggctggggt actcttcccc ggccgtggga gtctccgcgg ggtacgcggc 300cagggtggct gcctgggcac agggtttcag cgagctccgg gaca 344328334DNAHomo sapiens 328gggtgctgcc cgctggagtc tgcaaatgct gccttccaaa agggacctgg tgggttctgt 60tctccctggc aacacatctg gctgttcctg ccaccagcga gagcccaaga ctggtaactg 120tccacaggca acagggagtc atgtcgcggt tctccaggag cacccagagt ctgcaccaca 180gacacgtcca ggtaactggc cataggctgg taggttcccg gatatcccgg atagaaggca 240aactcagtgg ggcggctggg gtactcttcc ccggccgtgg gagtctccgc ggggtacgcg 300gccagggtgg ctgcctgggc acagggtttc agcg 334329288DNAHomo sapiensmisc_feature(207)..(207)a or g or c or t/u 329gggtgctgcc cgctggagtc tgcaaatgct gccttccaaa agggacctgg tgggttctgt 60tctccctggc aacacatctg gctgttccag ccaccagcga gacccaagac tggtaactgt 120ccacaggcaa cagggagtca tgtcgcggtt ctccaggagc acccagagtc tgcaccacag 180acacgtccag gtaactggcc ataggtnggt aggttcccgg atatcccgga tagaaggcaa 240actcagtggg gcggctgggg tactcttccc cggccgtggg agtctccg 288330343DNAHomo sapiensmisc_feature(238)..(238)a or g or c or t/umisc_feature(300)..(300)a or g or c or t/umisc_feature(321)..(321)a or g or c or t/u 330ctccctggca acacatctgg ctgttccagc accagcgaga gccaagactg gtaactgtcc 60acaggcaaca gggagtcatg tcgcggttct ccaggagcac ccagagtctg caccacagac 120acgtccaggt aactggccat aggtcggtag gttcccggat atcccggata gaaggcaaac 180tcagtggggc gactggggta ctcttccccg gccgtgggag tctccgcggg gtacggcnac 240agggtggctg cctgggcaca gggtttcagc gagctccggg acactcggca ggagtagtan 300ccgcctcaaa gtaaccataa ngcacgggag ctggggacgt ccc 343331441DNAHomo sapiensmisc_feature(379)..(379)a or g or c or t/u 331acgaaaggcg caggcgtcag gagggtgctg cccgctggag tctgcaaatg ctgccttcca 60aaagggacct ggtgggttct gttctccctg gcaacacatc tggctgttcc agccaccagc 120gagagcccaa gactggtaac tgtccacagg caacagggag tcatgtcgcg gttctccagg 180agcacccaga gtctgcacca cagacacgtc caggtaactg gccataggct ggtaggttcc 240cggatatccc ggatagaagg caaactcagt ggggcgactg gggtactctt ccccggcccg 300gggagtctcc gcggggtacg cggccagggt ggctgcctgg gcacagggtt tcagcgagct 360ccgggacact cggcggagnt agtacccgcc tccaaagtaa ccataaggca cgggagctgg 420ggaaccgtcc cctggggcac c 441332729DNAHomo sapiens 332gagcgagctg ggtgccccct agattccccg cccccgcacc tcatgagccg accctcggct 60ccatggagcc cggcaattat gccaccttgg atggagccaa ggatatcgaa ggcttgctgg 120gagcgggagg ggggcggaat ctggtcgccc actcccctct gaccagccac ccagcggcgc 180ctacgctgat gcctgctgtc aactatgccc ccttggatct gccaggctcg gcggagccgc 240caaagcaatg ccacccatgc cctggggtgc cccagggacg tccccagctc ccgtgcctta 300tggttacttt ggaggcgggt actactcctg ccgagtgtcc cggagctcgc tgaaaccctg 360tgcccaggca gccaccctgg ccgcgtaccc cgcggagact cccacggccg gggaagagta 420ccccagccgc cccactgagt ttgccttcta

tccgggatat ccgggaacct accagcctat 480ggccagttac ctggacgtgt ctgtggtgca gactctgggt gctcctggag aaccgcgaca 540tgactccctg ttgcctgtgg acagttacca gtcttgggct ctcgctggtg gctggaacag 600ccagatgtgt tgccagggag aacagaaccc accaggtccc tttttggaag gcagcatttg 660cagactccag cggcaggacc tcctgaacgc ctgcgccttt cgtcgcggcg tctaaagtaa 720tcctcgagg 729333502DNAHomo sapiensmisc_feature(301)..(301)a or g or c or t/umisc_feature(479)..(479)a or g or c or t/u 333gcggccgcgg cccaccacca actgctcgcc accgacccca ctactcgcca ccgacccgct 60gctcggagct tcggttctgc gggttgtcca gacttcaggc ctgtgcgctc aatcgtggag 120aatgcgccgg caggcccccc acccccagcc taaggtgcag gaaggaccag cacgaacccg 180ctggctttgc tgcgcggcca ggagatgagt cccaccgggc actgagccca ggtacaggac 240atcagagaat gaacacagag gcagaggccc tcatgtccct ctcagagtcc cggctctgca 300nagagcccgt ctgtctccag cttccagaat tccgcactgt gaatctgtct acgtggactg 360ggaaaacagg gttggcacca ctctgccact ccgtttgtgc ctgggaaggg ctaagtatgc 420aaggctacaa acatctactt cactgggatc ccaaatgctc aacaaaccat gacctgctnt 480ggtcagaacc accagaaata tt 502334282DNAHomo sapiens 334gcaggcgact tgcgagctgg gagcacttta aaacgctttg gattcccccg gcctgggtgg 60ggagagcgag ctgggtgccc cctagattcc ccgcccccgc acctcatgag ccgaccctcg 120gctccatgga gcctggcata ttatgccacc ttggtatgga gccaaggata tcgaaggctt 180gctgggagcg ggaggggggc ggaatctggt cgcccactcc cctctgacca gccacccagc 240ggcgcctacg ctgatgcctg ctgtcaacta tgcccccttg ga 282335381DNAHomo sapiensmisc_feature(201)..(201)a or g or c or t/u 335gcccgctgga gtctgcaaat gctgccttcc aaaagggacc tggtgggttc tgttctccct 60ggcaacacat ctggctgttc cagccaccag cgagacgcca agactggtaa ctgtccacag 120gcaacaggga gtcatgtcgc ggttctccag gagcacccag agtctgcacc acagacacgt 180ccaggtaact ggccataggt nggtaggttc ccggatatcc cggatagaag gcaaactcag 240tggggcggct ggggtactct tccccggccg tgggagtctc cgcggggtac gcgcacaggg 300tggctgcctg ggcacagggt ttcagcgagc tccgggacac tcggcaggag tagtacccgc 360ctccaaagta accataaggc a 381336405DNAHomo sapiens 336aactgctcgc caccgacccc actactcgcc accgacccgc tgctcggagc ttcggttctg 60cgggttgtcc agacttcagg cctgtgcgct caatcgtgga gaatgcgccg gcagccccca 120cccccagcct aaggtgcagg aaggaccagc acgaacccgc tggctttgct gcgcggccag 180gagatgagtc ccaccgggca ctgagcccag gtacaggaca tcagagaatg aacacagagg 240cagaggccct catgtccctc tcagagtccc ggctctgcaa agagcccgtc tgtctccagc 300ttccagaatt ccgcactgtg aatctgtcta cgtggactgg gaaaacaggg ttggcaccac 360tctgccactc cgtttgtgcc tgggaagggc taagtatgca aggct 405337328DNAHomo sapiens 337gatccctttg cagggaagct ttctctcaga cccccttcca ttacacctct caccctggta 60acagcaggaa gactgaggag aggggaacgg gcagattcgt tgtgtggctg tgatgtccgt 120ttagcatttt tctcagctga cagctgggta ggtggacaat tgtagaggct gtctcttcct 180ccctccttgt ccaccccata gggtgtaccc actggtcttg gaagcaccca tccttaatac 240gatgattttt ctgtcgtgtg aaaatgaagc cagcaggctg cccctagtca gtccttcctt 300ccagagaaaa agagatttga gaaagtga 328338320DNAHomo sapiens 338tttttttttt tttttttttt ctttttcact ttctcaaatc tctttttctc tggaaggaag 60gactgactag gggcagcctg ctggcttcat tttcacacga caaaaaaatc atcgtattaa 120ggatgggtgc ttccaaaacc agtgggtaca ccctatgggg gggacaagga gggaggaaga 180gacagcctct acaattgtcc acctacccag ctgtcagctg agaaaaatgc taaacggaca 240tcacagccac acaacgaatc tgcccgttcc cctctcctca gtcttcctgc tgttaccagg 300gtgagaggtg taatggaagg 320339321DNAHomo sapiens 339tttttttttt tttttttttt ctttttcact ttcccaaatc tctttttctc tggaaggaag 60gactgactag gggcagcctg ctggcttcat tttcacacga cagaaaaatc atcgtattaa 120ggatgggtgc ttccaagacc agtgggtaca ccctatgggg tggacacagg agggaggaag 180agacagcctc tacaattgtc cacctaccca gctgtcagct gagaaaaatg ctaaacggac 240atcacagcca cacaacgaat ctgcccgttc ccctctcctc agtcttcctg ctgttaccag 300ggtgagaggt gtaatggaag g 321340354DNAHomo sapiens 340gcggccgcgg cccaccacca actgctcgcc accgacccca ctactcgcca ccgacccgct 60gctcggagct tcggttctgc gggttgtcca gacttcaggc ctgtgcgctc aatcttggag 120aatgcgccgg caggcccccc acccccagcc taaggtgcag gaaggaccag cacgaacccg 180ctggctttgc tgcgcggcca ggagatgagt cccaccgggc actgagccca ggtacaggac 240atcagagaat gaacacagag gcagaggccc tcatgtccct ctcagagtcc cggctctgca 300aagagcccgt ctgtctccag cttccagaat tccgcactgt gaatctgtct acgt 354341448DNAHomo sapiens 341cacgcgtcga tcccagtgaa gtagatgttt gtagccttgc atacttagtc cttcccaggc 60acaaacggag tggcagagtg gtgccaaccc tgttttccca gtccacgtag acagattcac 120agtgcggaat tctggaagct ggagacagac gggctctttg cagagccggg actctgagag 180ggacatgagg gcctctgcct ctgtgttcat tctctgatgt cctgtacctg ggctcagtgc 240ccggtgggac tcatctcctg gccgcgcagc aaagccagcg ggttcgtgct ggtccttcct 300gcaccttagg ctgggggtgg ggggcctgcc ggcgcattct ccacgattga gcgcacaggc 360ctgaagtctg gacaacccgc agaaccgaag ctccgagcag cgggtcggtg gcgagtagtg 420gggtcggtgg cgagcagttg gtggtggg 448342223DNAHomo sapiens 342tcgacctcgc caaggtgaag aacaacgcta ccccttaaga gatctccttg cctgggtggg 60aggagcgaaa gtgggggtgt cctggggaga ccaggaacct gccaagccca ggctggggcc 120aaggactctg ctgagaggcc cctagagaca acacccttcc caggccactg gctgctggac 180tgttcctcag gagcggcctg ggtacccagt atgtgcaggg aga 223343157DNAHomo sapiens 343ttttttactg gttatcgtgg ttattgccac tgtcaggatg aatgattatg actgggccag 60gttctttggg aaccctggtg gagtgggctg tcacatgggg ttccgtctcc ctgcacatac 120tgggtaccca ggccgctcct gaggaacagt ccagcag 157344344DNAHomo sapiens 344ggcccaccac caactgctcg ccaccgaccc cactactcgc caccgacccg ctgctcggag 60cttcggttct gcgggttgtc cagacttcag gcctgtgcgc tcaatcgtgg agaatgcgcc 120ggcaggcccc ccacccccag cctaaggtgc aggaaggacc agcacgaacc cgctggcttt 180gctgcgcggc caggagatga gtcccaccgg gcactgagcc caggtacagg acatcagaga 240atgaacacag aggcagaggc cctcatgtcc ctctcagagt cccggctctg caaagagccc 300gtctgtctcc agcttccaga attccgcact gtgaacctcg tgcc 344345344DNAHomo sapiens 345ggcacgaggt tcacagtgcg gaattctgga agctggagac agacgggctc tttgcagagc 60cgggactctg agagggacat gagggcctct gcctctgtgt tcattctctg atgtcctgta 120cctgggctca gtgcccggtg ggactcatct cctggccgcg cagcaaagcc agcgggttcg 180tgctggtcct tcctgcacct taggctgggg gtggggggcc tgccggcgca ttctccacga 240ttgagcgcac aggcctgaag tctggacaac ccgcagaacc gaagctccga gcagcgggtc 300ggtggcgagt agtggggtcg gtggcgagca gttggtggtg ggcc 344346305DNAHomo sapiens 346gctgctcgga gcttcggttc tgcgggttgt ccagacttca ggcctgtgcg ctcaatcgtg 60gagaatgcgc cggcagcccc cacccccagc ctaaggtgca ggaaggacca gcacgaaccc 120gctggctttg ctgcgcggcc aggagatgag tcccaccggc actgagccag gtacaggaca 180tcagagaatg aacacagagg cagaggcctc atgtccctct cagagtcccg gctctgcaaa 240gagccgtact gtctccagct tccagaattc cgcactgtga atctgtctac gtggactggg 300aaaac 305347687DNAHomo sapiens 347cacgaggatt ttctatctag agctctgtag agcactttag aaaccgcttt catgaattga 60gctaattatg aataaatttg gaaggcgatc cctttgcagg gaagctttct ctcagacccc 120cttccattac acctctcacc ctggtaacag caggaagact gaggagaggg gaacgggcag 180attcgttgtg tggctgtgat gtccgtttag catttttctc agctgacagc tgggtaggtg 240gacaattgta gaggctgtct cttcctccct ccttgtccac cccatagggt gtacccactg 300gtcttggaaa cacccatcct taatacgatg atttttctgt cgtgtgaaaa tgaagccagc 360aggctgcccc tagtcagtcc ttccttccag agaaaaagag atttgagaaa gtgcctgggt 420aattcaccat taatttcctc ccccaaactc tctgagtctt cccttaatat ttctggtggt 480tctgaccaaa gcaggtcatg gtttgttgag catttgggat cccagtgaag tagatgtttg 540tagccttgca tacttagccc ttcccaggca caaacggagt ggcagagtgg tgccaaccct 600gttttcccag tccacgtaga cagattcaca gtgcggaatt ctggaagctg gagacagacg 660ggctctttgc agagccggga ctctgag 687348687DNAHomo sapiensmisc_feature(17)..(17)a or g or c or t/umisc_feature(42)..(42)a or g or c or t/u 348cacgaggatt ttctatncta gagctctggt agagcacttt anaaaccgct ttcatgaatt 60gagctaatta tgaataaatt tggaaggcga tccctttgca gggaagcttt ctctcagacc 120cccttccatt acacctctca ccctggtaac agcaggaaga ctgaggagag gggaacgggc 180agattcgttg tgtggctgtg atgtccgttt agcatttttc tcagctgaca gctgggtagg 240tggacaattg tagaggctgt ctcttcctcc ctccttgtcc accccatagg gtgtacccac 300tggtcttgga aacacccatc cttaatacga tgatttttct gtcgtgtgaa aatgaagcca 360gcaggctgcc cctagtcagt ccttccttcc agagaaaaag agattgagaa agtgcctggg 420taattcacca ttaatttcct cccccaaact ctctgagtct tcccttaata tttctggtgg 480ttctgaccaa agcaggtcat ggtttgttga gcatttggga tcccagtgaa gtagatgttt 540gtagccttgc atacttagcc cttcccaggc acaaacggag tggcagagtg gtgccaaccc 600tgttttccca gtccacgtag acagattcac agtgcggaat tctggaagct ggagacagac 660gggctctttg cagagccggg actctga 687349473DNAHomo sapiens 349cacgagggaa gccagcaggc tgcccctagt cagtccttcc ttccagagaa aaagagattt 60gagaaagtgc ctgggtaatt caccattaat ttcctccccc aaactctctg agtcttccct 120taatatttct ggtggttctg accaaagcag gtcatggttt gttgagcatt tgggatccca 180gtgaagtaga tgtttgtagc cttgcatact tagcccttcc caggcacaaa cggagtggca 240gagtggtgcc aaccctgttt tcccagtcca cgtagacaga ttcacagtgc ggaattctgg 300aagctggaga cagacgggct ctttgcagag ccgggactct gagagggaca tgagggcctc 360tgcctctgtg ttcattctct gatgtcctgt acctgggctc agtgcccggt gggactcatc 420tcctgggcgc gcagcaaagc cagcgggttc gtgctggtcc ttcctgcacc tta 473350514DNAHomo sapiens 350cacgaggcct ggtaacagca ggaagactga ggagagggga acgggcagat tcgttgtgtg 60gctgtgatgt ccgtttagca tttttctcag ctgacagctg ggtaggtgga caattgtaga 120ggctgtctct tcctccctcc ttgtccaccc catagggtgt acccactggt cttggaaaca 180cccatcctta atacgatgat ttttctgtcg tgtgaaaatg aagccagcag gctgccccta 240gtcagtcctt ccttccagag aaaaagagat ttgagaaagt gcctgggtaa ttcaccatta 300atttcctccc ccaaactctc tgagtcttcc cttaatattt ctggtggttc tgaccaaagc 360aggtcatggt ttgttgagca tttgggatcc cagtgaagta gatgtttgta gccttgcata 420cttagccctt cccaggcaca aacggagtgg cagagtggtg ccaaccctgt tttcccagtc 480cacgtagaca gattcacagt gcggaattct ggaa 514351477DNAHomo sapiens 351cacgaggtct tcccttaata tttctggtgg ttctgaccaa agcaggtcat ggtttgttga 60gcatttggga tcccagtgaa gtagatgttt gtagccttgc atacttagcc cttcccaggc 120acaaacggag tggcagagtg gtgccaaccc tgttttccca gtccacgtag acagattcac 180agtgcggaat tctggaagct ggagacagac gggctctttg cagagccggg actctgagag 240ggacatgagg gcctctgcct ctgtgttcat tctctgatgt cctgtacctg ggctcagtgc 300ccggtgggac tcatctcctg gccgcgcagc aaagccagcg ggttcgtgct ggtccttcct 360gcaccttagg ctgggggtgg ggggcctgcc ggcgcattct ccacgattga gcgcacaggc 420ctgaagtctg gacaacccgc agaaccgaag ctccgagcag cgggtcggtg gcgagta 477352355DNAHomo sapiens 352cacgaggatt tctggtggtt ctgaccaaag caggtcatgg tttgttgagc atttgggatc 60ccagtgaagt agatgtttgt agccttgcat acttagccct tcccaggcac aaacggagtg 120gcagagtggt gccaaccctg ttttcccagt ccacgtagac agattcacag tgcggaattc 180tggaagctgg agacagacgg gctctttgca gagccgggac tctgagaggg acatgagggc 240ctctgcctct gtgttcattc tctgatgtcc tgtacctggg ctcagtgccc ggtgggactc 300atctcctggc cgcgcagcaa agccagcggg ttcgtgctgg tccttcctgc acctt 355353490DNAHomo sapiens 353cacgaggaag gcgatccctt tgcagggaag ctttctctca gacccccttc cattacacct 60ctcaccctgg taacagcagg aagactgagg agaggggaac gggcagattc gttgtgtggc 120tgtgatgtcc gtttagcatt tttctcagct gacagctggg taggtggaca attgtagagg 180ctgtctcttc ctccctcctt gtccacccca tagggtgtac ccactggtct tggaaacacc 240catccttaat acgatgattt ttctgtcgtg tgaaaatgaa gccagcaggc tgcccctagt 300cagtccttcc ttccagagaa aaagagattt gagaaagtgc ctgggtaatt caccattaat 360ttcctccccc aaactctctg agtcttccct taatatttct ggtggttctg accaaagcag 420gtcatggttt gttgagcatt tgggatccca gtgaagtaga tgtttgtagc cttgcatact 480tagcccttcc 490354403DNAHomo sapiens 354cacgaggtgg attcccccgg cctgggtggg gagagcgagc tgggtgcccc ctagattccc 60cgcccccgca cctcatgagc cgaccctcgg ctccatggag cccggcaatt atgccacctt 120ggatggagcc aaggatatcg aaggcttgct gggagcggga ggggggcgga atctggtcgc 180ccactcccct ctgagcagcc acccagcggc gcctacgctg atgcctgctg tcaactatgc 240ccccttggat ctgccaggct cggcggagcc gccaaagcaa tgccacccat gccctggggt 300gccccagggg acgtccccag ctcccgtgcc ttatggttac tttggaggcg ggtactactc 360ctgccgagtg tcgcggagct cgctgaaacc ctgtgcccag gca 403355562DNAHomo sapiensmisc_feature(533)..(533)a or g or c or t/u 355cacgaggatt ttctatctag agctctgtag agcactttag aaaccgcttt catgaattga 60gctaattatg aataaatttg gaaggcgatc cctttgcagg gaagctttct ctcagacccc 120cttccattac acctctcacc ctggtaacag caggaagact gaggagaggg gaacgggcag 180attcgttgtg tggctgtgat gtccgtttag catttttctc agctgacagc tgggtaggtg 240gacaattgta gaggctgtct cttcctccct ccttgtccac cccatagggt gtacccactg 300gtcttggaaa cacccatcct taatacgatg atttttctgt cgtgtgaaaa tgaagccagc 360aggctgcccc tagtcagtcc ttccttccag agaaaaagag atttgagaaa gtgcctgggt 420aattcaccat taatttcctc ccccaaactc tctgagtctt cccttaatat ttctggtggt 480tctgaccaaa gcaggtcatg gtttgttgag catttgggat cccagtgaag tanatgtttg 540tagccttgca tacttagccc tt 562356463DNAHomo sapiens 356cattttcaca cgactgtaaa atcatcgtat taaggatggg tgcttccaag accagtgggt 60acaccctatg gggtggacaa ggagggagga agagacagcc tctacaattg tccacctacc 120cagctgtcag ctgagaaaaa tgctaaacgg acatcacagc cacacaacga atctgcccgt 180tcccctctcc tcagtcttcc tgctgttacc agggtgagag gtgtaatgga agggggtctg 240agagaaagct tccctgcaaa gggatcgcct tccaaattta ttcataatta gctcaattca 300tgaaagcggt ttctaaagtg ctctacagag ctctagatag aaaatatgag gctaacgatc 360atggcagcta gtactggtta tcgtgattat tgccactgtc aggatgaatg attatgactg 420ggccaggttc tttgggaacc ctggtggagt gggctgtcac atg 463357198DNAHomo sapiens 357tgcagctagt actggttatc gtgattattg ccactgtcag gatgaatgat tatgactggg 60ccaggttctt tgggaaccct ggtggagtgg gctgtcacat ggggttccgt ctccctgcac 120atactgggta cccaggccgc tcctgaggaa cagtccagca cagggtttca gcgagctccg 180ggacactcgg cctcgtgc 198358320DNAHomo sapiens 358tttttttttt tttttttttt ctttttcact ttctcaaatc tctttttctc tggaaggaag 60gactgactag gggcagcctg ctggcttcat tttcacacca caaaaaaatc atcgtattaa 120ggatgggtgc ttccaaaacc agtgggtaca ccctatgggg tggacaagga gggaggaaaa 180aacagcctct acaattgtcc acctacccag ctgtcagctg aaaaaaatgc taaacggaca 240tcacagccac acaacgaatc tgcccgttcc cctctcctca gtcttcctgc tgttaccagg 300gtgaaaggtg taatggaagg 320359421DNAHomo sapiens 359accgacccca ctacttgcca ccgacccgct gctcggagct tcggttctgc gggttgtcca 60gacttcaggc ctgtgcgctc aatcgtggag aatgcgccgg caggcccccc acccccagcc 120taaggtgcag gaaggaccag cacgaacccg ctggctttgc tgcgcggcca ggagatgagt 180cccaccgggc actgagccca ggtacaggac atcagagaat gaacacagag gcagaggccc 240tcatgtccct ctcagagtcc cggctctgca aagagcccgt ctgtctccag cttccagaat 300tccgcactgt gaatctgtct acgtggactg ggaaaacagg gttggcacca ctctgccact 360ccgtttgtgc ctgggaaggg ctaagtatgc aaggctacaa acatctactt cactgggatc 420c 421360272DNAHomo sapiens 360tttttttttt tttttccctg caaagggatc gccttccaaa tttattcata attagctcaa 60ttcatgaaag cggtttctaa agtgctctac agagctctag atagaaaata tgaggctaac 120gatcatggca gctagtactg gttatcgtga ttattgccac tgtcaggatg aatgattatg 180actgggccag gttctttggg aaccctggtg gagtgggctg tcacatgggg ttccgtctcc 240ctgcacatac tgggtaccca ggccgctcct ga 272361679DNAHomo sapiens 361cacgaggcga cttgcgagct gggagcgatt taaaacgctt tggattcccc ggcctgggtg 60gggagagcga gctgggtgcc ccctagattc cccgcccccg cacctcatga gccgaccctc 120ggctccatgg agcccggcaa ttatgccacc ttggatggag ccaaggatat cgaaggcttg 180ctgggagcgg gaggggggcg gaatctggtc gcccactccc ctctgaccag ccacccagcg 240gcgcctacgc tgatgcctgc tgtcaactat gcccccttgg atctgccagg ctcggcggag 300ccgccaaagc aatgccaccc atgccctggg gtgccccagg ggacgtcccc agctcccgtg 360ccttatggtt actttggagg cgggtactac tcctgccgag tgtcccggag ctcgctgaaa 420ccctgtgccc aggcagccac cctggccgcg taccccgcgg agactcccac ggccggggaa 480gagtacccca gccgccccac tgagtttgcc ttctatccgg gatatccggg aacctaccag 540cctatggcca gttacctgga cgtgtctgtg gtgcagactc tgggtgctcc tggagaacgc 600gacatgactc cctgttgcct gtggacagtt accagtcttg ggctctcgct ggtggctgga 660acagccagat gtgttgcca 679362393DNAHomo sapiens 362gcggccgcgg cccaccacca actgctcgcc attcgacccc actactcgcc accgacccgc 60tgctcggagc ttcggttctg cgggttgtcc agacttcagg cctgtgcgct caatcgtgga 120gaatgcgccg gcaggccccc cacccccagc ctaaggtgca ggaaggacca gcacgaaccc 180gctggctttg ctgcgcggcc aggagatgag tcccaccggg cactgagccc aggtacagga 240catcagagaa tgaacacaga ggcagaggcc ctcatgtccc tctcagagtc ccggctctgc 300aaagagcccg tctgtctcca gcttccagaa ttccgcactg tgaatctgtc tacgtggact 360gggaaaacag ggttggcacc actctgccac tcc 393363504DNAHomo sapiensmisc_feature(479)..(479)a or g or c or t/u 363gcggccgcgg cccaccacca actgctcgcc accgacccca ctactcgcca ccgacccgct 60gctcggagct tcggttctgc gggttgtcca gacttcaggc ctgtgcgctc aatcgtggag 120aatgcgccgg caggcccccc acccccagcc taaggtgcag gaaggaccag cacgaacccg 180ctggctttgc tgcgcggcca ggagatgagt cccaccgggc actgagccca ggtacaggac 240atcagagaat gaacacagag gcagaggccc tcatgtccct ctcagagtcc cggctctgca 300aagagcccgt ctgtctccag cttccagaat tccgcactgt gaatctgtct acgtggactg 360ggaaaacagg gttggcacca ctctgccact ccgtttgtgc ctgggaaggg ctaagtatgc 420aaggctacaa acatctactt cactgggatc ccaaatgctc aacaaaccat gacctgctnt 480ggtcagaacc accagaaata ttaa 504364451DNAHomo sapiens 364gcggccgcgg cccaccacca actgctcgcc accgacccca ctactcgcca ccgacccgct 60gctcggagct tcggttctgc gggttgtcca gacttcaggc ctgtgcgctc aatcgtggag 120aatgcgccgg caggcccccc acccccagcc taaggtgcag

gaaggaccag cacgaacccg 180ctggctttgc tgcgcggcca ggagatgagt cccaccgggc actgagccca ggtacaggac 240atcagagaat gaacacagag gcagaggccc tcatgtccct ctcagagtcc cggctctgca 300aagagcccgt ctgtctccag cttccagaat tccgcactgt gaatctgtct acgtggactg 360ggaaaacagg gttggcacca ctctgccact ccgtttgtgc ctgggaaggg ctaagtatgc 420aaggctacaa acatctactt cactgggatc c 451365219DNAHomo sapiens 365tcctccctct aagaaaggcg caagcgtcaa gagggtgctg cccgctggtt tctgcaaatg 60ctgccttcca aaaaggacct ggtgggttct gttctccctg gcaacacatc tggctgttcc 120agccaccagc gagagcccaa gactggtaac tgtccacagg caacagggag tcatgtcgcg 180gttctccagg agcacccaga gtctgcacca cagacacgt 219366699DNAHomo sapiens 366ttaatacgat gatttttctg tcgtgtgaaa atgaagccag caggctgccc ctagtcagtc 60cttccttcca gagaaaaaga gatttgagaa agtgcctggg taattcacca ttaatttcct 120cccccaaact ctctgagtct tcccttaata tttctggtgg ttctgaccaa agcaggtcat 180ggtttgttga gcatttggga tcccagtgaa gtagatgttt gtagccttgc atacttagcc 240cttcccaggc acaaacggag tggcagagtg gtgccaaccc tgttttccca gtccacgtag 300acagattcac agtgcggaat tctggaagct ggagacagac gggctctttg cagagccggg 360actctgagag ggacatgagg gcctctgcct ctgtgttcat tctctgatgt cctgtacctg 420ggctcagtgc ccggtgggac tcatctcctg gccgcgcagc aaagccagcg ggttcgtgct 480ggtccttcct gcaccttagg ctgggggtgg ggggcctgcc ggcgcattct ccacgattga 540gcgcacaggc ctgaagtctg gacaacccgc agaaccgaag ctccgagcag cgggtcggtg 600gcgagtagtg ggggtcggtg gcgaacaagt ggtggtgggc cggggccgca taactcgagg 660actttcctcc cggagcagtc cctaaaaacc cgggggcgc 699367575DNAHomo sapiens 367gacgaggaca attgtagagg ctgtctcttc ctccctcctt gtcaccccat agggtgtacc 60actggtcttg gaagcaccca tccttaatac gatgattttt ctgtcgtgtg aaaatgaagc 120cagcaggctg cccctagtca gtccttcctt ccagagaaaa agagatttga gaaagtgcct 180gggtaattca ccattaattt cctcccccaa actctctgag tcttccctta atatttctgg 240tggttctgac caaagcaggt catggtttgt tgagcatttg ggatcccagt gaagtagatg 300tttgtagcct tgcatactta gcccttccca ggcacaaacg gagtggcaga gtggtgccaa 360ccctgttttc ccagtccacg tagacagatt cacagtgcgg aattctggaa gctggagaca 420gacgggctct ttgcagagcc gggactctga gagggacatg agggcctctg cctctgtgtt 480cattctctga tgtcctgtac ctgggctcag tgcccggtgg gactcatctc ctggccgcgc 540agcaaagcca gcgggttcgt gctggtcctt cctgc 575368684DNAHomo sapiens 368cacgaggcga cttgcgagct gggagcgatt taaaacgctt tggattcccc cggcctgggt 60ggggagagcg agctgggtgc cccctagatt ccccgccccc gcacctcatg agccgaccct 120cggctccatg gagcccggca attatgccac cttggatgga gccaaggata tcgaaggctt 180gctgggagcg ggaggggggc ggaatctggt cgcccactcc cctctgacca gccacccagc 240ggcgcctacg ctgatgcctg ctgtcaacta tgcccccttg gatctgccag gctcggcgga 300gccgccaaag caatgccacc catgccctgg ggtgccccag gggacgtccc cagctcccgt 360gccttatggt tactttggag gcgggtacta ctcctgccga gtgtcccgga gctcgctgaa 420accctgtgcc caggcagcca ccctggccgc gtaccccgcg gagactccca cggccgggga 480agagtaccca gccgccccac tgagtttgcc ttctatccgg gatatccggg aacctaccag 540cctatggcca gttacctgga cgtgtctgtg gtgcagactc tgggtgctcc tggagaacgc 600gacatgactc cctgttgcct gtggacagtt accaatcttg ggctctcgct ggtggctgga 660acagccagat gtgttgccag ggag 684369855DNAHomo sapiens 369atggagcccg gcaattatgc caccttggat ggagccaagg atatcgaagg cttgctggga 60gcgggagggg ggcggaatct ggtcgcccac tcccctctga ccagccaccc agcggcgcct 120acgctgatgc ctgctgtcaa ctatgccccc ttggatctgc caggctcggc ggagccgcca 180aagcaatgcc acccatgccc tggggtgccc caggggacgt ccccagctcc cgtgccttat 240ggttactttg gaggcgggta ctactcctgc cgagtgtccc ggagctcgct gaaaccctgt 300gcccaggcag ccaccctggc cgcgtacccc gcggagactc ccacggccgg ggaagagtac 360cccagccgcc ccactgagtt tgccttctat ccgggatatc cgggaaccta ccagcctatg 420gccagttacc tggacgtgtc tgtggtgcag actctgggtg ctcctggaga accgcgacat 480gactccctgt tgcctgtgga cagttaccag tcttgggctc tcgctggtgg ctggaacagc 540cagatgtgtt gccagggaga acagaaccca ccaggtccct tttggaaggc agcatttgca 600gactccagcg ggcagcaccc tcctgacgcc tgcgcctttc gtcgcggccg caagaaacgc 660attccgtaca gcaaggggca gttgcgggag ctggagcggg agtatgcggc taacaagttc 720atcaccaagg acaagaggcg caagatctcg gcagccacca gcctctcgga gcgccagatt 780accatctggt ttcagaaccg ccgggtcaaa gagaagaagg ttctcgccaa ggtgaagaac 840agcgctaccc cttag 8553701356DNAHomo sapiens 370ggattccccc ggcctgggtg gggagagcga gctgggtgcc ccctagattc cccgcccccg 60cacctcatga gccgaccctc ggctccatgg agcccggcaa ttatgccacc ttggatggag 120ccaaggatat cgaaggcttg ctgggagcgg gaggggggcg gaatctggtc gcccactccc 180ctctgaccag ccacccagcg gcgcctacgc tgatgcctgc tgtcaactat gcccccttgg 240atctgccagg ctcggcggag ccgccaaagc aatgccaccc atgccctggg gtgccccagg 300ggacgtcccc agctcccgtg ccttatggtt actttggagg cgggtactac tcctgccgag 360tgtcccggag ctcgctgaaa ccctgtgccc aggcagccac cctggccgcg taccccgcgg 420agactcccac ggccggggaa gagtacccca gccgccccac tgagtttgcc ttctatccgg 480gatatccggg aacctaccag cctatggcca gttacctgga cgtgtctgtg gtgcagactc 540tgggtgctcc tggagaaccg cgacatgact ccctgttgcc tgtggacagt taccagtctt 600gggctctcgc tggtggctgg aacagccaga tgtgttgcca gggagaacag aacccaccag 660gtcccttttg gaaggcagca tttgcagact ccagcgggca gcaccctcct gacgcctgcg 720cctttcgtcg cggccgcaag aaacgcattc cgtacagcaa ggggcagttg cgggagctgg 780agcgggagta tgcggctaac aagttcatca ccaaggacaa gaggcgcaag atctcggcag 840ccaccagcct ctcggagcgc cagattacca tctggtttca gaaccgccgg gtcaaagaga 900agaaggttct cgccaaggtg aagaacagcg ctacccctta agagatctcc ttgcctgggt 960gggaggagcg aaagtggggg tgtcctgggg agaccaggaa cctgccaagc ccaggctggg 1020gccaaggact ctgctgagag gcccctagag acaacaccct tcccaggcca ctggctgctg 1080gactgttcct caggagcggc ctgggtaccc agtatgtgca gggagacgga accccatgtg 1140acagcccact ccaccagggt tcccaaagaa cctggcccag tcataatcat tcatcctgac 1200agtggcaata atcacgataa ccagtactag ctgccatgat cgttagcctc atattttcta 1260tctagagctc tgtagagcac tttagaaacc gctttcatga attgagctaa ttatgaataa 1320atttggaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 13563711026DNAHomo sapiens 371cgggtgcccc ctagattccc cgcccccgca cctcatgagc cgaccctcgg ctccatggag 60cccggcaatt atgccacctt ggatggagcc aaggatatcg aaggcttgct gggagcggga 120ggggggcgga atctggtcgc ccactcccct ctgaccagcc acccagcggc gcctacgctg 180atgcctgctg tcaactatgc ccccttggat ctgccaggct cggcggagcc gccaaagcaa 240tgccacccat gccctggggt gccccagggg acgtccccag ctcccgtgcc ttatggttac 300tttggaggcg ggtactactc ctgccgagtg tcccggagct cgctgaaacc ctgtgcccag 360gcagccaccc tggccgcgta ccccgcggag actcccacgg ccggggaaga gtaccccagc 420cgccccactg agtttgcctt ctatccggga tatccgggaa cctaccacgc tatggccagt 480tacctggacg tgtctgtggt gcagactctg ggtgctcctg gagaaccgcg acatgactcc 540ctgttgcctg tggacagtta ccagtcttgg gctctcgctg gtggctggaa cagccagatg 600tgttgccagg gagaacagaa cccaccaggt cccttttgga aggcagcatt tgcagactcc 660agcgggcagc accctcctga cgcctccgcc tttcgtcgcg gccgcaagaa acgcattccg 720tacagcaagg ggcagttgcg ggagctggag cgggagtatg cggctaacaa gttcatcacc 780aaggacaaga ggcgcaagat ctcggcagcc accagcctct cggagcgcca gattaccatc 840tggtttcaga accgccgggt caaagagaag aaggttctcg ccaaggtgaa gaacagcgct 900accccttaag agatctcctt gcctgggtgg gaggagcgaa agtgggggtg tcctggggag 960accaggaacc tgccaagccc aggctggggc caaggactct gctgagaggc ccctagagac 1020aacacc 10263721316DNAHomo sapiens 372tcctaatacg actcactata gggctcgagc ggccgcccgg gcaggtcgaa tgcaggcgac 60ttgcgagctg ggagcgattt aaaacgcttt ggattccccc ggcctgggtg gggagagcga 120gctgggtgcc ccctagattc cccgcccccg cacctcatga gccgaccctc ggctccatgg 180agcccggcaa ttatgccacc ttggatggag ccaaggatat cgaaggcttg ctgggagcgg 240gaggggggcg gaatctggtc gcccactccc ctctgaccag ccacccagcg gcgcctacgc 300tgatgcctgc tgtcaactat gcccccttgg atctgccagg ctcggcggag ccgccaaagc 360aatgccaccc atgccctggg gtgccccagg ggacgtcccc agctcccgtg ccttatggtt 420actttggagg cgggtactac tcctgccgag tgtcccggag ctcgctgaaa ccctgtgccc 480aggcagccac cctggccgcg taccccgcgg agactcccac ggccggggaa gagtacccca 540gtcgccccac tgagtttgcc ttctatccgg gatatccggg aacctaccac gctatggcca 600gttacctgga cgtgtctgtg gtgcagactc tgggtgctcc tggagaaccg cgacatgact 660ccctgttgcc tgtggacagt taccagtctt gggctctcgc tggtggctgg aacagccaga 720tgtgttgcca gggagaacag aacccaccag gtcccttttg gaaggcagca tttgcagact 780ccagcgggca gcaccctcct gacgcctgcg cctttcgtcg cggccgcaag aaacgcattc 840cgtacagcaa ggggcagttg cgggagctgg agcgggagta tgcggctaac aagttcatca 900ccaaggacaa gaggcgcaag atctcggcag ccaccagcct ctcggagcgc cagattacca 960tctggtttca gaaccgccgg gtcaaagaga agaaggttct cgccaaggtg aagaacagcg 1020ctacccctta agagatctcc ttgcctgggt gggaggagcg aaagtggggg tgtcctgggg 1080agaccagaaa cctgccaagc ccaggctggg gccaaggact ctgctgagag gcccctagag 1140acaacaccct tcccaggcca ctggctgctg gactgttcct caggagcggc ctgggtaccc 1200agtatgtgca gggagacgga accccatgtg acaggcccac tccaccaggg ttcccaaaga 1260acctggccca gtcataatca ttcatcctca cagtggcaat aatcacgata accagt 1316373506DNAHomo sapiens 373atttttctgt cgtgtgaaaa tgaagccagc aggctgcccc tagtcagtcc ttccttccag 60agaaaaagag atttgagaaa gtgcctgggt aattcaccat taatttcctc ccccaaactc 120tctgagtctt cccttaatat ttctggtggt tctgaccaaa gcaggtcatg gtttgttgag 180catttgggat cccagtgaag tagatgtttg tagccttgca tacttagccc ttcccaggca 240caaacggagt ggcagagtgg tgccaaccct gttttcccag tccacgtaga cagattcaca 300gtgcggaatt ctggaagctg gagacagacg ggctctttgc agagccggga ctctgagagg 360gacatgaggg cctctgcctc tgtgttcatt ctctgatgtc ctgtacctgg gctcagtgcc 420cggtgggact catctcctgg ctgcgcagca aagccagcgg gttcgtgctg gtccttcctg 480caccttaggc tgggggtggg gggcct 506374597DNAHomo sapiens 374atttttctgt cgtgtgaaaa tgaagccagc aggctgcccc tagtcagtcc ttccttccag 60agaaaaagag atttgagaaa gtgcctgggt aattcaccat taatttcctc ccccaaactc 120tctgagtctt cccttaatat ttctggtggt tctgaccaaa gcaggtcatg gtttgttgag 180catttgggat cccagtgaag tagatgtttg tagccttgca tacttagccc ttcccaggca 240caaacggagt ggcagagtgg tgccaaccct gttttcccag tccacgtaga cagattcaca 300gtgcggaatt ctggaagctg gagacagacg ggctctttgc agagccggga ctctgagagg 360gacatgaggg cctctgcctc tgtgttcatt ctctgatgtc ctgtacctgg gctcagtgcc 420cggtgggact catctcctgg ctgcgcagca aagccagcgg gttcgtgctg gtccttcctg 480caccttaggc tgggggtggg gggggcctgc cggcgcattc tccacgattg agcgcacagg 540cctgaagtct ggacaacccg cagaaccgaa gctccgagca gcgggtcggt ggcgagt 597375300DNAHomo sapiens 375atttaaaacg ctttggattc tttcgtcctg cgtggggaga gcgagctggg tgccccctag 60attccccgcc cccgcacctc atgagccgac cctcggctcc atggagcccg gcacttatgc 120caccttggat ggagccaagg atatcgaagg cttgctggga gcgggagggg ggcggaatct 180ggtcgcccac tcccctctga ccagccaccc agcggcgcct acgctgatgc ctgctgtcaa 240ttatgccccc ttgcatctgc caggctcggc ggagccgcca aagcaatgcc acccatgccc 300376508DNAHomo sapiens 376atttttctgt cgtgtgaaaa tgaagccagc aggctgcccc tagtcagtcc ttccttccag 60agaaaaagag atttgagaaa gtgcctgggt aattcaccat taatttcctc ccccaaactc 120tctgagtctt cccttaatat ttctggtggt tctgaccaaa gcaggtcatg gtttgttgag 180catttgggat cccagtgaag tagatgtttg tagccttgca tacttagccc ttcccaggca 240caaacggagt ggcagagtgg tgccaaccct gttttcccag tccacgtaga cagattcaca 300gtgcggaatt ctggaagctg gagacagacg ggctctttgc agagccggga ctctgagagg 360gacatgaggg cctctgcctc tgtgttcatt ctctgatgtc ctgtacctgg gctcagtgcc 420cggtgggact catctcctgg ctgcgcagca aagccagcgg gttcgtgctg gtccttcctg 480caccttaggc tgggggtggg gggcctgc 508377284DNAHomo sapiensmisc_feature(227)..(227)a or g or c or t/umisc_feature(269)..(269)a or g or c or t/u 377aggccgcacc cagtcttaag gtgcagtgaa ggacagcacg aacccgctgt gctttgctgc 60gcggcaggag atgagtccca ccgggcactg agcccaggta caggacatca gagaatgaac 120acagaggcag aggccctcat gtccctctca gagtcccggc tctgcaaaga gcccgtctgt 180ctccagcttc cagaattccg cactgtgaat ctgtctacgt ggactgngaa aacagggttg 240gcaccactct gccactccgt ttgtgcctng gggcgggcag aggg 284378651DNAHomo sapiens 378aaaaacgctt tggattcccc cggcctgggt ggggagagcg agctgggtgc cccctagatt 60ccccgccccc gcacctcatg agccgaccct cggctccatg gagcccggca attatgccac 120cttggatgga gccaaggata tcgaaggctt gctgggagcg ggaggggggc ggaatctggt 180cgcccactcc cctctgacca gccacccagc ggcgcctacg ctgatgcctg ctgtcaacta 240tgcccccttg gatctgccag gctcggcgga gccgccaaag caatgccacc catgccctgg 300ggtgccccag gggacgtccc cagctcccgt gccttatggt tactttggag gcgggtacta 360ctcctgccga gtgtcccgga gctcgctgaa accctgtgcc caggcagcca ccctggccgc 420gtaccccgcg gagactccca cggccgggga agagtacccc agccgcccca ctgagtttgc 480cttctatccg ggatatccgg gaacctacca gcctatggcc agttacctgg acgtgtctgt 540ggtgcagact ctgggtgctc ctggagaacc gcgacatgac tccctgttgc ctgtggacag 600ttaccagtct tgggctctcg ctggtggctg gaacagccag atgtgttgcc a 651379498DNAHomo sapiens 379gcagactctg ggtgctcctg gagaaccgcg acgtgactcc ctgttgcctg tggacagtta 60ccactcttgg gctctcgctg gtggctggaa cagccagatg tgttgccagg gagaacagaa 120cccaccaggt cccttttgga aggcagcatt tgcagactcc agcgggcagc accctcctga 180cgcctgcgcc tttcgtcgcg gccgcaagaa acgcattccg tacagcaagg ggcagttgcg 240ggagctggag cgggagtatg cggctaacaa gttcatcacc aaggacaaga ggcgcaagat 300ctcggcagcc accagcctct cggagcgcca gattaccatc tggtttcaga accgccgggt 360caaagagaag aaggttctcg ccaaggtgaa gaacagcgct accccttaag agatctcctt 420gcctgggtgg gaggatctaa agtgggggtg tcctggggag accaggaacc tgccaagccc 480aggctggggc caaggact 498380233DNAHomo sapiens 380acgctgcact gcgtttcaaa gagaagaagg ttctcgccaa ggtgaagaac agcgctaccc 60cttaagagat ctccttgctt gggtgggagg agcgaaagtg ggggtgtcct ggggagacca 120ggaacctgcc atcaccaggc tgggcccaag gactctgctg agaggcccct agagacaaca 180cccttcccag gccattgctt gctggactgt gcctcaggag cggcctgggt acc 233381539DNAHomo sapiens 381gagttttcca atttccaaag aaaaatttag gtttcctgca gccgtgacat atgtgtgtgc 60actgggatgg gttaatgtgt gtgtgtgtgt gtgtatgcgc atgtattggg agtgggggca 120gaaacgtgtt tccagaattt gcctgtagaa tctaaaagag tggccaagag tctggaaatg 180catgaagact ggacgtatgt gatggtgggc aaaggcctga ctgtgtgtgg tgtgtgggta 240tgtttgcaga ttcgcgggtg tgagagcagt gatgggtgag ggtggccttc aggagccaag 300gctgatcggt ggtgagagaa caagccggaa gccagggtgc tgtcctggta tgctttggag 360gaacaggatt gcacgtgcgc ctgtagggtg acctgtgtgc acctgtgaga tgacttagct 420tggggcttgc aaggcctggg tctgcatggg tgggtatctg accatgcctt ttcctccctc 480cctttcacgc cgcgcagact ccagcgggca gcaccctcct gacgcctgcg cctttcgtc 539382240DNAHomo sapiens 382ccggcctggg tggggagagc gagctgggtg ccccctagat tccccgcccc cgcacctcat 60gagccgaccc tcggctccat ggagcccggc aattatgcca ccttggatgg agccaaggat 120atcgaaggct tgctgggagc gggagggggg cggaatctgg tcgcccactc ccctctgacc 180agccacccag cggcgcctac gcttgatgcc tgcttgtcaa ctatgccccc ttggatctgc 240383469DNAHomo sapiens 383accgcgggtc aaatttattc ataattagct caatcatgaa agcggttcta aagtgctcta 60cagagctcta gatagaaaat atgaggctaa cgatcatggc agctagtact ggttatcgtg 120attatggcca ctgtcaggat gaatgataat gactgggcca ggtcctttgg aaaccctggt 180ggagtgggct gtcacatggg gtcccgtctc cctgcacata ctgggtaccc aggccgctcc 240tgaggaacag tccagcagcc agtggcctgg gaagggtgtg gtctctaggg gcctctcagc 300agagtccttg gccccagcct gggcttggca ggtccctggt ctccccagga cacccccact 360ttcgctcctc ccacccaggc aaggagatct cttaaggggt agcgctgttc ttcaccttgg 420cgagaacctt cttctctttg aaccggcggt gcggcgtggg gtaccgagc 469384472DNAHomo sapiens 384atttttctgt cgtgtgaaaa tgaagccagc aggctgcccc tagtcagtcc ttccttccag 60agaaaaagag atttgagaaa gtgcctgggt aattcaccat taatttcctc ccccaaactc 120tctgagtctt cccttaatat ttctggtggt tctgaccaaa gcaagtcatg gtttgttgag 180catttgggat cccagtgaag tagatgtttg tagccttgca tacttagccc ttcccaggca 240caaacggagt ggcagagtgg tgccaaccct gttttcccag tccacgtaga cagattcaca 300gtgcggaatt ctggaagctg gagacagacg ggctctttgc agagccggga ctctgagagg 360gacatgaagg cctctgcctc tgtgttcatt ctctgatgtc ctgtacctgg gctcagtgcc 420cggtgggact catctcctgg ctgcgcagca aagccagcgg gttcgtgctg gt 472385450DNAHomo sapiens 385ccaacgagaa gaaggttctc gcaaggtgaa gaacagcgct accccttaag agatctcctt 60gcgtgggtgg gaggagcgaa agtgggggtg tcctggggag accaggaacc tgccagccca 120ggctgaggcc aaggactctg ctgagaggcc cctagagaca acacccttcc caggccactg 180gatgctgaac tgtccctcag gagcggcctg ggtacccagt atgtgcaggg agacggaacc 240ccatgtgaca gcccactcca ccagggttcc caaagaacct ggccccagtc ataatcattc 300atcctgacag tggcaataat cacgataacc agtactagct gccatgatcg taagcctcat 360atttgctatc tagagctctg tagagcactt tagaaaccgc tttcatgaat tgagctaatt 420atgactcaat ttgaaccggc gtccggcgtg 450386472DNAHomo sapiens 386acgcgcaccg cggtcaagag aagaaggttc tcgcaaggtg aagaacagcg ctacccctta 60agagatctcc ttgcgtgggt gggaggagcg aaagtggggg tgtcctgggg agaccaggaa 120cctgccaagc ccaggctgtg gccaaggact ctgctgagag gcccctatga gacaacaccc 180ttcccaggcc actggctgct gggactgttc ctcaggagcg gcctgggtac ccgagtaatg 240tgcaggggag acggaacccc atgtgacagc ccactccacc agggttccca aaagaaccct 300ggcccagtca taatcattca tcctgacagt ggcaataatc acgataacca gtactagctg 360ccatgatcgt aagcctcata tttgctatct agagctctgt agagcccttt agaaaccgct 420ttcatgaatg gagctaaatt atgaatacat ttgaaccggc gatccgacgt ga 472387320DNAHomo sapiens 387ctagaggatc ccggaagcaa ctgcaacagg ttcccaaaga accgggccag tcataatcat 60tcatcctgac agggcaataa tcacgataac cagtactagc tgccatgatc gttagcctca 120tattttctat ctagagctct gtagagcact ttagaaaccg ctttcatgaa tggagctaat 180tatgaataaa tttggaaggc gatcccttgg cagggaagct ttctctcaga cccccttcca 240ttacacctct caccctggta acagcaggaa gactgaggag aggggaacgg gcagattcgt 300ggtgttgcag tgtgcttccg 320388459DNAHomo sapiensmisc_feature(393)..(393)a or g or c or t/umisc_feature(439)..(440)a or g or c or t/umisc_feature(443)..(443)a or g or c or t/umisc_feature(459)..(459)a or g or c or t/u 388gagcgaatgc aggcgacttg cgagctggga gcgatttaaa acgctttgga ttcccccggc 60ctgggtgggg agagcgagct gggtgccccc tagattcccc gcccccgcac ctcatgagcc 120gaccctcggc tccatggagc ccggcaatta tgccaccttg gatggagcca aggatatcga 180agacttgctg

ggagcgggag gggggcggaa tctggtcgcc cactcccctc tgaccagcca 240cccagcggcg cctacgctga tgcctgctgt caactatgcc cccttggatc tgccaggctc 300ggcggagccg ccaaagcaat gccacccatg ccctggggtg ccccagggga cgtccccagc 360tcccgtgcct tatggttact ttggaggcgg gtnctactcc tgccgagtgt cccggagctc 420gctgaaaccc tgtgcccann canccaccct ggccgcgtn 459389156DNAHomo sapiens 389ctctgcctct gtgttcattc tctgatgtcc tgtacctgtg ctcagtgccc ggtgggactc 60atctcctggc tgcgcagcaa agccagcggg ttcgtgctgg tccttcctgc accttcggct 120gggggtgggg ggcctgccgg cgcattctcc acgatt 156390467DNAHomo sapiensmisc_feature(233)..(233)a or g or c or t/u 390acgctgcacc gccggtccaa gagaagaagg ttctcgccaa ggtgaagaac agcgctaccc 60ctttaagaga tctccttgct ggggtgggag gagcgaaagt gggggtgtct ggggagacca 120ggaacctgcc agccccaggc tgggcccaag gactctgctg agaggcccct agagacaaca 180cccttcccag gccactgtct gctggactgt tcctcaggag cggcctgggt acncagtatg 240tgcagggaga cggaacccca tgtgacagcc cactccacca gggttcccaa agaacctggc 300ccagtcataa tcattcatcc tgacagtggc aataatcacg ataaccagta ctagctgcca 360tgatcgttag cctcatattt tctatctaga gctctgtaga gcactttaga aaccgctttc 420atgaattgag ctacttatga atcactttga accggcggtg cggcgtg 467391666DNAHomo sapiensmisc_feature(594)..(594)a or g or c or t/u 391gggggagagc gagctgggtg ccccctagat tccccgcccc cgcacctcat gagccgaccc 60tcggctccat ggagcccggc aattatgcca ccttggatgg agccaaggat atcgaaggct 120tgctgggagc gggagggggg cggaatctgg tcgcccactc ccctctgacc agccacccag 180cggcgcctac gctgacgcct gctgtcaact atgccccctt ggatctgcca ggctcggcgg 240agccgccaaa gcaatgccac ccatgccctg gggtgcccca ggggacgtcc ccagctcccg 300tgccttatgg ttactttgga ggcgggtact actcctgccg agtgtcccgg agctcgctga 360aaccctgtgc ccaggcagcc accctggccg cgtaccccgc ggagactccc acggccgggg 420aagagtaccc cagccgcccc actgagtttg ccttctatcc gggatatccg ggaacctacc 480agcctatggc cagttacctg gacgtgtctg tggtgcagac tctgggtgct cctggagaac 540cgcgacatga ctccctgttg cctgtggaca gttaccagtc ttgggctctc gctngtggct 600ggaacagcca gatgtgttgc cagggagaac agaacccacc aggtcccttt tggaaggcag 660catttg 666392664DNAHomo sapiens 392gctgagttct gaagcttctg agttctgcag cctcacctct gagaaaacct cttttccacc 60aataccatga agctctgcgt gactgtcctg tctctcctca tgctagtagc tgccttctgc 120tctctagcgc tctcagcacc aatgggctca gaccctccca ccgcctgctg cttttcttac 180accgcgagga agcttcctcg caactttgtg gtagattact atgagaccag cagcctctgc 240tcccagccag ctgtggtatt ccaaaccaaa agaagcaagc aagtctgtgc tgatcccagt 300gaatcctggg tccaggagta cgtgtatgac ctggaactga actgagctgc tcagagacag 360gaagtcttca gggaaggtca cctgagcccg gatgcttctc catgagacac atctcctcca 420tactcaggac tcctctccgc agttcctgtc ccttctctta atttaatctt ttttatgtgc 480cgtgttattg tattaggtgt catttccatt atttatatta gtttagccaa aggataagtg 540tcccctatgg ggatggtcca ctgtcactgt ttctctgctg ttgcaaatac atggataaca 600catttgattc tgtgtgtttt cataataaaa ctttaaaata aaatgcaaaa aaaaaaaaaa 660aaaa 6643931308DNAHomo sapiens 393gccacgtgct gctgggtctc agtcctccac ttcccgtgtc ctctggaagt tgtcaggagc 60aatgttgcgc ttgtacgtgt tggtaatggg agtttctgcc ttcacccttc agcctgcggc 120acacacaggg gctgccagaa gctgccggtt tcgtgggagg cattacaagc gggagttcag 180gctggaaggg gagcctgtag ccctgaggtg cccccaggtg ccctactggt tgtgggcctc 240tgtcagcccc cgcatcaacc tgacatggca taaaaatgac tctgctagga cggtcccagg 300agaagaagag acacggatgt gggcccagga cggtgctctg tggcttctgc cagccttgca 360ggaggactct ggcacctacg tctgcactac tagaaatgct tcttactgtg acaaaatgtc 420cattgagctc agagtttttg agaatacaga tgctttcctg ccgttcatct catacccgca 480aattttaacc ttgtcaacct ctggggtatt agtatgccct gacctgagtg aattcacccg 540tgacaaaact gacgtgaaga ttcaatggta caaggattct cttcttttgg ataaagacaa 600tgagaaattt ctaagtgtga gggggaccac tcacttactc gtacacgatg tggccctgga 660agatgctggc tattaccgct gtgtcctgac atttgcccat gaaggccagc aatacaacat 720cactaggagt attgagctac gcatcaagaa aaaaaaagaa gagaccattc ctgtgatcat 780ttcccccctc aagaccatat cagcttctct ggggtcaaga ctgacaatcc cgtgtaaggt 840gtttctggga accggcacac ccttaaccac catgctgtgg tggacggcca atgacaccca 900catagagagc gcctacccgg gaggccgcgt gaccgagggg ccacgccagg aatattcaga 960aaataatgag aactacattg aagtgccatt gatttttgat cctgtcacaa gagaggattt 1020gcacatggat tttaaatgtg ttgtccataa taccctgagt tttcagacac tacgcaccac 1080agtcaaggaa gcctcctcca cgttctcctg gggcattgtg ctggccccac tttcactggc 1140cttcttggtt ttggggggaa tatggatgca cagacggtgc aaacacagaa ctggaaaagc 1200agatggtctg actgtgctat ggcctcatca tcaagacttt caatcctatc ccaagtgaaa 1260taaatggaat gaaataattc aaacacaaaa aaaaaaaaaa aaaaaaaa 13083942110DNAHomo sapiens 394ggatccaagc tattgtcctg cccatggctt cccatctcag gacgctctct ggccgctatc 60atcccagcag tggagttcag cccactactc tgaaccagcc gcaggtggct gctatgggac 120tgaagccatg aatggtgccg gccctggccc cgccgcagcc gccccggtcc cagtcccggt 180cccggtcccg gactggcggc agttctgcga gctgcatgcg caggcggccg ccgtggactt 240tgcgcacaag ttctgccgtt tcctgcggga caacccagct tacgacacgc ccgacgccgg 300cgcctccttc tcccgccact tcgccgccaa cttcctggac gtcttcggcg aggaggtgcg 360ccgcgtgctg gtggctgggc cgacgactcg gggcgcggcc gtgagcgcag aggccatgga 420gccggagctc gcggacacct ctgcactcaa ggcggcgtcc tacggccact cgcggagctc 480ggaggacgtg tccacgcacg cggccaccaa ggcccgcgtt cgcaagggct tctcgctgcg 540caacatgagc ctgtgcgtgg tggacggcgt gcgcgacatg tggcaccggc gcgcctcgcc 600cgagcccgac gcggcagctg ccccgcgcac cgccgagccc cgcgacaagt ggacgcggcg 660cctgaggctg tcgcggacgc tggctgccaa ggtggagctg gtggacattc aacgcgaggg 720ggcgctgcgc ttcatggtgg ccgacgacgc ggccgcgggc tccgggggct cggctcagtg 780gcagaagtgc cgcctgctcc tgcgcagggc tgtggccgag gaacgcttcc gcctggagtt 840cttcgtgccg cccaaagcct ccaggcccaa ggtcagcatc ccactgtcag ccatcattga 900ggtccgcacc accatgcccc tggaaatgcc agagaaggat aacacattcg tcctcaaggt 960agagaatgga gccgaataca tcttggagac catcgactct ctgcagaagc actcgtgggt 1020agctgacatc cagggctgcg tggaccccgg tgacagtgag gaagacaccg agctctcctg 1080tacccgagga ggctgtctgg ccagccgcgt ggcctcctgc agctgtgagc tcctgactga 1140tgcagtcgac ctgccccgcc ccccagagac gacagccgtg ggtgcagtgg tgacagcccc 1200ccacagccga ggtcgagatg ccgtcagaga atccctgatc cacgtcccgc tagagacctt 1260tctgcagacc ctggaatccc cgggcggcag cggcagtgac agcaataaca caggggaaca 1320gggtgcagag acggatcccg aggctgaacc cgagctggag ctatccgact acccatggtt 1380ccacgggaca ctgtcccggg tcaaggctgc tcaactggtt ctggcagggg ggccccggaa 1440ccacggcctc ttcgtgatcc gccaaagtga gactcggcct ggggagtacg tgctgacctt 1500caacttccag ggcaaggcca agcacctgcg cctgtccctg aacggccacg gccagtgtca 1560cgtacagcat ctgtggttcc agtctgtgct tgacatgctc cgccacttcc acacacaccc 1620catcccactg gagtcagggg gctcggccga catcaccctt cgcagctatg tgcgggccca 1680ggacccccca ccagagccgg gccccacgcc ccctgccgcg cccgcgtccc cggcctgctg 1740gagcgactcg cccggccagc actacttctc cagcctcgcc gcggccgcct gcccgcctgc 1800ctcgccctcc gacgccgccg gcgcctcctc gtcttccgcc tcgtcgtcct ctgccgcgtc 1860ggggcccgcc cccccgcgcc ccgtcgaggg ccagctcagc gcgcggagcc gcagcaacag 1920cgccgagcgc ctgctggagg ccgtggccgc caccgccgcc gaggagcccc cggaggccgc 1980gcccggccgc gcgcgcgccg tggagaacca gtactccttc tactagcccg cggcgccgcc 2040cgggtgggac acgccaagct cttcagtgaa gacacgatgt tattaaaagc ctgttttagg 2100gactgcaaaa 2110395496DNAHomo sapiens 395gattccagca cgggcttcgc agactgcagg acacagaggc acgcgtgcac atcatgtctt 60ctaaggaatt tgaacactgt tgagaagact gtgtacaaga gagatgtgcc atgtcagcct 120tgcaagggac agcgtgaaaa ctacccatct ccggtcacca agttgcagga ggccaggagc 180caggagggga aaccgctcag tttgcaaaac gtcgcttcca caagcctgat ggctgaaact 240gctcactgta ccctgaaacc agctttacct acagcttctg agataaactg ctgcaactct 300gggacccacg atgcctatca cagtggctca tcaatggaac ctgccggctc ccaacccttc 360ctagggccca tgaactctct gaaaagagga acagaaatat ttctcctttt tgtaaaatct 420ttaaccttcc ctttgttctt catgtacacg ctgaactgca attcttcttc ccaaataaaa 480cattaaattt aaaaaa 496396824DNAHomo sapiens 396ggccccggag ggagagtaac ccggcccatc catccgtcgc ccggttcttg gggaactact 60ttcaggggct tcttgccgtc ccctcatcag ctctgtgcga accctctgtc ggcagccatt 120gaggagaccc tgccccctgg accctgacca catatagatt gaggccgagg agtggctgcc 180ctgtcccttt tatgacagcc cgcagaagcc ccggggtgag gcatggagga ggcaggcgac 240agctgacagg gaccctgttg gcctccagca tgtccagcca gccgggcagg atttctctgc 300ttctggctgg cagccaggaa ctgagtatga caatgttgta ctaaagaaag gcccaaagtg 360acagaggcag cagagggatg gtccaccgcc ccttggcttc tgctggtgac tcctcctggc 420cactgcatca gaagaacctc ctctgcccct tctggagccc gaggcctggc ctgtcttcgt 480tggggctgat aaattgcctc tcccagggcc tgctgggtga gtcaccatcc caaagcagga 540agggtgccct ggagagaacc accctcctcc tactcttttt ccacttcctc ctctttcttt 600ccccagctga ggaggaacct ggggcattta gggcagagga caaaaggatg tcagcaattg 660cttgggctgc ttggctatgc aagcctcctg cctgctgatg gccacttcag ggacagcctg 720ggcccaggca cccaggggga tggcggcagc ttcctgcacc tttcagattt cttggtggca 780ttaaagcatt ttcagaacaa aaaaaaaaaa aaaaaaaaaa aaaa 8243972429DNAHomo sapiens 397ggcgggcctg gacggccgcg tgctgtactg gccacgcggc cgcgtctggg gtggctcctc 60atccctcaat gccatggtct acgtccgtgg gcacgccgag gactacgagc gctggcagcg 120ccagggcgcc cgcggctggg actacgcgca ctgcctgccc tacttccgca aggcgcaggg 180ccacgagctg ggcgccagcc ggtaccgggg cgccgatggc ccgctgcggg tgtcccgggg 240caagaccaac cacccgctgc actgcgcatt cctggaggcc acgcagcagg ccggctaccc 300gctcaccgag gacatgaatg gcttccagca ggagggcttc ggctggatgg acatgaccat 360ccatgaaggc aaacggtgga gcgcggcctg tgcctacctg cacccagcac tgagccgcac 420caacctcaag gccgaggccg agacgcttgt gagcagggtg ctatttgagg gcacccgtgc 480agtgggcgtg gagtatgtta agaatggcca gagccacagg gcttatgcca gcaaggaggt 540gattctgagt ggaggtgcca tcaactctcc acagctgctc atgctctctg gcatcgggaa 600tgctgatgac ctcaagaaac tgggcatccc tgtggtgtgc cacctacctg gggttggcca 660gaacctgcaa gaccacctgg agatctacat tcagcaggca tgcacccgcc ctatcaccct 720ccattcagca cagaagcccc tgcggaaggt ctgcattggt ctggagtggc tctggaaatt 780cacaggggag ggagccactg cccatctgga aacaggtggg ttcatccgca gccagcctgg 840ggtcccccac ccggacatcc agttccattt cctgccatcc caagtgattg accacgggcg 900ggtccccacc cagcaggagg cttaccaggt acatgtgggg cccatgcggg gcacgagtgt 960gggctggctc aaactgagaa gtgccaatcc ccaagaccac cctgtgatcc agcccaacta 1020cttgtcaaca gaaactgata ttgaggattt ccgtctgtgt gtgaagctca ccagagaaat 1080ttttgcacag gaagccctgg ctccgttccg agggaaagag ctccagccag gaagccacat 1140tcagtcagat aaagagatag atgcctttgt gcgggcaaaa gccgacagcg cctaccaccc 1200ctcgtgcacc tgtaagatgg gccagccctc cgatcccact gccgtggtgg atccgcagac 1260aagggtcctc ggggtggaaa acctcagggt cgtcgatgcc tccatcatgc ctagcatggt 1320cagcggcaac ctgaacgccc ccacaatcat gatcgcagag aaggcagctg acattatcaa 1380ggggcagcct gcactctggg acaaagatgt ccctgtctac aagcccagga cgctggccac 1440ccagcgctaa gacagttgct gctggaggat gaccagggaa gccccctgat aagccaagag 1500ggccagcaca gcccttgctc ccaggctcct gcctgaaact atctagcaca ctaggaccca 1560ggtggtaccc tactcagtgg ctgagaattg gataaagtct tkgggaaatg agacaagtac 1620tgggcagtga atccagctcc ttttccccag cctttccctg tgggccattt ggggaaggcc 1680agcattycag cctgagatgt tcctccctgc ctcctggggg ggcaraaggg vtaggwtggt 1740taactcctgc cgcatccttc cctgcctcct ggagggacag aaggggagga tggttaactc 1800ctgccgcatc ctttttcttg tgttcacgtg gcattctcta acccagggca gtggttcctt 1860cccaggccat gcacagaggc tgggtgcctg ccagacccac ggagggttcg cgaaggaagg 1920ggcatcctcc ttcttgagct gcaagcttta gctgaggcag taagtcacac agtagttagt 1980tcagcctggg ctggcacata agtccccagt gtccctgttg agaggggaaa gttgcctgct 2040ggttgaaaaa ctggcttttc ctttctcgct gcctaatttc actctcagag tgaggcaggt 2100aactggggct ccactgggtc actctgagag ggttgtggct ctggttctta ttaaaccagg 2160gccaggtgca gggctcacac ctgtaatccc agcactttgg gaaggtcact tgagctcagg 2220agttcaagac cagcctgggc aacatagtga gaccttgtct ctggaaaaca attagctggg 2280catggtggta cacacctgta gtcccagcta cttgggaggc tgaggcggga ggatggcttt 2340agcccaggag gttgaggctc ctgtgaaccc tgatggcacc actgcactcc agcctgggtg 2400acagggtgag accctgtctc aaaaaaaaa 2429398626DNAHomo sapiensmisc_feature(10)..(10)a or g or c or t/umisc_feature(39)..(39)a or g or c or t/umisc_feature(77)..(77)a or g or c or t/umisc_feature(83)..(83)a or g or c or t/umisc_feature(102)..(102)a or g or c or t/umisc_feature(121)..(121)a or g or c or t/u 398ccgccgttgn caaagggccc agaatatggg ccatggacna tctccatgcc tggggaaatt 60ccctcgggtc ttttggntaa ccnccttata gaaaggtaat gncatggagt ctctacaggg 120ngcacaaggt ggactaattg atacgaagag ccctgtaaat atgtgggcag cggcagattt 180tgaccatttg gaccgaactg tatttgacac agcgcaatat ctggaactgg ttggtcaaaa 240acctgcttgt cttgttaaat ttcctctgtc caaggacatg gaatctctct ctaattttac 300ttcaaatttc cctttccttc atttctctaa aaacgttaaa taagaaagaa gattgtaaag 360ccagcatttg aagcctaagt attgaaagtc tttgacaatt tctgaaatca gacttgacat 420ctttcccccg ccttgcaaat ttcttgaaga aataagaagc tacatgtaag catcatcatg 480tttattaaat tacaatgaga actctcactc aatcttgacc agagcagact cttaacttgg 540aagcagagtc cctctaaagg taactcttgt ggtcactcaa tattgtattg gcatttgcat 600attaaataga catttcagta gcattt 626399691DNAHomo sapiens 399tggcccgcgg tcgcggtggg atcctagccc tgtctcctct cctgggaagg agtgagggtg 60ggacgtgact tagacaccta caaatctatt taccaaagag gagcccggga ctgagggaaa 120aggccaaaga gtgtgagtgc atgcggactg ggggttcagg ggaagaggac gaggaggagg 180aagatgaggt cgatttcctg atttaaaaaa tcgtccaagc cccgtggtcc agcttaaggt 240cctcggttac atgcgccgct cagagcaggt cactttctgc cttccacgtc ctccttcaag 300gaagccccat gtgggtagct ttcaatatcg caggttctta ctcctctgcc tctataagct 360caaacccacc aacgatcggg caagtaaacc ccctccctcg ccgacttcgg aactggcgag 420agttcagcgc agatgggcct gtggggaggg ggcaagatag atgaggggga gcggcatggt 480gcggggtgac cccttggaga gaggaaaaag gccacaagag gggctgccac cgccactaac 540ggagatggcc ctggtagaga cctttggggg tctggaacct ctggactccc catgctctaa 600ctcccacact ctgctatcag aaacttaaac ttgaggattt tctctgtttt tcactcgcaa 660taaattcaga gcaaacaaaa aaaaaaaaaa a 6914001824DNAHomo sapiens 400caataggccg gcttttgaac tgcttcgcag gggacttgga acagctggac cagctcttgc 60ccatcttttc agagcagttc ctggtcctgt ccttaatggt gatcgccgtc ctgttgattg 120tcagtgtgct gtctccatat atcctgttaa tgggagccat aatcatggtt atttgcttca 180tttattatat gatgttcaag aaggccatcg gtgtgttcaa gagactggag aactatagcc 240ggtctccttt attctcccac atcctcaatt ctctgcaagg cctgagctcc atccatgtct 300atggaaaaac tgaagacttc atcagccagt ttaagaggct gactgatgcg cagaataact 360acctgctgtt gtttctatct tccacacgat ggatggcatt gaggctggag atcatgacca 420accttgtgac cttggctgtt gccctgttcg tggcttttgg catttcctcc accccctact 480cctttaaagt catggctgtc aacatcgtgc tgcagctggc gtccagcttc caggccactg 540cccggattgg cttggagaca gaggcacagt tcacggctgt agagaggata ctgcagtaca 600tgaagatgtg tgtctcggaa gctcctttac acatggaagg cacaagttgt ccccaggggt 660ggccacagca tggggaaatc atatttcagg attatcacat gaaatacaga gacaacacac 720ccaccgtgct tcacggcatc aacctgacca tccgcggcca cgaagtggtg ggcatcgtgg 780gaaggacggg ctctgtaggt ttttactgag cacctactat gtgcctggga accgaaaggg 840aagtcctcct tgggcatggc tctcttccgc ctggtggagc ccatggcagg ccggattctc 900attgacggcg tggacatttg cagcatcggc ctggaggact tgcggtccaa gctctcagtg 960atccctcaag atccagtgct gctctcagga accatcagat tcaacctaga tccctttgac 1020cgtcacactg accagcagat ctgggatgcc ttggagagga cattcctgac caaggccatc 1080tcaaagttcc ccaaaaagct gcatacagat gtggtggaaa acggtggaaa cttctctgtg 1140ggggagaggc agctgctctg cattgccagg gctgtgcttc gcaactccaa gatcatcctt 1200atcgatgaag ccacagcctc cattgacatg gagacagaca ccctgatcca gcgcacaatc 1260cgtgaagcct tccagggctg caccgtgctc gtcattgccc accgtgtcac cactgtgctg 1320aactgtgacc acatcctggt tatgggcaat gggaaggtgg tagaatttga tcggccggag 1380gtactgcgga agaagcctgg gtcattgttc gcagccctca tggccacagc cacttcttca 1440ctgagataag gagatgtgga gacttcatgg aggctggcag ctgagctcag aggttcacac 1500aggtgcagct tcgaggccca cagtctgcga ccttcttgtt tggagatgag aacttctcct 1560ggaagcaggg gtaaatgtag ggggggtggg gattgctgga tggaaaccct ggaataggct 1620acttgatggc tctcaagacc ttagaacccc agaaccatct aagacatggg attcagtgat 1680catgtggttc tccttttaac ttacatgctg aataatttta taataaggta aaagcttata 1740gttttctgat ctgtgttaga agtgttgcaa atgctgtact gactttgtaa aatataaaac 1800taaggaaaac tcaaaaaaaa aaaa 18244013621DNAHomo sapiens 401cccacagggg gaccggccct gtgacccctc accggggccg tgggcccgag ccccggactt 60ccctaagccg gcaatgaccg cctgcgcccg ccgagcgggt gggcttccgg accccgggct 120ctgcggtccc gcgtggtggg ctccgtccct gccccgcctc ccccgggccc tgccccggct 180cccgctcctg ctgctcctgc ttctgctgca gccccccgcc ctctccgccg tgttcacggt 240gggggtcctg ggcccctggg cttgcgaccc catcttctct cgggctcgcc cggacctggc 300cgcccgcctg gccgccgccc gcctgaaccg cgaccccggc ctggcaggcg gtccccgctt 360cgaggtagcg ctgctgcccg agccttgccg gacgccgggc tcgctggggg ccgtgtcctc 420cgcgctggcc cgcgtgtcgg gcctcgtggg tccggtgaac cctgcggcct gccggccagc 480cgagctgctc gccgaagaag ccgggatcgc gctggtgccc tggggctgcc cctggacgca 540ggcggagggc accacggccc ctgccgtgac ccccgccgcg gatgccctct acgccctgct 600tcgcgcattc ggctgggcgc gcgtggccct ggtcaccgcc ccccaggacc tgtgggtgga 660ggcgggacgc tcactgtcca cggcactcag ggcccggggg ctgcctgtcg cctccgtgac 720ttccatggag cccttggacc tgtctggagc ccgggaggcc ctgaggaagg ttcgggacgg 780gcccagggtc acagcagtga tcatggtgat gcactcggtg ctgctgggtg gcgaggagca 840gcgctacctc ctggaggccg cagaggagct gggcctgacc gatggctccc tggtcttcct 900gcccttcgac acgatccact acgccttgtc cccaggcccg gaggccttgg ccgcactcgc 960caacagctcc cagcttcgca gggcccacga tgccgtgctc accctcacgc gccactgtcc 1020ctctgaaggc agcgtgctgg acagcctgcg cagggctcaa gagcgccgcg agctgccctc 1080tgacctcaat ctgcagcagg tctccccact ctttggcacc atctatgacg cggtcttctt 1140gctggcaagg ggcgtggcag aagcgcgggc tgccgcaggt ggcagatggg tgtccggagc 1200agctgtggcc cgccacatcc gggatgcgca ggtccctggc ttctgcgggg acctaggagg 1260agacgaggag cccccattcg tgctgctaga cacggacgcg gcgggagacc ggctttttgc 1320cacatacatg ctggatcctg cccggggctc cttcctctcc gccggtaccc ggatgcactt 1380cccgcgtggg ggatcagcac ccggacctga cccctcgtgc tggttcgatc caaacaacat 1440ctgcggtgga ggactggagc cgggcctcgt ctttcttggc

ttcctcctgg tggttgggat 1500ggggctggct ggggccttcc tggcccatta tgtgaggcac cggctacttc acatgcaaat 1560ggtctccggc cccaacaaga tcatcctgac cgtggacgac atcacctttc tccacccaca 1620tgggggcacc tctcgaaagg tggcccaggg gagtcgatca agtctgggtg cccgcagcat 1680gtcagacatt cgcagcggcc ccagccaaca cttggacagc cccaacattg gtgtctatga 1740gggagacagg gtttggctga agaaattccc aggggatcag cacatagcta tccgcccagc 1800aaccaagacg gccttctcca agctccagga gctccggcat gagaacgtgg ccctctacct 1860ggggcttttc ctggctcggg gagcagaagg ccctgcggcc ctctgggagg gcaacctggc 1920tgtggtctca gagcactgca cgcggggctc tcttcaggac ctcctcgctc agagagaaat 1980aaagctggac tggatgttca agtcctccct cctgctggac cttatcaagg gaataaggta 2040tctgcaccat cgaggcgtgg ctcatgggcg gctgaagtca cggaactgca tagtggatgg 2100cagattcgta ctcaagatca ctgaccacgg ccacgggaga ctgctggaag cacagaaggt 2160gctaccggag cctcccagag cggaggacca gctgtggaca gccccggagc tgcttaggga 2220cccagccctg gagcgccggg gaacgctggc cggcgacgtc tttagcttgg ccatcatcat 2280gcaagaagta gtgtgccgca gtgcccctta tgccatgctg gagctcactc ccgaggaagt 2340ggtgcagagg gtgcggagcc cccctccact gtgtcggccc ttggtgtcca tggaccaggc 2400acctgtcgag tgtatcctcc tgatgaagca gtgctgggca gagcagccgg aacttcggcc 2460ctccatggac cacaccttcg acctgttcaa gaacatcaac aagggccgga agacgaacat 2520cattgactcg atgcttcgga tgctggagca gtactctagt aacctggagg atctgatccg 2580ggagcgcacg gaggagctgg agctggaaaa gcagaagaca gaccggctgc ttacacagat 2640gctgcctccg tctgtggctg aggccttgaa gacggggaca ccagtggagc ccgagtactt 2700tgagcaagtg acactgtact ttagtgacat tgtgggcttc accaccatct ctgccatgag 2760tgagcccatt gaggttgtgg acctgctcaa cgatctctac acactctttg atgccatcat 2820tggttcccac gatgtctaca aggtggagac aataggggac gcctatatgg tggcctcggg 2880gctgccccag cggaatgggc agcgacacgc ggcagagatc gccaacatgt cactggacat 2940cctcagtgcc gtgggcactt tccgcatgcg ccatatgcct gaggttcccg tgcgcatccg 3000cataggcctg cactcgggtc catgcgtggc aggcgtggtg ggcctcacca tgccgcggta 3060ctgcctgttt ggggacacgg tcaacaccgc ctcgcgcatg gagtccaccg ggctgcctta 3120ccgcatccac gtgaacttga gcactgtggg gattctccgt gctctggact cgggctacca 3180ggtggagctg cgaggccgca cggagctgaa gggcaagggc gccgaggaca ctttctggct 3240agtgggcaga cgcggcttca acaagcccat ccccaaaccg cctgacctgc aaccggggtc 3300cagcaaccac ggcatcagcc tgcaggagat cccacccgag cggcgacgga agctggagaa 3360ggcgcggccg ggccagttct cttgagaagt gaggcccggc cccggacagg gtctgggccc 3420tgctccctgt cccatctgca gtggacccca ggcacccccc tttgaggagg tggggtgaac 3480tgctccttgg cagggatttg tgacactgca ttgctgggct gtgttcctcg ggctcttctg 3540gaccttgcac cgtggatacc aggccatgtg ccatggtatt tgggtcctgg gagggtgggt 3600gaaataaagg catactgtct t 36214021284DNAHomo sapiens 402ctttcacaga aagaaagtaa caggcataat tcctgttgat gaggctggga ttgtttttaa 60gaggagagat aataacttca tatttttaaa gtgccagtag cctaatatgt gaaacagatc 120agaatctgtt gtgtagtaag tctgctttgt tgaagaattt attatgggag taaagataag 180aaggaaagag atcaccatca gaaacaagtc agccttttca tgcttttttg agcatttttg 240gagatgattc cacttctcaa gttattatca tttgtgcatc tcttcaatgc tattgttaaa 300tgctttagaa ttagaatatt ttgatccttt aattaaagta agccaaacgt ctaggcaaaa 360acagccaatc attaaacttt aatagtaatt caaatataga tttctcatac agttttccat 420gtctgtagaa atcaaagttg taatgttaag cagagggaaa tgcgtgtgat ttactaatac 480acttcaacgt tctacttttg aaaggatact catgtgggtg gggcagagaa catagaaaaa 540gatatgatgg aaaacctgtc cattttctac ctgttaacct tcatcatttt gtgcaggccc 600tggaagcaaa gagaggaagg gaccgactgc atttatcttt gaacacttga gcatcagtag 660tactactgag tggccagggg tcttgtctgt caaagcaaat gataagttca ctcaggccat 720tattgactgc tgaactctct tccttcccaa ctcttccttg aaagagaaaa aaatactttg 780ccttcttgct ctccttatca aatgtttttg tacaaatagt gtaagcctgt ttaagcaaac 840caattaaaat aggcactgat tattttgatc tgtttgtaac aaatgaatgt aagtactatt 900tacatggtgt gcctaggagg agctgaaatc attggcactt taatccatat tgtaaagatc 960agtatcaaaa gcatagtgtt cttcacctct cctcctcagc atccatctct atatacttga 1020ttaaatggaa aagtctcttt tatcacctct atgtaaagtt ttatgggtag ttatcgtcag 1080tgtatttaaa tatatcttct agtatgtttt aaaggctggt cttcaatact gtggagacaa 1140aaaataaaag agcgtatgaa aagtacgtta gacttttgct ggcattcaag tcatggctag 1200tctgtgtatt taataaatgt gtgttattta tgtcgtgttt gtcaatggaa aataaagttg 1260aatattctga aaaaaaaaaa aaaa 1284403547DNAHomo sapiensmisc_feature(5)..(5)a or g or c or t/umisc_feature(10)..(10)a or g or c or t/umisc_feature(42)..(42)a or g or c or t/umisc_feature(49)..(49)a or g or c or t/umisc_feature(80)..(80)a or g or c or t/umisc_feature(115)..(115)a or g or c or t/u 403cctanaagtn ccattttggc aaggataaac tcccatgaca anctcccant actgcatgtg 60aatgaataag aaacaagaan tgaccacacc aaagcctccc tggctggtgt tacangggat 120caggtccaca gtggtgcaga ttcaaccacc acccagggag tgcttgcaga ctctgcatag 180atgttgctgc atgcgtccca tgtgcctgtc agaatggcag tgtttaattc tcttgaaaga 240aagttatttg ctcactatcc ccagcctcaa ggagccaagg aagagtcatt cacatggaag 300gtccgggact ggtcagccac tctgactttt ctaccacatt aaattctcca ttacatctca 360ctattggtaa tggcttaagt gtaaagagcc atgatgtgta tattaagcta tgtgccacat 420atttattttt agactctcca cagcattcat gtcaatatgg gattaatgcc taaactttgt 480aaatattgta cagtttgtaa atcaatgaat aaaggttttg agtgtaaaaa aaaaaaaaaa 540aaaaaaa 547404784DNAHomo sapiens 404ggcacgaggg caaagagtag tcagtccctt cttggctctg ctgacactcg agcccacatt 60ccatcacctg ctcccaatca tgcaggtctc cactgctgcc cttgccgtcc tcctctgcac 120catggctctc tgcaaccagg tcctctctgc accacttgct gctgacacgc cgaccgcctg 180ctgcttcagc tacacctccc ggcagattcc acagaatttc atagctgact actttgagac 240gagcagccag tgctccaagc ccagtgtcat cttcctaacc aagagaggcc ggcaggtctg 300tgctgacccc agtgaggagt gggtccagaa atacgtcagt gacctggagc cgagtgcctg 360aggggtccag aagcttcgag gcccagcgac ctcagtgggc ccagtgggga ggagcaggag 420cctgagcctt gggaacatgc gtgtgacctc cacagctacc tcttctatgg actggttatt 480gccaaacagc cacactgtgg gactcttctt aacttaaatt ttaatttatt tatactattt 540agtttttata atttattttt gatttcacag tgtgtttgtg attgtttgct ctgagagttc 600cccctgtccc ctccaccttc cctcacagtg tgtctggtga caaccgagtg gctgtcatcg 660gcctgtgtag gcagtcatgg caccaaagcc accagactga caaatgtgta tcagatgctt 720ttgttcaggg ctgtgatcgg cctggggaaa taataaagat gttcttttaa acggtaaaaa 780aaaa 7844051216DNAHomo sapiens 405agaaaactat tttctaaata ttaacactga aaatgttttg ttagcttttc cttctttctc 60tccagaagaa acatggatag atgatagctg tttcattgtt tgtttttgtc aagcatattc 120actttcctcc ttgtcctctg attctgagca aagggcctca gactctgaac ttccctcaag 180tgccgttgtt atgtgaactc ttccattcag attccagaga ggttctcatg ctcccccccc 240ctccttattt gtagcaatcg tagcaactaa ttccactaag tacaagggag ttttttacac 300tcctccattt ttatagcatc tgcatttttt ttttttgtta ggtacatgta tacacctgcc 360tgagtataaa tactctctct acctaataat aacatcaacc aacatctttt ccaaattagg 420gccacagaac agcaacattt gtctgacagt agtataaaga ataatgatag ctctatcctt 480aagaagtatt tcctttcctt tttatatagt cccgttaggg tttaaaacca tattgatcaa 540ctagaaagaa aaatatgaaa agagaaaaat attttaattt aaaaattgta atacattgat 600ttataaaatg ccttctctga tacttttgaa acagatgtga aaaacagaaa aagaaaaaat 660tgtctgaaat gtttattttg caaaacagtg caatagaatc tagttatgcc ttcatcactg 720ttgacagtaa atactgacag ccccttgcag tgtgttagtt ttagatcact ctgttttagt 780tgagagaaat gttttatatc atggttttta tatgaataca aattatttct caaagattta 840tagcacacac tattctcagg aattctgtat tacatgaatg ctgcttatat attttcatat 900tctaacttgt cttttcaagc aaataactaa tatatatgtg catgcagtct gccttgacaa 960gttgttccaa gctgaagagc tttcactgta caatgtgtgg aaaatcacca tagatcatgg 1020ctgaaatagt ttgtaattgt ctgagtctgt gcacgtactt ttagataaaa tgctgctgag 1080tgactgcatg atgagataca acttctgaat gctgcacatt cttccaaaat gatccttagc 1140acaatctatt gtatgatgga atgaatagaa aactttttca ctcaataaat tattatttga 1200tatggtaaaa aaaaaa 1216406993DNAHomo sapiens 406cccaaggttg ttatatcttc atgtcctcat ttcttaggga ggtaccttca gaaccaatag 60tgacccctaa cttctctggt ggtcggttcc atgaaaggca aaggagtgtg agagaggagt 120ggatggtcaa cctcccactg ccatggtaac atgggtgctg gctgatggga gcagaaaata 180atttagtgaa agtctgtggg ggcagtcaca agatgtctga gaaaactggc gagccagctg 240ctgaaaacag ggacaaggaa gcctccgtgg ctggagccca aatcacactg cagacccaga 300caccgtgacc accaccatgg actccagaga gagcagctta tagtactcaa tcagctgcca 360ctaccaccat ccagaacacc agatgttgta gccatggctg cagcaggaat ggatgtccca 420ctgtccctgc tcctcggtgt gacttgctcc caagttcagg gcaggtccat ctgattggct 480gagtctggaa tgtctgcctg tgcctcagct gtgagggagg cagggaaagt aagccttttc 540agcttctgtc gtgggaggtg ggctctgcct cctaccaaga atcaaagggt ggaggatctt 600caaacacagg aaaagaaccc ggatcctggc acccccaaat tttcagagtc catttcagag 660cataagaaat tgagggtcca agatcattca tgtaagaagt ttagaggggg aagaaaagaa 720tgataaacga aaagaacagc aatagtaaag gatcttttct ttgtttcagt aagatgaaga 780ggcctgagca gtttcgtgga ggggaagaaa caggaaaacc tcttcaaaag acaaaaagct 840ggcactgcat tctctctctg tagcaggaca gaactgtcta aagacaagac ccctttggcc 900aaaataaagg aacctgaaac attaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 960aaaaaaaaaa aaaaaaaaaa aaaaaacctc ggg 9934072214DNAHomo sapiens 407aaatttaatt aattataaac tcagtctctt ggttgcacca gccacatttc agatgctcaa 60tagccacatg tggctagtgg gtaccatatt ggacagggca gctatagaat atttccatca 120ttgcagaaag ttctattgga tagtaccata atctttttat agtaacttgg aaatactatt 180tgatattaga tgttagacca caaaaagaag aaaaatgtta ggactatttc agatataaaa 240aggaactgaa ttgtgacata attagcatct tacattccat acagttgaat accttatgct 300gtgacaacca tagttaatca tttcagtgct gttcaacata catacctatc agcagtgtgt 360ttagaccagg ggtctgcaaa ctttctgtga atggacaaag agtaaatact ttagtaaatg 420tcttaggctt tgtggcctac atgatctttg ttgcaagtac tcaactctgc cattatagag 480ttaaagcagc catacacaat atataaacaa aatgggcata gttgtatttc agtaaaactt 540tatttacaaa gacaggcggt aggccagatt tggcttgcat gctgtagagc tgtggtctaa 600attttattca tagactttct ttgcaaatac agtgtgagta ttgttccatt tacagtatta 660ttatttttta gatacctggt ttttagattc ttgcctggta actttttact gaaaatacaa 720gaatttcgta ctgcatttgc atctccgaga ttagggagca cctgtcagga tatgttgttc 780tatcagggtt acttctgttg actacctctt agattttgat acagttatat tgttgagttt 840cattttcata tattcttgta gtgtctgctt gcctgtgact tctggtaaaa taaaataagc 900ctttgaaaat attttagcat ggtatttaac attttctaaa tattatggca ttttgacata 960ttttagtcag cgaagacatc tgcccctttg gtgtttctac ttgcttatga ttgagatttt 1020acaagccctt caaactccgt tttaaaggaa tttattgtaa aacattaact ttaataaatt 1080agtgttttca cagatcagat cattatactt ggaacttcta aatcatgcaa tttctgaata 1140aggacataag gctagattca tttttcttaa tagagaaaaa ggaaatttct gatttatcac 1200ttttctagtt gataagtagg attcaaaacg tttgatatgt aagtatttat ataagactaa 1260tgtaatttaa agttctgtat tattgtgatt aatcatacag aaattcagga actgatcaga 1320agtgagattc ttttccacat ctggttaatg tagtgagttg acaccctgtg ggtggtaaag 1380cattataaac atttcatctt gaaccatgat ttatacacat ctgtgttata agggaggctt 1440gagtacatat accaatgaag agatattcag catttgtcta tttgataagg aattaaatgt 1500cctagtgatt ataaagtaaa accacagacc aatttgcaaa tgatcttcaa tgttaagcac 1560ttgctctaag attaaaattc cttttctttt taaggttaag ggtgtgtacg tatggcagtg 1620atgtctatgt tgagattaac ttatgtattg aggaaaattt gaagtttatt ttttcgatga 1680ataaggctgt caaatgattt agtatagatt aatgacatct tttttagaaa tattaaagtg 1740agtattcctc attatgtcat catttctgat aattagagtg ctaatttgaa tgttagataa 1800tgtttccaca tctataccta tttctttcta gggcacttct gaccctgggg cttggggatg 1860gcctttaggc cacagtagtg tctgtgttaa gttcactaaa tgtgtattta atgagaaaca 1920ttcctatgta aaaatgtgtg tatgtgaacg tatgcataca tttttattgt gcacctgtac 1980attgtgaaga agtagtttgg aaatttgtaa agcacaaacc ataaaagagt gtggagttat 2040taaatgatgt agcacaaatg taatgtttag cttataaaag gtcctttcta ttttctatgg 2100caaagacttt gacacttgaa aaataaaacc aatatttgat ttatttttgt aagtatttag 2160gatattattt taaataaatg attgtccatt atcaataaaa aaaaaaaaaa aaaa 22144081182DNAHomo sapiens 408gtcctgagca gccaacacac cagcccagac agctgcaagt caccatggac gctgaaggcc 60tggcgctgct gctgccgccc gtcaccctgg cagccctggt ggacagctgg ctccgagagg 120actgcccagg gctcaactac gcagccttgg tcagcggggc aggcccctcg caggcggcgc 180tgtgggccaa atcccctggg gtactggcag ggcagccttt cttcgatgcc atatttaccc 240aactcaactg ccaagtctcc tggttcctcc ccgagggatc gaagctggtg ccggtggcca 300gagtggccga ggtccggggc cctgcccact gcctgctgct gggggaacgg gtggccctca 360acacgctggc ccgctgcagt ggcattgcca gtgctgccgc cgctgcagtg gaggccgcca 420ggggggccgg ctggactggg cacgtggcag gcacgaggaa gaccacgcca ggcttccggc 480tggtggagaa gtatgggctc ctggtgggcg gggccgcctc gcaccgctac gacctgggag 540ggctggtgat gttgaaggat aaccatgtgg tgccccccgg tggcgtggag aaggcggtgc 600gggcggccag acaggcggct gacttcgctc tgaaggtgga agtggaatgc agcagcctgc 660aggaggtcgt ccaggcagct gaggctggcg ccgaccttgt cctgctggac aacttcaagc 720cagaggagct gcaccccacg gccaccgcgc tgaaggccca gttcccgagt gtggctgtgg 780aagccagtgg gggcatcacc ctggacaacc tcccccagtt ctgcgggccg cacatagacg 840tcatctccat ggggatgctg acccaggcgg tcccagccct tgatttctcc ctcaagctgt 900ttgccaaaga ggtggctcca gtgcccaaaa tccactagtc ctaaaccgga agaggatgac 960accggccatg ggttaacgtg gctcctcagg accctctggg tcacacatct ttagggtcag 1020tgaacaatgg ggcacatttg gcactagctt gagcccaact ctggctctgc cacctgctgc 1080tcctgtgacc tgtcagggct gacttcacct ctgctcatct cagtttccta atctgtaaaa 1140tgggtctaat aaaggatcaa ccaaaaaaaa aaaaaaaaaa aa 11824092630DNAHomo sapiens 409cggggcatgc tgcttccctt caccttccac catgattgta agtttcctga ggcctcccca 60ggtgtgcttc tgtacagcct gtggaatgtt accaaagacg ttggaagagg tggctatggg 120acatcacctg ggagaagtgg aagcaaatgg acactgttca gaagtccata tacagaaaca 180tacttggaaa aatatagaaa cctggttttg ctagatggga agcttgcagc tggggccaag 240acatcaagag tagagcagca ggacatttca aaagaagatt aactcaaaga ttagagatgg 300aagaacttgc aaagagaaag tctgtaccgg aagaaatctg gaaatctaga ggccagttta 360agaatcagca gctaaacaag gagaataatc tagggcaaga gatagctacc tgcacaaaaa 420ttcctaccag aaaaagagac atagaatcta atgaatttgt gaaaaatttt actgtaagat 480caatacttgt tgcagaacag atagatccta tggaagagaa ttgtcataaa tatggtacat 540gttgaaagat gctcaaacaa aactcagatt taattataca aagaaagtat gatggaaaaa 600aaaaaacctt gtaaatatag tgaatgtggg agaaccttca gaggccacat cactcttgtt 660cagcatcaaa taactcattg tggagagaga ccctgtaaat gtactgagtg tagaaaggga 720tttaatcaga gttcccactt aagaaataat cagagaaaaa ctctttcagg agaaaagccc 780tacaaatgca gtgagtgtgg gaaggccttc agttattgct tagttcttaa tcaacaccag 840agaattcaca gtggagagaa accttatgag ggtactgaat gtggcaagac attcattcag 900tcgtacatac cttactcagc atcaaagaat tcacacactg gtgagaagcc ctatacatgt 960cttgaatgtg gaaggctttt tagtcagaac acacatctta ctctacatca gagaatccat 1020actggagaga aaccttatga atgcaatgaa tgtggtaggt cctttagtca gactgcacat 1080cttactcaac atcaaagaat gtatacagga gaaaaactct atgaatgtaa tgaatgtgag 1140aaagccttcc atgatcactc agctcttatt caacatcata ttgtccatac tgcagagaaa 1200ccctatgata tcatgactgg gaaaactttc agttactgtt cagacctcat tcaacatcag 1260agaatgcaca ctggagagaa accatacaaa tgcaatgaat gtgggaatgc ctttagtgat 1320tgttcatccc ttattcagca tcaaagaact cacactggag aagagcctta tgaatgtaag 1380caatgtggaa aagcctttag cagaagcaca taccttactc aacatcagag aagtcacgca 1440ggagagaaac agtataaatg caatgaatgt gagaaaactt tcagcctgag ttcattcctt 1500acacagcata tgagggttca gactggagaa aaaccctaca aatataatga atatggaaaa 1560gcttttagtg actgctcagg acattttcag agaactcaca ctggagagaa gccctgtgaa 1620tgtaatgact gtgggaaacc tttcagtttc tgttcagccc taattcaaca taagagaatt 1680cataccagaa agaagccctg actgtacctt cataccagta aatgcactga ctgtggaaaa 1740gccttcagtg attggttagc acttgttcaa catcagataa ctcaacactg gagaaaaacc 1800gtataaatgt actgaatgtg gaaaagcctt cagttggagt acagacctca aaaatcacca 1860gaaaactcat actagtgaaa aatcctataa atgtaatgaa tgtagaaagg cctttagtta 1920ctgctctggt cttattcaat gtcaggtcat tcatactata gaaaaacctt atgaatacgg 1980taaatgtggc aaagccttta ggcagaggac agaccttaaa aaacatcaga aaatgcatac 2040cgaagagaaa ccctatgaat gtaatgaatg tgggaaagcc tttagccaga gcacatatct 2100tacaaaacac caaaaaattc atagtgaaga gaaatcaaat atacatactg agtgtgggga 2160aaccattaga caaaactctt ctttttacaa caataaaacc tcacactgga gagttctctg 2220aatgccttaa gaatttggtt aatatggaga cccttcccag ggaaacagaa ggaggatcgt 2280gaaaaccgtt gactacttga atgatcacat ggtttagtgg agagagcatg attctgggtt 2340ttaaaagtca tggatctcaa tctcagctcc tattactaac tagatctttt actttggggt 2400aagtcacttc atatctttag gccttaattt cctcatctga aaactggaag gcctgacttg 2460acttgttgag cttaagatcc tcaattatta tatttactag gaattcaagt ttctatagat 2520gtggttcaga attgtgactt atttattgta catcaggtgt gattcacaag tgagcttgta 2580gtagttatta aggagtcaat aaagatatga tataaaaaaa aaaaaaaaaa 2630410551DNAHomo sapiens 410catttcatct tcattggata gtgttacata gtaatatatt tatgttttct tttaatcatt 60tcataacttg gaaaatacta acatagtcaa aactctaggg taggtgatac atgagtttct 120gtagtaatct ggttggagac atgttgtaat tctgtatata tatgtacatt tatcccatgc 180atgttatgcc taaactaaga cggatacccc tgaattaaga ggtgctgtta tacattgacc 240aggcttaaga atatctcttt aaagtgtgtc gacatttaat tgacctttgg aagttcattc 300tgttaatcat actcaaagtg ctaaagctat ggttgactgc tctggtgttt ttatattcat 360tcgtgcttta gcatataaat tcttcagcat aattgctact tatttagcaa gagtttcctt 420tatttgaaaa tgtgagttgt gcttgtattt ttgtgtcttt ctttctttct ttcttttttt 480aaactttgct tcaggctggg tagtggtaga ggtttgaatt aaaatgtttt cctgtcagta 540aaaaaaaaaa a 5514112390DNAHomo sapiens 411gagcgagccc agcagcttgc ccttgacagg tgggggctgg ctggggcctt aatgtgaaaa 60gacagtggca ggcagctgga gtagagcgag cccagcagcc ctaaaaggct gccttcatgg 120ccatctagcc ccagttcagg gcagcatcca tagcccacaa gccagcgtgg gtggggcggg 180ggtggtccca cagctgggtt ccacctgaag agcctccgtg cctcggagca ggagaggcag 240gctatggctg tcaccctccc tcctgcctgt gtcccagtga gaactgacct gagtcccctt 300ccaaacccag acccacctcc tgccccaggc ccactgaagc atgttccatt tctaaaaagc 360ccagagttca gtgtgtccca aggaaaaccc aaagtggagg tgctcaggtc caggggagtc 420cagtgggcag gacccttggc aggcaagccc ctcccttcac tcccaggacc taccttctgc 480tagtaaagga ctggcttcat tctaattatg gcccacagac tgccccggag acctggagga 540cagcagtgct ggcacttggg tgtccatggg cccgtctgcc ggctctgcct gtgctgcaag 600tgttggccgt gggtccagcc aacaactccc tacgtcctgt gtggggccct gcccaagtgg 660atgaggcatt ccttgaggag tatcattttc cctgacaatc cccatcacct

ttaggggttc 720cctgcttggc tcctttccag ctgaaaaact agacctgtgc cattggggaa gctggacaaa 780gtctaggggg cccgcctggt agagggtccc gggaagctgg atctgtcagc ctcggccctg 840aggcccctgt taactcaaga ctgtgagctg cctctaggtg gtcacgtctg ggagctagct 900tgtatggctt ctgaccagta tcaggatttc tgttctgaga gcagcgtggg cagcaaggca 960gggcagccca gaggtggcag cggcaggcaa tctggtcact aggtctttgt gatgccaaaa 1020ataaaagagg gtggggtggg tgctttctgt tcctctgatt ggatggagtc cgccagcagg 1080catggggcta cattccagtg cctgactata gggaggcact cctgattcca tggagcagcc 1140cggactttga gaatgggctc tggtttgcgg ggggcaggcg taccagactg caagaccccc 1200cagtacctca ccgtgccaaa taggaagagg tggccttggt gtagccaaat ggatcttttt 1260aacagtgtgc ctttggggag ggacccatgt ccatggcttc gttgagggcc atccatatgc 1320cagctggggg ccagcccaca gtggccatat tggctgcagc aggaatggtg cccacctcgg 1380cgaattgaag ggctaagagt cccagatagc taggccagag ctggaagcag acagtaaggg 1440gaagagctgc tcccacagga gagggagaga ttccagctca ctgcgcagcc tgggaggagg 1500cgtggatcct ggcacgctga gcctcaggca ccagcctccc tgtgctcgac agcaaagtct 1560tgactccttc ctgctgagca ctgtgctacc ttcactgctc caaagccaga ctaacagctc 1620tccaagccct tggggtgact cggcttccag gagctgttgg agaaatgagg atgtctgtcc 1680ctgtctgcct gggcaggcca gattcctccc cagcagccgg gtctctccag accctgattc 1740ggtgcctttc tgtttaccag ctacttcaat cccaaagttt gaatctgcag ataccttact 1800cccagccact ttgccttctt actgtgttgt gtgtttttcc tggtgcttca agagcgtgtg 1860cagggcaagt gccgtcactg ggaactgcac cagatgctca gacttggttg tcttatgttt 1920accaataaat aaaagtagac tttttctatt tttatttgct gctatttgtg tgtgtgtttg 1980tgtttgtgta gctaggtatc tggcacttct gacgatgcat tgttgctttt ttcccgaagg 2040tcccgcagga actgtggcaa tggtgtgtgt gtgaaatggt gtgttaaccg cgttttgttt 2100gctcctgtat tgaataggaa gcagtggcca gtctgtcttc cttagagatg ttagcatatt 2160tttatatgta tatattttgt accaaaaaag agtgttcctt gttttggtta cactcgaaat 2220tctgacctag ctggagaggg ctctgggccg agagctttca ctaaggggag acttcagggg 2280aggatcaagc tttgaaccaa agccaatcac tggcttgatt tgtgtttttt aattaaaaaa 2340aaaatcattc atgtatgcca cttctaaaaa aaaaaaaaaa aaaaaaaaaa 23904121303DNAHomo sapiens 412ggcacgaggc tgagaccggt gcgccgcgcg ctagtggccg ctcttccgcg ggctagcggg 60cggtgggggc gccagcagcg cggaaggcgg gcacgcgggc catggctccc tgggcggagg 120ccgagcactc ggcgctgaac ccgctgcgcg cggtgtggct cacgctgacc gccgccttcc 180tgctgaccct actgctgcag ctcctgccgc ccggcctgct cccgggctgc gcgatcttcc 240aggacctgat ccgctatggg aaaaccaagt gtggggagcc gtcgcgcccc gccgcctgcc 300gagcctttga tgtccccaag agatattttt cccactttta tatcatctca gtgctgtgga 360atggcttcct gctttggtgc cttactcaat ctctgttcct gggagcacct tttccaagct 420ggcttcatgg tttgctcaga attctcgggg cggcacagtt ccagggaggg gagctggcac 480tgtctgcatt cttagtgcta gtatttctgt ggctgcacag cttacgaaga ctcttcgagt 540gcctctacgt cagtgtcttc tccaatgtca tgattcacgt cgtgcagtac tgttttggac 600ttgtctatta tgtccttgtt ggcctaactg tgctgagcca agtgccaatg gatggcagga 660atgcctacat aacagggaaa aatctattga tgcaagcacg gtggttccat attcttggga 720tgatgatgtt catctggtca tctgcccatc agtataagtg ccatgttatt ctcggcaatc 780tcaggaaaaa taaagcagga gtggtcattc actgtaacca caggatccca tttggagact 840ggtttgaata tgtttcttcc cctaactact tagcagagct gatgatctac gtttccatgg 900ccgtcacctt tgggttccac aacttaactt ggtggctagt ggtgacaaat gtcttcttta 960atcaggccct gtctgccttt ctcagccacc aattctacaa aagcaaattt gtctcttacc 1020cgaagcatag gaaagctttc ctaccatttt tgttttaagt taacctcagt catgaagaat 1080gcaaaccagg tgatggtttc aatgcctaag gacagtgaag tctggagccc aaagtacagt 1140ttcagcaaag ctgtttgaaa ctctccattc catttctata ccccacaagt tttcactgaa 1200tgagcatggc agtgccactc aagaaaatga atctccaaag tatcttcaaa gaataaatac 1260taatggcaga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 1303

科学支持的最完整的草药数据库

  • 支持55种语言
  • 科学支持的草药疗法
  • 通过图像识别草药
  • 交互式GPS地图-在位置标记草药(即将推出)
  • 阅读与您的搜索相关的科学出版物
  • 通过药效搜索药草
  • 组织您的兴趣并及时了解新闻研究,临床试验和专利

输入症状或疾病,并阅读可能有用的草药,输入草药并查看所使用的疾病和症状。
*所有信息均基于已发表的科学研究

Google Play badgeApp Store badge