заявка
№ US 20050142144
МПК A61K48/00

Epitope sequences

Авторы:
John Simard
Номер заявки
11067064
Дата подачи заявки
25.02.2005
Опубликовано
30.06.2005
Страна
US
Как управлять
интеллектуальной собственностью
Реферат

[0000]

Disclosed herein are polypeptides, including epitopes, clusters, and antigens. Also disclosed are compositions that include said polypeptides and methods for their use.

Формула изобретения

1. An isolated nucleic acid comprising a reading frame comprising a first sequence, wherein said first sequence encodes one or more segments of tumor-associated antigen PSMA (SEQ ID NO: 4), wherein the first sequence does not encode a complete PSMA antigen, and wherein each segment comprises an epitope cluster, said cluster comprising or encoding at least two amino acid sequences having a known or predicted affinity for a same MHC receptor peptide binding cleft.

2. The nucleic acid of claim 1, wherein said epitope cluster is chosen from the group consisting of amino acids 3-12, 3-45, 13-45, 20-43, 217-227, 247-268, 278-297, 345-381, 385-405, 415-435, 440-450, 454-481, 547-562, 568-591, 603-614, 660-681, 663-676, 700-715, 726-749 and 731-749 of PSMA.

3. The nucleic acid of claim 1, wherein said one or more segments consist of said epitope cluster.

4. The nucleic acid of claim 1, wherein said first sequence encodes a fragment of PSMA.

5. The nucleic acid of claim 4, wherein said encoded fragment consists of a polypeptide having a length, wherein the length of the polypeptide is less than about 90% of the length of PSMA.

6. The nucleic acid of claim 4, wherein said encoded fragment consists of a polypeptide having a length, wherein the length of the polypeptide is less than about 80% of the length of PSMA.

7. The nucleic acid of claim 4, wherein said encoded fragment consists of a polypeptide having a length, wherein the length of the polypeptide is less than about 60% of the length of PSMA.

8. The nucleic acid of claim 4, wherein said encoded fragment consists of a polypeptide having a length, wherein the length of the polypeptide is less than about 50% of the length of PSMA.

9. The nucleic acid of claim 4, wherein said encoded fragment consists of a polypeptide having a length, wherein the length of the polypeptide is less than about 25% of the length of PSMA.

10. The nucleic acid of claim 4, wherein said encoded fragment consists of a polypeptide having a length, wherein the length of the polypeptide is less than about 10% of the length of PSMA.

11. The nucleic acid of claim 4, wherein said encoded fragment consists essentially of an amino acid sequence beginning at one of amino acids selected from the group consisting of 3, 13, 20, 217, 247, 278, 345, 385, 415, 440, 454, 547, 568, 603, 660, 663, 700, 726, and 731 of PSMA, and ending at one of the amino acids selected from the group consisting of amino acid 12, 43, 45, 227, 268, 297, 381, 405, 435, 450, 481, 562, 591, 614, 681, 676, 715, and 749 of PSMA.

12. The nucleic acid of claim 11, wherein said encoded fragment consists essentially of amino acids 3-12, 3-43, 3-45, 3-227, 3-268, 3-297, 3-381, 3-405, 3-435, 3-450, 3-481, 3-562, 3-591, 3-614, 3-676, 3-681, 3-715, 3-749, 13-43, 13-45, 13-227, 13-268, 13-297, 13-381, 13-405, 13-435, 13-450, 13-481, 13-562, 13-591, 13-614, 13-676, 13-681, 13-715, 13-749, 20-43, 20-45, 20-227, 20-268, 20-297, 20-381, 20-405, 20-435, 20-450, 20-481, 20-562, 20-591, 20-614, 20-676, 20-681, 20-715, 20-749, 217-227, 217-268, 217-297, 217-381, 217-405, 217-435, 217-450, 217-481, 217-562, 217-591, 217-614, 217-676, 217-681, 217-715, 217-749, 247-268, 247-297, 247-381, 247-405, 247-435, 247-450, 247-481, 247-562, 247-591, 247-614, 247-676, 247-681, 247-715, 247-749, 278-297, 278-381, 278-405, 278-435, 278-450, 278-481, 278-562, 278-591, 278-614, 278-676, 278-681, 278-715, 278-749, 345-381, 345-405, 345-435, 345-450, 345-481, 345-562, 345-591, 345-614, 345-676, 345-681, 345-715, 345-749, 385-405, 385-435, 385-450, 385-481, 385-562, 385-591, 385-614, 385-676, 385-681, 385-715, 385-749, 415-435, 415-450, 415-481, 415-562, 415-591, 415-614, 415-676, 415-681, 415-715, 415-749, 440-450, 440-481, 440-562, 440-591, 440-614, 440-676, 440-681, 440-715, 440-749, 454-481, 454-562, 454-591, 454-614, 454-676, 454-681, 454-715, 454-749, 547-562, 547-591, 547-614, 547-676, 547-681, 547-715, 547-749, 568-591, 568-614, 568-676, 568-681, 568-715, 568-749, 603-614, 603-676, 603-681, 603-715, 603-749, 660-676, 660-681, 660-715, 660-749, 663-681, 663-715, 663-749, 700-715, 700-749, 726-749, or 731-749 of PSMA.

13. The nucleic acid of claim 1, further comprising a second sequence, wherein the second sequence encodes essentially a housekeeping epitope.

14. The nucleic acid of claim 1, wherein said reading frame is operably linked to a promoter.

15. The nucleic acid of claim 13 wherein said first and second sequences constitute a single reading frame.

16. The nucleic acid of claim 15 wherein said reading frame is operably linked to a promoter.

17. An isolated polypeptide comprising the amino acid sequence encoded in said reading frame of claim 15.

18. An immunogenic composition comprising the nucleic acid of claim 16.

19. An immunogenic composition comprising the polypeptide of claim 18.

20. The nucleic acid of claim 1, wherein said reading frame further comprises a second sequence encoding a polypeptide sequence consisting essentially of an epitope or epitope array.

21. An expression vector comprising a promoter operably linked to means for encoding an amino acid sequence comprising at least one PSMA epitope cluster, wherein said means do not encode the complete PSMA antigen.

Описание

CROSS REFERENCE

[0001]

This application is a continuation of U.S. patent application Ser. No. 10/117,937, filed Apr. 4, 2002, which claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Patent Application Ser. No. 60/282,211, filed on Apr. 6, 2001; U.S. Provisional Patent Application Ser. No. 60/337,017, filed on Nov. 7, 2001; and U.S. Provisional Patent Application Ser. No. 60/363,210, filed on Mar. 7, 2002; all entitled “EPITOPE SEQUENCES,” and all of which are hereby incorporated by reference in their entirety.

BACKGROUND OF THE INVENTION

[0002]

1. Field of the Invention

[0003]

The present invention generally relates to peptides, and nucleic acids encoding peptides, that are useful epitopes of target-associated antigens. More specifically, the invention relates to epitopes that have a high affinity for MHC class I and that are produced by target-specific proteasomes. The invention disclosed herein further relates to the identification of epitope cluster regions that are used to generate pharmaceutical compositions capable of inducing an immune response from a subject to whom the compositions have been administered.

[0004]

2. Description of the Related Art

[0005]

Neoplasia and the Immune System

[0006]

The neoplastic disease state commonly known as cancer is thought to result generally from a single cell growing out of control. The uncontrolled growth state typically results from a multi-step process in which a series of cellular systems fail, resulting in the genesis of a neoplastic cell. The resulting neoplastic cell rapidly reproduces itself, forms one or more tumors, and eventually may cause the death of the host.

[0007]

Because the progenitor of the neoplastic cell shares the host's genetic material, neoplastic cells are largely unassailed by the host's immune system. During immune surveillance, the process in which the host's immune system surveys and localizes foreign materials, a neoplastic cell will appear to the host's immune surveillance machinery as a “self” cell.

[0008]

Viruses and the Immune System

[0009]

In contrast to cancer cells, virus infection involves the expression of clearly non-self antigens. As a result, many virus infections are successfully dealt with by the immune system with minimal clinical sequela. Moreover, it has been possible to develop effective vaccines for many of those infections that do cause serious disease. A variety of vaccine approaches have been used successfully to combat various diseases. These approaches include subunit vaccines consisting of individual proteins produced through recombinant DNA technology. Notwithstanding these advances, the selection and effective administration of minimal epitopes for use as viral vaccines has remained problematic.

[0010]

In addition to the difficulties involved in epitope selection stands the problem of viruses that have evolved the capability of evading a host's immune system. Many viruses, especially viruses that establish persistent infections, such as members of the herpes and pox virus families, produce immunomodulatory molecules that permit the virus to evade the host's immune system. The effects of these immunomodulatory molecules on antigen presentation may be overcome by the targeting of select epitopes for administration as immunogenic compositions. To better understand the interaction of neoplastic cells and virally infected cells with the host's immune system, a discussion of the system's components follows below.

[0011]

The immune system functions to discriminate molecules endogenous to an organism (“self” molecules) from material exogenous or foreign to the organism (“non-self” molecules). The immune system has two types of adaptive responses to foreign bodies based on the components that mediate the response: a humoral response and a cell-mediated response. The humoral response is mediated by antibodies, while the cell-mediated response involves cells classified as lymphocytes. Recent anticancer and antiviral strategies have focused on mobilizing the host immune system as a means of anticancer or antiviral treatment or therapy.

[0012]

The immune system functions in three phases to protect the host from foreign bodies: the cognitive phase, the activation phase, and the effector phase. In the cognitive phase, the immune system recognizes and signals the presence of a foreign antigen or invader in the body. The foreign antigen can be, for example, a cell surface marker from a neoplastic cell or a viral protein. Once the system is aware of an invading body, antigen specific cells of the immune system proliferate and differentiate in response to the invader-triggered signals. The last stage is the effector stage in which the effector cells of the immune system respond to and neutralize the detected invader.

[0013]

An array of effector cells implements an immune response to an invader. One type of effector cell, the B cell, generates antibodies targeted against foreign antigens encountered by the host. In combination with the complement system, antibodies direct the destruction of cells or organisms bearing the targeted antigen. Another type of effector cell is the natural killer cell (NK cell), a type of lymphocyte having the capacity to spontaneously recognize and destroy a variety of virus infected cells as well as malignant cell types. The method used by NK cells to recognize target cells is poorly understood.

[0014]

Another type of effector cell, the T cell, has members classified into three subcategories, each playing a different role in the immune response. Helper T cells secrete cytokines which stimulate the proliferation of other cells necessary for mounting an effective immune response, while suppressor T cells down-regulate the immune response. A third category of T cell, the cytotoxic T cell (CTL), is capable of directly lysing a targeted cell presenting a foreign antigen on its surface.

[0015]

The Major Histocompatibility Complex and T Cell Target Recognition

[0016]

T cells are antigen-specific immune cells that function in response to specific antigen signals. B lymphocytes and the antibodies they produce are also antigen-specific entities. However, unlike B lymphocytes, T cells do not respond to antigens in a free or soluble form. For a T cell to respond to an antigen, it requires the antigen to be processed to peptides which are then bound to a presenting structure encoded in the major histocompatibility complex (MHC). This requirement is called “MHC restriction” and it is the mechanism by which T cells differentiate “self” from “non-self” cells. If an antigen is not displayed by a recognizable MHC molecule, the T cell will not recognize and act on the antigen signal. T cells specific for a peptide bound to a recognizable MHC molecule bind to these MHC-peptide complexes and proceed to the next stages of the immune response.

[0017]

There are two types of MHC, class I MHC and class II MHC. T Helper cells (CD4+) predominately interact with class II MHC proteins while cytolytic T cells (CD8+) predominately interact with class I MHC proteins. Both classes of MHC protein are transmembrane proteins with a majority of their structure on the external surface of the cell. Additionally, both classes of MHC proteins have a peptide binding cleft on their external portions. It is in this cleft that small fragments of proteins, endogenous or foreign, are bound and presented to the extracellular environment.

[0018]

Cells called “professional antigen presenting cells” (pAPCs) display antigens to T cells using the MHC proteins but additionally express various co-stimulatory molecules depending on the particular state of differentiation/activation of the pAPC. When T cells, specific for the peptide bound to a recognizable MHC protein, bind to these MHC-peptide complexes on pAPCs, the specific co-stimulatory molecules that act upon the T cell direct the path of differentiation/activation taken by the T cell. That is, the co-stimulation molecules affect how the T cell will act on antigenic signals in future encounters as it proceeds to the next stages of the immune response.

[0019]

As discussed above, neoplastic cells are largely ignored by the immune system. A great deal of effort is now being expended in an attempt to harness a host's immune system to aid in combating the presence of neoplastic cells in a host. One such area of research involves the formulation of anticancer vaccines.

[0000]

Anticancer Vaccines

[0020]

Among the various weapons available to an oncologist in the battle against cancer is the immune system of the patient. Work has been done in various attempts to cause the immune system to combat cancer or neoplastic diseases. Unfortunately, the results to date have been largely disappointing. One area of particular interest involves the generation and use of anticancer vaccines.

[0021]

To generate a vaccine or other immunogenic composition, it is necessary to introduce to a subject an antigen or epitope against which an immune response may be mounted. Although neoplastic cells are derived from and therefore are substantially identical to normal cells on a genetic level, many neoplastic cells are known to present tumor-associated antigens (TuAAs). In theory, these antigens could be used by a subject's immune system to recognize these antigens and attack the neoplastic cells. In reality, however, neoplastic cells generally appear to be ignored by the host's immune system.

[0022]

A number of different strategies have been developed in an attempt to generate vaccines with activity against neoplastic cells. These strategies include the use of tumor-associated antigens as immunogens. For example, U.S. Pat. No. 5,993,828, describes a method for producing an immune response against a particular subunit of the Urinary Tumor Associated Antigen by administering to a subject an effective dose of a composition comprising inactivated tumor cells having the Urinary Tumor Associated Antigen on the cell surface and at least one tumor associated antigen selected from the group consisting of GM-2, GD-2, Fetal Antigen and Melanoma Associated Antigen. Accordingly, this patent describes using whole, inactivated tumor cells as the immunogen in an anticancer vaccine.

[0023]

Another strategy used with anticancer vaccines involves administering a composition containing isolated tumor antigens. In one approach, MAGE-A1 antigenic peptides were used as an immunogen. (See Chaux, P., et al., “Identification of Five MAGE-A1 Epitopes Recognized by Cytolytic T Lymphocytes Obtained by In Vitro Stimulation with Dendritic Cells Transduced with MAGE-A1,” J. Immunol., 163(5):2928-2936 (1999)). There have been several therapeutic trials using MAGE-A1 peptides for vaccination, although the effectiveness of the vaccination regimes was limited. The results of some of these trials are discussed in Vose, J. M., “Tumor Antigens Recognized by T Lymphocytes,” 10th European Cancer Conference, Day 2, Sep. 14, 1999.

[0024]

In another example of tumor associated antigens used as vaccines, Scheinberg, et al. treated 12 chronic myelogenous leukemia (CML) patients already receiving interferon (IFN) or hydroxyurea with 5 injections of class I-associated bcr-abl peptides with a helper peptide plus the adjuvant QS-21. Scheinberg, D. A., et al., “BCR-ABL Breakpoint Derived Oncogene Fusion Peptide Vaccines Generate Specific Immune Responses in Patients with Chronic Myelogenous Leukemia (CML) [Abstract 1665], American Society of Clinical Oncology 35th Annual Meeting, Atlanta (1999). Proliferative and delayed type hypersensitivity (DTH) T cell responses indicative of T-helper activity were elicited, but no cytolytic killer T cell activity was observed within the fresh blood samples.

[0025]

Additional examples of attempts to identify TuAAs for use as vaccines are seen in the recent work of Cebon, et al. and Scheibenbogen, et al. Cebon, et al. immunized patients with metastatic melanoma using intradermallly administered MART-126-35 peptide with IL-12 in increasing doses given either subcutaneously or intravenously. Of the first 15 patients, 1 complete remission, 1 partial remission, and 1 mixed response were noted. Immune assays for T cell generation included DTH, which was seen in patients with or without IL-12. Positive CTL assays were seen in patients with evidence of clinical benefit, but not in patients without tumor regression. Cebon, et al., “Phase I Studies of Immunization with Melan-A and IL-12 in HLA A2+ Positive Patients with Stage III and IV Malignant Melanoma,” [Abstract 1671], American Society of Clinical Oncology 35th Annual Meeting, Atlanta (1999).

[0026]

Scheibenbogen, et al. immunized 18 patients with 4 HLA class I restricted tyrosinase peptides, 16 with metastatic melanoma and 2 adjuvant patients. Scheibenbogen, et al., “Vaccination with Tyrosinase peptides and GM-CSF in Metastatic Melanoma: a Phase II Trial,” [Abstract 1680], American Society of Clinical Oncology 35th Annual Meeting, Atlanta (1999). Increased CTL activity was observed in 4/15 patients, 2 adjuvant patients, and 2 patients with evidence of tumor regression. As in the trial by Cebon, et al., patients with progressive disease did not show boosted immunity. In spite of the various efforts expended to date to generate efficacious anticancer vaccines, no such composition has yet been developed.

[0000]

Antiviral Vaccines

[0027]

Vaccine strategies to protect against viral diseases have had many successes. Perhaps the most notable of these is the progress that has been made against the disease small pox, which has been driven to extinction. The success of the polio vaccine is of a similar magnitude.

[0028]

Viral vaccines can be grouped into three classifications: live attenuated virus vaccines, such as vaccinia for small pox, the Sabin poliovirus vaccine, and measles mumps and rubella; whole killed or inactivated virus vaccines, such as the Salk poliovirus vaccine, hepatitis A virus vaccine and the typical influenza virus vaccines; and subunit vaccines, such as hepatitis B. Due to their lack of a complete viral genome, subunit vaccines offer a greater degree of safety than those based on whole viruses.

[0029]

The paradigm of a successful subunit vaccine is the recombinant hepatitis B vaccine based on the viruses envelope protein. Despite much academic interest in pushing the reductionist subunit concept beyond single proteins to individual epitopes, the efforts have yet to bear much fruit. Viral vaccine research has also concentrated on the induction of an antibody response although cellular responses also occur. However, many of the subunit formulations are particularly poor at generating a CTL response.

SUMMARY OF THE INVENTION

[0030]

Previous methods of priming professional antigen presenting cells (pAPCs) to display target cell epitopes have relied simply on causing the pAPCs to express target-associated antigens (TAAs), or epitopes of those antigens which are thought to have a high affinity for MHC I molecules. However, the proteasomal processing of such antigens results in presentation of epitopes on the pAPC that do not correspond to the epitopes present on the target cells.

[0031]

Using the knowledge that an effective cellular immune response requires that pAPCs present the same epitope that is presented by the target cells, the present invention provides epitopes that have a high affinity for MHC I, and that correspond to the processing specificity of the housekeeping proteasome, which is active in peripheral cells. These epitopes thus correspond to those presented on target cells. The use of such epitopes in vaccines can activate the cellular immune response to recognize the correctly processed TAA and can result in removal of target cells that present such epitopes. In some embodiments, the housekeeping epitopes provided herein can be used in combination with immune epitopes, generating a cellular immune response that is competent to attack target cells both before and after interferon induction. In other embodiments the epitopes are useful in the diagnosis and monitoring of the target-associated disease and in the generation of immunological reagents for such purposes.

[0032]

The invention disclosed herein relates to the identification of epitope cluster regions that are used to generate pharmaceutical compositions capable of inducing an immune response from a subject to whom the compositions have been administered. One embodiment of the disclosed invention relates to an epitope cluster, the cluster being derived from an antigen associated with a target, the cluster including or encoding at least two sequences having a known or predicted affinity for an MHC receptor peptide binding cleft, wherein the cluster is an incomplete fragment of the antigen.

[0033]

In one aspect of the invention, the target is a neoplastic cell.

[0034]

In another aspect of the invention, the MHC receptor may be a class I HLA receptor.

[0035]

In yet another aspect of the invention, the cluster includes or encodes a polypeptide having a length, wherein the length is at least 10 amino acids. Advantageously, the length of the polypeptide may be less than about 75 amino acids.

[0036]

In still another aspect of the invention, there is provided an antigen having a length, wherein the cluster consists of or encodes a polypeptide having a length, wherein the length of the polypeptide is less than about 80% of the length of the antigen. Preferably, the length of the polypeptide is less than about 50% of the length of the antigen. Most preferably, the length of the polypeptide is less than about 20% of the length of the antigen.

[0037]

Embodiments of the invention particularly relate to epitope clusters identified in the tumor-associated antigen PSMA (SEQ ID NO: 4). One embodiment of the invention relates to an isolated nucleic acid containing a reading frame with a first sequence encoding one or more segments of PSMA, wherein the whole antigen is not encoded, wherein each segment contains an epitope cluster, and wherein each cluster contains at least two amino acid sequences with a known or predicted affinity for a same MHC receptor peptide binding cleft. In various aspects of the invention the epitope cluster can be amino acids 3-12, 3-45, 13-45, 20-43, 217-227, 247-268, 278-297, 345-381, 385-405, 415-435, 440-450, 454-481, 547-562, 568-591, 603-614, 660-681, 663-676, 700-715, 726-749 or 731-749 of PSMA.

[0038]

In other aspects the segments can consist of an epitope cluster; the first sequence can be a fragment of PSMA; the fragment can consists of a polypeptide having a length, wherein the length of the polypeptide is less than about 90%, 80%, 60%, 50%, 25%, or 10% of the length of PSMA; the fragment can consist essentially of an amino acid sequence beginning at amino acid 3, 13, 20, 217, 247, 278, 345, 385, 415, 440, 454, 547, 568, 603, 660, 663, 700, 726, or 731 of PSMA and ending at amino acid 12, 43, 45, 227, 268, 297, 381, 405, 435, 450, 481, 562, 591, 614, 676, 681, 715, or 749 of PSMA; or the fragment consists of amino acids 3-45 or 217-297 of PSMA. In some embodiments, the encoded fragment consists essentially of amino acids 3-12, 3-43, 3-45, 3-227, 3-268, 3-297, 3-381, 3-405, 3-435, 3-450, 3-481, 3-562, 3-591, 3-614, 3-676, 3-681, 3-715, 3-749, 13-43, 13-45, 13-227, 13-268, 13-297, 13-381, 13-405, 13-435, 13-450, 13-481, 13-562, 13-591, 13-614, 13-676, 13-681, 13-715, 13-749, 20-43, 20-45, 20-227, 20-268, 20-297, 20-381, 20-405, 20-435, 20-450, 20-481, 20-562, 20-591, 20-614, 20-676, 20-681, 20-715, 20-749, 217-227, 217-268, 217-297, 217-381, 217-405, 217-435, 217-450, 217-481, 217-562, 217-591, 217-614, 217-676, 217-681, 217-715, 217-749, 247-268, 247-297, 247-381, 247-405, 247-435, 247-450, 247-481, 247-562, 247-591, 247-614, 247-676, 247-681, 247-715, 247-749, 278-297, 278-381, 278-405, 278-435, 278-450, 278-481, 278-562, 278-591, 278-614, 278-676, 278-681, 278-715, 278-749, 345-381, 345-405, 345-435, 345-450, 345-481, 345-562, 345-591, 345-614, 345-676, 345-681, 345-715, 345-749, 385-405, 385-435, 385-450, 385-481, 385-562, 385-591, 385-614, 385-676, 385-681, 385-715, 385-749, 415-435, 415-450, 415-481, 415-562, 415-591, 415-614, 415-676, 415-681, 415-715, 415-749, 440-450, 440-481, 440-562, 440-591, 440-614, 440-676, 440-681, 440-715, 440-749, 454-481, 454-562, 454-591, 454-614, 454-676, 454-681, 454-715, 454-749, 547-562, 547-591, 547-614, 547-676, 547-681, 547-715, 547-749, 568-591, 568-614, 568-676, 568-681, 568-715, 568-749, 603-614, 603-676, 603-681, 603-715, 603-749, 660-676, 660-681, 660-715, 660-749, 663-681, 663-715, 663-749, 700-715, 700-749, 726-749, or 731-749 of PSMA

[0039]

In other aspects, the segments can consist of an epitope cluster; the first sequence can be a fragment of SSX-2; the fragment can consists of a polypeptide having a length, wherein the length of the polypeptide is less than about 90%, 80%, 60%, 50%, 25%, or 10% of the length of SSX-2.

[0040]

Other embodiments of the invention include a second sequence encoding essentially a housekeeping epitope. In one aspect of this embodiment the first and second sequences constitute a single reading frame. In some aspects of the invention the reading frame is operably linked to a promoter. Other embodiments of the invention include the polypeptides encoded by the nucleic acid embodiments of the invention and immunogenic compositions containing the nucleic acids or polypeptides of the invention.

[0041]

Other embodiments of the invention relate to isolated epitopes, and antigens or polypeptides that comprise the epitopes. Preferred embodiments include an epitope or antigen having the sequence as disclosed in Table 1. Other embodiments can include an epitope cluster comprising a polypeptide from Table 1. Further, embodiments include a polypeptide having substantial similarity to the already mentioned epitopes, polypeptides, antigens, or clusters. Other preferred embodiments include a polypeptide having functional similarity to any of the above. Still further embodiments relate to a nucleic acid encoding the polypeptide of any of the epitopes, clusters, antigens, and polypeptides from Table 1 and mentioned herein. For purposes of the following summary, discussions of other embodiments of the invention, when making reference to “the epitope,” or “the epitopes” may refer without limitation to all of the foregoing forms of the epitope.

[0042]

The epitope can be immunologically active. The polypeptide comprising the epitope can be less than about 30 amino acids in length, more preferably, the polypeptide is 8 to 10 amino acids in length, for example. Substantial or functional similarity can include addition of at least one amino acid, for example, and the at least one additional amino acid can be at an N-terminus of the polypeptide. The substantial or functional similarity can include a substitution of at least one amino acid.

[0043]

The epitope, cluster, or polypeptide comprising the same can have affinity to an HLA-A2 molecule. The affinity can be determined by an assay of binding, by an assay of restriction of epitope recognition, by a prediction algorithm, and the like. The epitope, cluster, or polypeptide comprising the same can have affinity to an HLA-B7, HLA-B51 molecule, and the like.

[0044]

In preferred embodiments the polypeptide can be a housekeeping epitope. The epitope or polypeptide can correspond to an epitope displayed on a tumor cell, to an epitope displayed on a neovasculature cell, and the like. The epitope or polypeptide can be an immune epitope. The epitope, cluster and/or polypeptide can be a nucleic acid.

[0045]

Other embodiments relate to pharmaceutical compositions comprising the polypeptides, including an epitope from Table 1, a cluster, or a polypeptide comprising the same, and a pharmaceutically acceptable adjuvant, carrier, diluent, excipient, and the like. The adjuvant can be a polynucleotide. The polynucleotide can include a dinucleotide, which can be CpG, for example. The adjuvant can be encoded by a polynucleotide. The adjuvant can be a cytokine and the cytokine can be, for example, GM-CSF.

[0046]

The pharmaceutical compositions can further include a professional antigen-presenting cell (pAPC). The pAPC can be a dendritic cell, for example. The pharmaceutical composition can further include a second epitope. The second epitope can be a polypeptide, a nucleic acid, a housekeeping epitope, an immune epitope, and the like.

[0047]

Still further embodiments relate to pharmaceutical compositions that include any of the nucleic acids discussed herein, including those that encode polypeptides that comprise epitopes or antigens from Table 1. Such compositions can include a pharmaceutically acceptable adjuvant, carrier, diluent, excipient, and the like.

[0048]

Other embodiments relate to recombinant constructs that include such a nucleic acid as described herein, including those that encode polypeptides that comprise epitopes or antigens from Table 1. The constructs can further include a plasmid, a viral vector, an artificial chromosome, and the like. The construct can further include a sequence encoding at least one feature, such as for example, a second epitope, an IRES, an ISS, an NIS, a ubiquitin, and the like.

[0049]

Further embodiments relate to purified antibodies that specifically bind to at least one of the epitopes in Table 1. Other embodiments relate to purified antibodies that specifically bind to a peptide-MHC protein complex comprising an epitope disclosed in Table 1 or any other suitable epitope. The antibody from any embodiment can be a monoclonal antibody or a polyclonal antibody.

[0050]

Still other embodiments relate to multimeric MHC-peptide complexes that include an epitope, such as, for example, an epitope disclosed in Table 1. Also, contemplated are antibodies specific for the complexes.

[0051]

Embodiments relate to isolated T cells expressing a T cell receptor specific for an MHC-peptide complex. The complex can include an epitope, such as, for example, an epitope disclosed in Table 1. The T cell can be produced by an in vitro immunization and can be isolated from an immunized animal. Embodiments relate to T cell clones, including cloned T cells, such as those discussed above. Embodiments also relate to polyclonal population of T cells. Such populations can include a T cell, as described above, for example.

[0052]

Still further embodiments relate to pharmaceutical compositions that include a T cell, such as those described above, for example, and a pharmaceutically acceptable adjuvant, carrier, diluent, excipient, and the like.

[0053]

Embodiments of the invention relate to isolated protein molecules comprising the binding domain of a T cell receptor specific for an MHC-peptide complex. The complex can include an epitope as disclosed in Table 1. The protein can be multivalent. Other embodiments relate to isolated nucleic acids encoding such proteins. Still further embodiments relate to recombinant constructs that include such nucleic acids.

[0054]

Other embodiments of the invention relate to host cells expressing a recombinant construct as described herein, including constructs encoding an epitope, cluster or polypeptide comprising the same, disclosed in Table 1, for example. The host cell can be a dendritic cell, macrophage, tumor cell, tumor-derived cell, a bacterium, fungus, protozoan, and the like. Embodiments also relate to pharmaceutical compositions that include a host cell, such as those discussed herein, and a pharmaceutically acceptable adjuvant, carrier, diluent, excipient, and the like.

[0055]

Still other embodiments relate to vaccines or immunotherapeutic compositions that include at least one component, such as, for example, an epitope disclosed in Table 1 or otherwise described herein; a cluster that includes such an epitope, an antigen or polypeptide that includes such an epitope; a composition as described above and herein; a construct as described above and herein, a T cell, or a host cell as described above and herein.

[0056]

Further embodiments relate to methods of treating an animal. The methods can include administering to an animal a pharmaceutical composition, such as, a vaccine or immunotherapeutic composition, including those disclosed above and herein. The administering step can include a mode of delivery, such as, for example, transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, mucosal, aerosol inhalation, instillation, and the like. The method can further include a step of assaying to determine a characteristic indicative of a state of a target cell or target cells. The method can include a first assaying step and a second assaying step, wherein the first assaying step precedes the administering step, and wherein the second assaying step follows the administering step. The method can further include a step of comparing the characteristic determined in the first assaying step with the characteristic determined in the second assaying step to obtain a result. The result can be for example, evidence of an immune response, a diminution in number of target cells, a loss of mass or size of a tumor comprising target cells, a decrease in number or concentration of an intracellular parasite infecting target cells, and the like.

[0057]

Embodiments relate to methods of evaluating immunogenicity of a vaccine or immunotherapeutic composition. The methods can include administering to an animal a vaccine or immunotherapeutic, such as those described above and elsewhere herein, and evaluating immunogenicity based on a characteristic of the animal. The animal can be HLA-transgenic.

[0058]

Other embodiments relate to methods of evaluating immunogenicity that include in vitro stimulation of a T cell with the vaccine or immunotherapeutic composition, such as those described above and elsewhere herein, and evaluating immunogenicity based on a characteristic of the T cell. The stimulation can be a primary stimulation.

[0059]

Still further embodiments relate to methods of making a passive/adoptive immunotherapeutic. The methods can include combining a T cell or a host cell, such as those described above and elsewhere herein, with a pharmaceutically acceptable adjuvant, carrier, diluent, excipient, and the like.

[0060]

Other embodiments relate to methods of determining specific T cell frequency, and can include the step of contacting T cells with a MHC-peptide complex comprising an epitope disclosed in Table 1, or a complex comprising a cluster or antigen comprising such an epitope. The contacting step can include at least one feature, such as, for example, immunization, restimulation, detection, enumeration, and the like. The method can further include ELISPOT analysis, limiting dilution analysis, flow cytometry, in situ hybridization, the polymerase chain reaction, any combination thereof, and the like.

[0061]

Embodiments relate to methods of evaluating immunologic response. The methods can include the above-described methods of determining specific T cell frequency carried out prior to and subsequent to an immunization step.

[0062]

Other embodiments relate to methods of evaluating immunologic response. The methods can include determining frequency, cytokine production, or cytolytic activity of T cells, prior to and subsequent to a step of stimulation with MHC-peptide complexes comprising an epitope, such as, for example an epitope from Table 1, a cluster or a polypeptide comprising such an epitope.

[0063]

Further embodiments relate to methods of diagnosing a disease. The methods can include contacting a subject tissue with at least one component, including, for example, a T cell, a host cell, an antibody, a protein, including those described above and elsewhere herein; and diagnosing the disease based on a characteristic of the tissue or of the component. The contacting step can take place in vivo or in vitro, for example.

[0064]

Still other embodiments relate to methods of making a vaccine. The methods can include combining at least one component, an epitope, a composition, a construct, a T cell, a host cell; including any of those described above and elsewhere herein, with a pharmaceutically acceptable adjuvant, carrier, diluent, excipient, and the like.

[0065]

Embodiments relate to computer readable media having recorded thereon the sequence of any one of SEQ ID NOS: 1-602, in a machine having a hardware or software that calculates the physical, biochemical, immunologic, molecular genetic properties of a molecule embodying said sequence, and the like.

[0066]

Still other embodiments relate to methods of treating an animal. The methods can include combining the method of treating an animal that includes administering to the animal a vaccine or immunotherapeutic composition, such as described above and elsewhere herein, combined with at least one mode of treatment, including, for example, radiation therapy, chemotherapy, biochemotherapy, surgery, and the like.

[0067]

Further embodiments relate to isolated polypeptides that include an epitope cluster. In preferred embodiments the cluster can be from a target-associated antigen having the sequence as disclosed in any one of Tables 25-44, wherein the amino acid sequence includes not more than about 80% of the amino acid sequence of the antigen.

[0068]

Other embodiments relate to vaccines or immunotherapeutic products that include an isolated peptide as described above and elsewhere herein. Still other embodiments relate to isolated polynucleotides encoding a polypeptide as described above and elsewhere herein. Other embodiments relate vaccines or immunotherapeutic products that include these polynucleotides. The polynucleotide can be DNA, RNA, and the like.

[0069]

Still further embodiments relate to kits comprising a delivery device and any of the embodiments mentioned above and elsewhere herein. The delivery device can be a catheter, a syringe, an internal or external pump, a reservoir, an inhaler, microinjector, a patch, and any other like device suitable for any route of delivery. As mentioned, the kit, in addition to the delivery device also includes any of the embodiments disclosed herein. For example, without limitations, the kit can include an isolated epitope, a polypeptide, a cluster, a nucleic acid, an antigen, a pharmaceutical composition that includes any of the foregoing, an antibody, a T cell, a T cell receptor, an epitope-MHC complex, a vaccine, an immunotherapeutic, and the like. The kit can also include items such as detailed instructions for use and any other like item.

BRIEF DESCRIPTION OF THE DRAWINGS

[0070]

FIG. 1 is a sequence alignment of NY-ESO-1 and several similar protein sequences.

[0071]

FIG. 2 graphically represents a plasmid vaccine backbone useful for delivering nucleic acid-encoded epitopes.

[0072]

FIGS. 3A and 3B are FACS profiles showing results of HLA-A2 binding assays for tyrosinase207-215 and tyrosinase208-216.

[0073]

FIG. 3C shows cytolytic activity against a tyrosinase epitope by human CTL induced by in vitro immunization.

[0074]

FIG. 4 is a T=120 min. time point mass spectrum of the fragments produced by proteasomal cleavage of SSX-231-68.

[0075]

FIG. 5 shows a binding curve for HLA-A2:SSX-241-49 with controls.

[0076]

FIG. 6 shows specific lysis of SSX-241-49-pulsed targets by CTL from SSX-241-49-immunized HLA-A2 transgenic mice.

[0077]

FIGS. 7A, B, and C show results of N-terminal pool sequencing of a T=60 min. time point aliquot of the PSMA163-192 proteasomal digest.

[0078]

FIG. 8 shows binding curves for HLA-A2:PSMA168-177 and HLA-A2:PSMA288-297 with controls.

[0079]

FIG. 9 shows results of N-terminal pool sequencing of a T=60 min. time point aliquot of the PSMA281-310 proteasomal digest.

[0080]

FIG. 10 shows binding curves for HLA-A2:PSMA461-469, HLA-A2:PSMA460-469, and HLA-A2:PSMA663-671, with controls.

[0081]

FIG. 11 shows the results of a γ-IFN-based ELISPOT assay detecting PSMA463-471-reactive HLA-A1+ CD8+ T cells.

[0082]

FIG. 12 shows blocking of reactivity of the T cells used in FIG. 10 by anti-HLA-A1 mAb, demonstrating HLA-A1-restricted recognition.

[0083]

FIG. 13 shows a binding curve for HLA-A2:PSMA663-671, with controls.

[0084]

FIG. 14 shows a binding curve for HLA-A2:PSMA662-671, with controls.

[0085]

FIG. 15. Comparison of anti-peptide CTL responses following immunization with various doses of DNA by different routes of injection.

[0086]

FIG. 16. Growth of transplanted gp33 expressing tumor in mice immunized by i.ln. injection of gp33 epitope-expressing, or control, plasmid.

[0087]

FIG. 17. Amount of plasmid DNA detected by real-time PCR in injected or draining lymph nodes at various times after i.ln. of i.m. injection, respectively.

[0088]

FIG. 18 depicts the sequence of Melan-A, showing clustering of class I HLA epitopes.

[0089]

FIG. 19 depicts the sequence of SSX-2, showing clustering of class I HLA epitopes.

[0090]

FIG. 20 depicts the sequence of NY-ESO, showing clustering of class I HLA epitopes.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

[0000]

Definitions

[0091]

Unless otherwise clear from the context of the use of a term herein, the following listed terms shall generally have the indicated meanings for purposes of this description.

[0092]

PROFESSIONAL ANTIGEN-PRESENTING CELL (pAPC)—a cell that possesses T cell costimulatory molecules and is able to induce a T cell response. Well characterized pAPCs include dendritic cells, B cells, and macrophages.

[0093]

PERIPHERAL CELL—a cell that is not a pAPC.

[0094]

HOUSEKEEPING PROTEASOME—a proteasome normally active in peripheral cells, and generally not present or not strongly active in pAPCs.

[0095]

IMMUNE PROTEASOME—a proteasome normally active in pAPCs; the immune proteasome is also active in some peripheral cells in infected tissues.

[0096]

EPITOPE—a molecule or substance capable of stimulating an immune response. In preferred embodiments, epitopes according to this definition include but are not necessarily limited to a polypeptide and a nucleic acid encoding a polypeptide, wherein the polypeptide is capable of stimulating an immune response. In other preferred embodiments, epitopes according to this definition include but are not necessarily limited to peptides presented on the surface of cells, the peptides being non-covalently bound to the binding cleft of class I MHC, such that they can interact with T cell receptors.

[0097]

MHC EPITOPE—a polypeptide having a known or predicted binding affinity for a mammalian class I or class II major histocompatibility complex (MHC) molecule.

[0098]

HOUSEKEEPING EPITOPE—In a preferred embodiment, a housekeeping epitope is defined as a polypeptide fragment that is an MHC epitope, and that is displayed on a cell in which housekeeping proteasomes are predominantly active. In another preferred embodiment, a housekeeping epitope is defined as a polypeptide containing a housekeeping epitope according to the foregoing definition, that is flanked by one to several additional amino acids. In another preferred embodiment, a housekeeping epitope is defined as a nucleic acid that encodes a housekeeping epitope according to the foregoing definitions.

[0099]

IMMUNE EPITOPE—In a preferred embodiment, an immune epitope is defined as a polypeptide fragment that is an MHC epitope, and that is displayed on a cell in which immune proteasomes are predominantly active. In another preferred embodiment, an immune epitope is defined as a polypeptide containing an immune epitope according to the foregoing definition, that is flanked by one to several additional amino acids. In another preferred embodiment, an immune epitope is defined as a polypeptide including an epitope cluster sequence, having at least two polypeptide sequences having a known or predicted affinity for a class I MHC. In yet another preferred embodiment, an immune epitope is defined as a nucleic acid that encodes an immune epitope according to any of the foregoing definitions.

[0100]

TARGET CELL—a cell to be targeted by the vaccines and methods of the invention. Examples of target cells according to this definition include but are not necessarily limited to: a neoplastic cell and a cell harboring an intracellular parasite, such as, for example, a virus, a bacterium, or a protozoan.

[0101]

TARGET-ASSOCIATED ANTIGEN (TAA)—a protein or polypeptide present in a target cell.

[0102]

TUMOR-ASSOCIATED ANTIGENS (TuAA)—a TAA, wherein the target cell is a neoplastic cell.

[0103]

HLA EPITOPE—a polypeptide having a known or predicted binding affinity for a human class I or class II HLA complex molecule.

[0104]

ANTIBODY—a natural immunoglobulin (Ig), poly- or monoclonal, or any molecule composed in whole or in part of an Ig binding domain, whether derived biochemically or by use of recombinant DNA. Examples include inter alia, F(ab), single chain Fv, and Ig variable region-phage coat protein fusions.

[0105]

ENCODE—an open-ended term such that a nucleic acid encoding a particular amino acid sequence can consist of codons specifying that (poly)peptide, but can also comprise additional sequences either translatable, or for the control of transcription, translation, or replication, or to facilitate manipulation of some host nucleic acid construct.

[0106]

SUBSTANTIAL SIMILARITY—this term is used to refer to sequences that differ from a reference sequence in an inconsequential way as judged by examination of the sequence. Nucleic acid sequences encoding the same amino acid sequence are substantially similar despite differences in degenerate positions or modest differences in length or composition of any non-coding regions. Amino acid sequences differing only by conservative substitution or minor length variations are substantially similar. Additionally, amino acid sequences comprising housekeeping epitopes that differ in the number of N-terminal flanking residues, or immune epitopes and epitope clusters that differ in the number of flanking residues at either terminus, are substantially similar. Nucleic acids that encode substantially similar amino acid sequences are themselves also substantially similar.

[0107]

FUNCTIONAL SIMILARITY—this term is used to refer to sequences that differ from a reference sequence in an inconsequential way as judged by examination of a biological or biochemical property, although the sequences may not be substantially similar. For example, two nucleic acids can be useful as hybridization probes for the same sequence but encode differing amino acid sequences. Two peptides that induce cross-reactive CTL responses are functionally similar even if they differ by non-conservative amino acid substitutions (and thus do not meet the substantial similarity definition). Pairs of antibodies, or TCRs, that recognize the same epitope can be functionally similar to each other despite whatever structural differences exist. In testing for functional similarity of immunogenicity one would generally immunize with the “altered” antigen and test the ability of the elicited response (Ab, CTL, cytokine production, etc.) to recognize the target antigen. Accordingly, two sequences may be designed to differ in certain respects while retaining the same function. Such designed sequence variants are among the embodiments of the present invention.

[0108]

Epitope Clusters

[0109]

Embodiments of the invention disclosed herein provide epitope cluster regions (ECRs) for use in vaccines and in vaccine design and epitope discovery. Specifically, embodiments of the invention relate to identifying epitope clusters for use in generating immunologically active compositions directed against target cell populations, and for use in the discovery of discrete housekeeping epitopes and immune epitopes. In many cases, numerous putative class I MHC epitopes may exist in a single target-associated antigen (TAA). Such putative epitopes are often found in clusters (ECRs), MHC epitopes distributed at a relatively high density within certain regions in the amino acid sequence of the parent TAA. Since these ECRs include multiple putative epitopes with potential useful biological activity in inducing an immune response, they represent an excellent material for in vitro or in vivo analysis to identify particularly useful epitopes for vaccine design. And, since the epitope clusters can themselves be processed inside a cell to produce active MHC epitopes, the clusters can be used directly in vaccines, with one or more putative epitopes in the cluster actually being processed into an active MHC epitope.

[0110]

The use of ECRs in vaccines offers important technological advances in the manufacture of recombinant vaccines, and further offers crucial advantages in safety over existing nucleic acid vaccines that encode whole protein sequences. Recombinant vaccines generally rely on expensive and technically challenging production of whole proteins in microbial fermentors. ECRs offer the option of using chemically synthesized polypeptides, greatly simplifying development and manufacture, and obviating a variety of safety concerns. Similarly, the ability to use nucleic acid sequences encoding ECRs, which are typically relatively short regions of an entire sequence, allows the use of synthetic oligonucleotide chemistry processes in the development and manipulation of nucleic acid based vaccines, rather than the more expensive, time consuming, and potentially difficult molecular biology procedures involved with using whole gene sequences.

[0111]

Since an ECR is encoded by a nucleic acid sequence that is relatively short compared to that which encodes the whole protein from which the ECR is found, this can greatly improve the safety of nucleic acid vaccines. An important issue in the field of nucleic acid vaccines is the fact that the extent of sequence homology of the vaccine with sequences in the animal to which it is administered determines the probability of integration of the vaccine sequence into the genome of the animal. A fundamental safety concern of nucleic acid vaccines is their potential to integrate into genomic sequences, which can cause deregulation of gene expression and tumor transformation. The Food and Drug Administration has advised that nucleic acid and recombinant vaccines should contain as little sequence homology with human sequences as possible. In the case of vaccines delivering tumor-associated antigens, it is inevitable that the vaccines contain nucleic acid sequences that are homologous to those which encode proteins that are expressed in the tumor cells of patients. It is, however, highly desirable to limit the extent of those sequences to that which is minimally essential to facilitate the expression of epitopes for inducing therapeutic immune responses. The use of ECRs thus offers the dual benefit of providing a minimal region of homology, while incorporating multiple epitopes that have potential therapeutic value.

[0000]

ECRs are Processed into MHC-Binding Epitopes in pAPCs

[0112]

The immune system constantly surveys the body for the presence of foreign antigens, in part through the activity of pAPCs. The pAPCs endocytose matter found in the extracellular milieu, process that matter from a polypeptide form into shorter oligopeptides of about 3 to 23 amino acids in length, and display some of the resulting peptides to T cells via the MHC complex of the pAPCs. For example, a tumor cell upon lysis releases its cellular contents, including various proteins, into the extracellular milieu. Those released proteins can be endocytosed by pAPCs and processed into discrete peptides that are then displayed on the surface of the pAPCs via the MHC. By this mechanism, it is not the entire target protein that is presented on the surface of the pAPCs, but rather only one or more discrete fragments of that protein that are presented as MHC-binding epitopes. If a presented epitope is recognized by a T cell, that T cell is activated and an immune response results.

[0113]

Similarly, the scavenger receptors on pAPC can take-up naked nucleic acid sequences or recombinant organisms containing target nucleic acid sequences. Uptake of the nucleic acid sequences into the pAPC subsequently results in the expression of the encoded products. As above, when an ECR can be processed into one or more useful epitopes, these products can be presented as MHC epitopes for recognition by T cells.

[0114]

MHC-binding epitopes are often distributed unevenly throughout a protein sequence in clusters. Embodiments of the invention are directed to identifying epitope cluster regions (ECRs) in a particular region of a target protein. Candidate ECRs are likely to be natural substrates for various proteolytic enzymes and are likely to be processed into one or more epitopes for MHC display on the surface of an pAPC. In contrast to more traditional vaccines that deliver whole proteins or biological agents, ECRs can be administered as vaccines, resulting in a high probability that at least one epitope will be presented on MHC without requiring the use of a full length sequence.

[0000]

The Use of ECRs in Identifying Discrete MHC-Binding Epitopes

[0115]

Identifying putative MHC epitopes for use in vaccines often includes the use of available predictive algorithms that analyze the sequences of proteins or genes to predict binding affinity of peptide fragments for MHC. These algorithms rank putative epitopes according to predicted affinity or other characteristics associated with MHC binding. Exemplary algorithms for this kind of analysis include the Rammensee and NIH (Parker) algorithms. However, identifying epitopes that are naturally present on the surface of cells from among putative epitopes predicted using these algorithms has proven to be a difficult and laborious process. The use of ECRs in an epitope identification process can enormously simplify the task of identifying discrete MHC binding epitopes.

[0116]

In a preferred embodiment, ECR polypeptides are synthesized on an automated peptide synthesizer and these ECRs are then subjected to in vitro digests using proteolytic enzymes involved in processing proteins for presentation of the epitopes. Mass spectrometry and/or analytical HPLC are then used to identify the digest products and in vitro MHC binding studies are used to assess the ability of these products to actually bind to MHC. Once epitopes contained in ECRs have been shown to bind MHC, they can be incorporated into vaccines or used as diagnostics, either as discrete epitopes or in the context of ECRs.

[0117]

The use of an ECR (which because of its relatively short sequence can be produced through chemical synthesis) in this preferred embodiment is a significant improvement over what otherwise would require the use of whole protein. This is because whole proteins have to be produced using recombinant expression vector systems and/or complex purification procedures. The simplicity of using chemically synthesized ECRs enables the analysis and identification of large numbers of epitopes, while greatly reducing the time and expense of the process as compared to other currently used methods. The use of a defined ECR also greatly simplifies mass spectrum analysis of the digest, since the products of an ECR digest are a small fraction of the digest products of a whole protein.

[0118]

In another embodiment, nucleic acid sequences encoding ECRs are used to express the polypeptides in cells or cell lines to assess which epitopes are presented on the surface. A variety of means can be used to detect the epitope on the surface. Preferred embodiments involve the lysis of the cells and affinity purification of the MHC, and subsequent elution and analysis of peptides from the MHC; or elution of epitopes from intact cells; (Falk, K. et al. Nature 351:290, 1991, and U.S. Pat. No. 5,989,565, respectively, both of which references are incorporated herein by reference in their entirety). A sensitive method for analyzing peptides eluted in this way from the MHC employs capillary or nanocapillary HPLC ESI mass spectrometry and on-line sequencing.

[0000]

Target-Associated Antigens that Contain ECRs

[0119]

TAAs from which ECRs may be defined include those from TuAAs, including oncofetal, cancer-testis, deregulated genes, fusion genes from errant translocations, differentiation antigens, embryonic antigens, cell cycle proteins, mutated tumor suppressor genes, and overexpressed gene products, including oncogenes. In addition, ECRs may be derived from virus gene products, particularly those associated with viruses that cause chronic diseases or are oncogenic, such as the herpes viruses, human papilloma viruses, human immunodeficiency virus, and human T cell leukemia virus. Also ECRs may be derived from gene products of parasitic organisms, such as Trypanosoma, Leishmania, and other intracellular or parasitic organisms.

[0120]

Some of these TuAA include α-fetoprotein, carcinoembryonic antigen (CEA), esophageal cancer derived NY-ESO-1, and SSX genes, SCP-1, PRAME, MART-1/MelanA (MART-1), gp100 (Pmel 17), tyrosinase, TRP-1, TRP-2, MAGE-1, MAGE-2, MAGE-3, BAGE, GAGE-1, GAGE-2, p15; overexpressed oncogenes and mutated tumor-suppressor genes such as p53, Ras, HER-2/neu; unique tumor antigens resulting from chromosomal translocations such as BCR-ABL, E2A-PRL, H4-RET, IGH-IGK, MYL-RAR1 and viral antigens, EBNAI, EBNA2, HPV-E6, -E7; prostate specific antigen (PSA), prostate stem cell antigen (PSCA), MAAT-1, GP-100, TSP-180, MAGE-4, MAGE-5, MAGE-6, RAGE, p185erbB-2, p185erbB-3, c-met, nm-23H1, TAG-72, CA 19-9, CA 72-4, CAM 17.1, NuMa, K-ras, β-Catenin, CDK4, Mum-1, p15, and p16.

[0121]

Numerous other TAAs are also contemplated for both pathogens and tumors. In terms of TuAAs, a variety of methods are available and well known in the art to identify genes and gene products that are differentially expressed in neoplastic cells as compared to normal cells. Examples of these techniques include differential hybridization, including the use of microarrays; subtractive hybridization cloning; differential display, either at the level of mRNA or protein expression; EST sequencing; and SAGE (sequential analysis of gene expression). These nucleic acid techniques have been reviewed by Carulli, J. P. et al., J. Cellular Biochem Suppl. 30/31:286-296, 1998 (hereby incorporated by reference). Differential display of proteins involves, for example, comparison of two-dimensional poly-acrylamide gel electrophoresis of cell lysates from tumor and normal tissue, location of protein spots unique or overexpressed in the tumor, recovery of the protein from the gel, and identification of the protein using traditional biochemical- or mass spectrometry-based sequencing. An additional technique for identification of TAAs is the Serex technique, discussed in Türeci, Ö, Sahin, U., and Pfreundschuh, M., “Serological analysis of human tumor antigens: molecular definition and implications”, Molecular Medicine Today, 3:342, 1997, and hereby incorporated by reference.

[0122]

Use of these and other methods provides one of skill in the art the techniques necessary to identify genes and gene products contained within a target cell that may be used as potential candidate proteins for generating the epitopes of the invention disclosed. However, it is not necessary, in practicing the invention, to identify a novel TuAA or TAA. Rather, embodiments of the invention make it possible to identify ECRs from any relevant protein sequence, whether the sequence is already known or is new.

[0000]

Protein Sequence Analysis to Identify Epitope Clusters

[0123]

In preferred embodiments of the invention, identification of ECRs involves two main steps: (1) identifying good putative epitopes; and (2) defining the limits of any clusters in which these putative epitopes are located. There are various preferred embodiments of each of these two steps, and a selected embodiment for the first step can be freely combined with a selected embodiment for the second step. The methods and embodiments that are disclosed herein for each of these steps are merely exemplary, and are not intended to limit the scope of the invention in any way. Persons of skill in the art will appreciate the specific tools that can be applied to the analysis of a specific TAA, and such analysis can be conducted in numerous ways in accordance with the invention.

[0124]

Preferred embodiments for identifying good putative epitopes include the use of any available predictive algorithm that analyzes the sequences of proteins or genes to predict binding affinity of peptide fragments for MHC, or to rank putative epitopes according to predicted affinity or other characteristics associated with MHC binding. As described above, available exemplary algorithms for this kind of analysis include the Rammensee and NIH (Parker) algorithms. Likewise, good putative epitopes can be identified by direct or indirect assays of MHC binding. To choose “good” putative epitopes, it is necessary to set a cutoff point in terms of the score reported by the prediction software or in terms of the assayed binding affinity. In some embodiments, such a cutoff is absolute. For example, the cutoff can be based on the measured or predicted half time of dissociation between an epitope and a selected MHC allele. In such cases, embodiments of the cutoff can be any half time of dissociation longer than, for example, 0.5 minutes; in a preferred embodiment longer than 2.5 minutes; in a more preferred embodiment longer than 5 minutes; and in a highly stringent embodiment can be longer than 10, or 20, or 25 minutes. In these embodiments, the good putative epitopes are those that are predicted or identified to have good MHC binding characteristics, defined as being on the desirable side of the designated cutoff point. Likewise, the cutoff can be based on the measured or predicted.binding affinity between an epitope and a selected MHC allele. Additionally, the absolute cutoff can be simply a selected number of putative epitopes.

[0125]

In other embodiments, the cutoff is relative. For example, a selected percentage of the total number of putative epitopes can be used to establish the cutoff for defining a candidate sequence as a good putative epitope. Again the properties for ranking the epitopes are derived from measured or predicted MHC binding; the property used for such a determination can be any that is relevant to or indicative of binding. In preferred embodiments, identification of good putative epitopes can combine multiple methods of ranking candidate sequences. In such embodiments, the good epitopes are typically those that either represent a consensus of the good epitopes based on different methods and parameters, or that are particularly highly ranked by at least one of the methods.

[0126]

When several good putative epitopes have been identified, their positions relative to each other can be analyzed to determine the optimal clusters for use in vaccines or in vaccine design. This analysis is based on the density of a selected epitope characteristic within the sequence of the TAA. The regions with the highest density of the characteristic, or with a density above a certain selected cutoff, are designated as ECRs. Various embodiments of the invention employ different characteristics for the density analysis. For example, one preferred characteristic is simply the presence of any good putative epitope (as defined by any appropriate method). In this embodiment, all putative epitopes above the cutoff are treated equally in the density analysis, and the best clusters are those with the highest density of good putative epitopes per amino acid residue. In another embodiment, the preferred characteristic is based on the parameter(s) previously used to score or rank the putative epitopes. In this embodiment, a putative epitope with a score that is twice as high as another putative epitope is doublv weighted in the density analysis, relative to the other putative epitope. Still other embodiments take the score or rank into account, but on a diminished scale, such as, for example, by using the log or the square root of the score to give more weight to some putative epitopes than to others in the density analysis.

[0127]

Depending on the length of the TAA to be analyzed, the number of possible candidate epitopes, the number of good putative epitopes, the variability of the scoring of the good putative epitopes, and other factors that become evident in any given analysis, the various embodiments of the invention can be used alone or in combination to identify those ECRs that are most useful for a given application. Iterative or parallel analyses employing multiple approaches can be beneficial in many cases. ECRs are tools for increased efficiency of identifying true MHC epitopes, and for efficient “packaging” of MHC epitopes into vaccines. Accordingly, any of the embodiments described herein, or other embodiments that are evident to those of skill in the art based on this disclosure, are useful in enhancing the efficiency of these efforts by using ECRs instead of using complete TAAs in vaccines and vaccine design.

[0128]

Since many or most TAAs have regions with low density of predicted MHC epitopes, using ECRs provides a valuable methodology that avoids the inefficiencies of including regions of low epitope density in vaccines and in epitope identification protocols. Thus, useful ECRs can also be defined as any portion of a TAA that is not the whole TAA, wherein the portion has a higher density of putative epitopes than the whole TAA, or than any regions of the TAA that have a particularly low density of putative epitopes. In this aspect of the invention, therefore, an ECR can be any fragment of a TAA with elevated epitope density. In some embodiments, an ECR can include a region up to about 80% of the length of the TAA. In a preferred embodiment, an ECR can include a region up to about 50% of the length of the TAA. In a more preferred embodiment, an ECR can include a region up to about 30% of the length of the TAA. And in a most preferred embodiment, an ECR can include a region of between 5 and 15% of the length of the TAA.

[0129]

In another aspect of the invention, the ECR can be defined in terms of its absolute length. Accordingly, by this definition, the minimal cluster for 9-mer epitopes includes 10 amino acid residues and has two overlapping 9-mers with 8 amino acids in common. In a preferred embodiment, the cluster is between about 15 and 75 amino acids in length. In a more preferred embodiment, the cluster is between about 20 and 60 amino acids in length. In a most preferred embodiment, the cluster is between about 30 and 40 amino acids in length.

[0130]

In practice, as described above, ECR identification can employ a simple density function such as the number of epitopes divided by the number of amino acids spanned by the those epitopes. It is not necessarily required that the epitopes overlap, but the value for a single epitope is not significant. If only a single value for a percentage cutoff is used and an absolute cutoff in the epitope prediction is not used, it is possible to set a single threshold at this step to define a cluster. However, using both an absolute cutoff and carrying out the first step using different percentage cutoffs, can produce variations in the global density of candidate epitopes. Such variations can require further accounting or manipulation. For example, an overlap of 2 epitopes is more significant if only 3 candidate epitopes were considered, than if 30 candidates were considered for any particular length protein. To take this feature into consideration, the weight given to a particular cluster can further be divided by the fraction of possible peptides actually being considered, in order to increase the significance of the calculation. This scales the result to the average density of predicted epitopes in the parent protein.

[0131]

Similarly, some embodiments base the scoring of good putative epitopes on the average number of peptides considered per amino acid in the protein. The resulting ratio represents the factor by which the density of predicted epitopes in the putative cluster differs from the average density in the protein. Accordingly, an ECR is defined in one embodiment as any region containing two or more predicted epitopes for which this ratio exceeds 2, that is, any region with twice the average density of epitopes. In other embodiments, the region is defined as an ECR if the ratio exceeds 1.5, 3, 4, or 5, or more.

[0132]

Considering the average number of peptides per amino acid in a target protein to calculate the presence of an ECR highlights densely populated ECRs without regard to the score/affinity of the individual constituents. This is most appropriate for use of score-based cutoffs. However, an ECR with only a small number of highly ranked candidates can be of more biological significance than a cluster with several densely packed but lower ranking candidates, particularly if only a small percentage of the total number of candidate peptides were designated as good putative epitopes. Thus in some embodiments it is appropriate to take into consideration the scores of the individual peptides. This is most readily accomplished by substituting the sum of the scores of the peptides in the putative cluster for the number of peptides in the putative cluster in the calculation described above.

[0133]

This sum of scores method is more sensitive to sparsely populated clusters containing high scoring epitopes. Because the wide range of scores (i.e. half times of dissociation) produced by the BIMAS-NIH/Parker algorithm can lead to a single high scoring peptide dwarfing the contribution of other potential epitopes, the log of the score rather than the score itself is preferably used in this procedure.

[0134]

Various other calculations can be devised under one or another condition. Generally speaking, the epitope density function is constructed so that it is proportional to the number of predicted epitopes, their scores, their ranks, and the like, within the putative cluster, and inversely proportional to the number of amino acids or fraction of protein contained within that putative cluster. Alternatively, the function can be evaluated for a window of a selected number of contiguous amino acids. In either case the function is also evaluated for all predicted epitopes in the whole protein. If the ratio of values for the putative cluster (or window) and the whole protein is greater than, for example, 1.5, 2, 3, 4, 5, or more, an ECR is defined.

[0000]

Analysis of Target Gene Products for MHC Binding

[0135]

Once a TAA has been identified, the protein sequence can be used to identify putative epitopes with known or predicted affinity to the MHC peptide binding cleft. Tests of peptide fragments can be conducted in vitro, or using the sequence can be computer analyzed to determine MHC receptor binding of the peptide fragments. In one embodiment of the invention, peptide fragments based on the amino acid sequence of the target protein are analyzed for their predicted ability to bind to the MHC peptide binding cleft. Examples of suitable computer algorithms for this purpose include that found at the world wide web page of Hans-Georg Rammensee, Jutta Bachmann, Niels Emmerich, Stefan Stevanovic: SYFPEITHI: An Internet Database for MHC Ligands and Peptide Motifs (access via: http://134.2.96.221/scripts/hlaserver.dll/EpPredict.htm). Results obtained from this method are discussed in Rammensee, et al., “MHC Ligands and Peptide Motifs,” Landes Bioscience Austin, TX, 224-227, 1997, which is hereby incorporated by reference in its entirety. Another site of interest is http://bimas.dcrt.nih.gov/molbio/hla_bind, which also contains a suitable algorithm. The methods of this web site are discussed in Parker, et al., “Scheme for ranking potential HLA-A2 binding peptides based on independent binding of individual peptide side-chains,” J. Immunol. 152:163-175, which is hereby incorporated by reference in its entirety.

[0136]

As an alternative to predictive algorithms, a number of standard in vitro receptor binding affinity assays are available to identify peptides having an affinity for a particular allele of MHC. Accordingly, by the method of this aspect of the invention, the initial population of peptide fragments can be narrowed to include only putative epitopes having an actual or predicted affinity for the selected allele of MHC. Selected common alleles of MHC I, and their approximate frequencies, are reported in the tables below.

Estimated gene frequencies of HLA-A antigens
CAUAFRASILATNAT
AntigenGfaSEbGfSEGfSEGfSEGfSE
A115.18430.04895.72560.07714.48180.08467.40070.097812.03160.2533
A228.65350.061918.88490.131724.63520.179428.11980.170029.34080.3585
A313.38900.04638.44060.09252.64540.06558.07890.101911.02930.2437
A284.46520.02809.92690.09971.76570.05378.94460.10675.38560.1750
A360.02210.00201.88360.04480.01480.00490.15840.01480.15450.0303
A231.82870.018110.20860.10100.32560.02312.92690.06281.99030.1080
A249.32510.03952.96680.056022.03910.172213.26100.127112.66130.2590
A9 unsplit0.08090.00380.03670.00630.08580.01190.05370.00860.03560.0145
A9 total11.23470.042913.21210.112822.45050.173316.24160.138214.68720.2756
A252.11570.01950.43290.02160.09900.01281.19370.04041.45200.0924
A263.87950.02622.82840.05474.66280.08623.26120.06622.42920.1191
A340.15080.00523.52280.06101.35290.04700.49280.02600.31500.0432
A430.00180.00060.03340.00600.02310.00620.00550.00280.00590.0059
A660.01730.00180.22330.01550.04780.00890.03990.00740.05340.0178
A10 unsplit0.07900.00380.09390.01010.12550.01440.06470.00940.02980.0133
A10 total6.24410.03287.13480.08506.31110.09935.05780.08164.28530.1565
A293.57960.02523.20710.05821.12330.04294.51560.07743.43450.1410
A302.50670.021213.09690.11292.20250.05984.48730.07722.53140.1215
A312.73860.02211.65560.04203.60050.07614.83280.08006.08810.1855
A323.69560.02561.53840.04051.03310.04112.70640.06042.55210.1220
A331.20800.01486.56070.08229.27010.11912.65930.05991.07540.0796
A740.02770.00221.99490.04610.05610.00960.20270.01670.10680.0252
A19 unsplit0.05670.00320.20570.01490.09900.01280.12110.01290.04750.0168
A19 total13.81290.046828.25930.150417.38460.155519.52520.148115.83580.2832
AX0.82040.02974.95060.09632.99160.11771.63320.08781.84540.1925

aGene frequency.

bStandard error.

[0137]

Estimated gene frequencies for HLA-B antigens
CAUAFRASILATNAT
AntigenGfaSEbGfSEGfSEGfSEGfSE
B712.17820.044510.59600.10244.26910.08276.44770.091810.98450.2432
B89.40770.03973.83150.06341.33220.04673.82250.0715 8.57890.2176
B132.30610.02030.81030.02954.92220.08861.26990.0416 1.74950.1013
B144.34810.02773.03310.05660.50040.02875.41660.0846 2.98230.1316
B184.79800.02903.20570.05821.12460.04294.23490.0752 3.34220.1391
B274.38310.02781.29180.03722.23550.06032.37240.0567 5.19700.1721
B359.66140.04028.51720.09278.12030.112214.65160.132910.11980.2345
B371.40320.01590.59160.02521.23270.04490.78070.0327 0.97550.0759
B410.92110.01290.81830.02960.13030.01471.28180.0418 0.47660.0531
B420.06080.00335.69910.07680.08410.01180.58660.0284 0.28560.0411
B460.00990.00130.01510.00404.92920.08860.02340.0057 0.02380.0119
B470.20690.00610.13050.01190.09560.01260.18320.0159 0.21390.0356
B480.08650.00400.13160.01192.02760.05751.59150.0466 1.02670.0778
B530.46200.009210.95290.10390.43150.02661.69820.0481 1.08040.0798
B590.00200.00060.00320.00190.42770.02650.00550.0028 0c
B670.00400.00090.00860.00300.22760.01940.00550.0028 0.00590.0059
B700.32700.00777.35710.08660.89010.03821.92660.0512 0.69010.0639
B730.01080.00140.00320.00190.01320.00470.02610.0060 0c
B515.42150.03072.59800.05257.47510.10806.81470.0943 6.90770.1968
B520.96580.01321.37120.03833.51210.07522.24470.0552 0.69600.0641
B5 unsplit0.15650.00530.15220.01280.12880.01460.15460.0146 0.13070.0278
B5 total6.54380.04354.12140.074711.11600.15049.21410.1324 7.73440.2784
B4413.48380.04657.01370.08475.68070.09489.92530.112111.80240.2511
B450.57710.01024.80690.07080.18160.01731.88120.0506 0.76030.0670
B12 unsplit0.07880.00380.02800.00550.00490.00290.01930.0051 0.06540.0197
B12 total14.14400.047411.84860.10725.86730.096311.82580.121012.62810.2584
B625.91170.03201.52670.04049.22490.11904.18250.0747 6.94210.1973
B630.43020.00881.88650.04480.44380.02700.80830.0333 0.37380.0471
B750.01040.00140.02260.00491.96730.05660.11010.0123 0.035600.0145
B760.00260.00070.00650.00260.08740.01200.00550.0028 0c
B770.00570.00100.01190.00360.05770.00980.00830.0034 0.00590.0059
B15 unsplit0.13050.00490.06910.00860.43010.02660.18200.0158 0.07150.0206
B15 total6.49100.03343.52320.060812.21120.13445.29670.0835 7.42900.2035
B382.44130.02090.33230.01893.28180.07281.96520.0517 1.10170.0806
B391.96140.01881.28930.03712.03520.05766.30400.0909 4.55270.1615
B16 unsplit0.06380.00340.02370.00510.06440.01030.12260.0130 0.05930.0188
B16 total4.46670.02801.64530.04195.38140.09218.39170.1036 5.71370.1797
B573.59550.02525.67460.07662.57820.06472.18000.0544 2.72650.1260
B580.71520.01145.95460.07844.01890.08031.24810.0413 0.93980.0745
B17 unsplit0.28450.00720.32480.01870.37510.02480.14460.0141 0.26740.0398
B17 total4.59520.028411.95400.10766.97220.10413.57270.0691 3.93380.1503
B491.64520.01722.62860.05280.24400.02002.33530.0562 1.54620.0953
B501.05800.01380.86360.03040.44210.02701.88830.0507 0.78620.0681
B21 unsplit0.07020.00360.02700.00540.01320.00470.07710.0103 0.03560.0145
B21 total2.77330.02223.51920.06080.69930.03394.30070.0755 2.36800.1174
B540.01240.00150.01830.00442.68730.06600.02890.0063 0.05340.0178
B551.90460.01850.48950.02292.24440.06040.95150.0361 1.40540.0909
B560.55270.01000.26860.01700.82600.03680.35960.0222 0.33870.0448
B22 unsplit0.16820.00550.04960.00730.27300.02120.03720.0071 0.12460.0272
B22 total2.08520.02170.82610.02976.03070.09711.37710.0433 1.92210.1060
B605.22220.03021.52990.04048.32540.11352.25380.0553 5.72180.1801
B611.19160.01470.47090.02256.20720.09894.66910.0788 2.60230.1231
B40 unsplit0.26960.00700.03880.00650.32050.02300.24730.0184 0.22710.0367
B40 total6.68340.03382.03960.046514.85310.14627.17020.0963 8.55120.2168
BX1.09220.02523.52580.08023.87490.09882.52660.0807 1.98670.1634

aGene frequency.

bStandard error.

cThe observed gene count was zero.

[0138]

Estimated gene frequencies of HLA-DR antigens
CAUAFRASILATNAT
AntigenGfaSEbGfSEGfSEGfSEGfSE
DR110.22790.04136.82000.08323.46280.07477.98590.10138.25120.2139
DR215.24080.049116.23730.122218.61620.160811.23890.118215.39320.2818
DR310.87080.042413.30800.11244.72230.08677.89980.100810.25490.2361
DR416.75890.05115.70840.076515.46230.149020.53730.152019.82640.3123
DR614.39370.047918.61170.129113.44710.140417.02650.141114.80210.2772
DR713.28070.046310.13170.09976.92700.104010.67260.115510.42190.2378
DR82.88200.02276.26730.08006.54130.10139.77310.11106.00590.1844
DR91.06160.01392.96460.05599.75270.12181.07120.03832.86620.1291
DR101.47900.01632.03970.04652.23040.06021.80440.04951.08960.0801
DR119.31800.039610.61510.10184.73750.08697.04110.09555.31520.1740
DR121.90700.01854.11520.065510.13650.12391.72440.04842.01320.1086
DR5 unsplit1.21990.01492.29570.04931.41180.04801.82250.04981.67690.0992
DR5 total12.44490.004517.02600.124316.28580.151610.58800.11489.00520.2218
DRX1.35980.03420.88530.07602.55210.10891.40230.09302.08340.2037

aGene frequency.

bStandard error.

[0139]

It has been observed that predicted epitopes often cluster at one or more particular regions within the amino acid sequence of a TAA. The identification of such ECRs offers a simple and practicable solution to the problem of designing effective vaccines for stimulating cellular immunity. For vaccines in which immune epitopes are desired, an ECR is directly useful as a vaccine. This is because the immune proteasomes of the pAPCs can correctly process the cluster, liberating one or more of the contained MHC-binding peptides, in the same way a cell having immune proteasomes activity processes and presents peptides derived from the complete TAA. The cluster is also a useful a starting material for identification of housekeeping epitopes produced by the housekeeping proteasomes active in peripheral cells.

[0140]

Identification of housekeeping epitopes using ECRs as a starting material is described in copending U.S. patent application Ser. No. 09/561,074 entitled “METHOD OF EPITOPE DISCOVERY,” filed Apr. 28, 2000, which is incorporated herein by reference in its entirety. Epitope synchronization technology and vaccines for use in connection with this invention are disclosed in copending U.S. patent application Ser. No. 09/560,465 entitled “EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING CELLS,” filed Apr. 28, 2000, which is incorporated herein by reference in its entirety. Nucleic acid constructs useful as vaccines in accordance with the present invention are disclosed in copending U.S. patent application Ser. No. 09/561,572 entitled “EXPRESSION VECTORS ENCODING EPITOPES OF TARGET-ASSOCIATED ANTIGENS,” filed Apr. 28, 2000, which is incorporated herein by reference in its entirety.

SEQ ID NOS.* including epitopes in
Examples 1-7, 13.
SEQ ID NOIDENTITYSEQUENCE
1Tyr 207-216LPWHRLFLL
2TyrosinaseAccession number**:
proteinP14679
3SSX-2 proteinAccession number:
NP_003138
4PSMA proteinAccession number:
NP_004467
5TyrosinaseAccession number:
cDNANM_000372
6SSX-2 cDNAAccession number:
NM_003147
7PSMA cDNAAceession number:
NM_004476
8Tyr 207-215FLPWHRLFL
9Tyr 208-216LPWHRLFLL
10SSX-2 31-68YFSKEEWEKMKASEKIIFYVYMKRK
YEAMTKLGFKATLP
11SSX-2 32-40FSKEEWEKM
12SSX-2 39-47KMKASEKIF
13SSX-2 40-48MKASEKIFY
14SSX-2 39-48KMKASEKIFY
15SSX-2 41-49KASEKIFYV
16SSX-2 40-49MKASEKIFYV
17SSX-2 41-50KASEKIFYVY
18SSX-2 42-49ASEKIFYVY
19SSX-2 53-61RKYEAMTKL
20SSX-2 52-61KRKYEAMTKL
21SSX-2 54-63KYEAMTKLGF
22SSX-2 55-63YEAMTKLGF
23SSX-2 56-63EAMTKLGF
24HBV18-27FLPSDYFPSV
25HLA-B44 binderAEMGKYSFY
26SSX-1 41-49KYSEKISYV
27SSX-3 41-49KVSEKIVYV
28SSX-4 41-49KSSEKIVYV
29SSX-5 41-49KASEKIIYV
30PSMA163-192AFSPQGMPEGDLVYVNYARTEDFFKL
ERDM
31PSMA 168-190GMPEGDLVYVNYARTEDFFKLER
32PSMA 169-177MPEGDLVYV
33PSMA 168-177GMPEGDLVYV
34PSMA 168-176GMPEGDLVY
35PSMA 167-176QGMPEGDLVY
36PSMA 169-176MPEGDLVY
37PSMA 171-179EGDLVYVNY
38PSMA 170-179EGDLVYVNY
39PSMA 174-183LVYVNYARTE
40PSMA 177-185VNYARTEDF
41PSMA 176-185YVNYARTEDF
42PSMA 178-186NYARTEDEF
43PSMA 179-186YARTEDFF
44PSMA 181-189RTEDFFKLE
45PSMA 281-310RGIAEAVGLPSIPVHPIGYYDAQKLL
EKMG
46PSMA 283-307IAEAVGLPSIPVHPIGYYDAQKLLE
47PSMA 289-297LPSIPVHPI
48PSMA 288-297GLPSIPVHPI
49PSMA 297-305IGYYDAQKL
50PSMA 296-305PIGYYDAQKL
51PSMA 291-299SIPVHPIGY
52PSMA 290-299PSIPVHPIGY
53PSMA 292-299IPVHPIGY
54PSMA 299-307YYDAQKLLE
55PSMA454-481SSIEGNYTLRVDCTPLMYSLVHLTK
EL
56PSMA 456-464IEGNYTLRV
57PSMA 455-464SIEGNYTLRV
58PSMA 457-464EGNYTLRV
59PSMA 461-469TLRVDCTPL
60PSMA 460-469TLRVDCTPL
61PSMA 462-470YTLRVDCTPLM
62PSMA 463-471LRVDCTPLMY
63PSMA 462-471LRVDCTPLMY
64PSMA653-687FDKSNPIVLRMMNDQLMFLERAFID
PLGLPDRPFY
65PSMA 660-681VLRMMNDQLMFLERAFIDPLGL
66PSMA 663-671MMNDQLMFL
67PSMA 662-671RMMNDQLMFL
68PSMA 662-670RMMNDQLMF
69Tyr 1-17MLLAVLYCLLWSFQTSA

[0141]

SEQ ID NOS.* including epitopes in Examples
14 and 15.
SEQ ID NOIDENTITYSEQUENCE
70GP100 protein2**Accession number:
P40967
71MAGE-1 proteinAccession number: P43355
72MAGE-2 proteinAccession number: P43356
73MAGE-3 proteinAccession number: P43357
74NY-ESO-1 roteinAccession number: P78358
75LAGE-1a proteinAccession number:
CAA11116
76LAGE-1b proteinAccession number:
CAA11117
77PRAME proteinAccession number:
NP 006106
78PSA proteinAccession number: P07288
79PSCA proteinAccession number: O43653
80GP100 cdsAccession number: U20093
81MAGE-1 cdsAccession number: M77481
82MAGE-2 cdsAccession number: L18920
83MAGE-3 cdsAccession number: U03735
84NY-ESO-1 cDNAAccession number: U87459
85PRAME cDNAAccession number:
NM 006115
86PSA cDNAAccession number:
NM 001648
87PSCA cDNAAccession number:
AF043498
88GP100 630-638LPHSSSHWL
89GP100 629-638QLPHSSSHWL
90GP100 614-622LIYRRRLMK
91GP100 613-622SLIYRRRLMK
92GP100 615-622IYRRRLMK
93GP100 630-638LPHSSSHWL
94GP100 629-638QLPHSSSHWL
95MAGE-1 95-102ESLFRAVI
96MAGE-1 93-102ILESLFRAVI
97MAGE-1 93-101ILESLFRAV
98MAGE-1 92-101CILESLFRAV
99MAGE-1 92-100CILESLFRA
100MAGE-1 263-271EFLWGPRAL
101MAGE-1 264-271FLWGPRAL
102MAGE-1 264-273FLWGPRALAE
103MAGE-1 265-274LWGPRALAET
104MAGE-1 268-276PRALAETSY
105MAGE-1 267-276GPRALAETSY
106MAGE-1 269-277RALAETSYV
107MAGE-1 271-279LAETSYVKV
108MAGE-1 270-279ALAETSYVKV
109MAGE-1 272-280AETSYVKVL
110MAGE-1 271-280LAETSYVKVL
111MAGE-1 274-282TSYVKVLEY
112MAGE-1 273-282ETSYVKVLEY
113MAGE-1 278-286KVLEYVIKV
114MAGE-1 168-177SYVLVTCLGL
115MAGE-1 169-177YVLVTCLGL
116MAGE-1 170-177VLVTCLGL
117MAGE-1 240-248TQDLVQEKY
118MAGE-1 239-248LTQDLVQEKY
119MAGE-1 232-240YGEPRKLLT
120MAGE-1 243-251LVQEKYLEY
121MAGE-1 242-251DLVQEKYLEY
122MAGE-1 230-238SAYGEPRKL
123MAGE-1 278-286KVLEYVIKV
124MAGE-1 277-286VKVLEYVIKV
125MAGE-1 276-284YVKVLEYVI
126MAGE-1 274-282TSYVKVLEY
127MAGE-1 273-282ETSYVKVLEY
128MAGE-1 283-291VIKVSARVR
129MAGE-1 282-291YVIKVSARVR
130MAGE-2 115-122ELVHFLLL
131MAGE-2 113-122MVELVHFLLL
132MAGE-2 109-116ISRKMVEL
133MAGE-2 108-116AISRKMVEL
134MAGE-2 107-116AAISRKMVEL
135MAGE-2 112-120KMVELVHFL
136MAGE-2 109-117ISRKMVELV
137MAGE-2 108-117AISRKMVELV
138MAGE-2 116-124LVHFLLLKY
139MAGE-2 115-124ELVHFLLLKY
140MAGE-2 111-119RKMVELVHF
141MAGE-2 158-166LQLVFGIEV
142MAGE-2 157-166YLQLVFGIEV
143MAGE-2 159-167QLVFGIEVV
144MAGE-2 158-167LQLVFGIEVV
145MAGE-2 164-172IEVVEVVPI
146MAGE-2 163-172GIEVVEVVPI
147MAGE-2 162-170FGIEVVEVV
148MAGE-2 154-162ASEYLQLVF
149MAGE-2 153-162KASEYLQLVF
150MAGE-2 218-225EEKIWEEL
151MAGE-2 216-225APEEKIWEEL
152MAGE-2 216-223APEEKIWE
153MAGE-2 220-228KIEELSML
154MAGE-2 219-228EKIWEELSML
155MAGE-2 271-278FLWGPRAL
156MAGE-2 271-279FLWGPRALJ
157MAGE-2 278-286LIETSYVKV
158MAGE-2 277-286ALIETSYVKV
159MAGE-2 276-284RALIETSYV
160MAGE-2 279-287IETSYVKVL
161MAGE-2 278-287LIETSYVKVL
162MAGE-3 271-278FLWGPRAL
163MAGE-3 270-278EFLWGPRAL
164MAGE-3 271-279FLWGPRALV
165MAGE-3 276-284RALVETSYV
166MAGE-3 272-280LWGPRALVE
167MAGE-3 271-280FLWGPRALVE
168MAGE-3 272-281LWGPRALVET
169NY-ESO-1 82-90GPESRLLEF
170NY-ESO-1 83-91PESRLLEFY
171NY-ESO-1 82-91GPESRLLEFY
172NY-ESO-1 84-92ESRLLEFYL
173NY-ESO-1 86-94RLLEFYLAM
174NY-ESO-1 88-96LEFYLAMPF
175NY-ESO-1 87-96LLEFYLAMPF
176NY-ESO-1 93-102AMPFATPMEA
177NY-ESO-1 94-102MPFATPMEA
178NY-ESO-1 115-123PLPVPGVLL
179NY-ESO-1 114-123PPLPVPGVLL
180NY-ESO-1 116-123LPVPGVLL
181NY-ESO-1 103-112ELARRSLAQD
182NY-ESO-1 118-126VPGVLLKEF
183NY-ESO-1 117-126PVPGVLLKEF
184NY-ESO-1 116-123LPVPGVLL
185NY-ESO-1 127-135YVSGNILTI
186NY-ESO-1 126-135TVSGNILTI
187NY-ESO-1 120-128GVLLKEFTV
188NY-ESO-1 121-130VLLKEFTVSG
189NY-ESO-1 122-130LLKEFTVSG
190NY-ESO-1 118-126VPGVLLKEF
191NY-ESO-1 117-126PVPGVLLKEF
192NY-ESO-1 139-147AADHRQLQL
193NY-ESO-1 148-156SISSCLQQL
194NY-ESO-1 147-156LSISSCLQQL
195NY-ESO-1 138-147TAADHRQLQL
196NY-ESO-1 161-169WITQCFLPV
197NY-ESO-1 157-165SLLMWITQC
198NY-ESO-1 150-158SSCLQQLSL
199NY-ESO-1 154-162QQLSLLMWI
200NY-ESO-1 151-159SCLQQLSLL
201NY-ESO-1 150-159SSCLQQLSLL
202NY-ESO-1 163-171TQCFLPVFL
203NY-ESO-1 162-171ITQCFLPVFL
204PRAME 219-227PMQDIKMIL
205PRAME 218-227MPMQDIIKMIL
206PRAME 428-436QHLIGLSNL
207PRAME 427-436LQHLIGLSNL
208PRAME 429-436HLIGLSNL
209PRAME 431-439IGLSNLTHV
210PRAME 430-439LIGLSNLTHV
211PSA 53-61VLVHPQWVL
212PSA 52-61GVLVHPQWVL
213PSA 52-60GVLVHPQWV
214PSA 59-67WVLTAAHCI
215PSA 54-63LVHPQWVLTA
216PSA 53-62VLVHPQWVLT
217PSA 54-62LVHPQWVLT
218PSA 66-73CIRNKSVI
219PSA 65-73HCIRNKSVI
220PSA 56-64HPQWVLTAA
221PSA 63-72AAHCIRNKSV
222PSCA 116-123LLWGPGQL
223PSCA 115-123LLLWGPGQL
224PSCA 114-123GLLLWGPGQL
225PSCA 99-107ALQPAAAIL
226PSCA 98-107HALQPAAAIL
227Tyr 128-137APEKDKFFAY
228Tyr 129-137PEKDKFFAY
229Tyr 130-138EKDKFFAYL
230Tyr 131-138KDKFFAYL
231Tyr 205-213PAFLPWHRL
232Tyr 204-213APAFLPWHRL
233Tyr 214-223FLLRWEQEIQ
234Tyr 212-220RLFLLRWEQ
235Tyr 191-200GSEIWRDIDF
236Tyr 192-200SEIWRDIDF
237Tyr 473-481RIWSWLLGA
238Tyr 476-484SWLLGAAMV
239Tyr 477-486WLLGAAMVGA
240Tyr 478-486LLGAAMVGA
241PSMA 4-12LLHETDSAV
242PSMA 13-21ATARRPRWL
243PSMA 53-61TPKHNMKAF
244PSMA 64-73ELKAENIKKF
245PSMA 69-77NIKKFLH1NF
246PSMA 68-77ENIKKFLH1NF
247PSMA 220-228AGAKGVILY
248PSMA 468-477PLMYSLVHNL
249PSMA 469-477LMYSLVHNL
250PSMA 463-471RVDCTPLMY
251PSMA 465-473DCTPLMYSL
252PSMA 507-515SGMPRISKL
253PSMA 506-515FSGMPRISKL
254NY-ESO-1 136-163RLTAADHRQLQLSISSCLQQLS
LLMWIT
255NY-ESO-1 150-177SSCLQQLSLLMWITQCFLPVFL
AQPPSG

1This H was reported as Y in the SWISSPROT database.

2The amino acid at position 274 may be Pro or Leu depending upon the database. The particular analysis presented herein used the Pro.

[0142]

SEQ ID NOS.* including epitopes in Example 14.
SEQ ID NO.IDENTITYSEQUENCE
256Mage-1 125-132KAEMLESV
257Mage-1 124-132TKAEMLESV
258Mage-1 123-132VTKAEMLESV
259Mage-1 128-136MLESVIKNY
260Mage-1 127-136EMLESVIKNY
261Mage-1 125-133KAEMLESVI
262Mage-1 146-153KASESLQL
263Mage-1 145-153GKASESLQL
264Mage-1 147-155ASESLQLVF
265Mage-1 153-161LVFGIDVKE
266Mage-1 114-121LLKYRARE
267Mage-1 106-113VADLVGFL
268Mage-1 105-113KVADLVGFL
269Mage-1 107-115ADLVGFLLL
270Mage-1 106-115VADLVGFLLL
271Mage-1 114-123LLKYRAREPV
272Mage-3 278-286LVETSYVKV
273Mage-3 277-286ALVETSYVKV
274Mage-3 285-293KVLHHMVKI
275Mage-3 283-291YVKVLHHMV
276Mage-3 275-283PRALVETSY
277Mage-3 274-283GPRALVETSY
278Mage-3 278-287LVETSYVKVL
279ED-B 4'-5TIIPEVPQL
280ED-B 5'-5DTIIPEVPQL
281ED-B 1-10EVPQLTDLSF
282ED-B 23-30TPLNSSTI
283ED-B 18-25IGLRWTPL
284ED-B 17-25SIGLRWTPL
285ED-B 25-33LNSSTIIGY
286ED-B 24-33PLNSSTIIGY
287ED-B 23-31TPLNSSTII
288ED-B 31-38IGYRITVV
289ED-B 30-38IIGYRITVV
290ED-B 29-38TIIGYRITVV
291ED-B 31-39IGYRITVVA
292ED-B 30-39IIGYRJTVVA
293CEA 184-191SLPVSPRL
294CEA 183-191QSLPVSPRL
295CEA 186-193PVSPRLQL
296CEA 185-193LPVSPRLQL
297CEA 184-193SLPVSPRLQL
298CEA 185-192LPVSPRLQ
299CEA 192-200QLSNGNRTL
300CEA 191-200LQLSNGNRTL
301CEA 179-187WVNNQSLPV
302CEA 186-194PVSPRLQLS
303CEA 362-369SLPVSPRL
304CEA 361-369QSLPVSPRL
305CEA 364-371PVSPRLQL
306CEA 363-371LPVSPRLQL
307CEA 362-371SLPVSPRLQL
308CEA 363-370LPVSPRLQ
309CEA 370-378QLSNDNRTL
310CEA 369-378LQLSNDNRTL
311CEA 357-365WVNNQSLPV
312CEA 360-368NQSLPVSPR
313CEA 540-547SLPVSPRL
314CEA 539-547QSLPVSPRL
315CEA 542-549PVSPRLQL
316CEA 541-549LPVSPRLQL
317CEA 540-549SLPVSPRLQL
318CEA 541-548LPVSPRLQ
319CEA 548-556QLSNGNRTL
320CEA 547-556LQLSNGNRTL
321CEA 535-543WVNGQSLPV
322CEA 533-541LWWVNGQSL
323CEA 532-541YLWWVNGQSL
324CEA 538-546GQSLPVSPR
325Her-2 30-37DMKLRLPA
326Her-2 28-37GTDMKLRLPA
327Her-2 42-49HLDMLRHL
328Her-2 41-49THLDMLRHL
329Her-2 40-49ETHLDMLRHL
330Her-2 36-43PASPETHL
331Her-2 35-43LPASPETHL
332Her-2 34-43RLPASPETHL
333Her-2 38-46SPETHLDML
334Her-2 37-46ASPETHLDML
335Her-2 42-50HLDMLRHLY
336Her-2 41-50THLDMLRHLY
337Her-2 719-726ELRKVKVL
338Her-2 718-726TELRKVKVL
339Her-2 717-726ETELRKVKVL
340Her-2 715-723LKETELRKV
341Her-2 714-723ILKETELRKV
342Her-2 712-720MRILKETEL
343Her-2 711-720QMRILKETEL
344Her-2 717-725ETELRKVKV
345Her-2 716-725KETELRKVKV
346Her-2 706-714MPNQAQMRI
347Her-2 705-714AMPNQAQMRI
348Her-2 706-715MPNQAQMRIL
349HER-2 966-973RPRFRELV
350HER-2 965-973CRPRFRELV
351HER-2 968-976RFRELVSEF
352HER-2 967-976PRFRELVSEF
353HER-2 964-972ECRPRFREL
354NY-ESO-1 67-75GAASGLNGC
355NY-ESO-1 52-60RASGPGGGA
356NY-ESO-1 64-72PHGGAASGL
357NY-ESO-1 63-72GPHGGAASGL
358NY-ESO-1 60-69APRGPHGGAA
359PRAME 112-119VRPRRWKL
360PRAME 111-119EVRPRRWKL
361PRAME 113-121RPRRWKLQV
362PRAME 114-122PRRWKLQVL
363PRAME 113-122RPRRWKLQVL
364PRAME 116-124RWKLQVLDL
365PRAME 115-124RRWKiLQVLDL
366PRAME 174-182PVEVLVDLF
367PRAME 199-206VKRKKNVL
368PRAME 198-206KVKRKKNVL
369PRAME 197-206EKVKRKKNVL
370PRAME 198-205KVKRKKNV
371PRAME 201-208RKKNVLRL
372PRAME 200-208KRKKNVLRL
373PRAME 199-208VKRKKNVLRL
374PRAME 189-196DELFSYLI
375PRAME 205-213VLRLCCKKL
376PRAME 204-213NVLRLCCKKL
377PRAME 194-202YLIEKVKRK
378PRAME 74-81QAWPFTCL
379PRAME 73-81VQAWPFTCL
380PRAME 72-81MVQAWPFTCL
381PRAME 81-88LPLGVLMK
382PRAME 80-88CLPLGVLMK
383PRAME 79-88TCLPLGVLMK
384PRAME 84-92GVLMKGQHL
385PRAME 81-89LPLGVLMKG
386PRAME 80-89CLPLGVLMKG
387PRAME 76-85WPFTCLPLGV
388PRAME 51-59ELFPPLFMA
389PRAME 49-57PRELFPPLF
390PRAME 48-57LPRELFPPLF
391PRAME 50-58RELFPPLFM
392PRAME 49-58PRELFPPLFM
393PSA 239-246RPSLYTKV
394PSA 238-246ERPSLYTKV
395PSA 236-243LPERPSLY
396PSA 235-243ALPERPSLY
397PSA 241-249SLYTKVVHY
398PSA 240-249PSLYTKVVHY
399PSA 239-247RPSLYTKVV
400PSMA 211-218GNKVKNAQ
401PSMA 202-209IARYGKVF
402PSMA 217-225AQLAGAKGV
403PSMA 207-215KVFRGNKVK
404PSMA 211-219GNKVKNAQL
405PSMA 269-277TPGYPANEY
406PSMA 268-277LTPGYPANEY
407PSMA 271-279GYPANEYAY
408PSMA 270-279PGYPANEYAY
409PSMA 266-274DPLTPGYPA
410PSMA 492-500SLYESWTKK
411PSMA 491-500KSLYESWTKK
412PSMA 486-494EGFEGKSLY
413PSMA 485-494DEGFEGKSLY
414PSMA 498-506TKiKSPSPEF
415PSMA 497-506WTKKSPSPEF
416PSMA 492-501SLYESWTKKS
417PSMA 725-732WGEVKRQI
418PSMA 724-732AWGEVKRQI
419PSMA 723-732KAWGEVKRQI
420PSMA 723-730KAWGEVKR
421PSMA 722-730SKAWGEVKR
422PSMA 731-739QIYVAAFTV
423PSMA 733-741YVAAFTVQA
424PSMA 725-733WGEVKRQIY
425PSMA 727-735EVKRQIYVA
426PSMA 738-746TVQAAAETL
427PSMA 737-746FTVQAAAETL
428PSMA 729-737KRQIYVAAF
429PSMA 721-729PSKAWGEVK
430PSMA 723-731KAWGEVKRQ
431PSMA 100-108WKEFGLDSV
432PSMA 99-108QWKEFGLDSV
433PSMA 102-111EFGLDSVELA
434SCP-1 126-134ELRQKESKL
435SCP-1 125-134AELRQKESKL
436SCP-1 133-141KLQENRKII
437SCP-1 298-305QLEEKTKL
438SCP-1 297-305NQLEEKTKL
439SCP-1 288-296LLEESRDKV
440SCP-1 287-296FLLEESRDKV
441SCP-1 291-299ESRDKVNQL
442SCP-1 290-299EESRDKVNQL
443SCP-1 475-483EKEVHDLEY
444SCP-1 474-483REKEVHDLEY
445SCP-1 480-488DLEYSYCHY
446SCP-1 477-485EVHDLEYSY
447SCP-1 477-486EVHDLEYSYC
448SCP-1 502-509KLSSKREL
449SCP-1 508-515ELKNTEYF
450SCP-1 507-515RELKNTEYF
451SCP-1 496-503KRGQRPKL
452SCP-1 494-503LPKRGQRPKL
453SCP-1 509-517LKNTEYFTL
454SCP-1 508-517ELKNTEYFTL
455SCP-1 506-514KRELKNTEY
456SCP-1 502-510KLSSKRELK
457SCP-1 498-506GQRPKLSSK
458SCP-1 497-506RGQRPKLSSK
459SCP-1 500-508RPKLSSKRE
460SCP-1 573-580LEYVREEL
461SCP-1 572-580ELEYVREEL
462SCP-1 571-580NELEYVREEL
463SCP-1 579-587ELKQKREDEV
464SCP-1 575-583YVREELKQK
465SCP-1 632-640QLNVYEIKV
466SCP-1 630-638SKQLNVYEI
467SCP-1 628-636AESKQLNVY
468SCP-1 627-636TAESKQLNVY
469SCP-1 638-645IKVNKLEL
470SCP-1 637-645EIKVNKLEL
471SCP-1 636-645YEIKVNKLEL
472SCP-1 642-650KLELELESA
473SCP-1 635-643VYEIKVNKL
474SCP-1 634-643NVYEIKVNKL
475SCP-1 646-654ELESAKQKF
476SCP-1 642-650KLELELESA
477SCP-1 646-654ELESAKQKF
478SCP-1 771-778KEKLKREA
479SCP-1 777-785EAKENTATL
480SCP-1 776-785REAKENTATL
481SCP-1 773-782KLKREAKENT
482SCP-1 112-119EAEKIKKW
483SCP-1 101-109GLSRVYSKL
484SCP-1 100-109EGLSRVYSKL
485SCP-1 108-116KLYKEAEKI
486SCP-1 98-106NSEGLSRVY
487SCP-1 97-106ENSEGLSRVY
488SCP-1 102-110LSRVYSKLY
489SCP-1 101-110GLSRVYSKLY
490SCP-1 96-105LENSEGLSRV
491SCP-1 108-117KLYKEAEKIK
492SCP-1 949-956REDRWAVI
493SCP-1 948-956MREDRWAVI
494SCP-1 947-956KMREDRWAVI
495SCP-1 947-955KMREDRWAV
496SCP-1 934-942TTPGSTLKF
497SCP-1 933-942LTTPGSTLKF
498SCP-1 937-945GSTLKGAI
499SCP-1 945-953IRKMREDRW
500SCP-1 236-243RLEMHFKL
501SCP-1 235-243SRLEMHFKL
502SCP-1 242-250KLKEDYEKI
503SCP-1 249-257KJQHLEQEY
504SCP-1 248-257EKIQHLEQEY
505SCP-1 233-242ENSRLEMHF
506SCP-1 236-245RLEMHFKLKE
507SCP-1 324-331LEDIKVSL
508SCP-1 323-331ELEDIKVSL
509SCP-1 322-331KELEDIKVSL
510SCP-1 320-327LTKELEDI
511SCP-1 319-327HLTKELEDI
512SCP-1 330-338SLQRSVSTQ
513SCP-1 321-329TKELEDIKV
514SCP-1 320-329LTKELEDIKV
515SCP-1 326-335DIKVSLQRSV
516SCP-1 281-288KMKDLTFL
517SCP-1 280-288NKMKDLTFL
518SCP-1 279-288ENKMKDLTFL
519SCP-1 288-296LLEESRDKV
520SCP-1 287-296FLLEESRDKV
521SCP-1 291-299ESRDKVNQL
522SCP-1 290-299EESRDKVNQL
523SCP-1 277-285EKENKMKDL
524SCP-1 276-285TEKENKMKDL
525SCP-1 279-287ENKMKDLTF
526SCP-1 218-225IEKMITAF
527SCP-1 217-225NIEKMITAF
528SCP-1 216-225SNIEKIMITAF
529SCP-1 223-230TAFEELRV
530SCP-1 222-230ITAFEELRV
531SCP-1 221-230MITAFEELRV
532SCP-1 220-228KIMITAFEEL
533SCP-1 219-228EKMITAFEEL
534SCP-1 227-235ELRVQAENS
535SCP-1 213-222DLNSNIEKMI
536SCP-1 837-844WTSAKNTL
537SCP-1 846-854TPLPKAYTV
538SCP-1 845-854STPLPKAYTV
539SCP-1 844-852LSTPLPKAY
540SCP-1 843-852TLSTPLPKAY
541SCP-1 842-850NTLSTPLPK
542SCP-1 841-850KNTLSTPLPK
543SCP-1 828-835ISKDKRDY
544SCP-1 826-835HGISKDKRDY
545SCP-1 832-840KRDYLWTSA
546SCP-1 829-838SKDKRDYLWT
547SCP-1 279-286ENKMKDLT
548SCP-1 260-268EINDKEKQV
549SCP-1 274-282QITEKENKM
550SCP-1 269-277SLLLIQITE
551SCP-1 453-460FEKIAEEL
552SCP-1 452-460QFEKIABEL
553SCP-1 451-460KQFEKIAEEL
554SCP-1 449-456DNKQFEKI
555SCP-1 448-456YDNKQFEKJ
556SCP-1 447-456LYDNKQFEKI
557SCP-1 440-447LGEKETLL
558SCP-1 439-447VLGEKETLL
559SCP-1 438-447KVLGEKETLL
560SCP-1 390-398LLRTEQQRL
561SCP-1 389-398ELLRTEQQRL
562SCP-1 393-401TEQQRLENY
563SCP-1 392-401RTEQQRLENY
564SCP-1 402-410EDQLIILTM
565SCP-1 397-406RLENYEDQLI
566SCP-1 368-375KARAAHSF
567SCP-1 376-384VVTEFETTV
568SCP-1 375-384FVVTEFETTV
569SCP-1 377-385VTEFETTVC
570SCP-1 376-385VVTEFETTVC
571SCP-1 344-352DLQIATNTI
572SCP-1 347-355IATNTICQL
573SCP-1 346-355QIATNTICQL
574SSX4 57-65VMTKLGFKY
575SSX4 53-61LNYEVMTKL
576SSX4 52-61KLNYEVMTKL
577SSX4 66-74TLPPFMRSK
578SSX4 110-118KIMPKIKPAE
579SSX4 103-112SLQRIFPKIM
580Tyr 463-471 YIKSYLEQA
581Tyr 459-467 SFQDYJKSY
582Tyr 458-467DSFQDYIKSY
583Tyr 507-514LPEEKQPL
584Tyr 506-514QLPEEKQPL
585Tyr 505-514KQLPEEKQPL
586Tyr 507-515LPEEKQPLL
587Tyr 506-515QLPEEKQPLL
588Tyr 497-505SLLCRHKRK
589ED-B domain ofEVPQLTDLSFVDITDSSIGLRWT
FibronectinPLNSSTIIGYRITVVAAGEGIPI
FEDFVDSSVGYYTVTGLEPGIDY
DISVITLINGGESAPTTLTQQT
590ED-B domain ofCTFDNLSPGLEYNVSVYTVKDDK
FibronectinESVPISDTIIPEVPQLTDLSFVD
with flankingITDSSIGLRWTPLNSSTIIGYRI
sequence fromTVVAAGEGIPIFEDFVDSSVGYY
FribronectinTVTGLEPGIDYDISVITLINGGE
SAPTTLTQQTAVPPPTDLRFTNI
GPDTMRVTW
591ED-B domain ofAccession number:
FibronectinX07717
cds
592CEA proteinAccession number:
P06731
593CEA cDNAAccession number:
NM 004363
594Her2/NeuAccession number:
proteinP04626
595Her2/Neu cDNAAccession number:
M11730
596SCP-1 proteinAccession number:
Q15431
597SCP-1 cDNAAccession number:
X95654
598SSX-4 proteinAccession number:
O60224
599SSX-4 cDNAAccession number:
NM 005636

*Any of SEQ ID NOS. 1, 8, 9, 11-23, 26-29, 32-44, 47-54, 56-63, 66-68 88-253, and 256-588 can be useful as epitopes in any of the various embodiments of the invention. Any of SEQ ID NOS. 10, 30, 31, 45, 46, 55, 64, 65, 69, 254, and 255 can be useful as sequences containing epitopes or epitope clusters, as described in various embodiments of the invention.

**All accession numbers used here and throughout can be accessed through the NCBI databases, for example, through the Entrez seek and retrieval system on the world wide web.

[0143]

Note that the following discussion sets forth the inventors' understanding if the operation of the invention. However, it is not intended that this discussion limit the patent to any particular theory of operation not set forth in the claims.

[0144]

In pursuing the development of epitope vaccines others have generated lists of predicted epitopes based on MHC binding motifs. Such peptides can be immunogenic, but may not correspond to any naturally produced antigenic fragment. Therefore, whole antigen will not elicit a similar response or sensitize a target cell to cytolysis by CTL. Therefore such lists do not differentiate between those sequences that can be useful as vaccines and those that cannot. Efforts to determine which of these predicted epitopes are in fact naturally produced have often relied on screening their reactivity with tumor infiltrating lymphocytes (TIL). However, TIL are strongly biased to recognize immune epitopes whereas tumors (and chronically infected cells) will generally present housekeeping epitopes. Thus, unless the epitope is produced by both the housekeeping and immunoproteasomes, the target cell will generally not be recognized by CTL induced with TIL-identified epitopes. The epitopes of the present invention, in contrast, are generated by the action of a specified proteasome, indicating that they can be naturally produced, and enabling their appropriate use. The importance of the distinction between housekeeping and immune epitopes to vaccine design is more fully set forth in PCT publication WO 01/82963A2, which is hereby incorporated by reference in its entirety.

[0145]

The epitopes of the invention include or encode polypeptide fragments of TAAs that are precursors or products of proteasomal cleavage by a housekeeping or immune proteasome, and that contain or consist of a sequence having a known or predicted affinity for at least one allele of MHC I. In some embodiments, the epitopes include or encode a polypeptide of about 6 to 25 amino acids in length, preferably about 7 to 20 amino acids in length, more preferably about 8 to 15 amino acids in length, and still more preferably 9 or 10 amino acids in length. However, it is understood that the polypeptides can be larger as long as N-terminal trimming can produce the MHC epitope or that they do not contain sequences that cause the polypeptides to be directed away from the proteasome or to be destroyed by the proteasome. For immune epitopes, if the larger peptides do not contain such sequences, they can be processed in the pAPC by the immune proteasome. Housekeeping epitopes may also be embedded in longer sequences provided that the sequence is adapted to facilitate liberation of the epitope's C-terminus by action of the immunoproteasome. The foregoing discussion has assumed that processing of longer epitopes proceeds through action of the immunoproteasome of the pAPC. However, processing can also be accomplished through the contrivance of some other mechanism, such as providing an exogenous protease activity and a sequence adapted so that action of the protease liberates the MHC epitope. The sequences of these epitopes can be subjected to computer analysis in order to calculate physical, biochemical, immunologic, or molecular genetic properties such as mass, isoelectric point, predicted mobility in electrophoresis, predicted binding to other MHC molecules, melting temperature of nucleic acid probes, reverse translations, similarity or homology to other sequences, and the like.

[0146]

In constructing the polynucleotides encoding the polypeptide epitopes of the invention, the gene sequence of the associated TAA can be used, or the polynucleotide can be assembled from any of the corresponding codons. For a 10 amino acid epitope this can constitute on the order of 106 different sequences, depending on the particular amino acid composition. While large, this is a distinct and readily definable set representing a miniscule fraction of the >1018 possible polynucleotides of this length, and thus in some embodiments, equivalents of a particular sequence disclosed herein encompass such distinct and readily definable variations on the listed sequence. In choosing a particular one of these sequences to use in a vaccine, considerations such as codon usage, self-complementarity, restriction sites, chemical stability, etc. can be used as will be apparent to one skilled in the art.

[0147]

The invention contemplates producing peptide epitopes. Specifically these epitopes are derived from the sequence of a TAA, and have known or predicted affinity for at least one allele of MHC I. Such epitopes are typically identical to those produced on target cells or pAPCs.

[0000]

Compositions Containing Active Epitopes

[0148]

Embodiments of the present invention provide polypeptide compositions, including vaccines, therapeutics, diagnostics, pharmacological and pharmaceutical compositions. The various compositions include newly identified epitopes of TAAs, as well as variants of these epitopes. Other embodiments of the invention provide polynucleotides encoding the polypeptide epitopes of the invention. The invention further provides vectors for expression of the polypeptide epitopes for purification. In addition, the invention provides vectors for the expression of the polypeptide epitopes in an APC for use as an anti-tumor vaccine. Any of the epitopes or antigens, or nucleic acids encoding the same, from Table 1 can be used. Other embodiments relate to methods of making and using the various compositions.

[0149]

A general architecture for a class I MHC-binding epitope can be described, and has been reviewed more extensively in Madden, D. R. Annu. Rev. Immunol. 13:587-622, 1995, which is hereby incorporated by reference in its entirety. Much of the binding energy arises from main chain contacts between conserved residues in the MHC molecule and the N- and C-termini of the peptide. Additional main chain contacts are made but vary among MHC alleles. Sequence specificity is conferred by side chain contacts of so-called anchor residues with pockets that, again, vary among MHC alleles. Anchor residues can be divided into primary and secondary. Primary anchor positions exhibit strong preferences for relatively well-defined sets of amino acid residues. Secondary positions show weaker and/or less well-defined preferences that can often be better described in terms of less favored, rather than more favored, residues. Additionally, residues in some secondary anchor positions are not always positioned to contact the pocket on the MHC molecule at all. Thus, a subset of peptides exists that bind to a particular MHC molecule and have a side chain-pocket contact at the position in question and another subset exists that show binding to the same MHC molecule that does not depend on the conformation the peptide assumes in the peptide-binding groove of the MHC molecule. The C-terminal residue (PΩ; omega) is preferably a primary anchor residue. For many of the better studied HLA molecules (e.g. A2, A68, B27, B7, B35, and B53) the second position (P2) is also an anchor residue. However, central anchor residues have also been observed including P3 and P5 in HLA-B8, as well as P5 and PΩ(omega)-3 in the murine MHC molecules H-2Db and H-2Kb, respectively. Since more stable binding will generally improve immunogenicity, anchor residues are preferably conserved or optimized in the design of variants, regardless of their position.

[0150]

Because the anchor residues are generally located near the ends of the epitope, the peptide can buckle upward out of the peptide-binding groove allowing some variation in length. Epitopes ranging from 8-11 amino acids have been found for HLA-A68, and up to 13 amino acids for HLA-A2. In addition to length variation between the anchor positions, single residue truncations and extensions have been reported and the N- and C-termini, respectively. Of the non-anchor residues, some point up out of the groove, making no contact with the MHC molecule but being available to contact the TCR, very often P1, P4, and PΩ(omega)-1 for HLA-A2. Others of the non-anchor residues can become interposed between the upper edges of the peptide-binding groove and the TCR, contacting both. The exact positioning of these side chain residues, and thus their effects on binding, MHC fine conformation, and ultimately immunogenicity, are highly sequence dependent. For an epitope to be highly immunogenic it must not only promote stable enough TCR binding for activation to occur, but the TCR must also have a high enough off-rate that multiple TCR molecules can interact sequentially with the same peptide-MHC complex (Kalergis, A. M. et al., Nature Immunol. 2:229-234, 2001, which is hereby incorporated by reference in its entirety). Thus, without further information about the ternary complex, both conservative and non-conservative substitutions at these positions merit consideration when designing variants.

[0151]

The polypeptide epitope variants can be made, for example, using any of the techniques and guidelines for conservative and non-conservative mutations. Variants can be derived from substitution, deletion or insertion of one or more amino acids as compared with the native sequence. Amino acid substitutions can be the result of replacing one amino acid with another amino acid having similar structural and/or chemical properties, such as the replacement of a threonine with a serine, for example. Such replacements are referred to as conservative amino acid replacements, and all appropriate conservative amino acid replacements are considered to be embodiments of one invention. Insertions or deletions can optionally be in the range of about 1 to 4, preferably 1 to 2, amino acids. It is generally preferable to maintain the “anchor positions” of the peptide which are responsible for binding to the MHC molecule in question. Indeed, immunogenicity of peptides can be improved in many cases by substituting more preferred residues at the anchor positions (Franco, et al., Nature Immunology, 1(2):145-150, 2000, which is hereby incorporated by reference in its entirety). Immunogenicity of a peptide can also often be improved by substituting bulkier amino acids for small amino acids found in non-anchor positions while maintaining sufficient cross-reactivity with the original epitope to constitute a useful vaccine. The variation allowed can be determined by routine insertions, deletions or substitutions of amino acids in the sequence and testing the resulting variants for activity exhibited by the polypeptide epitope. Because the polypeptide epitope is often 9 amino acids, the substitutions preferably are made to the shortest active epitope, for example, an epitope of 9 amino acids.

[0152]

Variants can also be made by adding any sequence onto the N-terminus of the polypeptide epitope variant. Such N-terminal additions can be from 1 amino acid up to at least 25 amino acids. Because peptide epitopes are often trimmed by N-terminal exopeptidases active in the pAPC, it is understood that variations in the added sequence can have no effect on the activity of the epitope. In preferred embodiments, the amino acid residues between the last upstream proteasomal cleavage site and the N-terminus of the MHC epitope do not include a proline residue. Serwold, T. at al., Nature Immunol. 2:644-651, 2001, which is hereby incorporated by reference in its entirety. Accordingly, effective epitopes can be generated from precursors larger than the preferred 9-mer class I motif.

[0153]

Generally, peptides are useful to the extent that they correspond to epitopes actually displayed by MHC I on the surface of a target cell or a pACP. A single peptide can have varying affinities for different MHC molecules, binding some well, others adequately, and still others not appreciably (Table 2). MHC alleles have traditionally been grouped according to serologic reactivity which does not reflect the structure of the peptide-binding groove, which can differ among different alleles of the same type. Similarly, binding properties can be shared across types; groups based on shared binding properties have been termed supertypes. There are numerous alleles of MHC I in the human population; epitopes specific to certain alleles can be selected based on the genotype of the patient.

Predicted Binding of Tyrosinase207-216 (SEQ ID NO. 1) to Various
MHC types
*Half time of
MHC I typedissociation (min)
A10.05
A*02011311.
A*020550.4
A32.7
A*1101 (part of the A3 supertype)0.012
A246.0
B74.0
B88.0
B14 (part of the B27 supertype)60.0
B*27020.9
B*270530.0
B*3501 (part of the B7 supertype)2.0
B*44030.1
B*5101 (part of the B7 supertype)26.0
B*510255.0
B*58010.20
B600.40
B622.0

*HLA Peptide Binding Predictions (world wide web hypertext transfer protocol “access at bimas.dcrt.nih.gov/molbio/hla_bin”).

[0154]

In further embodiments of the invention, the epitope, as peptide or encoding polynucleotide, can be administered as a pharmaceutical composition, such as, for example, a vaccine or an immunogenic composition, alone or in combination with various adjuvants, carriers, or excipients. It should be noted that although the term vaccine may be used throughout the discussion herein, the concepts can be applied and used with any other pharmaceutical composition, including those mentioned herein. Particularly advantageous adjuvants include various cytokines and oligonucleotides containing immunostimulatory sequences (as set forth in greater detail in the co-pending applications referenced herein). Additionally the polynucleotide encoded epitope can be contained in a virus (e.g. vaccinia or adenovirus) or in a microbial host cell (e.g. Salmonella or Listeria which is then used as a vector for the polynucleotide (Dietrich, G. et al. Nat. Biotech. 16:181-185, 1998, which is hereby incorporated by reference in its entirety). Alternatively a pAPC can be transformed, ex vivo, to express the epitope, or pulsed with peptide epitope, to be itself administered as a vaccine. To increase efficiency of these processes, the encoded epitope can be carried by a viral or bacterial vector, or complexed with a ligand of a receptor found on pAPC. Similarly the peptide epitope can be complexed with or conjugated to a pAPC ligand. A vaccine can be composed of more than a single epitope.

[0155]

Particularly advantageous strategies for incorporating epitopes and/or epitope clusters, into a vaccine or pharmaceutical composition are disclosed in U.S. patent application Ser. No. 09/560,465 entitled “EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING CELLS,” filed on Apr. 28, 2000, which is hereby incorporated by reference in its entirety. Epitope clusters for use in connection with this invention are disclosed in U.S. patent application Ser. No. 09/561,571 entitled “EPITOPE CLUSTERS,” filed on Apr. 28, 2000, which is hereby incorporated by reference in its entirety.

[0156]

Preferred embodiments of the present invention are directed to vaccines and methods for causing a pAPC or population of pAPCs to present housekeeping epitopes that correspond to the epitopes displayed on a particular target cell. Any of the epitopes or antigens in Table 1, can be used for example. In one embodiment, the housekeeping epitope is a TuAA epitope processed by the housekeeping proteasome of a particular tumor type. In another embodiment, the housekeeping epitope is a virus-associated epitope processed by the housekeeping proteasome of a cell infected with a virus. This facilitates a specific T cell response to the target cells. Concurrent expression by the pAPCs of multiple epitopes, corresponding to different induction states (pre- and post-attack), can drive a CTL response effective against target cells as they display either housekeeping epitopes or immune epitopes.

[0157]

By having both housekeeping and immune epitopes present on the pAPC, this embodiment can optimize the cytotoxic T cell response to a target cell. With dual epitope expression, the pAPCs can continue to sustain a CTL response to the immune-type epitope when the tumor cell switches from the housekeeping proteasome to the immune proteasome with induction by IFN, which, for example, may be produced by tumor-infiltrating CTLs.

[0158]

In a preferred embodiment, immunization of a patient is with a vaccine that includes a housekeeping epitope. Many preferred TAAs are associated exclusively with a target cell, particularly in the case of infected cells. In another embodiment, many preferred TAAs are the result of deregulated gene expression in transformed cells, but are found also in tissues of the testis, ovaries and fetus. In another embodiment, useful TAAs are expressed at higher levels in the target cell than in other cells. In still other embodiments, TAAs are not differentially expressed in the target cell compare to other cells, but are still useful since they are involved in a particular function of the cell and differentiate the target cell from most other peripheral cells; in such embodiments, healthy cells also displaying the TAA may be collaterally attacked by the induced T cell response, but such collateral damage is considered to be far preferable to the condition caused by the target cell.

[0159]

The vaccine contains a housekeeping epitope in a concentration effective to cause a pAPC or populations of pAPCs to display housekeeping epitopes. Advantageously, the vaccine can include a plurality of housekeeping epitopes or one or more housekeeping epitopes optionally in combination with one or more immune epitopes. Formulations of the vaccine contain peptides and/or nucleic acids in a concentration sufficient to cause pAPCs to present the epitopes. The formulations preferably contain epitopes in a total concentration of about 1 μg-1 mg/100 μl of vaccine preparation. Conventional dosages and dosing for peptide vaccines and/or nucleic acid vaccines can be used with the present invention, and such dosing regimens are well understood in the art. In one embodiment, a single dosage for an adult human may advantageously be from about 1 to about 5000 μl of such a composition, administered one time or multiple times, e.g., in 2, 3, 4 or more dosages separated by 1 week, 2 weeks, 1 month, or more. insulin pump delivers 1 ul per hour (lowest frequency) ref intranodal method patent.

[0160]

The compositions and methods of the invention disclosed herein further contemplate incorporating adjuvants into the formulations in order to enhance the performance of the vaccines. Specifically, the addition of adjuvants to the formulations is designed to enhance the delivery or uptake of the epitopes by the pAPCs. The adjuvants contemplated by the present invention are known by those of skill in the art and include, for example, GMCSF, GCSF, IL-2, IL-12, BCG, tetanus toxoid, osteopontin, and ETA-1.

[0161]

In some embodiments of the invention, the vaccines can include a recombinant organism, such as a virus, bacterium or parasite, genetically engineered to express an epitope in a host. For example, Listeria monocytogenes, a gram-positive, facultative intracellular bacterium, is a potent vector for targeting TuAAs to the immune system. In a preferred embodiment, this vector can be engineered to express a housekeeping epitope to induce therapeutic responses. The normal route of infection of this organism is through the gut and can be delivered orally. In another embodiment, an adenovirus (Ad) vector encoding a housekeeping epitope for a TuAA can be used to induce anti-virus or anti-tumor responses. Bone marrow-derived dendritic cells can be transduced with the virus construct and then injected, or the virus can be delivered directly via subcutaneous injection into an animal to induce potent T-cell responses. Another embodiment employs a recombinant vaccinia virus engineered to encode amino acid sequences corresponding to a housekeeping epitope for a TAA. Vaccinia viruses carrying constructs with the appropriate nucleotide substitutions in the form of a minigene construct can direct the expression of a housekeeping epitope, leading to a therapeutic T cell response against the epitope.

[0162]

The immunization with DNA requires that APCs take up the DNA and express the encoded proteins or peptides. It is possible to encode a discrete class I peptide on the DNA. By immunizing with this construct, APCs can be caused to express a housekeeping epitope, which is then displayed on class I MHC on the surface of the cell for stimulating an appropriate CTL response. Constructs generally relying on termination of translation or non-proteasomal proteases for generation of proper termini of housekeeping epitopes have been described in U.S. patent application Ser. No. 09/561,572 entitled EXPRESSION VECTORS ENCODING EPITOPES OF TARGET-ASSOCIATED ANTIGENS, filed on Apr. 28, 2000.

[0163]

As mentioned, it can be desirable to express housekeeping peptides in the context of a larger protein. Processing can be detected even when a small number of amino acids are present beyond the terminus of an epitope. Small peptide hormones are usually proteolytically processed from longer translation products, often in the size range of approximately 60-120 amino acids. This fact has led some to assume that this is the minimum size that can be efficiently translated. In some embodiments, the housekeeping peptide can be embedded in a translation product of at least about 60 amino acids. In other embodiments the housekeeping peptide can be embedded in a translation product of at least about 50, 30, or 15 amino acids.

[0164]

Due to differential proteasomal processing, the immune proteasome of the pAPC produces peptides that are different from those produced by the housekeeping proteasome in peripheral body cells. Thus, in expressing a housekeeping peptide in the context of a larger protein, it is preferably expressed in the APC in a context other than its full length native sequence, because, as a housekeeping epitope, it is generally only efficiently processed from the native protein by the housekeeping proteasome, which is not active in the APC. In order to encode the housekeeping epitope in a DNA sequence encoding a larger protein, it is useful to find flanking areas on either side of the sequence encoding the epitope that permit appropriate cleavage by the immune proteasome in order to liberate that housekeeping epitope. Altering flanking amino acid residues at the N-terminus and C-terminus of the desired housekeeping epitope can facilitate appropriate cleavage and generation of the housekeeping epitope in the APC. Sequences embedding housekeeping epitopes can be designed de novo and screened to determine which can be successfully processed by immune proteasomes to liberate housekeeping epitopes.

[0165]

Alternatively, another strategy is very effective for identifying sequences allowing production of housekeeping epitopes in APC. A contiguous sequence of amino acids can be generated from head to tail arrangement of one or more housekeeping epitopes. A construct expressing this sequence is used to immunize an animal, and the resulting T cell response is evaluated to determine its specificity to one or more of the epitopes in the array. By definition, these immune responses indicate housekeeping epitopes that are processed in the pAPC effectively. The necessary flanking areas around this epitope are thereby defined. The use of flanking regions of about 4-6 amino acids on either side of the desired peptide can provide the necessary information to facilitate proteasome processing of the housekeeping epitope by the immune proteasome. Therefore, a sequence ensuring epitope synchronization of approximately 16-22 amino acids can be inserted into, or fused to, any protein sequence effectively to result in that housekeeping epitope being produced in an APC. In alternate embodiments the whole head-to-tail array of epitopes, or just the epitopes immediately adjacent to the correctly processed housekeeping epitope can be similarly transferred from a test construct to a vaccine vector.

[0166]

In a preferred embodiment, the housekeeping epitopes can be embedded between known immune epitopes, or segments of such, thereby providing an appropriate context for processing. The abutment of housekeeping and immune epitopes can generate the necessary context to enable the immune proteasome to liberate the housekeeping epitope, or a larger fragment, preferably including a correct C-terminus. It can be useful to screen constructs to verify that the desired epitope is produced. The abutment of housekeeping epitopes can generate a site cleavable by the immune proteasome. Some embodiments of the invention employ known epitopes to flank housekeeping epitopes in test substrates; in others, screening as described below are used whether the flanking regions are arbitrary sequences or mutants of the natural flanking sequence, and whether or not knowledge of proteasomal cleavage preferences are used in designing the substrates.

[0167]

Cleavage at the mature N-terminus of the epitope, while advantageous, is not required, since a variety of N-terminal trimming activities exist in the cell that can generate the mature N-terminus of the epitope subsequent to proteasomal processing. It is preferred that such N-terminal extension be less than about 25 amino acids in length and it is further preferred that the extension have few or no proline residues. Preferably, in screening, consideration is given not only to cleavage at the ends of the epitope (or at least at its C-terminus), but consideration also can be given to ensure limited cleavage within the epitope.

[0168]

Shotgun approaches can be used in designing test substrates and can increase the efficiency of screening. In one embodiment multiple epitopes can be assembled one after the other, with individual epitopes possibly appearing more than once. The substrate can be screened to determine which epitopes can be produced. In the case where a particular epitope is of concern a substrate can be designed in which it appears in multiple different contexts. When a single epitope appearing in more than one context is liberated from the substrate additional secondary test substrates, in which individual instances of the epitope are removed, disabled, or are unique, can be used to determine which are being liberated and truly constitute sequences ensuring epitope synchronization.

[0169]

Several readily practicable screens exist. A preferred in vitro screen utilizes proteasomal digestion analysis, using purified immune proteasomes, to determine if the desired housekeeping epitope can be liberated from a synthetic peptide embodying the sequence in question. The position of the cleavages obtained can be determined by techniques such as mass spectrometry, HPLC, and N-terminal pool sequencing; as described in greater detail in U.S. patent applications entitled METHOD OF EPITOPE DISCOVERY, EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING CELLS, two Provisional U.S. patent applications entitled EPITOPE SEQUENCES, which are all cited and incorporated by reference above.

[0170]

Alternatively, in vivo screens such as immunization or target sensitization can be employed. For immunization a nucleic acid construct capable of expressing the sequence in question is used. Harvested CTL can be tested for their ability to recognize target cells presenting the housekeeping epitope in question. Such targets cells are most readily obtained by pulsing cells expressing the appropriate MHC molecule with synthetic peptide embodying the mature housekeeping epitope. Alternatively, cells known to express housekeeping proteasome and the antigen from which the housekeeping epitope is derived, either endogenously or through genetic engineering, can be used. To use target sensitization as a screen, CTL, or preferably a CTL clone, that recognizes the housekeeping epitope can be used. In this case it is the target cell that expresses the embedded housekeeping epitope (instead of the pAPC during immunization) and it must express immune proteasome. Generally, the target cell can be transformed with an appropriate nucleic acid construct to confer expression of the embedded housekeeping epitope. Loading with a synthetic peptide embodying the embedded epitope using peptide loaded liposomes or a protein transfer reagent such as BIOPORTER™ (Gene Therapy Systems, San Diego, Calif.) represents an alternative.

[0171]

Additional guidance on nucleic acid constructs useful as vaccines in accordance with the present invention are disclosed in U.S. patent application Ser. No. 09/561,572 entitled “EXPRESSION VECTORS ENCODING EPITOPES OF TARGET-ASSOCIATED ANTIGENS,” filed on Apr. 28, 2000. Further, expression vectors and methods for their design, which are useful in accordance with the present invention are disclosed in U.S. Patent Application Ser. No. 60/336,968 (attorney docket number CTLIMM.022PR) entitled “EXPRESSION VECTORS ENCODING EPITOPES OF TARGET-ASSOCIATED ANTIGENS AND METHODS FOR THEIR DESIGN,” filed on Nov. 7, 2001, which is incorporated by reference in its entirety.

[0172]

A preferred embodiment of the present invention includes a method of administering a vaccine including an epitope (or epitopes) to induce a therapeutic immune response. The vaccine is administered to a patient in a manner consistent with the standard vaccine delivery protocols that are known in the art. Methods of administering epitopes of TAAs including, without limitation, transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, and mucosal administration, including delivery by injection, instillation or inhalation. A particularly useful method of vaccine delivery to elicit a CTL response is disclosed in Australian Patent No. 739189 issued Jan. 17, 2002; U.S. patent application Ser. No. 09/380,534, filed on Sep. 1, 1999; and a Continuation-in-Part thereof U.S. patent application Ser. No. 09/776,232 both entitled “A METHOD OF INDUCING A CTL RESPONSE,” filed on Feb. 2, 2001.

[0000]

Reagents Recognizing Epitopes

[0173]

In another aspect of the invention, proteins with binding specificity for the epitope and/or the epitope-MHC molecule complex are contemplated, as well as the isolated cells by which they can be expressed. In one set of embodiments these reagents take the form of immunoglobulins: polyclonal sera or monoclonal antibodies (mAb), methods for the generation of which are well know in the art. Generation of mAb with specificity for peptide-MHC molecule complexes is known in the art. See, for example, Aharoni et al. Nature 351:147-150, 1991; Andersen et al. Proc. Natl. Acad. Sci. USA 93:1820-1824, 1996; Dadaglio et al. Immunity 6:727-738, 1997; Duc et al. Int. Immunol. 5:427-431,1993; Eastman et al. Eur. J. Immunol. 26:385-393, 1996; Engberg et al. Immunotechnology 4:273-278, 1999; Porgdor et al. Immunity 6:715-726, 1997; Puri et al. J. Immunol. 158:2471-2476, 1997; and Polakova, K., et al. J. Immunol. 165 342-348, 2000; all of which are hereby incorporated by reference in their entirety.

[0174]

In other embodiments the compositions can be used to induce and generate, in vivo and in vitro, T-cells specific for the any of the epitopes and/or epitope-MHC complexes. In preferred embodiments the epitope can be any one or more of those listed in TABLE 1, for example. Thus, embodiments also relate to and include isolated T cells, T cell clones, T cell hybridomas, or a protein containing the T cell receptor (TCR) binding domain derived from the cloned gene, as well as a recombinant cell expressing such a protein. Such TCR derived proteins can be simply the extra-cellular domains of the TCR, or a fusion with portions of another protein to confer a desired property or function. One example of such a fusion is the attachment of TCR binding domains to the constant regions of an antibody molecule so as to create a divalent molecule. The construction and activity of molecules following this general pattern have been reported, for example, Plaksin, D. et al. J. Immunol. 158:2218-2227, 1997 and Lebowitz, M. S. et al. Cell Immunol. 192:175-184, 1999, which are hereby incorporated by reference in their entirety. The more general construction and use of such molecules is also treated in U.S. Pat. No. 5,830,755 entitled T CELL RECEPTORS AND THEIR USE IN THERAPEUTIC AND DIAGNOSTIC METHODS, which is hereby incorporated by reference in its entirety.

[0175]

The generation of such T cells can be readily accomplished by standard immunization of laboratory animals, and reactivity to human target cells can be obtained by. immunizing with human target cells or by immunizing HLA-transgenic animals with the antigen/epitope. For some therapeutic approaches T cells derived from the same species are desirable. While such a cell can be created by cloning, for example, a murine TCR into a human T cell as contemplated above, in vitro immunization of human cells offers a potentially faster option. Techniques for in vitro immunization, even using naive donors, are know in the field, for example, Stauss et al., Proc. Natl. Acad. Sci. USA 89:7871-7875, 1992; Salgaller et al. Cancer Res. 55:4972-4979, 1995; Tsai et al., J. Immunol. 158:1796-1802, 1997; and Chung et al., J. Immunother. 22:279-287, 1999; which are hereby incorporated by reference in their entirety.

[0176]

Any of these molecules can be conjugated to enzymes, radiochemicals, fluorescent tags, and toxins, so as to be used in the diagnosis (imaging or other detection), monitoring, and treatment of the pathogenic condition associated with the epitope. Thus a toxin conjugate can be administered to kill tumor cells, radiolabeling can facilitate imaging of epitope positive tumor, an enzyme conjugate can be used in an ELISA-like assay to diagnose cancer and confirm epitope expression in biopsied tissue. In a further embodiment, such T cells as set forth above, following expansion accomplished through stimulation with the epitope and/or cytokines, can be administered to a patient as an adoptive immunotherapy.

[0000]

Reagents Comprising Epitopes

[0177]

A further aspect of the invention provides isolated epitope-MHC complexes. In a particularly advantageous embodiment of this aspect of the invention, the complexes can be soluble, multimeric proteins such as those described in U.S. Pat. No. 5,635,363 (tetramers) or U.S. Pat. No. 6,015,884 (Ig-dimers), both of which are hereby incorporated by reference in their entirety. Such reagents are useful in detecting and monitoring specific T cell responses, and in purifying such T cells.

[0178]

Isolated MHC molecules complexed with epitopic peptides can also be incorporated into planar lipid bilayers or liposomes. Such compositions can be used to stimulate T cells in vitro or, in the case of liposomes, in vivo. Co-stimulatory molecules (e.g. B7, CD40, LFA-3) can be incorporated into the same compositions or, especially for in vitro work, co-stimulation can be provided by anti-co-receptor antibodies (e.g. anti-CD28, anti-CD154, anti-CD2) or cytokines (e.g. IL-2, IL-12). Such stimulation of T cells can constitute vaccination, drive expansion of T cells in vitro for subsequent infusion in an immuotherapy, or constitute a step in an assay of T cell function.

[0179]

The epitope, or more directly its complex with an MHC molecule, can be an important constituent of functional assays of antigen-specific T cells at either an activation or readout step or both. Of the many assays of T cell function current in the art (detailed procedures can be found in standard immunological references such as Current Protocols in Immunology 1999 John Wiley & Sons Inc., New York, which is hereby incorporated by reference in its entirety) two broad classes can be defined, those that measure the response of a pool of cells and those that measure the response of individual cells. Whereas the former conveys a global measure of the strength of a response, the latter allows determination of the relative frequency of responding cells. Examples of assays measuring global response are cytotoxicity assays, ELISA, and proliferation assays detecting cytokine secretion. Assays measuring the responses of individual cells (or small clones derived from them) include limiting dilution analysis (LDA), ELISPOT, flow cytometric detection of unsecreted cytokine (described in U.S. Pat. No. 5,445,939, entitled “METHOD FOR ASSESSMENT OF THE MONONUCLEAR LEUKOCYTE IMMUNE SYSTEM” and U.S. Pat. Nos. 5,656,446; and 5,843,689, both entitled “METHOD FOR THE ASSESSMENT OF THE MONONUCLEAR LEUKOCYTE IMMUNE SYSTEM,” reagents for which are sold by Becton, Dickinson & Company under the tradename ‘FASTIMMUNE’, which patents are hereby incorporated by reference in their entirety) and detection of specific TCR with tetramers or Ig-dimers as stated and referenced above. The comparative virtues of these techniques have been reviewed in Yee, C. et al. Current Opinion in Immunology, 13:141-146, 2001, which is hereby incorporated by reference in its entirety. Additionally detection of a specific TCR rearrangement or expression can be accomplished through a variety of established nucleic acid based techniques, particularly in situ and single-cell PCR techniques, as will be apparent to one of skill in the art.

[0180]

These functional assays are used to assess endogenous levels of immunity, response to an immunologic stimulus (e.g. a vaccine), and to monitor immune status through the course of a disease and treatment. Except when measuring endogenous levels of immunity, any of these assays presume a preliminary step of immunization, whether in vivo or in vitro depending on the nature of the issue being addressed. Such immunization can be carried out with the various embodiments of the invention described above or with other forms of immunogen (e.g., pAPC-tumor cell fusions) that can provoke similar immunity. With the exception of PCR and tetramer/Ig-dimer type analyses which can detect expression of the cognate TCR, these assays generally benefit from a step of in vitro antigenic stimulation which can advantageously use various embodiments of the invention as described above in order to detect the particular functional activity (highly cytolytic responses can sometimes be detected directly). Finally, detection of cytolytic activity requires epitope-displaying target cells, which can be generated using various embodiments of the invention. The particular embodiment chosen for any particular step depends on the question to be addressed, ease of use, cost, and the like, but the advantages of one embodiment over another for any particular set of circumstances will be apparent to one of skill in the art.

[0181]

The peptide MHC complexes described in this section have traditionally been understood to be non-covalent associations. However it is possible, and can be advantageous, to create a covalent linkages, for example by encoding the epitope and MHC heavy chain or the epitope, β2-microglobulin, and MHC heavy chain as a single protein (Yu, Y. L. Y., et al., J. Immunol. 168:3145-3149, 2002; Mottez, E., et at., J. Exp. Med. 181:493,1995; Dela Cruz, C. S., et al., Int. Immunol. 12:1293, 2000; Mage, M. G., et al., Proc. Natl. Acad. Sci. USA 89:10658,1992; Toshitani, K., et al., Proc. Natl. Acad. Sci. USA 93:236,1996; Lee, L., et al., Eur. J. Immunol. 24:2633,1994; Chung, D. H., et al., J. Immunol. 163:3699,1999; Uger, R. A. and B. H. Barber, J. Immunol. 160:1598, 1998; Uger, R. A., et al., J. Immunol. 162:6024,1999; and White, J., et al., J. Immunol. 162:2671, 1999; which are incorporated herein by reference in their entirety). Such constructs can have superior stability and overcome roadblocks in the processing-presentation pathway. They can be used in the already described vaccines, reagents, and assays in similar fashion.

[0000]

Tumor Associated Antigens

[0182]

Epitopes of the present invention are derived from the TuAAs tyrosinase (SEQ ID NO. 2), SSX-2, (SEQ ID NO. 3), PSMA (prostate-specific membrane antigen) (SEQ ID NO. 4), GP100, (SEQ ID NO. 70), MAGE-1, (SEQ ID NO. 71), MAGE-2, (SEQ ID NO. 72), MAGE-3, (SEQ ID NO. 73), NY-ESO-1, (SEQ ID NO. 74), PRAME, (SEQ ID NO. 77), PSA, (SEQ ID NO. 78), PSCA, (SEQ ID NO. 79), the ED-B domain of fibronectin (SEQ ID NOS 589 and 590), CEA (carcinoembryonic antigen) (SEQ ID NO. 592), Her2/Neu (SEQ ID NO. 594), SCP-1 (SEQ ID NO. 596) and SSX-4 (SEQ ID NO. 598). The natural coding sequences for these eleven proteins, or any segments within them, can be determined from their cDNA or complete coding (cds) sequences, SEQ ID NOS. 5-7, 80-87, 591, 593, 595, 597, and 599, respectively.

[0183]

Tyrosinase is a melanin biosynthetic enzyme that is considered one of the most specific markers of melanocytic differentiation. Tyrosinase is expressed in few cell types, primarily in melanocytes, and high levels are often found in melanomas. The usefulness of tyrosinase as a TuAA is taught in U.S. Pat. No. 5,747,271 entitled “METHOD FOR IDENTIFYING INDIVIDUALS SUFFERING FROM A CELLULAR ABNORMALITY SOME OF WHOSE ABNORMAL CELLS PRESENT COMPLEXES OF HLA-A2/TYROSINASE DERIVED PEPTIDES, AND METHODS FOR TREATING SAID INDIVIDUALS” which is hereby incorporated by reference in its entirety.

[0184]

GP100, also known as PMel17, also is a melanin biosynthetic protein expressed at high levels in melanomas. GP100 as a TuAA is disclosed in U.S. Pat. No. 5,844,075 entitled “MELANOMA ANTIGENS AND THEIR USE IN DIAGNOSTIC AND THERAPEUTIC METHODS,” which is hereby incorporated by reference in its entirety.

[0185]

SSX-2, also know as Hom-Mel-40, is a member of a family of highly conserved cancer-testis antigens (Gure, A. O. et al. Int. J. Cancer 72:965-971, 1997, which is hereby incorporated by reference in its entirety). Its identification as a TuAA is taught in U.S. Pat. No. 6,025,191 entitled “ISOLATED NUCLEIC ACID MOLECULES WHICH ENCODE A MELANOMA SPECIFIC ANTIGEN AND USES THEREOF,” which is hereby incorporated by reference in its entirety. Cancer-testis antigens are found in a variety of tumors, but are generally absent from normal adult tissues except testis. Expression of different members of the SSX family have been found variously in tumor cell lines. Due to the high degree of sequence identity among SSX family members, similar epitopes from more than one member of the family will be generated and able to bind to an MHC molecule, so that some vaccines directed against one member of this family can cross-react and be effective against other members of this family (see example 3 below).

[0186]

MAGE-1, MAGE-2, and MAGE-3 are members of another family of cancer-testis antigens originally discovered in melanoma (MAGE is a contraction of melanoma-associated antigen) but found in a variety of tumors. The identification of MAGE proteins as TuAAs is taught in U.S. Pat. No. 5,342,774 entitled NUCLEOTIDE SEQUENCE ENCODING THE TUMOR REJECTION ANTIGEN PRECURSOR, MAGE-1, which is hereby incorporated by reference in its entirety, and in numerous subsequent patents. Currently there are 17 entries for (human) MAGE in the SWISS Protein database. There is extensive similarity among these proteins so in many cases, an epitope from one can induce a cross-reactive response to other members of the family. A few of these have not been observed in tumors, most notably MAGE-H1 and MAGE-D1, which are expressed in testes and brain, and bone marrow stromal cells, respectively. The possibility of cross-reactivity on normal tissue is ameliorated by the fact that they are among the least similar to the other MAGE proteins.

[0187]

NY-ESO-1, is a cancer-testis antigen found in a wide variety of tumors, also known as CTAG-1 (Cancer-Testis Antigen-1) and CAG-3 (Cancer Antigen-3). NY-ESO-1 as a TuAA is disclosed in U.S. Pat. No. 5,804,381 entitled ISOLATED NUCLEIC ACID MOLECULE ENCODING AN ESOPHAGEAL CANCER ASSOCIATED ANTIGEN, THE ANTIGEN ITSELF, AND USES THEREOF which is hereby incorporated by reference in its entirety. A paralogous locus encoding antigens with extensive sequence identity, LAGE-1a/s (SEQ ID NO. 75) and LAGE-1b/L (SEQ ID NO. 76), have been disclosed in publicly available assemblies of the human genome, and have been concluded to arise through alternate splicing. Additionally, CT-2 (or CTAG-2, Cancer-Testis Antigen-2) appears to be either an allele, a mutant, or a sequencing discrepancy of LAGE-1b/L. Due to the extensive sequence identity, many epitopes from NY-ESO-1 can also induce immunity to tumors expressing these other antigens. See FIG. 1. The proteins are virtually identical through amino acid 70. From 71-134 the longest run of identities between NY-ESO-1 and LAGE is 6 residues, but potentially cross-reactive sequences are present. And from 135-180 NY-ESO and LAGE-1a/s are identical except for a single residue, but LAGE-1b/L is unrelated due to the alternate splice. The CAMEL and LAGE-2 antigens appear to derive from the LAGE-1 mRNA, but from alternate reading frames, thus giving rise to unrelated protein sequences. More recently, GenBank Accession AF277315.5, Homo sapiens chromosome X clone RP5-865E18, RP5-1087L19, complete sequence, reports three independent loci in this region which are labeled as LAGE1 (corresponding to CTAG-2 in the genome assemblies), plus LAGE2-A and LAGE2-B (both corresponding to CTAG-1 in the genome assemblies).

[0188]

PSMA (prostate-specific membranes antigen), a TuAA described in U.S. Pat. No. 5,538,866 entitled “PROSTATE-SPECIFIC MEMBRANES ANTIGEN” which is hereby incorporated by reference in its entirety, is expressed by normal prostate epithelium and, at a higher level, in prostatic cancer. It has also been found in the neovasculature of non-prostatic tumors. PSMA can thus form the basis for vaccines directed to both prostate cancer and to the neovasculature of other tumors. This later concept is more fully described in a provisional U.S. Patent application Ser. No. 60/274,063 entitled ANTI-NEOVASCULAR VACCINES FOR CANCER, filed Mar. 7, 2001, and U.S. application Ser. No. 10/094,699, attorney docket number CTLIMM.015A, filed on Mar. 7, 2002, entitled “ANTI-NEOVASCULAR PREPARATIONS FOR CANCER,” both of which are hereby incorporated by reference in their entirety. Briefly, as tumors grow they recruit ingrowth of new blood vessels. This is understood to be necessary to sustain growth as the centers of unvascularized tumors are generally necrotic and angiogenesis inhibitors have been reported to cause tumor regression. Such new blood vessels, or neovasculature, express antigens not found in established vessels, and thus can be specifically targeted. By inducing CTL against neovascular antigens the vessels can be disrupted, interrupting the flow of nutrients to (and removal of wastes from) tumors, leading to regression.

[0189]

Alternate splicing of the PSMA mRNA also leads to a protein with an apparent start at Met58, thereby deleting the putative membrane anchor region of PSMA as described in U.S. Pat. No. 5,935,818 entitled “ISOLATED NUCLEIC ACID MOLECULE ENCODING ALTERNATIVELY SPLICED PROSTATE-SPECIFIC MEMBRANES ANTIGEN AND USES THEREOF” which is hereby incorporated by reference in its entirety. A protein termed PSMA-like protein, Genbank accession number AF261715, is nearly identical to amino acids 309-750 of PSMA and has a different expression profile. Thus the most preferred epitopes are those with an N-terminus located from amino acid 58 to 308.

[0190]

PRAME, also know as MAPE, DAGE, and OIP4, was originally observed as a melanoma antigen. Subsequently, it has been recognized as a CT antigen, but unlike many CT antigens (e.g., MAGE, GAGE, and BAGE) it is expressed in acute myeloid leukemias. PRAME is a member of the MAPE family which consists largely of hypothetical proteins with which it shares limited sequence similarity. The usefulness of PRAME as a TuAA is taught in U.S. Pat. No. 5,830,753 entitled “ISOLATED NUCLEIC ACID MOLECULES CODING FOR TUMOR REJECTION ANTIGEN PRECURSOR DAGE AND USES THEREOF” which is hereby incorporated by reference in its entirety.

[0191]

PSA, prostate specific antigen, is a peptidase of the kallikrein family and a differentiation antigen of the prostate. Expression in breast tissue has also been reported. Alternate names include gamma-seminoprotein, kallikrein 3, seminogelase, seminin, and P-30 antigen. PSA has a high degree of sequence identity with the various alternate splicing products prostatic/glandular kallikrein-1 and -2, as well as kalikrein 4, which is also expressed in prostate and breast tissue. Other kallikreins generally share less sequence identity and have different expression profiles. Nonetheless, cross-reactivity that might be provoked by any particular epitope, along with the likelihood that that epitope would be liberated by processing in non-target tissues (most generally by the housekeeping proteasome), should be considered in designing a vaccine.

[0192]

PSCA, prostate stem cell antigen, and also known as SCAH-2, is a differentiation antigen preferentially expressed in prostate epithelial cells, and overexpresssed in prostate cancers. Lower level expression is seen in some normal tissues including neuroendocrine cells of the digestive tract and collecting ducts of the kidney. PSCA is described in U.S. Pat. No. 5,856,136 entitled “HUMAN STEM CELL ANTIGENS” which is hereby incorporated by reference in its entirety.

[0193]

Synaptonemal complex protein 1 (SCP-1), also known as HOM-TES-14, is a meiosis-associated protein and also a cancer-testis antigen (Tureci, O., et al. Proc. Natl. Acad. Sci. USA 95:5211-5216, 1998). As a cancer antigen its expression is not cell-cycle regulated and it is found frequently in gliomas, breast, renal cell, and ovarian carcinomas. It has some similarity to myosins, but with few enough identities that cross-reactive epitopes are not an immediate prospect.

[0194]

The ED-B domain of fibronectin is also a potential target. Fibronectin is subject to developmentally regulated alternative splicing, with the ED-B domain being encoded by a single exon that is used primarily in oncofetal tissues (Matsuura, H. and S. Hakomori Proc. Natl. Acad. Sci. USA 82:6517-6521, 1985; Carnemolla, B. et al. J. Cell Biol. 108:1139-1148, 1989; Loridon-Rosa, B. et al. Cancer Res.50:1608-1612, 1990; Nicolo, G. et al. Cell Differ. Dev. 32:401-408, 1990; Borsi, L. et al. Exp. Cell Res. 199:98-105, 1992; Oyama, F. et al. Cancer Res. 53:2005-2011, 1993; Mandel, U. et al. APMIS 102:695-702, 1994; Farnoud, M. R. et al. Int. J. Cancer 61:27-34, 1995; Pujuguet, P. et al. Am. J. Pathol. 148:579-592, 1996; Gabler, U. et al. Heart 75:358-362, 1996;Chevalier, X. Br. J. Rheumatol. 35:407-415, 1996; Midulla, M. Cancer Res. 60:164-169, 2000).

[0195]

The ED-B domain is also expressed in fibronectin of the neovasculature (Kaczmarek, J. et al. Int. J. Cancer 59:11-16, 1994; Castellani, P. et al. Int. J. Cancer 59:612-618, 1994; Neri, D. et al. Nat. Biotech. 15:1271-1275, 1997; Karelina, T. V. and A. Z. Eisen Cancer Detect. Prev. 22:438-444, 1998; Tarli, L. et al. Blood 94:192-198, 1999; Castellani, P. et al. Acta Neurochir. (Wien) 142:277-282, 2000). As an oncofetal domain, the ED-B domain is commonly found in the fibronectin expressed by neoplastic cells in addition to being expressed by the neovasculature. Thus, CTL-inducing vaccines targeting the ED-B domain can exhibit two mechanisms of action: direct lysis of tumor cells, and disruption of the tumor's blood supply through destruction of the tumor-associated neovasculature. As CTL activity can decay rapidly after withdrawal of vaccine, interference with normal angiogenesis can be minimal. The design and testing of vaccines targeted to neovasculature is described in Provisional U.S. Patent Application Ser. No. 60/274,063 entitled “ANTI-NEOVASCULATURE VACCINES FOR CANCER” and in U.S. patent application Ser. No. 10/094,699, attorney docket number CTLIMM.0.15A, entitled “ANTI-NEOVASCULATURE PREPARATIONS FOR CANCER, filed on date even with this application (Mar. 7, 2002). A tumor cell line is disclosed in Provisional U.S. Application Ser. No. 60/363,131, filed on Mar. 7, 2002, attorney docket number CTLIMM.028PR, entitled “HLA-TRANSGENIC MURINE TUMOR CELL LINE,” which is hereby incorporated by reference in its entirety.

[0196]

Carcinoembryonic antigen (CEA) is a paradigmatic oncofetal protein first described in 1965 (Gold and Freedman, J. Exp. Med. 121: 439-462, 1965. Fuller references can be found in the Online Medelian Inheritance in Man; record *114890). It has officially been renamed carcinoembryonic antigen-related cell adhesion molecule 5 (CEACAM5). Its expression is most strongly associated with adenocarcinomas of the epithelial lining of the digestive tract and in fetal colon. CEA is a member of the immunoglobulin supergene family and the defining member of the CEA subfamily.

[0197]

HER2/NEU is an oncogene related to the epidermal growth factor receptor (van de Vijver, et al., New Eng. J. Med. 319:1239-1245, 1988), and apparently identical to the c-ERBB2 oncogene (Di Fiore, et al., Science 237: 178-182, 1987). The over-expression of ERBB2 has been implicated in the neoplastic transformation of prostate cancer. As HER2 it is amplified and over-expressed in 25-30% of breast cancers among other tumors where expression level is correlated with the aggressiveness of the tumor (Slamon, et al., New Eng. J. Med. 344:783-792, 2001). A more detailed description is available in the Online Medelian Inheritance in Man; record *164870.

[0198]

All references mentioned herein are hereby incorporated by reference in their entirety. Further, incorporated by reference in its entirety is U.S. patent application Ser. No. 10/005,905 (attorney docket number CTLIMM.021CP1) entitled “EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING CELLS,” filed on Nov. 7, 2001 and a continuation thereof, U.S. application Ser. No. 10/026066, filed on Dec. 7, 2001, attorney docket number MANNK.021CP1C, also entitled “EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING CELLS.”

[0199]

Useful epitopes were identified and tested as described in the following examples. However, these examples are intended for illustration purposes only, and should not be construed as limiting the scope of the invention in any way.

EXAMPLES

[0000]

Sequences of Specific Preferred Epitopes

Example 1

Manufacture of Epitopes

[0000]

A. Synthetic Production of Epitopes

[0200]

Peptides having an amino acid sequence of any of SEQ ID NO: 1, 8, 9, 11-23, 26-29, 32-44, 47-54, 56-63, 66-68 88-253, or 256-588 are synthesized using either FMOC or tBOC solid phase synthesis methodologies. After synthesis, the peptides are cleaved from their supports with either trifluoroacetic acid or hydrogen fluoride, respectively, in the presence of appropriate protective scavengers. After removing the acid by evaporation, the peptides are extracted with ether to remove the scavengers and the crude, precipitated peptide is then lyophilized. Purity of the crude peptides is determined by HPLC, sequence analysis, amino acid analysis, counterion content analysis and other suitable means. If the crude peptides are pure enough (greater than or equal to about 90% pure), they can be used as is. If purification is required to meet drug substance specifications, the peptides are purified using one or a combination of the following: re-precipitation; reverse-phase, ion exchange, size exclusion or hydrophobic interaction chromatography; or counter-current distribution.

[0000]

Drug Product Formulation

[0201]

GMP-grade peptides are formulated in a parenterally acceptable aqueous, organic, or aqueous-organic buffer or solvent system in which they remain both physically and chemically stable and biologically potent. Generally, buffers or combinations of buffers or combinations of buffers and organic solvents are appropriate. The pH range is typically between 6 and 9. Organic modifiers or other excipients can be added to help solubilize and stabilize the peptides. These include detergents, lipids, co-solvents, antioxidants, chelators and reducing agents. In the case of a lyophilized product, sucrose or mannitol or other lyophilization aids can be added. Peptide solutions are sterilized by membrane filtration into their final container-closure system and either lyophilized for dissolution in the clinic, or stored until use.

[0000]

B. Construction of Expression Vectors for Use as Nucleic Acid Vaccines

[0202]

The construction of three generic epitope expression vectors is presented below. The particular advantages of these designs are set forth in U.S. patent application Ser. No. 09/561,572 entitled “EXPRESSION VECTORS ENCODING EPITOPES OF TARGET-ASSOCIATED ANTIGENS,” which has been incorporated by reference in its entirety above.

[0203]

A suitable E. coli strain was then transfected with the plasmid and plated out onto a selective medium. Several colonies were grown up in suspension culture and positive clones were identified by restriction mapping. The positive clone was then grown up and aliquotted into storage vials and stored at −70° C.

[0204]

A mini-prep (QIAprep Spin Mini-prep: Qiagen, Valencia, Calif.) of the plasmid was then made from a sample of these cells and automated fluorescent dideoxy sequence analysis was used to confirm that the construct had the desired sequence.

[0205]

B.1 Construction ofpVAX-EPI-IRES-EP2

[0000]

Overview:

[0206]

The starting plasmid for this construct is pVAX1 purchased from Invitrogen (Carlsbad, Calif.). Epitopes EP1 and EP2 were synthesized by GIBCO BRL (Rockville, Md.). The IRES was excised from pIRES purchased from Clontech (Palo Alto, Calif.).

[0000]

Procedure:

[0000]

    • 1 pIRES was digested with EcoRI and NotI. The digested fragments were separated by agarose gel electrophoresis, and the IRES fragment was purified from the excised band.
    • 2 pVAX1 was digested with EcoRI and NotI, and the pVAX1 fragment was gel-purified.
    • 3 The purified pVAX1 and IRES fragments were then ligated together.
    • 4 Competent E. coli of strain DH5α were transformed with the ligation mixture.
    • 5 Minipreps were made from 4 of the resultant colonies.
    • 6 Restriction enzyme digestion analysis was performed on the miniprep DNA. One recombinant colony having the IRES insert was used for further insertion of EP1 and EP2. This intermediate construct was called pVAX-IRES.
    • 7 Oligonucleotides encoding EP1 and EP2 were synthesized.
    • 8 EP1 was subcloned into pVAX-IRES between AflII and EcoRI sites, to make pVAX-EP1-IRES;
    • 9 EP2 was subcloned into pVAX-EP1-IRES between SalI and NotI sites, to make the final construct pVAX-EP1-IRES-EP2.
    • 10 The sequence of the EP1-IRES-EP2 insert was confirmed by DNA sequencing.

[0217]

B2. Construction of pVAX-EP1-IRES-EP2-ISS-NIS

[0000]

Overview:

[0218]

The starting plasmid for this construct was pVAX-EP1-IRES-EP2 Example 1). The ISS (immunostimulatory sequence) introduced into this construct is AACGTT, and the NIS (standing for nuclear import sequence) used is the SV40 72 bp repeat sequence. ISS-NIS was synthesized by GIBCO BRL. See FIG. 2.

[0000]

Procedure:

[0000]

    • 1 pVAX-EP1-IRES-EP2 was digested with NruI; the linearized plasmid was gel-purified.
    • 2 ISS-NIS oligonucleotide was synthesized.
    • 3 The purified linearized pVAX-EP1-IRES-EP2 and synthesized ISS-NIS were ligated together.
    • 4 Competent E. coli of strain DH5α were transformed with the ligation product.
    • 5 Minipreps were made from resultant colonies.
    • 6 Restriction enzyme digestions of the minipreps were carried out.
    • 7 The plasmid with the insert was sequenced.

[0226]

B3. Construction of pVAX-EP2-UB-EP1

[0000]

Overview:

[0227]

The starting plasmid for this construct was pVAX1 (Invitrogen). EP2 and EP1 were synthesized by GIBCO BRL. Wild type Ubiquitin cDNA encoding the 76 amino acids in the construct was cloned from yeast.

[0000]

Procedure:

[0000]

    • 1 RT-PCR was performed using yeast mRNA. Primers were designed to amplify the complete coding sequence of yeast Ubiquitin.
    • 2 The RT-PCR products were analyzed using agarose gel electrophoresis. A band with the predicted size was gel-purified.
    • 3 The purified DNA band. was subcloned into pZERO1 at EcoRV site. The resulting clone was named pZERO-UB.
    • 4 Several clones of pZERO-UB were sequenced to confirm the Ubiquitin sequence before further manipulations.
    • 5 EP1 and EP2 were synthesized.
    • 6 EP2, Ubiquitin and EP1 were ligated and the insert cloned into pVAX1 between BamHI and EcoRI, putting it under control of the CMV promoter.
    • 7 The sequence of the insert EP2-UB-EP1 was confirmed by DNA sequencing.

Example 2

Identification of Useful Epitope Variants

[0235]

The 10-mer FLPWHRLFLL (SEQ ID NO. 1) is identified as a useful epitope. Based on this sequence, numerous variants are made. Variants exhibiting activity in HLA binding assays (see Example 3, section 6) are identified as useful, and are subsequently incorporated into vaccines.

[0236]

The HLA-A2 binding of length variants of FLPWHRLFLL have been evaluated. Proteasomal digestion analysis indicates that the C-terminus of the 9-mer FLPWHRLFL (SEQ ID NO. 8) is also produced. Additionally the 9-mer LPWHRLFLL (SEQ ID NO. 9) can result from N-terminal trimming of the 10-mer. Both are predicted to bind to the HLA-A*0201 molecule, however of these two 9-mers, FLPWHRLFL displayed more significant binding and is preferred (see FIGS. 3A and B).

[0237]

In vitro proteasome digestion and N-terminal pool sequencing indicates that tyrosinase207-216 (SEQ ID NO. 1) is produced more commonly than tyrosinase207-215 (SEQ ID NO. 8), however the latter peptide displays superior immunogenicity, a potential concern in arriving at an optimal vaccine design. FLPWHRLFL, tyrosinase207-215 (SEQ ID NO. 8) was used in an in vitro immunization of HLA-A2+ blood to generate CTL (see CTL Induction Cultures below). Using peptide pulsed T2 cells as targets in a standard chromium release assay it was found that the CTL induced by tyrosinase207-215 (SEQ ID NO. 8) recognize tyrosinase207-216 (SEQ ID NO. 1) targets equally well (see FIG. 3C). These CTL also recognize the HLA-A2+, tyrosinase+ tumor cell lines 624.38 and HTB64, but not 624.28 an HLA-A2 derivative of 624.38 (FIG. 3C). Thus the relative amounts of these two epitopes produced in vivo, does not become a concern in vaccine design.

[0238]

CTL Induction Cultures

[0239]

PBMCs from normal donors were purified by centrifugation in Ficoll-Hypaque from buffy coats. All cultures were carried out using the autologous plasma (AP) to avoid exposure to potential xenogeneic pathogens and recognition of FBS peptides. To favor the in vitro generation of peptide-specific CTL, we employed autologous dendritic cells (DC) as APCs. DC were generated and CTL were induced with DC and peptide from PBMCs as described (Keogh et al., 2001). Briefly, monocyte-enriched cell fractions were cultured for 5 days with GM-CSF and IL-4 and were cultured for 2 additional days in culture media with 2 μg/ml CD40 ligand to induce maturation. 2×106 CD8+-enriched T lymphocytes/well and 2×105 peptide-pulsed DC/well were co-cultured in 24-well plates in 2 ml RPMI supplemented with 10% AP, 10 ng/ml IL-7 and 20 IU/ml IL-2. Cultures were restimulated on days 7 and 14 with autologous irradiated peptide-pulsed DC.

[0240]

Sequence variants of FLPWHRLFL are constructed as follow. Consistent with the binding coefficient table (see Table 3) from the NIH/BIMAS MHC binding prediction program (see reference in example 3 below), binding can be improved by changing the L at position 9, an anchor position, to V. Binding can also be altered, though generally to a lesser extent, by changes at non-anchor positions. Referring generally to Table 3, binding can be increased by employing residues with relatively larger coefficients. Changes in sequence can also alter immunogenicity independently of their effect on binding to MHC. Thus binding and/or immunogenicity can be improved as follows:

[0241]

By substituting F,L,M,W, or Y for P at position 3; these are all bulkier residues that can also improve immunogenicity independent of the effect on binding. The amine and hydroxyl-bearing residues, Q and N; and S and T; respectively, can also provoke a stronger, cross-reactive response.

[0242]

By substituting D or E for W at position 4 to improve binding; this addition of a negative charge can also make the epitope more immunogenic, while in some cases reducing cross-reactivity with the natural epitope. Alternatively the conservative substitutions of F or Y can provoke a cross-reactive response.

[0243]

By substituting F for H at position 5 to improve binding. H can be viewed as partially charged, thus in some cases the loss of charge can hinder cross-reactivity. Substitution of the fully charged residues R or K at this position can enhance immunogenicity without disrupting charge-dependent cross-reactivity.

[0244]

By substituting I, L, M, V, F, W, or Y for R at position 6. The same caveats and alternatives apply here as at position 5.

[0245]

By substituting W or F for L at position 7 to improve binding. Substitution of V, I, S, T, Q, or N at this position are not generally predicted to reduce binding affinity by this model (the NIH algorithm), yet can be advantageous as discussed above.

[0246]

Y and W, which are equally preferred as the Fs at positions 1 and 8, can provoke a useful cross-reactivity. Finally, while substitutions in the direction of bulkiness are generally favored to improve immunogenicity, the substitution of smaller residues such as A, S, and C, at positions 3-7 can be useful according to the theory that contrast in size, rather than bulkiness per se, is an important factor in immunogenicity. The reactivity of the thiol group in C can introduce other properties as discussed in Chen, J.-L., et al. J. Immunol. 165:948-955, 2000.

9-mer Coefficient Table for HLA-A*0201*
HLA Coefficient table for file “A_0201_standard”
Amino Acid
Type1st2nd3rd4th5th6th7th8th9th
A1.0001.0001.0001.0001.0001.0001.0001.0001.000
C1.0000.4701.0001.0001.0001.0001.0001.0001.000
D0.0750.1000.4004.1001.0001.0000.4901.0000.003
E0.0751.4000.0644.1001.0001.0000.4901.0000.003
F4.6000.0503.7001.0003.8001.9005.8005.5000.015
G1.0000.4701.0001.0001.0001.0000.1301.0000.015
H0.0340.0501.0001.0001.0001.0001.0001.0000.015
I1.7009.9001.0001.0001.0002.3001.0000.4102.100
K3.5000.1000.0351.0001.0001.0001.0001.0000.003
L1.70072.0003.7001.0001.0002.3001.0001.0004.300
M1.70052.0003.7001.0001.0002.3001.0001.0001.000
N1.0000.4701.0001.0001.0001.0001.0001.0000.015
P0.0220.4701.0001.0001.0001.0001.0001.0000.003
Q1.0007.3001.0001.0001.0001.0001.0001.0000.003
R1.0000.0100.0761.0001.0001.0000.2001.0000.003
S1.0000.4701.0001.0001.0001.0001.0001.0000.015
T1.0001.0001.0001.0001.0001.0001.0001.0001.500
V1.7006.3001.0001.0001.0002.3001.0000.41014.000
W4.6000.0108.3001.0001.0001.7007.5005.5000.015
Y4.6000.0103.2001.0001.0001.5001.0005.5000.015

*This table and other comparable data that are publicly available are useful in designing epitope variants and in determining whether a particular variant is substantially similar, or is functionally similar.

Example 3

Cluster Analysis (SSX-231-68)

[0000]

1. Epitope Cluster Region Prediction:

[0247]

The computer algorithms: SYFPEITHI (internet http:// access at syfpeithi.bmi-heidelberg.com/Scripts/MHCServer.dll/EpPredict.htm), based on the book “MHC Ligands and Peptide Motifs” by H. G. Rammensee, J. Bachmann and S. Stevanovic; and HLA Peptide Binding Predictions (NIH) (internet http:// access at bimas.dcrt.nih.gov/molbio/hla_bin), described in Parker, K. C., et al., J. Immunol. 152:163, 1994; were used to analyze the protein sequence of SSX-2 (GI:10337583). Epitope clusters (regions with higher than average density of peptide fragments with high predicted MHC affinity) were defined as described fully in U.S. patent application Ser. No. 09/561,571 entitled “EPITOPE CLUSTERS,” filed on Apr. 28, 2000. Using a epitope density ratio cutoff of 2, five and two clusters were defined using the SYFPETHI and NIH algorithms, respectively, and peptides score cutoffs of 16 (SYFPETHI) and 5 (NIH). The highest scoring peptide with the NIH algorithm, SSX-241-49, with an estimated halftime of dissociation of >1000 min., does not overlap any other predicted epitope but does cluster with SSX-257-65 in the NIH analysis.

[0000]

2. Peptide Synthesis and Characterization:

[0248]

SSX-231-68, YFSKEEWEKMKASEKIFYVYMKRKYEAMTKLGFKATLP (SEQ ID NO. 10) was synthesized by MPS (Multiple Peptide Systems, San Diego, Calif. 92121) using standard solid phase chemistry. According to the provided ‘Certificate of Analysis’, the purity of this peptide was 95%.

[0000]

3. Proteasome Digestion:

[0249]

Proteasome was isolated from human red blood cells using the proteasome isolation protocol described in U.S. patent application Ser. No. 09/561,074 entitled “METHOD OF EPITOPE DISCOVERY,” filed on Apr. 28, 2000. SDS-PAGE, western-blotting, and ELISA were used as quality control assays. The final concentration of proteasome was 4 mg/ml, which was determined by non-interfering protein assay (Geno Technologies Inc.). Proteasomes were stored at −70° C. in 25 μl aliquots.

[0250]

SSX-231-68 was dissolved in Milli-Q water, and a 2 mM stock solution prepared and 20 μL aliquots stored at −20° C.

[0251]

1 tube of proteasome (25 μL) was removed from storage at −70° C. and thawed on ice. It was then mixed thoroughly with 12.5 μL of 2 mM peptide by repipetting (samples were kept on ice). A 5 μL sample was immediately removed after mixing and transferred to a tube containing 1.25 μL 10% TFA (final concentration of TFA was 2%); the T=0 min sample. The proteasome digestion reaction was then started and carried out at 37° C. in a programmable thermal controller. Additional 5 μL samples were taken out at 15, 30, 60, 120, 180 and 240 min respectively, the reaction was stopped by adding the sample to 1.25 μL 10% TFA as before. Samples were kept on ice or frozen until being analyzed by MALDI-MS. All samples were saved and stored at −20° C. for HPLC analysis and N-terminal sequencing. Peptide alone (without proteasome) was used as a blank control: 2 μL peptide+4 μL Tris buffer (20 mM, pH 7.6)+1.5 μL TFA.

[0000]

4. MALDI-TOF MS Measurements:

[0252]

For each time point 0.3 μL of matrix solution (10 mg/ml α-cyano-4-hydroxycinnamic acid in AcCN/H2O (70:30)) was first applied on a sample slide, and then an equal volume of digested sample was mixed gently with matrix solution on the slide. The slide was allowed to dry at ambient air for 3-5 min. before acquiring the mass spectra. MS was performed on a Lasermat 2000 MALDI-TOF mass spectrometer that was calibrated with peptide/protein standards. To improve the accuracy of measurement, the molecular ion weight (MH+) of the peptide substrate was used as an internal calibration standard. The mass spectrum of the T=120 min. digested sample is shown in FIG. 4.

[0000]

5. MS Data Analysis and Epitope Identification:

[0253]

To assign the measured mass peaks, the computer program MS-Product, a tool from the UCSF Mass Spectrometry Facility (http:// accessible at prospector.ucsf.edu/ucsfhtml3.4/msprod.htm), was used to generate all possible fragments (N- and C-terminal ions, and internal fragments) and their corresponding molecular weights. Due to the sensitivity of the mass spectrometer, average molecular weight was used. The mass peaks observed over the course of the digestion were identified as summarized in Table 4.

[0254]

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction steps of the procedure can be usefully practiced in any order. Although the substrate peptide used in proteasomal digest described here was specifically designed to include predicted HLA-A2.1 binding sequences, the actual products of digestion can be checked after the fact for actual or predicted binding to other MHC molecules. Selected results are shown in Table 5.

SSX-231-68 Mass Peak Identification.
MS PEAKCALCULATED
(measured)PEPTIDESEQUENCEMASS (MH+)
988.2331-37YFSKEEW989.08
1377.68 ± 2.3831-40YFSKEEWEKM1377.68
1662.45 ± 1.3031-43YFSKEEWEKMKAS1663.90
2181.72 ± 0.8531-47YFSKEEWEKMKCASEKIF2181.52
2346.631-48YFSKEEWEKMKASEFIFY2344.71
1472.16 ± 1.5438-49       EKMKASEKIFYV1473.77
2445.78 ± 1.1831-49*YFSKEEWEKMKASEKIFYV2443.84
2607.31-50YFSKEEWEKMKASEKIFYVY2607.02
1563.350-61                   YMKRKYEAMTKL1562.93
3989.931-61YFSKEEWEKMKASEKIFYVYMKRKYEAMTKL3987.77
1603.74 ± 1.5351-63                    MKRKYEAMTKLGF1603.98
1766.45 ± 1.550-63                   YMKRKYEAMTKLGF1767.16
1866.32 ± 1.2249-63                  VYMKRKYEAMTKLGF1866.29
4192.631-63YFSKEEWEKMKASEKIFYVYMKRKYEAMTKLGF4192.00
4392.131-65YFSKEEWEKMKASEKIFYVYMKRKYEAMTKLGFKA4391.25

Boldface sequence correspond to peptides predicted to bind to MHC.

*On the basis of mass alone this peak could also have been assigned to the peptide 32-50, however proteasomal removal of just the N-terminal amino acid is unlikely. N-terminal sequencing (below) verifies the assignment to 31-49.

**On the basis of mass this fragment might also represent 33-68. N-terminal sequencing below is consistent with the assignment to 31-65.

[0255]

Predicted HLA binding by proteasomally
generated fragments
SEQ ID NO.PEPTIDEHLASYFPEITHINIH
11FSKEEWEKMB*3501NP†90
12KMKASEKIFB*0817<5
13 & (14)(K) MKASEKIFYA119 (19)<5
15 & (16)(M) KASEKIFYVA*020122 (16)1017
B*0817<5
B*510122 (13)60
B*5102NP133
B*5103NP121
17 & (18)(K) ASEKTFYVYA134 (19)14
19 & (20)(K) RKYEAMTKLA*020115<5
A2615NP
B14NP45 (60)
B*27052115
B*270916NP
B*510115<5
21KYEAMTKLGFA116<5
A24NP300
22YEAMTKLGFB*4403NP80
23EAMTKLGFB*0822<5

†No prediction

[0256]

As seen in Table 5, N-terminal addition of authentic sequence to epitopes can generate epitopes for the same or different MHC restriction elements. Note in particular the pairing of (K)RKYEAMTKL (SEQ ID NOS 19 and (20)) with HLA-B14, where the 10-mer has a longer predicted halftime of dissociation than the co-C-terminal 9-mer. Also note the case of the 10-mer KYEAMTKLGF (SEQ ID NO. 21) which can be used as a vaccine useful with several MHC types by relying on N-terminal trimming to create the epitopes for HLA-B*4403 and -B*08.

[0000]

6. HLA-A0201 Binding Assay:

[0257]

Binding of the candidate epitope KASEKIFYV, SSX-241-49, (SEQ ID NO. 15) to HLA-A2.1 was assayed using a modification of the method of Stauss et al., (Proc Natl Acad Sci USA 89(17):7871-5 (1992)). Specifically, T2 cells, which express empty or unstable MHC molecules on their surface, were washed twice with Iscove's modified Dulbecco's medium (IMDM) and cultured overnight in serum-free AIM-V medium (Life Technologies, Inc., Rockville, Md.) supplemented with human β2-microglobulin at 3 μg/ml (Sigma, St. Louis, Mo.) and added peptide, at 800, 400, 200, 100, 50, 25, 12.5, and 6.25 μg/ml. in a 96-well flat-bottom plate at 3×105 cells/200 μl/well. Peptide was mixed with the cells by repipeting before distributing to the plate (alternatively peptide can be added to individual wells), and the plate was rocked gently for 2 minutes. Incubation was in a 5% CO2 incubator at 37° C. The next day the unbound peptide was removed by washing twice with serum free RPMI medium and a saturating amount of anti-class I HLA monoclonal antibody, fluorescein isothiocyanate (FITC)-conjugated anti-HLA A2, A28 (One Lambda, Canoga Park, Calif.) was added. After incubation for 30 minutes at 4° C., cells were washed 3 times with PBS supplemented with 0.5% BSA, 0.05% (w/v) sodium azide, pH 7.4-7.6 (staining buffer). (Alternatively W6/32 (Sigma) can be used as the anti-class I HLA monoclonal antibody the cells washed with staining buffer and then incubated with fluorescein isothiocyanate (FITC)-conjugated goat F(ab′) antimouse-IgG (Sigma) for 30 min at 4° C. and washed 3 times as before.) The cells were resuspended in 0.5 ml staining buffer. The analysis of surface HLA-A2.1 molecules stabilized by peptide binding was performed by flow cytometry using a FACScan (Becton Dickinson, San Jose, Calif.). If flow cytometry is not to be performed immediately the cells can be fixed by adding a quarter volume of 2% paraformaldehyde and storing in the dark at 4° C.

[0258]

The results of the experiment are shown in FIG. 5. SSX-241-49 (SEQ ID NO. 15) was found to bind HLA-A2.1 to a similar extent as the known A2.1 binder FLPSDYFPSV (HBV18-27; SEQ ID NO: 24) used as a positive control. An HLA-B44 binding peptide, AEMGKYSFY (SEQ ID NO: 25), was used as a negative control. The fluoresence obtained from the negative control was similar to the signal obtained when no peptide was used in the assay. Positive and negative control peptides were chosen from Table 18.3.1 in Current Protocols in Immunology p. 18.3.2, John Wiley and Sons, New York, 1998.

[0000]

7. Immunogenicity:

[0259]

A. In Vivo Immunization of Mice.

[0260]

HHD1 transgenic A*0201 mice (Pascolo, S., et al. J. Exp. Med. 185:2043-2051, 1997) were anesthetized and injected subcutaneously at the base of the tail, avoiding lateral tail veins, using 100 lI containing 100 nmol of SSX-241-49 (SEQ ID NO. 15) and 20 μg of HTL epitope peptide in PBS emulsified with 50 μl of IFA (incomplete Freund's adjuvant).

[0261]

B. Preparation of Stimulating Cells (LPS Blasts).

[0262]

Using spleens from 2 naive mice for each group of immunized mice, un-immunized mice were sacrificed and the carcasses were placed in alcohol. Using sterile instruments, the top dermal layer of skin on the mouse's left side (lower mid-section) was cut through, exposing the peritoneum. The peritoneum was saturated with alcohol, and the spleen was aseptically extracted. The spleen was placed in a petri dish with serum-free media. Splenocytes were isolated by using sterile plungers from 3 ml syringes to mash the spleens. Cells were collected in a 50 ml conical tubes in serum-free media, rinsing dish well. Cells were centrifuged (12000 rpm, 7 min) and washed one time with RPMI. Fresh spleen cells were resuspended to a concentration of 1×106 cells per ml in RPMI-10% FCS (fetal calf serum). 25 g/ml lipopolysaccharide and 7 μg/ml Dextran Sulfate were added. Cell were incubated for 3 days in T-75 flasks at 37° C., with 5% CO2. Splenic blasts were collected in 50 ml tubes pelleted (12000 rpm, 7 min) and resuspended to 3×107/ml in RPMI. The blasts were pulsed with the priming peptide at 50 μg/ml, RT 4 hr. mitomycin C-treated at 25 μg/ml, 37° C., 20 min and washed three times with DMEM.

[0263]

C. In Vitro Stimulation.

[0264]

3 days after LPS stimulation of the blast cells and the same day as peptide loading, the primed mice were sacrificed (at 14 days post immunization) to remove spleens as above. 3×106 splenocytes were co-cultured with 1×106 LPS blasts/well in 24-well plates at 37° C., with 5% CO2 in DMEM media supplemented with 10% FCS, 5×10−5 M β-mercaptoethanol, 100 μg/ml streptomycin and 100 IU/ml penicillin. Cultures were fed 5% (vol/vol) ConA supernatant on day 3 and assayed for cytolytic activity on day 7 in a51Cr-release assay.

[0265]

D. Chromium-Release Assay Measuring CTL Activity.

[0266]

To assess peptide specific lysis, 2×106 T2 cells were incubated with 100 μCi sodium chromate together with 50 μg/ml peptide at 37° C. for 1 hour. During incubation they were gently shaken every 15 minutes. After labeling and loading, cells were washed three times with 10 ml of DMEM-10% FCS, wiping each tube with a fresh Kimwipe after pouring off the supernatant. Target cells were resuspended in DMEM-10% FBS 1×105/ml. Effector cells were adjusted to 107/ml in DMEM-10% FCS and 100 μl serial 3-fold dilutions of effectors were prepared in U-bottom 96-well plates. 100 μl of target cells were added per well. In order to determine spontaneous release and maximum release, six additional wells containing 100 μl of target cells were prepared for each target. Spontaneous release was revealed by incubating the target cells with 100 μl medium; maximum release was revealed by incubating the target cells with 100 μl of 2% SDS. Plates were then centrifuged for 5 min at 600 rpm and incubated for 4 hours at 37° C. in 5% CO2 and 80% humidity. After the incubation, plates were then centrifuged for 5 min at 1200 rpm. Supernatants were harvested and counted using a gamma counter. Specific lysis was determined as follows: % specific release=[(experimental release−spontaneous mum release−spontaneous release)]×100.

[0267]

Results of the chromium release assay demonstrating specific lysis of target cells are shown in FIG. 6.

[0000]

Cross-Reactivity with Other SSX Proteins:

[0268]

SSX-241-49 (SEQ ID NO. 15) shares a high degree of sequence identity with the same region of the other SSX proteins. The surrounding regions have also been generally well conserved. Thus the housekeeping proteasome can cleave following V49 in all five sequences. Moreover, SSX41-49 is predicted to bind HLA-A*0201 (see Table 6). CTL generated by immunization with SSX-241-49 cross-react with tumor cells expressing other SSX proteins.

SSX41-49 - A*0201 Predicted Binding
SEQ IDFamilySYFPEITHINIH
NO.MemberSequenceScoreScore
15SSX-2KASEKIFYV221017
26SSX-1KYSEKISYV181.7
27SSX-3KVSEKIVYV241105
28SSX-4KSSEKIVYV2082
29SSX-5KASEKIIYV22175

Example 4

Cluster Analysis (PSMA163-192)

[0269]

A peptide, AFSPQGMPEGDLVYVNYARTEDFFKLERDM, PSMA163-192, (SEQ ID NO. 30), containing an A1 epitope cluster from prostate specific membrane antigen, PSMA168-190 (SEQ ID NO. 31) was synthesized using standard solid-phase F-moc chemistry on a 433A ABI Peptide synthesizer. After side chain deprotection and cleavage from the resin, peptide first dissolved in formic acid and then diluted into 30% Acetic acid, was run on a reverse-phase preparative HPLC C4 column at following conditions: linear AB gradient (5% B/min) at a flow rate of 4 ml/min, where eluent A is 0.1% aqueous TFA and eluent B is 0.1% TFA in acetonitrile. A fraction at time 16.642 min containing the expected peptide, as judged by mass spectrometry, was pooled and lyophilized. The peptide was then subjected to proteasome digestion and mass spectrum analysis essentially as described above. Prominent peaks from the mass spectra are summarized in Table 7.

PSMA163-192 Mass Peak Identification.
CALCULATED
PEPTIDESEQUENCEMASS (MH+)
163-177AFSPQGMPEGDLVYV1610.0
178-189               NYARTEDFFKLE1533.68
170-189       PEGDLVYVNYARTEDFFKLE2406.66
178-191NYARTEDFFKLERD1804.95
170-191PEGDLVYVNYARTEDFFKLERD2677.93
178-192NYARTEDFFKLERDM1936.17
163-176AFSPQGMPEGDLVY1511.70
177-192VNYARTEDFFKLERDM2035.30
163-179AFSPQGMPEGDLVYVNY1888.12
180-192ARTEDFTKLERDM1658.89
163-183AFSPQGMPEGDLVYVNYARTE2345.61
184-192DFFKLERDM1201.40
176-192YVNYARTEDFTKLERDM2198.48
167-185    QGMPEGDLVYVNYARTEDF2205.41
178-186NYARTEDFF1163.22

Boldface sequences correspond to peptides predicted to bind to MHC, see Table 8.

N-terminal Pool Sequence Analysis

[0270]

One aliquot at one hour of the proteasomal digestion (see Example 3 part 3 above) was subjected to N-terminal amino acid sequence analysis by an ABI 473A Protein Sequencer (Applied Biosystems, Foster City, Calif.). Determination of the sites and efficiencies of cleavage was based on consideration of the sequence cycle, the repetitive yield of the protein sequencer, and the relative yields of amino acids unique in the analyzed sequence. That is if the unique (in the analyzed sequence) residue X appears only in the nth cycle a cleavage site exists n−1 residues before it in the N-terminal direction. In addition to helping resolve any ambiguity in the assignment of mass to sequences, these data also provide a more reliable indication of the relative yield of the various fragments than does mass spectrometry.

[0271]

For PSMA163-192 (SEQ ID NO. 30) this pool sequencing supports a single major cleavage site after V177 and several minor cleavage sites, particularly one after Y179. Reviewing the results presented in FIGS. 7A-C reveals the following:

    • S at the 3rd cycle indicating presence of the N-terminus of the substrate.
    • Q at the 5th cycle indicating presence of the N-terminus of the substrate.
    • N at the 1st cycle indicating cleavage after V177.
    • N at the 3rd cycle indicating cleavage after V175. Note the fragment 176-192 in Table 7.
    • T at the 5th cycle indicating cleavage after V177.
    • T at the 1st-3rd cycles, indicating increasingly common cleavages after R181, A180 and Y179. Only the last of these correspond to peaks detected by mass spectrometry; 163-179 and 180-192, see Table 7. The absence of the others can indicate that they are on fragments smaller than were examined in the mass spectrum.
    • K at the 4th, 8th, and 10th cycles indicating cleavages after E183, Y179, and V177, respectively, all of which correspond to fragments observed by mass spectroscopy. See Table 7.
    • A at the 1st and 3rd cycles indicating presence of the N-terminus of the substrate and cleavage after V177, respectively.
    • P at the 4th and 8th cycles indicating presence of the N-terminus of the substrate.
    • G at the 6th and 10th cycles indicating presence of the N-terminus of the substrate.
    • M at the 7th cycle indicating presence of the N-terminus of the substrate and/or cleavage after F185.
    • M at the 15th cycle indicating cleavage after V177.
    • The 1st cycle can indicate cleavage after D191, see Table 7.
    • R at the 4th and 13th cycle indicating cleavage after V177.
    • R at the 2nd and 11th cycle indicating cleavage after Y179.
    • V at the 2nd, 6th, and 13th cycle indicating cleavage after V175, M169 and presence of the N-terminus of the substrate, respectively. Note fragments beginning at 176 and 170 in Table 7.
    • Y at the 1st, 2nd, and 14th cycles indicating cleavage after V175, V177, and presence of the N-terminus of the substrate, respectively.
    • L at the 11th and 12th cycles indicating cleavage after V177, and presence of the N-terminus of the substrate, respectively, is the interpretation most consistent with the other data. Comparing to the mass spectrometry results we see that L at the 2nd, 5th, and 9th cycles is consistent with cleavage after F186, E183 or M169, and Y179, respectively. See Table 7.

Epitope Identification

[0290]

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by the SYFPEITHI or NIH algorithms were chosen for further analysis. The digestion and prediction steps of the procedure can be usefully practiced in any order. Although the substrate peptide used in proteasomal digest described here was specifically designed to include a predicted HLA-A1 binding sequence, the actual products of digestion can be checked after the fact for actual or predicted binding to other MHC molecules. Selected results are shown in Table 8.

Predicted HLA binding by proteasomally
generated fragments
SEQ ID NOPEPTIDEHLASYFPEITHINIH
32 & (33)(G) MPEGDLVYVA*020117 (27)(2605)
B*070220<5
B*510122314
34 & (35)(Q) GMPEGDLVYA124 (26)<5
A316 (18)36
B*27051725
36MPEGDLVYB*510115NP†
37 & (38)(P) EGDLVYVNYA127 (15)12
A2623 (17)NP
39LVYVNYARTEA321<5
40 & (41)(Y) VNYARTEDFA26(20)NP
B*0815<5
B*27051250
42NYARTEDFFA24NP†100
Cw*0401NP120
43YARTEDFFB*0816<5
44RTEDFFKLEA121<5
A2615NP

†No prediction

HLA-A*0201 Binding Assay:

[0291]

HLA-A*0201 binding studies were preformed with PSMA168-177, GMPEGDLVYV, (SEQ ID NO. 33) essentially as described in Example 3 above. As seen in FIG. 8, this epitope exhibits significant binding at even lower concentrations than the positive control peptides. The Melan-A peptide used as a control in this assay (and throughout this disclosure), ELAGIGILTV, is actually a variant of the natural sequence (EAAGIGILTV) and exhibits a high affinity in this assay.

Example 5

Cluster Analysis (PSMA281-310)

[0292]

Another peptide, RGIAEAVGLPSIPVHPIGYYDAQKLLEKMG, PSMA281-310, (SEQ ID NO. 45), containing an A1 epitope cluster from prostate specific membrane antigen, PSMA283-307 (SEQ ID NO. 46), was synthesized using standard solid-phase F-moc chemistry on a 433A ABI Peptide synthesizer. After side chain deprotection and cleavage from the resin, peptide in ddH2O was run on a reverse-phase preparative HPLC C18 column at following conditions: linear AB gradient (5% B/min) at a flow rate of 4 ml/min, where eluent A is 0.1% aqueous TFA and eluent B is 0.1% TFA in acetonitrile. A fraction at time 17.061 min containing the expected peptide as judged by mass spectrometry, was pooled and lyophilized. The peptide was then subjected to proteasome digestion and mass spectrum analysis essentially as described above. Prominent peaks from the mass spectra are summarized in Table 9.

PSMA281-310 Mass Peak Identification.
CALCULATED
PEPTIDESEQUENCEMASS (MH+)
281-297RGIAEAVGLPSIPVHPI*1727.07
286-297     AVGLPSIPVHPI**1200.46
287-297      VGLPSIPVHPI1129.38
288-297GLPSIPVHPI1030.25
298-310GYYDAQKLLEKMG‡1516.5
298-305                 GYYDAQKL958.05
281-305RGIAEAVGLPSIPVHPIGYYDAQKL2666.12
281-307RGIAEAVGLPSIPVHPIGYYDAQKLLE2908.39
286-307     AVGLPSIPVHPIGYYDAQKLLE¶2381.78
287-307      VGLPSIPVHPTGYYDAQKLLE2310.70
288-307       GLPSIPVHPIGYYDAQKLLE#2211.57
281-299RGIAEAVGLPSIPVHPIGY1947
286-299     AVGLPSIPVHPIGY1420.69
287-299      VGLPSIPVHPIGY1349.61
288-299       GLPSIPVHPIGY1250.48
287-310VGLPSIPVHPIGYYDAQKLLEKMG2627.14
288-310GLPSTPVHPIGYYDAQKLLEKMG2528.01

Boldface sequences correspond to peptides predicted to bind to MHC, see Table 10.

*By mass alone this peak could also have been 296-310 or 288-303.

**By mass alone this peak could also have been 298-307. Combination of HPLC and mass spectrometry show that at some later time points this peak is a mixture of both species.

†By mass alone this peak could also have been 289-298.

≠ By mass alone this peak could also have been 281-295 or 294-306.

§By mass alone this peak could also have been 297-303.

¶By mass alone this peak could also have been 285-306.

#By mass alone this peak could also have been 288-303.

N-terminal Pool Sequence Analysis

[0293]

One aliquot at one hour of the proteasomal digestion (see Example 3 part 3 above) was subjected to N-terminal amino acid sequence analysis by an ABI 473A Protein Sequencer (Applied Biosystems, Foster City, Calif.). Determination of the sites and efficiencies of cleavage was based on consideration of the sequence cycle, the repetitive yield of the protein sequencer, and the relative yields of amino acids unique in the analyzed sequence. That is if the unique (in the analyzed sequence) residue X appears only in the nth cycle a cleavage site exists n−1 residues before it in the N-terminal direction. In addition to helping resolve any ambiguity in the assignment of mass to sequences, these data also provide a more reliable indication of the relative yield of the various fragments than does mass spectrometry.

[0294]

For PSMA281-310 (SEQ ID NO. 45) this pool sequencing supports two major cleavage sites after V287 and I297 among other minor cleavage sites. Reviewing the results presented in FIG. 9 reveals the following:

    • S at the 4th and 11th cycles indicating cleavage after V287 and presence of the N-terminus of the substrate, respectively.
    • H at the 8th cycle indicating cleavage after V287. The lack of decay in peak height at positions 9 and 10 versus the drop in height present going from 10 to 11 can suggest cleavage after A286 and E285 as well, rather than the peaks representing latency in the sequencing reaction.
    • D at the 2nd, 4th, and 7th cycles indicating cleavages after Y299, I297, and V294, respectively. This last cleavage is not observed in any of the fragments in Table 10 or in the alternate assignments in the notes below.
    • Q at the 6th cycle indicating cleavage after I297.
    • M at the 10th and 12th cycle indicating cleavages after Y299 and I297, respectively.

Epitope Identification

[0300]

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction steps of the procedure can be usefully practiced in any order. Although the substrate peptide used in proteasomal digest described here was specifically designed to include a predicted HLA-A1 binding sequence, the actual products of digestion can be checked after the fact for actual or predicted binding to other MHC molecules. Selected results are shown in Table 10.

Predicted HLA binding by proteasomally
generated fragments: PSMA281-310
SEQ ID NO.PEPTIDEHLASYFPEITHINIH
47 & (48)(G) LPSIPVHPTA*020116 (24)(24)
B*0702/B72312
B*510124572
Cw*0401NP†20
49 & (50)(P) IGYYDAQKLA*0201(16)<5
A26(20)NP
B*27051625
B*270915NP
B*51012157
Cw*0301NP24
51 & (52)(P) SIPVHPIGYA121 (27)<5
A2622NP
A316<5
53TPVHPTGYB*510116NP
54YYDAQKLLEA122<5

†No prediction

[0301]

As seen in Table 10, N-terminal addition of authentic sequence to epitopes can often generate still useful, even better epitopes, for the same or different MHC restriction elements. Note for example the pairing of (G)LPSIPVHPI with HLA-A*0201, where the 10-mer can be used as a vaccine useful with several MHC types by relying on N-terminal trimming to create the epitopes for HLA-B7, -B*5101, and Cw*0401.

[0000]

HLA-A*0201 Binding Assay:

[0302]

HLA-A*0201 binding studies were preformed with PSMA288-297, GLPSIPVHPI, (SEQ ID NO. 48) essentially as described in Examples 3 and 4 above. As seen in FIG. 8, this epitope exhibits significant binding at even lower concentrations than the positive control peptides.

Example 6

Cluster Analysis (PSMA454-481)

[0303]

Another peptide, SSIEGNYTLRVDCTPLMYSLVHLTKEL, PSMA454-481, (SEQ ID NO. 55) containing an epitope cluster from prostate specific membrane antigen, was synthesized by MPS (purity >95%) and subjected to proteasome digestion and mass spectrum analysis as described above. Prominent peaks from the mass spectra are summarized in Table 11.

PSMA454-481 Mass Peak Identification.
MS PEAKCALCULATED
(measured)PEPTIDESEQUENCEMASS (MH+)
1238.5454-464SSIEGNYTLRV1239.78
1768.38 ± 0.60454-469SSIEGNYTLRVDCTPL1768.99
1899.8454-470SSIEGNYTLRVDCTPLM1900.19
1097.63 ± 0.91463-471RVDCTPLMY1098.32
2062.87 ± 0.68454-471*SSIEGNYTLRVDCTPLMY2063.36
1153472-481**                 SLVHNLTKEL1154.36
1449.93 ± 1.79470-481               MYSLVHNLTKEL1448.73

Boldface sequence correspond to peptides predicted to bind to MHC, see Table 12.

*On the basis of mass alone this peak could equally well be assigned to the peptide 455-472 however proteasomal removal of just the N-terminal amino acid is considered unlikely. If the issue were important it could be resolved by N-terminal sequencing.

**On the basis of mass this fragment might also represent 455-464.

Epitope Identification

[0304]

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction steps of the procedure can be usefully practiced in any order. Although the substrate peptide used in proteasomal digest described here was specifically designed to include predicted HLA-A2.1 binding sequences, the actual products of digestion can be checked after the fact for actual or predicted binding to other MHC molecules. Selected results are shown in Table 12.

Predicted HLA binding by proteasomally
generated fragments
SEQ ID NOPEPTIDEHLASYFPEITHINIH
56 & (57)(S) IEGNYTLRVA1(19)<5
58EGNYTLRV A*020116 (22)<5
B*510115NP†
59 & (60)(Y) TLRVDCTPLA*020120 (18)(5)
A2616 (18)NP
B71440
B823<5
B*27051230
Cw*0301NP(30)
61LRVDCTPLMB*270520600
B*270920NP
62 & (63)(L) RVDCTPLMYA132 (22)125
(13.5)
A325<5
A2622NP
B*2702NP(200)
B*270513 (NP)(1000)

†No prediction

[0305]

As seen in Table 12, N-terminal addition of authentic sequence to epitopes can often generate still useful, even better epitopes, for the same or different MHC restriction elements. Note for example the pairing of (L)RVDCTPLMY (SEQ ID NOS 62 and (63)) with HLA-B*2702/5, where the 10-mer has substantial predicted halftimes of dissociation and the co-C-terminal 9-mer does not. Also note the case of SIEGNYTLRV (SEQ ID NO 57) a predicted HLA-A*0201 epitope which can be used as a vaccine useful with HLA-B*5101 by relying on N-terminal trimming to create the epitope.

[0000]

HLA-A*0201 Binding Assay

[0306]

HLA-A*0201 binding studies were preformed, essentially as described in Example 3 above, with PSMA460-469, TLRVDCTPL, (SEQ ID NO. 60). As seen in FIG. 10, this epitope was found to bind HLA-A2.1 to a similar extent as the known A2.1 binder FLPSDYFPSV (HBV18-27; SEQ ID NO: 24) used as a positive control. Additionally, PSMA461-469, (SEQ ID NO. 59) binds nearly as well.

[0000]

ELISPOT Analysis: PSMA463-471 (SEQ ID NO. 62)

[0307]

The wells of a nitrocellulose-backed microtiter plate were coated with capture antibody by incubating overnight at 4° C. using 50 μl/well of 4 μg/ml murine anti-human γ-IFN monoclonal antibody in coating buffer (35 mM sodium bicarbonate, 15 mM sodium carbonate, pH 9.5). Unbound antibody was removed by washing 4 times 5 min. with PBS. Unbound sites on the membrane then were blocked by adding 200 μl/well of RPMI medium with 10% serum and incubating 1 hr. at room temperature. Antigen stimulated CD8+ T cells, in 1:3 serial dilutions, were seeded into the wells of the microtiter plate using 100 μl/well, starting at 2×105 cells/well. (Prior antigen stimulation was essentially as described in Scheibenbogen, C. et al. Int. J. Cancer 71:932-936, 1997. PSMA462-471 (SEQ ID NO. 62) was added to a final concentration of 10 μg/ml and IL-2 to 100 U/ml and the cells cultured at 37° C. in a 5% CO2, water-saturated atmosphere for 40 hrs. Following this incubation the plates were washed with 6 times 200 μl/well of PBS containing 0.05% Tween-20 (PBS-Tween). Detection antibody, 50 μl/well of 2 g/ml biotinylated murine anti-human γ-IFN monoclonal antibody in PBS+10% fetal calf serum, was added and the plate incubated at room temperature for 2 hrs. Unbound detection antibody was removed by washing with 4 times 200 μl of PBS-Tween. 100 μl of avidin-conjugated horseradish peroxidase (Pharmingen, San Diego, Calif.) was added to each well and incubated at room temperature for 1 hr. Unbound enzyme was removed by washing with 6 times 200 μl of PBS-Tween. Substrate was prepared by dissolving a 20 mg tablet of 3-amino 9-ethylcoarbasole in 2.5 ml of N,N-dimethylformamide and adding that solution to 47.5 ml of 0.05 M phosphate-citrate buffer (pH 5.0). 25 μl of 30% H2O2 was added to the substrate solution immediately before distributing substrate at 100 μl/well and incubating the plate at room temperature. After color development (generally 15-30 min.), the reaction was stopped by washing the plate with water. The plate was air dried and the spots counted using a stereomicroscope.

[0308]

FIG. 11 shows the detection of PSMA463-471 (SEQ ID NO. 62)-reactive HLA-A1+ CD8+ T cells previously generated in cultures of HLA-A1+ CD8+ T cells with autologous dendritic cells plus the peptide. No reactivity is detected from cultures without peptide (data not shown). In this case it can be seen that the peptide reactive T cells are present in the culture at a frequency between 1 in 2.2×104 and 1 in 6.7×104. That this is truly an HLA-A1-restricted response is demonstrated by the ability of anti-HLA-A1 monoclonal antibody to block γ-IFN production; see FIG. 12.

Example 7

Cluster Analysis (PSMA653-687)

[0309]

Another peptide, FDKSNPIVLRMMNDQLMFLERAFIDPLGLPDRPFY PSMA653-687, (SEQ ID NO. 64) containing an A2 epitope cluster from prostate specific membrane antigen, PSMA660-681 (SEQ ID NO 65), was synthesized by MPS (purity >95%) and subjected to proteasome digestion and mass spectrum analysis as described above. Prominent peaks from the mass spectra are summarized in Table 13.

PSMA653-687 Mass Peak Identification.
MS PEAKCALCULATED
measuredPEPTIDESEQUENCEMASS (MH+)
 906.17 ± 0.65681-687**LPDRPFY908.05
1287.73 ± 0.76677-687**DPLGLPDRPFY1290.47
 1400.3 ± 1.79676-687IDPLGLPDRPFY1403.63
 1548.0 ± 1.37675-687FIDPLGLPDRPFY1550.80
 1619.5 ± 1.51674-687**AFIDPLGLPDRPFY1621.88
1775.48 ± 1.32673-687*RAFIDPLGLPDRPFY1778.07
2440.2 ± 1.3653-672FDKSNPIVLRMMNDQLMFLE2442.932313.82
1904.63 ± 1.56672-687*ERAFIDPLGLPDRPFY1907.19
2310.6 ± 2.5653-671FDKSNPIVLRMMNDQLMFL2313.82
 2017.4 ± 1.94671-687LERAFIDPLGLPDRPFY2020.35
2197.43 ± 1.78653-670FDKSNPIVLRMMNDQLMF2200.66

Boldface sequence correspond to peptides predicted to bind to MHC, see Table 13.

*On the basis of mass alone this peak could equally well be assigned to a peptide beginning at 654, however proteasomal removal of just the N-terminal amino acid is considered unlikely. If the issue were important it could be resolved by N-terminal sequencing.

**On the basis of mass alone these peaks could have been assigned to internal fragments, but given the overall pattern of digestion it was considered unlikely.

Epitope Identification

[0310]

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction steps of the procedure can be usefully practiced in any order. Although the substrate peptide used in proteasomal digest described here was specifically designed to include predicted HLA-A2.1 binding sequences, the actual products of digestion can. be checked after the fact for actual or predicted binding to other MHC molecules. Selected results are shown in Table 14.

Predicted HLA binding by proteasomally
generated fragments
SEQ ID NOPEPTIDEHLASYFPEITHINIH
66 & (67)(R) MMNDQLMFLA*020124 (23)1360 (722)
A*0205NP†71 (42)
A2615NP
B*27051250
68RMMNDQLMFB*27051775

†No prediction

[0311]

As seen in Table 14, N-terminal addition of authentic sequence to epitopes can generate still useful, even better epitopes, for the same or different MHC restriction elements. Note for example the pairing of (R)MMNDQLMFL (SEQ ID NOS. 66 and (67)) with HLA-A*02, where the 10-mer retains substantial predicted binding potential.

[0000]

HLA-A*0201 Binding Assay

[0312]

HLA-A*0201 binding studies were preformed, essentially as described in Example 3 above, with PSMA663-671, (SEQ ID NO. 66) and PSMA662-671, RMMNDQLMFL (SEQ NO. 67). As seen in FIGS. 10, 13 and 14, this epitope exhibits significant binding at even lower concentrations than the positive control peptide (FLPSDYFPSV (HBV18-27); SEQ ID NO: 24). Though not run in parallel, comparison to the controls suggests that PSMA662-671 (which approaches the Melan A peptide in affinity) has the superior binding activity of these two PSMA peptides.

Example 8

Vaccinating with Epitope Vaccines

[0000]

1. Vaccination with Peptide Vaccines:

[0313]

A. Intranodal Delivery

[0314]

A formulation containing peptide in aqueous buffer with an antimicrobial agent, an antioxidant, and an immunomodulating cytokine, was injected continuously over several days into the inguinal lymph node using a miniature pumping system developed for insulin delivery (MiniMed; Northridge, Calif.). This infusion cycle was selected in order to mimic the kinetics of antigen presentation during a natural infection.

[0315]

B. Controlled Release

[0316]

A peptide formulation is delivered using controlled PLGA microspheres as is known in the art, which alter the pharmacokinetics of the peptide and improve immunogenicity. This formulation is injected or taken orally.

[0317]

C. Gene Gun Delivery

[0318]

A peptide formulation is prepared wherein the peptide is adhered to gold microparticles as is known in the art. The particles are delivered in a gene gun, being accelerated at high speed so as to penetrate the skin, carrying the particles into dermal tissues that contain pAPCs.

[0319]

D. Aerosol Delivery

[0320]

A peptide formulation is inhaled as an aerosol as is known in the art, for uptake into appropriate vascular or lymphatic tissue in the lungs.

[0000]

2. Vaccination with Nucleic Acid Vaccines:

[0321]

A nucleic acid vaccine is injected into a lymph node using a miniature pumping system, such as the MiniMed insulin pump. A nucleic acid construct formulated in an aqueous buffered solution containing an antimicrobial agent, an antioxidant, and an immunomodulating cytokine, is delivered over a several day infusion cycle in order to mimic the kinetics of antigen presentation during a natural infection.

[0322]

Optionally, the nucleic acid construct is delivered using controlled release substances, such as PLGA microspheres or other biodegradable substances. These substances are injected or taken orally. Nucleic acid vaccines are given using oral delivery, priming the immune response through uptake into GALT tissues. Alternatively, the nucleic acid vaccines are delivered using a gene gun, wherein the nucleic acid vaccine is adhered to minute gold particles. Nucleic acid constructs can also be inhaled as an aerosol, for uptake into appropriate vascular or lymphatic tissue in the lungs.

Example 9

Assays for the Effectiveness of Epitope Vaccines.

[0000]

1. Tetramer Analysis:

[0323]

Class I tetramer analysis is used to determine T cell frequency in an animal before and after administration of a housekeeping epitope. Clonal expansion of T cells in response to an epitope indicates that the epitope is presented to T cells by pAPCs. The specific T cell frequency is measured against the housekeeping epitope before and after administration of the epitope to an animal, to determine if the epitope is present on pAPCs. An increase in frequency of T cells specific to the epitope after administration indicates that the epitope was presented on pAPC.

[0000]

2. Proliferation Assay:

[0324]

Approximately 24 hours after vaccination of an animal with housekeeping epitope, pAPCs are harvested from PBMCs, splenocytes, or lymph node cells, using monoclonal antibodies against specific markers present on pAPCs, fixed to magnetic beads for affinity purification. Crude blood or splenoctye preparation is enriched for pAPCs using this technique. The enriched pAPCs are then used in a proliferation assay against a T cell clone that has been generated and is specific for the housekeeping epitope of interest. The pAPCs are coincubated with the T cell clone and the T cells are monitored for proliferation activity by measuring the incorporation of radiolabeled thymidine by T cells. Proliferation indicates that T cells specific for the housekeeping epitope are being stimulated by that epitope on the pAPCs.

[0000]

3. Chromium Release Assay:

[0325]

A human patient, or non-human animal genetically engineered to express human class I MHC, is immunized using a housekeeping epitope. T cells from the immunized subject are used in a standard chromium release assay using human tumor targets or targets engineered to express the same class I MHC. T cell killing of the targets indicates that stimulation of T cells in a patient would be effective at killing a tumor expressing a similar TuAA.

Example 10

Induction of CTL Response with Naked DNA is Efficient by Intra-Lymph Node Immunization

[0326]

In order to quantitatively compare the CD8+ CTL responses induced by different routes of immunization a plasmid DNA vaccine (pEGFPL33A) containing a well-characterized immunodominant CTL epitope from the LCMV-glycoprotein (G) (gp33; amino acids 33-41) (Oehen, S., et al. Immunology 99, 163-169 2000) was used, as this system allows a comprehensive assessment of antiviral CTL responses. Groups of 2 C57BL/6 mice were immunized once with titrated doses (200-0.02 μg) of pEGFPL33A DNA or of control plasmid pEGFP-N3, administered i.m. (intramuscular), i.d. (intradermal), i.spl. (intrasplenic), or i.ln. (intra-lymph node). Positive control mice received 500 pfu LCMV i.v. (intravenous). Ten days after immunization spleen cells were isolated and gp33-specific CTL activity was determined after secondary in vitro restimulation. As shown in FIG. 15, i.m. or i.d. immunization induced weakly detectable CTL responses when high doses of pEFGPL33A DNA (200 μg) were administered. In contrast, potent gp33-specific CTL responses were elicited by immunization with only 2 μg pEFGPL33A DNA i.spl. and with as little as 0.2 μg pEFGPL33A DNA given i.ln. (FIG. 15; symbols represent individual mice and one of three similar experiments is shown). Immunization with the control pEGFP-N3 DNA did not elicit any detectable gp33-specific CTL responses (data not shown).

Example 11

Intra-Lymph Node DNA Immunization Elicits Anti-Tumor Immunity

[0327]

To examine whether the potent CTL responses elicited following i.ln. immunization were able to confer protection against peripheral tumors, groups of 6 C57BL/6mice were immunized three times at 6-day intervals with 10 μg of pEFGPL33A DNA or control pEGFP-N3 DNA. Five days after the last immunization small pieces of solid tumors expressing the gp33 epitope (EL4-33) were transplanted s.c. into both flanks and tumor growth was measured every 3-4 d. Although the EL4-33 tumors grew well in mice that had been repetitively immunized with control pEGFP-N3 DNA (FIG. 16), mice which were immunized with pEFGPL33A DNA i.ln. rapidly eradicated the peripheral EL4-33 tumors (FIG. 16).

Example 12

Differences in Lymph Node DNA Content Mirrors Differences in CTL Response Following Intra-Lymph Node and Intramuscular Injection

[0328]

pEFGPL33A DNA was injected i.ln. or i.m. and plasmid content of the injected or draining lymph node was assessed by real time PCR after 6, 12, 24, 48 hours, and 4 and 30 days. At 6, 12, and 24 hours the plasmid DNA content of the injected lymph nodes was approximately three orders of magnitude greater than that of the draining lymph nodes following i.m. injection. No plasmid DNA was detectable in the draining lymph node at subsequent time points (FIG. 17). This is consonant with the three orders of magnitude greater dose needed using i.m. as compared to i.ln. injections to achieve a similar levels of CTL activity. CD8−/− knockout mice, which do not develop a CTL response to this epitope, were also injected i.ln. showing clearance of DNA from the lymph node is not due to CD8+ CTL killing of cells in the lymph node. This observation also supports the conclusion that i.ln. administration will not provoke immunopathological damage to the lymph node.

Example 13

Administration of a DNA Plasmid Formulation of a Therapeutic Vaccine for Melanoma to Humans

[0329]

SYNCHROTOPE TA2M, a melanoma vaccine, encoding the HLA-A2-restricted tyrosinase epitope SEQ ID NO. 1 and epitope cluster SEQ ID NO. 69, was formulated in 1% Benzyl alcohol, 1% ethyl alcohol, 0.5 mM EDTA, citrate-phosphate, pH 7.6. Aliquots of 80, 160, and 320 μg DNA/ml were prepared for loading into MINIMED 407C infusion pumps. The catheter of a SILHOUETTE infusion set was placed into an inguinal lymph node visualized by ultrasound imaging. The assembly of pump and infusion set was originally designed for the delivery of insulin to diabetics and the usual 17mm catheter was substituted with a 31 mm catheter for this application. The infusion set was kept patent for 4 days (approximately 96 hours) with an infusion rate of about 25 μl/hour resulting in a total infused volume of approximately 2.4 ml. Thus the total administered dose per infusion was approximately 200, and 400 μg; and can be 800 μg, respectively, for the three concentrations described above. Following an infusion subjects were given a 10 day rest period before starting a subsequent infusion. Given the continued residency of plasmid DNA in the lymph node after administration (as in example 12) and the usual kinetics of CTL response following disappearance of antigen, this schedule will be sufficient to maintain the immunologic CTL response.

Example 14

Additional Epitopes

[0330]

The methodologies described above, and in particular in examples 3-7, have been applied to additional synthetic peptide substrates, leading to the identification of further epitopes as set for the in tables 15-36 below. The substrates used here were designed to identify products of housekeeping proteasomal processing that give rise to HLA-A*0201 binding epitopes, but additional MHC-binding reactivities can be predicted, as discussed above. Many such reactivities are disclosed, however, these listings are meant to be exemplary, not exhaustive or limiting. As also discussed above, individual components of the analyses can be used in varying combinations and orders. The digests of the NY-ESO-1 substrates 136-163 and 150-177 (SEQ ID NOS. 254 and 255, respectively) yielded fragments that did not fly well in MALDI-TOF mass spectrometry. However, they were quite amenable to N-terminal peptide pool sequencing, thereby allowing identification of cleavage sites. Not all of the substrates necessarily meet the formal definition of an epitope cluster as referenced in example 3. Some clusters are so large, e.g. NY-ESO-186-171, that it was more convenient to use substrates spanning only a portion of this cluster. In other cases, substrates were extended beyond clusters meeting the formal definition to include neighboring predicted epitopes. In some instances, actual binding activity may have dictated what substrate was made, as with for example the MAGE epitopes reported here, where HLA binding activity was determined for a selection of peptides with predicted affinity, before synthetic substrates were designed.

GP100: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
HLA Binding
SEQPredictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceID NOA*0201A1A3B7B8Comments
609-644630-638*LPHSSSHWL8820/8016/<5*The digestion of
629-638*QLPHSSSHWL8921/117609-644 and 622-
614-622LIYRRRLMK9032/20650 have
613-622SLIYRRRLMK9114/<529/60generated the
615-622IYRRRLMK9215/<5same epitopes.
622-650630-638*LPHSSSHWL9320/8016/<5
629-638*QLPHSSSHWL9421/117

†Scores are given from the two binding prediction programs referenced above (see example 3).

[0331]

MAGE-1: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
 86-109 95-102ESLFRAVI9516/<5
 93-102ILESLFRAVI9621/<520/<5
 93-101ILESLFRAV9723/<5
 92-101CILESLFRAV9823/55
 92-100CILESLFRA9920/138
263-292263-271EFLWGPRAL100A26 (R 21),
A24 (NIH 30)
264-271FLWGPRAL10117/<5
264-273FLWGPRALAE10216/<519/<5
265-274LWGPRALAET10316/<5
268-276PRALAETSY10415/<5
267-276GPRALAETSY10515/<5<15/<5B4403 (NIH 7);
B3501 (NIH 120)
269-277RALAETSYV10618/20
271-279LAETSYVKV10719/<5
270-279ALAETSYVKV10830/42719/<5<5
272-280AETSYVKVL10915/<5B4403 (NIH 36)
271-280LAETSYVKVL11018/<5<15/<5
274-282TSYVKVLEY11126/<5B4403 (NIH 14)
273-282ETSYVKVLEY11228/6A26 (R 31),
B4403 (NIH 14)
278-286KVLEYVIKV11326/74316/<5

[0332]

MAGE-1: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
168-193168-177SYVLVTCLGL114A24 (NIH 300)
169-177YVLVTCLGL11520/3215/<5<15/20
170-177VLVTCLGL11617/<5
229-258240-248TQDLVQEKY11729/<5
239-248LTQDLVQEKY11823/<5A26 (R 22)
232-240YGEPRKLLT11924/11
243-251LVQEKYLEY12021/<521/<5A26 (R 28)
242-251DLVEKYLEY12122/<519/<5A26 (R 30)
230-238SAYGEPRKL12221/<5B5101 (25/121)
272-297278-286KVLEYVIIKV12326/74316/<5
277-286VKVLEYVIKV12417/<5
276-284YVKVLEYVI12515/<515/<517/<5
274-282TSYVKVLEY12626/<5
273-282ETSYVKVLEY12728/6
283-291VIKVSARVR12820/<5
282-291YVIKVSARVR12924/<5

†Scores are given from the two binding prediction programs referenced above (see example 3). R indicates a SYFPEITHI score.

[0333]

MAGE-2: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
107-126115-122ELVHFLLL13018/<5
113-122MVELVHFLLL13121/<5A26 (R 22)
109-116ISRKMVEL13217/<5
108-116AISRKMVEL13325/719/<516/1226/<5
107-116AAISRKMVEL13422/<514/36n.p./16
112-120KMVELVHFL13527/2800
109-117ISRKMVELV13616/<5
108-117AISRKMVELV13724/11
116-124LVHFLLLKY13823/<519/<5A26 (R 26)
115-124ELVHFLLLKY13924/<519/5A26 (R 29)
111-119RKMVELVHF140
145-175158-166LQLVFGIEV14117/168
157-166YLQLVFGJEV14224/1215
159-167QLVFGTEVV14325/3218/<5
158-167LQLVFGTEVV14418/20
164-172IEVVEVVPI14516/<5
163-172GIEVVEVVPI14622/<5
162-170FGIEVVEVV14719/<5B5101 (24/69.212)
154-162ASEYLQLVF14822/68
153-162KASEYLQLVF14915/<5

†Scores are given from the two binding prediction programs referenced above (see example 3). R indicates a SYFPEITHI score.

[0334]

MAGE-2: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
HLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceA*0201A1A3B7B8Other
213-233218-225EEKIWEEL15022/<5
216-225APEEKIWEEL15115/<522/72
216-223APEEKIWE15218/<5
220-228KIWEELSML15326/80416/<516/<5A26 (R 26)
219-228EKIWEELSML154A26 (R 22)
271-291271-278FLWGPRAL15517/<5
271-279FLWGPRALI15625/39816/7
278-286LIETSYVKV15723/<5
277-286ALIETSYVKV15830/42721/<5
276-284RALIETSYV15918/19B5101 (20/55)
279-287IETSYVKVL16015/<5
278-287LIETSYVKVL16122/<5A26 (R 22)

†Scores are given from the two binding prediction programs referenced above (see example 3). R indicates a SYFPEITHI score.

[0335]

MAGE-3: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
267-286271-278FLWGPRAL16217/<5
270-278EFLWGPRAL163A26 (R 21);
A24 (NIH 30)
271-279FLWGPRALV16427/265516/<5
276-284RALVETSYV16518/19B5101 20/55
272-280LWGPRALVE16615/<5
271-280FLWGPRALVE16715/<522/<5
272-281LWGPRALVET16816/<5

†Scores are given from the two binding prediction programs referenced above (see example 3). R indicates a SYFPEITHI score.

[0336]

NY-ESO-1: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
 81-11382-90GPESRLLEF169 16/1118/<522/<5
83-91PESRLLEFY170 15/<5B4403 (NIH 18)
82-91GPESRLLEFY171 25/11
84-92ESRLLEFYL17219/8
86-94RLLEFYLAM17321/43021/<5
88-96LEFYLAMPF174B4403 (NIH 60)
87-96LLEFYLAMPF175<15/4518/<5
 93-102AMPFATPMEA17615/<5
 94-102MPFATPMEA17717/<5
101-133115-123PLPVPGVLL17820/<517/<516/<518/<5
114-123PPLPVPGVLL17923/12
 116-123*LPVPGVLL18016/<5Comment
103-112ELARRSLAQD18115/<520/<5*Evidence of the
118-126*VPGVLLKEF18217/<516/<5same epitope
 117-126*PVPGVLLKEF18316/<5obtained from
116-145 116-123*LPVPGVLL18416/<5two digests.
127-135TVSGNILTI18521/<519/<5
126-135FTVSGNILLTI18620/<5
120-128GVLLKEFTV18720/13018/<5
121-130VLLKEFTVSG18817/<518/<5
122-130LLKEFTVSG18920/<518/<5
 118-126*VPGVLLKEF19017/<516/<5
 117-126*PVPGVLLKIEF19116/<5

\Scores are given from the two binding prediction programs referenced above (see example 3).

[0337]

NY-ESO-1: Preferred Epitopes Revealed by
Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
136-163139-147AADHRQLQL192 17/<517/<522/<5
(SEQ ID148-156SISSCLQQL193 24/7A26 (R 25)
NO 254)147-156LSISSCLQQL194 18/<5
138-147TAADHRQLQL195 18/<5
150-177161-169WITQCFLPV196 18/84
(SEQ ID157-165SLLMWITQC197 18/4217/<5
NO 255)150-158SSCLQQLSL198 15/<5
154-162QQLSLLMWI199 15/50
151-159SCLQQLSLL200 18/<5
150-159SSCLQQLSLL201 16/<5
163-171TQCFLPVFL202<15/12
162-171ITQCFLPVFL203 18/<5A26 (R 19)

†Scores are given from the two binding prediction programs referenced above (see example 3). R indicates a SYFPEITHI score

[0338]

PRAME: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
211-245219-227PMQDIKMIL20416/<516/n.d.A26 (R 20)
218-227MPMQDIKMTL205<15/240
411-446428-436QHLJGLSNL20618/<5
427-436LQHLIGLSNL20716/8
429-436HLIGLSNL20817/<5B15 (R 21)
431-439IGLSNLTHV20918/7B*5101 (R 22)
430-439LIGLSNLTHV21024/37

†Scores are given from the two binding prediction programs referenced above (see example 3). R indicates a SYFPEITHI score.

[0339]

PSA: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
42-7753-61VLVHPQWVL21122/112<15/6 17/<5
52-61GVLVHPQWVL21217/2116/<5<15/30A26 (R 18)
52-60GVLVHPQWV21317/124
59-67WVLTAAHCI21415/16
54-63LVHIPQWVLTA21519/<520/<5A26 (R 16)
53-62VLVHPQWVLT21617/22
54-62LVHPQWVLT21717/n.d.
55-9566-73CIIRNKSVI218 26/20
65-73HCIRNKSVI219<15/16
56-64HLPQWVLTAA220 18/<5
63-72AAHCIRNKSV22117/<5

†Scores are given from the two binding prediction programs referenced above (see example 3). R indicates a SYFPEITHI score.

[0340]

PSCA: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
93-123*116-123LLWGPGQL22216/<5
115-123LLLWGPGQL223<15/18
114-123GLLLWGPGQL224<15/10
 99-107ALQPAAAIL225 26/922/<5<15/1216/<5A26 (R 19)
 98-107HALQPAAAIL226 18/<5<15/12

*L123 is the C-terminus of the natural protein.

†Scores are given from the two binding prediction programs referenced above (see example 3).

[0341]

Tyrosinase: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
128-157128-137APEKDKFFAY22729/6 15/<5B4403 (NIH 14)
129-137PEKDKFFAY22818/<5 21/<5
130-138EKDKFFAYL229 15/<5
131-138KDKFFAYL230 20/<5
197-228205-213PAFLPWHRL231 15/<5
204-213APAFLPWHRL232 23/360
207-216FLPWHRLFLL125/1310<15/8
208-216LPWHRLFLL917/26 20/80 24/16
214-223FLLRWEQEIQ23315/<5
212-220RLFLLRWEQ23416/<5
191-211191-200GSEIWRDIDF23518/68
192-200SEIWRDIDF236 16/<5B4403 (NIH 400)
207-230207-215FLWHRLFL822/540<15/6 17/<5
466-484473-481RIWSWLLGA23719/1315/<5
476-497476-484SWLLGAAMV23818/<5
477-486WLLGAAMVGA23921/19418/<5
478-486LLGAAMVGA24019/1916/<5

†Scores are given from the two binding prediction programs referenced above (see example 3).

[0342]

PSMA: Preferred Epitopes Revealed by Housekeeping Proteasome Digestion
SEQ IDHLA Binding Predictions (SYFPEITHI/NIH)†
SubstrateEpitopeSequenceNOA*0201A1A3B7B8Other
 1-30 4-12LLHETDSAV24125/485 15/<5
13-21ATARRPRWL24218/<518/<5A26 (R 19)
53-8053-61TPKHNMKAF24324/<5
64-73ELKAENIKKF244 17/<5A26 (R 30)
69-77NIKKFLH1NF245A26 (R 27)
68-77ENIKKFLH1NF246A26 (R 24)
215-244220-228AGAKGVTLY24725/<5
457-489468-477PLMYSLVHNL24822/<5
469-477LMYSLVHNL24927/193<15/9
463-471RVDCTPLMY25032/125 25/<5A26 (R 22)
465-473DCTPLMYSL251A26 (R 22)
 503-533507-515SGMPRISKL25221/<521<5
506-515FSGMIPRISKL25317/<5

1This H was reported as Y in the SWISSPROT database.

†Scores are given from the two binding prediction programs referenced above (see example 3).

[0343]

MAGE-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA TypeSYFPEITHINIH
Mage-1125-132KAEMLESV256B510119n.a.
119-146124-132TKAEMLESV257A020120<5
123-132VTKAEMLESV258A020120<5
128-136MLESVIKNY259A12845
A2624n.a.
A3175
127-136EMLESVIKNY260A115<1.0
A2623<1.0
125-133KAEMLESVI261B510123100
A24N.A.4
Mage-1146-153KASESLQL262B0816<1.0
143-170B510117N.A.
145-153GKASESLQL263B2705171
B270916N.A.
147-155ASESLQLVF264A12268
A2616N.A.
153-161LVFGIDVKE265A316<1.0

[0344]

MAGE-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
Mage-1114-121LLKYRARE266B825<1.0
10 99-125106-113VADLVGFL267B816<1.0
B510121N.A.
105-113KVADLVGFL268A02012344
A2625N.A.
A316<5
B07021420
B27051430
107-115ADLVGFLLL269A020117<5
B070215<5
B2705161
106-115VADLVGFLLL270A020116<5
A1223
114-123LLKYRAREPV271A0201202

[0345]

MAGE-3: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA TypeSYFPEITHIINIH
Mage-3271-278FLWGPRAL162B0817<5
267-295270-278EFLWGPRAL163A2621N.A.
A24N.A.30
B151016N.A.
271-279FLWGPRALV164A0201272655
A3162
278-286LVETSYVKV272A020119<1.0
A2617N.A.
277-286ALVETSYVKV273A020128428
A2616<5
A318<5
285-293KVLHHMVKI274A02011927
A319<5
276-284RALVETSYV165A02011820
283-291YVKVLHHMV275A020117<1.0
275-283PRALVETSY276A117<1.0
274-283GPRALVETSY277A115<1.0
278-287LVETSYVKVL278A020118<1.0
272-281LWGPRALVET168A020116<1.0
271-280FLWGPRALVE167A322<5

[0346]

Fibronectin ED-B: Preferred Epitopes Revealed by
Housekeeping Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
ED-B4′-5**TIIPEVPQL279A0201277
14′-21*A2628N.A.
A317<5
B815<5
B151015N.A.
B27051710
B270915N.A.
A020120<5
5′-5**DTIIPEVPQL280A2632N.A.
 1-10 EVPQLTDLSF281A2629N.A.

*This substrate contains the 14 amino acids from fibronectin flanking ED-B to the N-terminal side.

**These peptides span the junction between the N-terminus of the ED-B domain and the rest of fibronectin.

The italicized lettering indicates sequence outside the ED-B domain.

[0347]

Fibronectin ED-B: Preferred Epitopes Revealed by
Housekeeping Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
ED-B 8-3523-30TPLNSSTI282B510122N.A.
18-25IGLRWTPL283B510118N.A.
17-25SIGLRWTPL284A0201205
A2618N.A.
B0825<5
25-33LNSSTIIGY285A119<5
A2616<5
24-33PLNSSTIIGY286A120<5
A2624N.A.
A316<5
23-31TPLNSSTII287B0702178
B510125440

[0348]

Fibronectin ED-B: Preferred Epitopes Revealed by
Housekeeping Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
ED-B31-38IGYRITVV288B510125N.A.
20-4930-38IIGYRITVV289A02012315
A317<1.0
B0815<1.0
B5101153
29-38TIIGYRITVV290A0201269
A2618N.A.
A318<5
23-30TPLNSSTI282B510122N.A.
25-33LNSSTIIGY285A119<5
A2616N.A.
24-33PLNSSTIIGY286A2624N.A.
A316<5
31-39IGYRITVVA291A317<5
30-39IIGYRITVVA292A020115<5
A318<5
23-31TPLNSSTII287B0702178
B510125440

[0349]

CEA: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
CEA 176-202184-191SLPVSPRL293B0819<5
183-191QSLPVSPRL294A020115<5
B151015
B27051810
B270915
186-193PVSPRLQL295B0818<5
185-193LPVSPRLQL296B070226180
B0816<5
B510119130
184-193SLPVSPRLQL297A02012321
A2618N.A.
A318<5
185-192LPVSPRLQ298B510117N.A.
192-200QLSNGNRTL299A0201214
A2616N.A.
A319<5
B0817<5
B151015
191-200LQLSNGNRTL300A0201163
179-187WVNNQSLPV301A02011628
186-194PVSPRLQLS302A2617N.A.
A315<5

[0350]

CEA: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
CEA 354-380362-369SLPVSPRL303B0819<1.0
361-369QSLPVSPRL304A020115<1.0
B27051810
B270915
364-371PVSPRLQL305B0818<1.0
363-371LPVSPRLQL306B070226180
B0816<1.0
B510119130
362-371SLPVSPRLQL307A02012321
A2618N.A.
A24N.A.6
A318<5
363-370LPVSPRLQ308B510117N.A.
370-378QLSNDNRTL309A0201224
A2616N.A.
A317<1.0
B0817<1.0
369-378LQLSNDNRTL310A0201163
357-365WVNNQSLPV311A02011628
360-368NQSLPVSPR312B270514100

[0351]

CEA: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
CEA 532-558540-547SLPVSPRL313B0819<5
539-547QSLPVSPRL314A020115<5
B151015<5
B27051810
B270915
542-549PVSPRLQL315B0818<5
541-549LPVSPRLQL316B070226180
B0816<1.0
B510119130
540-549SLPVSPRLQL317A02012321
A2618N.A.
A318<5
541-548LPVSPRLQ318B510117N.A.
548-556QLSNGNRTL319A0201244
A2616N.A.
A319<1.0
B0817<1.0
B151015
547-556LQLSNGNRTL320A0201163
535-543WVNGQSLPV321A02011828
A315<1.0
533-541LWWVNGQSL322A020115<5

[0352]

CEA: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID. No.HLA typeSYFPEITHINIH
CEA 532-558532-541YLWWVNGQSL323A020125816
(continued)A2618N.A.
538-546GQSLPVSPR324B270517100

[0353]

HER2/NEU: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
Her-230-37DMKLRLPA325B08198
25-5228-37GTDMKLRLPA326A1236
42-49HLDMLRHIL327B0817<5
41-49THLDMILRHL328A020117<5
B151024N.A.
40-49ETHLDMLRHL329A2629N.A.
36-43PASPETHL330B510117N.A.
35-43LPASPETHL331A020115<5
B510120130
B5102N.A.100
34-43RLPASPETHL332A02012021
38-46SPETHLDML333A020115<5
B07022024
B0818<5
B510118110
37-46ASPETHLDML334A020118<5
42-50HLDMLRHLY335A12925
A2620N.A.
A3174
41-50THLDMLRHLY336A118<1.0

[0354]

HER2/NEU: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
Her-2 705-732719-726ELRKVKVL337B082416
718-726TELRKVKVL338A0201161
B0822<5
B510116<5
717-726ETELRKVKVL339A1182
A26286
715-723LKETELRKV340A020117<5
B510115<5
714-723ILKETELRKV341A0201298
712-720MRILKETEL342A020115<5
B0822<5
B2705272000
B270921N.A.
711-720QMRTLKETEL343A0201202
B07021340
717-725ETELRKVKV344A1185
A2618N.A.
716-725KIETELRKVKV345A02011619
706-714MPNQAQMRJ346B0702168
B510122629
705-714AMPNQAQMIRI347A0201188
706-715MPNQAQMRIL348B07022080

[0355]

HER2/NEU: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
Her-2966-973RPRFRELV349B082024
954-982B510118N.A.
965-973CRPRFRELV350B270918
968-976RFRELVSEF351A2625N.A.
A24N.A.32
A315<5
B0816<5
B270519
967-976PRFRELVSEF352A2618N.A.
964-972ECRPRFREL353B070221N.A.
A24N.A.6
B07021540
B827640
B151016<5

[0356]

NY-ESO-1: Preferred Epitopes Revealed by
Housekeeping Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
NY-ESO-167-75GAASGLNGC354A020115<5
51-7752-60RASGPGGGA355B070215<5
64-72PHGGAASGL356B151021N.A.
63-72GPHGGAASGL357B07022280
60-69APRGPHGGAA358B07022360

[0357]

PRAME: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
PRAME112-119VRPRRWKL359B0819
103-135111-119EVRPRRWKL360A2627N.A.
A24N.A.5
A319N.A.
B070215(B7) 300.00
B0826160
113-121RPRRWKLQV361B070221(B7) 40.00
B510119110
114-122PRRWKLQVL362B0826<5
B270523200
113-122RPRRWKLQVL363B070224(B7) 800.00
B8N.A.160
B5101N.A.61
B5102N.A.61
A24N.A.10
116-124RWKLQVLDL364B0822<5
B2705173
115-124RRWKLQVLDL365A020116<5
PRAME174-182PVEVLVDLF366A2625N.A.
161-187

[0358]

PRAME: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
PRAME199-206VKRKKNVL367B08278
185-215198-206KVKRKKNVL368A020116<1.0
A2620N.A.
A322<1.0
B083040
B270516
197-206EKVKRKKNVL369A2615N.A.
198-205KVKRKKNV370B08206
201-208RKKNVLRL371B0820<5
200-208KRKKNVLRL372A020115<1.0
A2615N.A.
B070215<1.0
B0821<1.0
B270528
B270925
199-208VKRKKNVLRL373A020116<1.0
B0702164
189-196DELFSYLI374B510115N.A.
205-213VLRLCCKKL375A0201223
A2617N.A.
B08258
204-213NVLRLCCKKL376A0201177
A2619N.A.

[0359]

PRAME: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
PRAME 185-215194-202YLIEKVKRK377A020120<1.0
(continued)A2618NA.
A32568
B0820<1.0
B270517
PRAME 71-9874-81QAWPFTCL378B510117n.a.
73-81VQAWPFTCL379A0201147
A24n.a.5
B0702166
72-81MVQAWPFTCL380A2622n.a.
A24n.a.7
B07021330
81-88LPLGVLMK381B510118n.a.
A020117<1.0
80-88CLPLGVLMK382A327120
79-88TCLPLGVLMK383A11210
A3193
84-92GVLMKGQHL384A0201187
A2621n.a.
B08214
81-89LPLGVLMKG385B5101202
80-89CLPLGVLMKG386A020116<1.0
76-85WPFTCLPLGV387B0702184

[0360]

PRAME: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
PRAME 39-6551-59ELFPPLFMA388A02011918
A2623N.A.
49-57PRELFPPLF389B270522
B270919
48-57LPRELFPPLF390B0702194
50-58RELFPPLFM391B270516
B270515
49-58PRELFPPLFM392A116<1.0

[0361]

PSA: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
PSA 232-258239-246RPSLYTKV393B510121N.A.
238-246ERPSLYTKV394B27051560
236-243LPERPSLY395B510118N.A.
235-243ALPERPSLY396A119<1.0
A2622N.A.
A3266
B0816<1.0
B27051115
B270919N.A.
241-249SLYTKVVHY397A020120<1.0
A119<1.0
A2625N.A.
A32660
B0820<1.0
B27051375
240-249PSLYTKVVHY398A120<1.0
A2616N.A.
239-247RPSLYTKVV399B0702214
B510123110

[0362]

PSMA: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
PSMA 202-228211-218GNKVKNAQ400B0822<5
202-209LARYGKVF401B0818<5
217-225AQLAGAKGV402A02011626
207-215KVFRGNKVK403A33215
211-219GNKVKJNAQL404B83380
B27051720
PSMA 255-282269-277TPGYPANEY405A116<5
268-277LTPGYPANEY406A1211
A2624N.A.
271-279GYPANEYAY407A115<5
270-279PGYPANEYAY408A119<5
266-274DPLTPGYPA409B0702213
B51011720
PSMA 483-509492-500SLYESWTKK410A020117<5
A327150
B270518150
491-500KSLYESWTKK411A316<5
486-494EGFEGKSLY412A119’15
A2621N.A.
B270516<5
485-494DEGFEGKSLY413A117<5
A2617N.A.
498-506TKIKSPSPEF414B0817<5

[0363]

PSMA: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
PSMA 483-509497-506WTKKSPSPEF415A2624N.A.
(continued)492-501SLYESWTKKS416A020116<5
A316<5
PSMA 721-749725-732WGEVKRQI417B0817<5
B510117N.A.
724-732AWGEVKRQJ418B5101156
723-732KAWGEVKRQI419A020116<1.0
723-730KAWGEVKR420B510115N.A.
722-730SKAWGEVKR421B270515<5
731-739QIYVAAFTV422A020121177
A321<1.0
B5101155
733-741YVAAFTVQA423A0201176
A320<1.0
725-733WGEVKRQIY424A12611
727-735EVKRQJYVA425A2622N.A.
A318<1.0
738-746TVQAAAETL426A2618N.A.
A319<1.0
737-746FTVQAAAETL427A020117<1.0
A2619N.A.

[0364]

PSMA: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
PSMA 721-749729-737KRQIYVAAF428A2616N.A.
(continued)B2705243000
B270921N.A.
721-729PSKAWGEVK429A320<1.0
723-731KAWGEVKRQ430B510116<1.0
PSMA 95-122100-108WKEFGLDSV431A020116<5
 99-108QWKEFGLDSV432A020117<5
102-111EFGLDSVELA433A2616N.A.

[0365]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1126-134ELRQKESKL434A020120<5
117-143A2626N.A.
A317<5
B070213(B7) 40.00
B834320
125-134AELRQKESKL435A020116<5
133-141KLQENRKII436A02012061
SCP-1298-305QLEEKTKL437B08282
281-308297-305NQLEEKTKL438A02011633
B270519200
288-296LLEESRDKV439A02012515
B5101153
287-296FLLEESRDKV440A0201272378
291-299ESRDKVNQL441A2621N.A.
B0829240
290-299EESRDKVNQL442A2619N.A.
SCP-1475-483EKEVHDLEY443A13111
471-498A2617N.A.
474-483REKEVHDLEY444A121<1.0
480-488DLEYSYCIIY445A12645
A2630N.A.
A316<5
477-485EVHDLEYSY446A1151

[0366]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1 471-498477-485EVHDLEYSYA2629NA.
(continued)A319<1.0
477-486EVHDLEYSYC447A2622N.A.
SCP-1 493-520502-509KLSSKREL448B08264
508-515ELKNTEYF449B0824<1.0
507-515RELKNTEYF450B27051845
B4403N.A.120
496-503KRGQRPKL451B0818<1.0
494-503LPKRGQRPKL452B070222120
B8N.A.16
B5101N.A.130
B3501N.A.60
509-517LKINTEYFTL453A020115<5
508-517ELKNTEYFTL454A020118<1.0
A2627N.A.
A316<1.0
506-514KRELKNTEY455A1262
B2705263000
502-510KLSSKLRELK456A32560
498-506GQRPKLSSK457A3224
B270518200
497-506RGQRPKLSSK458A322<1.0
500-508RPKLSSKRE459B0818<1.0

[0367]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1573-580LEYVREEL460B0819<5
570-596572-580ELEYVREEL461A020117<1.0
A2623N.A.
A24N.A.9
B0820N.A.
571-580N ELEYVREEL462A0201164
579-587ELKQKRDEV463A020119<1.0
A2618N.A.
B082948
575-583YVREELKQK464A2617N.A.
A3272
SCP-1632-640QLNVYEIKV465A02012470
618-645630-638SKQLNVYEI466A020117<5
628-636AESKQLNVY467A119<5
A2616N.A.
627-636TAESKQLNVY468A12645
A2615N.A.

[0368]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1638-645IKVNKLEL469B0821<1.0
633-660637-645EIKVNKLEL470A020117<1.0
A2626N.A.
B08288
B151015N.A.
636-645YEIKVNKLEL471A0201172
642-650KLELELESA472A0201201
A316<1.0
635-643VYEIKVNKL473A020118<1.0
A24N.A.396
B0822<1.0
634-643NVYEIKVNKL474A02012456
A2625N.A.
A24N.A.6
A315<5
B070211(B7) 20
B08N.A.6
646-654ELESAKQKF475A2627N.A.
SCP-1642-650KLELELESA476A0201201
640-668A316<1.0
646-654ELESAKQKF477A2627N.A.

[0369]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1771-778KEKLKREA478B0821<5
768-796777-785EAKENTATL479A020118<5
A2618N.A.
A24N.A.5
B07021312
B082848
B510120121
776-785REAKENTATL480A020116<5
773-782KLKREAKENT481A317<5
SCP-1112-119EAEKIKKW482B510117N.A.
92-125101-109GLSRVYSKL483A02012332
A2622N.A
A24N.A.6
A3173
B0817<1.0
100-109EGLSRVYSKL484A2621N.A.
A24N.A.9
108-116KILYKEAEKI485A02012257
A3209
B5101185
 98-106NSEGLSRVY486A13168
 97-106ENSEGLSRVY487A2618N.A.
102-110LSRVYSKLY488A122<1.0

[0370]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1101-110GLSRVYSKLY489A118<1.0
92-125A2618N.A.
(continued)A31918
 96-105LENSEGLSRV490A0201175
108-117KLYKEAEKJK491A327150
SCP-1949-956REDRWAVI492B510115N.A.
931-958948-956MREDRWAVI493B270518600
B270918N.A.
B5101151
947-956KMREDRWAVI494A0201216
B08N.A.15
947-955KMREDRWAV495A020122411
934-942TTPGSTLKF496A2625N.A.
933-942LTTPGSTLKF497A2623N.A.
937-945GSTLKFGAI498B08191
945-953IRKMREDRW499B0819<5
SCP-1236-243RLEMHEKL500B0816<5
232-259235-243SRLEMHFKL501A020118<5
B2705252000
B270922
242-250KLKEDYEKI502A0201224

[0371]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1A2616N.A.
232-259A3153
(continued)B0824<5
B5101142
249-257KIQHLEQEY503A115<5
A2623N.A.
A317<5
248-257EKIQHLEQEY504A115<5
A2621N.A.
233-242ENSRLEMHF505A2619N.A.
236-245RLEMHFKLKE506A119<5
SCP-1324-331LEDIKVSL507A317<5
310-340323-331ELEDIKVSL508B0820<1.0
A020121<1.0
A2625N.A.
A24N.A.10
A317<1.0
B0819<1.0
B151016N.A.
322-331KELEDIKVSL509A02011922
320-327LTKELEDI500B0818<5
319-327HLTKELEDI511A020121<1.0
330-338SLQRSVSTQ512A020118<1.0

[0372]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1321-329TKELEDIKV513A116<1.0
310-340320-329LTKELEDIKV514A020119<1.0
(continued)326-335DIKVSLQRSV515A2618N.A.
SCP-1281-288KMKDLTFL516B08203
272-305280-288NKMKDLTFL517A0201151
279-288ENKMKDLTFL518A2619N.A.
288-296LLEESRDKV519A02012515
B5101153
287-296FLLEESRDKV520A0201272378
291-299ESRDKVNQL521A2621N.A.
B0829240
290-299EESRDKVNQL522A2619N.A.
277-285EKENKMKDL523A2619N.A.
B0823<1.0
276-285TEKENKMKLDL524A2615N.A.
279-287ENKMKDLTF525A2618N.A.
B08284
SCP-1218-225IEKMITAF526B0817<5
211-239217-225NIEKMITAF527A2626N.A.
216-225SNIEKMITAF528A2619N.A.
223-230TAFEELRV529B510123N.A.
222-230ITAFEELRV530A0201182
221-230MITAFEELRV531A02011816

[0373]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1220-228KMITAFEEL532A02012350
211-239A2615N.A.
(continued)A24N.A.16
219-228EKMITAFEEL533A2619N.A.
227-235ELRVQAENS534A316<1.0
B0815<1.0
213-222DLNSNIEKMI535A020117<1.0
A2616N.A.
SCP-1837-844WTSAKNTL536B08204
836-863846-854TPLPKAYTV537A0201182
B0702174
B08162
B510125220
845-854STPLPKAYTV538A020119<5
844-852LSTPLPKAY539A1238
843-852TLSTPLPKAY540A116<1.0
A2619N.A.
A3182
842-850NTLSTPLPK541A3163
841-850KNTLSTPLPK542A318<1.0

[0374]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1828-835ISKDKRDY543B08213
819-845A2621N.A.
826-835HGISKDKRDY544A115<5
832-840KRDYLWTSA545B270516600
829-838SKDKRDYLWT546A118<5
SCP-1279-286ENKMKDLT547B08228
260-288260-268EINDKEKQV548A0201173
A2619N.A.
B0817<5
274-282QITEKENKM549A0201173
A2622N.A.
B0816<5
269-277SLLLIQITE550A020116<1.0
A318<1.0
SCP-1453-460FEKIAEEL551B0821<1.0
437-464452-460QFEKIAEEL552B270515
451-460KQFEKIAEEL553A02011656
B08162
449-456DNKQFEKI554B510116N.A.
448-456YDNKQFEKI555B5101161
447-456LYDNKQFEKI556A115<1.0

[0375]

SCP-1: Preferred Epitopes Revealed by Housekeeping
Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1440-447LGEKETLL557B510116N.A.
437-464439-447VLGEKETLL558A020124149
(continued)A2619N.A.
B082912
438-447KVLGEKETLL559A02011924
A2620N.A.
A24N.A.12
A318<1.0
B07021420
SCP-1390-398LLRTEQQRL560A0201223
383-412A2618N.A.
B08221.6
B27051530
389-398ELLRTEQQRL561A0201196
A2624N.A.
A315<1.0
393-401TEQQRLENY562A115<5
A2616N.A.
392-401RTEQQRLENY563A131113
A2626N.A.
402-410EDQLIILTM564A2618N.A.
397-406RLENYEDQLI565A020117<1.0
A315<1.0

[0376]

SCP-1: Preferred Epitopes Revealed by
Housekeeping Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SCP-1 366-394368-375KARAAHSF566B0816<1.0
376-384VVTEFETTV567A020119161
A316<1.0
375-384FVVTEFETTV568A020117106
377-385VTEFETTVC569A1182
376-385VVTEFETTVC570A316<5
SCP-1 331-357344-352DLQIATNTI571A020122<5
A315<1.0
B51011711
347-355IATNTICQL572A0201191
B0816<1.0
B51012079
346-355QIATNTICQL573A0201247
A2624N.A.

[0377]

SSX-4: Preferred Epitopes Revealed by
Housekeeping Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
SSX4 45-7657-65VMTKLGFKV574A020121495
53-61LNYEVMTKL575A0201177
52-61KLNYEVMTKL576A020123172
A2621N.A.
A24N.A.18
A3144
B7N.A.4
66-74TLPPFMRSK577A2616N.A.
A32514
 SSX4 98-124110-118KIMPKKPAE578A020115<5
A2615N.A.
A316<5
103-112SLQRIFPKIM579A0201158
A2616N.A.
A315<5

[0378]

Tyrosinase: Preferred Epitopes Revealed by
Housekeeping Proteasome Digestion
Binding Prediction
SubstrateEpitopeSequenceSeq. ID No.HLA typeSYFPEITHINIH
Tyr 463-474463-471YIKSYLEQA580A020118<5
A2617N.A.
459-467SFQDYIKSY581A118<5
A2622N.A.
458-467DSFQDYIKSY582A119<5
A2624N.A.
Tyr 490-518507-514LPEEKQPL583B08285
B510118N.A.
506-514QLPEEKQPL584A02012288
A2620N.A.
A24N.A.9
B0818<5
505-514KQLPEEKQPL585A02011528
A24N.A.17
507-515LPEEKQPLL586A020115<5
B07022124
B08285
B510121157
506-515QLPEEKQPLL587A02012388
A2620N.A.
A24N.A.7
497-505SLLCRHKRK588A32515

Example 15

Evaluating Likelihood of Epitope Cross-Reactivity on Non-Target Tissues

[0379]

As noted above PSA is a member of the kallikrein family of proteases, which is itself a subset of the serine protease family. While the members of this family sharing the greatest degree of sequence identity with PSA also share similar expression profiles, it remains possible that individual epitope sequences might be shared with proteins having distinctly different expression profiles. A first step in evaluating the likelihood of undesirable cross-reactivity is the identification of shared sequences. One way to accomplish this is to conduct a BLAST search of an epitope sequence against the SWISSPROT or Entrez non-redundant peptide sequence databases using the “Search for short nearly exact matches” option; hypertext transfer protocol accessible on the world wide web (http://www) at “ncbi.nlm.nih.gov/blast/index.html”. Thus searching SEQ ID NO. 214, WVLTAAHCI, against SWISSPROT (limited to entries for homo sapiens) one finds four exact matches, including PSA. The other three are from kallikrein 1 (tissue kallikrein), and elastase 2A and 2B. While these nine amino acid segments are identical, the flanking sequences are quite distinct, particularly on the C-terminal side, suggesting that processing may proceed differently and that thus the same epitope may not be liberated from these other proteins. (Please note that kallikrein naming is confused. Thus the kallikrein 1 [accession number P06870] is a different protein than the one [accession number AAD13817] mentioned in the paragraph on PSA above in the section on tumor-associated antigens).

[0380]

It is possible to test this possibility in several ways. Synthetic peptides containing the epitope sequence embedded in the context of each of these proteins can be subjected to in vitro proteasomal digestion and analysis as described above. Alternatively, cells expressing these other proteins, whether by natural or recombinant expression, can be used as targets in a cytotoxicity (or similar) assay using CD8+ T cells that recognize the epitope, in order to determine if the epitope is processed and presented.

Epitope Clusters

[0381]

Known and predicted epitopes are generally not evenly distributed across the sequences of protein antigens. As referred to above, we have defined segments of sequence containing a higher than average density of (known or predicted) epitopes as epitope clusters. Among the uses of epitope clusters is the incorporation of their sequence into substrate peptides used in proteasomal digestion analysis as described herein. Epitope clusters can also be useful as vaccine components. A fuller discussion of the definition and uses of epitope clusters is found in U.S. patent application Ser. No. 09/561,571 entitled corporated by reference in its entirety.

Example 16

Metal-A/MART-1

[0382]

This melanoma tumor-associated antigen (TAA) is 118 amino acids in length. Of the 110 possible 9-mers, 16 are given a score ≧16 by the SYFPEITHI/Rammensee algorithm. (See Table 37). These represent 14.5% of the possible peptides and an average epitope density on the protein of 0.136 per amino acid. Twelve of these overlap, covering amino acids 22-49 resulting in an epitope density for the cluster of 0.428, giving a ratio, as described above, of 3.15. Another two predicted epitopes overlap amino acids 56-69, giving an epitope density for the cluster of 0.143, which is not appreciably different than the average, with a ratio of just 1.05. See FIG. 18.

SYFPEITHI (Rammensee algorithm) Results for Melan-A/MART-1
RankStartScore
13127
25626
33526
43225
52725
62924
73423
86120
93319
102219
119918
123618
132818
148717
154117
164016

[0383]

Restricting the analysis to the 9-mers predicted to have a half time of dissociation of ≧5 minutes by the BIMAS-NIH/Parker algorithm leaves only 5. (See Table 38). The average density of epitopes in the protein is now only 0.042 per amino acid. Three overlapping peptides cover amino acids 31-48 and the other two cover 56-69, as before, giving ratios of 3.93 and 3.40, respectively. (See Table 39).

BIMAS-NIH/Parker algorithm Results for Melan-A/MART-1
RankStartScoreLog(Score)
1401289.013.11
2561055.1043.02
33181.3851.91
43520.7531.32
5614.9680.70

[0384]

Predicted Epitope Clusters for Melan-A/MART-1
Calculations(Epitopes/AAs)
ClusterAAPeptidesClusterWhole proteinRatio
131-483, 4, 10.170.0423.93
256-692, 50.140.0423.40

Example 17

SSX-2/HOM-MEL-40

[0385]

This melanoma tumor-associated antigen (TAA) is 188 amino acids in length. Of the 180 possible 9-mers, 11 are given a score ≧16 by the SYFPEITHI/Rammensee algorithm. These represent 6.1% of the possible peptides and an average epitope density on the protein of 0.059 per amino acid. Three of these overlap, covering amino acids 99-114 resulting in an epitope density for the cluster of 0.188, giving a ratio, as described above, of 3.18. There are also overlapping pairs of predicted epitopes at amino acids 16-28, 57-67, and 167-183, giving ratios of 2.63, 3.11, and 2.01, respectively. There is an additional predicted epitope covering amino acids 5-28. Evaluating the region 5-28 containing three epitopes gives an epitope density of 0.125 and a ratio 2.14.

[0386]

Restricting the analysis to the 9-mers predicted to have a half time of dissociation of ≧5 minutes by the BIMAS-NIH/Parker algorithm leaves only 6. The average density of epitopes in the protein is now only 0.032 per amino acid. Only a single pair overlap, at 167-180, with a ratio of 4.48. However the top ranked peptide is close to another single predicted epitope if that region, amino acids 41-65, is evaluated the ratio is 2.51, representing a substantial difference from the average. See FIG. 19.

SYFPEITHI/Rammensee algorithm for SSX-2/HOM-MEL-40
RankStartScore
110323
216722
34122
41621
59920
65919
72017
8517
917516
1010616
115716

[0387]

Calculations (Epitopes/AAs)
Calculations (Epitopes/AAs)
ClusterAAPeptidesClusterWhole proteinRatio
1 5 to 288, 4, 70.1250.0592.14
216-284, 70.150.0592.63
357-6711, 60.180.0593.11
4 99-1145, 1, 100.190.0593.20
5167-1832, 90.120.0592.01

[0388]

BIMAS-NIH/Parker algorithm
RankStartScoreLog(Score)
1411017.0623.01
216721.6721.34
35720.811.32
410310.4331.02
517210.0681.00
6166.4420.81

[0389]

Calculations(Epitopes/AAs)
ClusterAAPeptidesClusterWhole proteinRatio
141-651, 30.080.0322.51
2167-1802, 50.140.0324.48

Example 18

NY-ESO

[0390]

This tumor-associated antigen (TAA) is 180 amino acids in length. Of the 172 possible 9-mers, 25 are given a score ≧16 by the SYFPEITHI/Rammensee algorithm. Like Melan-A above, these represent 14.5% of the possible peptides and an average epitope density on the protein of 0.136 per amino acid. However the distribution is quite different. Nearly half the protein is empty with just one predicted epitope in the first 78 amino acids. Unlike Melan-A where there was a very tight cluster of highly overlapping peptides, in NY-ESO the overlaps are smaller and extend over most of the rest of the protein. One set of 19 overlapping peptides covers amino acids 108-174, resulting in a ratio of 2.04. Another 5 predicted epitopes cover 79-104, for a ratio of just 1.38.

[0391]

If instead one takes the approach of considering only the top 5% of predicted epitopes, in this case 9 peptides, one can examine whether good clusters are being obscured by peptides predicted to be less likely to bind to MHC. When just these predicted epitopes are considered we see that the region 108-140 contains 6 overlapping peptides with a ratio of 3.64. There are also 2 nearby peptides in the region 148-167 with a ratio of 2.00. Thus the large cluster 108-174 can be broken into two smaller clusters covering much of the same sequence.

[0392]

Restricting the analysis to the 9-mers predicted to have a half time of dissociation of ≧5 minutes by the BIMAS-NIH/Parker algorithm brings 14 peptides into consideration. The average density of epitopes in the protein is now 0.078 per amino acid. A single set of 10 overlapping peptides is observed, covering amino acids 144-171, with a ratio of 4.59. All 14 peptides fall in the region 86-171 which is still 2.09 times the average density of epitopes in the protein. While such a large cluster is larger than we consider ideal it still offers a significant advantage over working with the whole protein. See FIG. 20.

SYFPEITHI (Rammensee algorithm) Results for NY-ESO
RankStartScore
110825
214824
315921
412721
58621
613220
712220
812020
911520
109620
1111319
129119
1316618
1416118
1515718
1615118
1713718
187918
1913917
2013117
218717
2215216
2314416
2412916
251516

[0393]

Calculations(Epitopes/AAs)
ClusterAAPeptidesClusterWhole proteinRatio
1108-1401, 9, 8, 7, 4, 60.180.053.64
2148-1672, 30.100.052.00
3 79-1045 12, 10, 18, 210.190.141.38
4108-1741, 11, 9, 8, 7, 4,0.280.142.04
6, 17, 2, 16, 15,
3, 14, 13, 24, 20,
19, 23, 22

[0394]

BIMAS-NIH/Parker algorithm Results for NY-ESO
RankStartScoreLog(Score)
11591197.3213.08
286429.5782.63
3120130.6012.12
416183.5841.92
515552.7041.72
615449.5091.69
715742.2781.63
810821.3621.33
913219.4251.29
1014513.6241.13
1116311.9131.08
1214411.4261.06
131486.7560.83
141524.9680.70

[0395]

Calculations(Epitopes/AAs)
ClusterAAPeptidesClusterWhole proteinRatio
1 86-1712, 8, 3, 9, 10, 12,0.1630.0782.09
13, 14, 6, 5, 7,
1, 4, 11
2144-17110, 12, 13, 14, 6,0.360.0784.59
5, 7, 1, 4, 11

Example 19

Tyrosinase

[0396]

This melanoma tumor-associated antigen (TAA) is 529 amino acids in length. Of the 521 possible 9-mers, 52 are given a score ≧16 by the SYFPEITHI/Rammensee algorithm. These represent 10% of the possible peptides and an average epitope density on the protein of 0.098 per amino acid. There are 5 groups of overlapping peptides containing 2 to 13 predicted epitopes each, with ratios ranging from 2.03 to 4.41, respectively. There are an additional 7 groups of overlapping peptides, containing 2 to 4 predicted epitopes each, with ratios ranging from 1.20 to 1.85, respectively. The 17 peptides in the region 444-506, including the 13 overlapping peptides above, constitutes a cluster with a ratio of 2.20.

[0397]

Restricting the analysis to the 9-mers predicted to have a half time of dissociation of ≧5 minutes by the BIMAS-NIH/Parker algorithm brings 28 peptides into consideration. The average density of epitopes in the protein under this condition is 0.053 per amino acid. At this density any overlap represents more than twice the average density of epitopes. There are 5 groups of overlapping peptides containing 2 to 7 predicted epitopes each, with ratios ranging from 2.22 to 4.9, respectively. Only three of these clusters are common to the two algorithms. Several, but not all, of these clusters could be enlarged by evaluating a region containing them and nearby predicted epitopes.

SYFPEITHI/Rammensee algorithm Results for Tyrosinase
RankStartScore
149034
249131
348728
4127
5225
648223
738023
836923
921423
1050622
1134322
1220722
1313722
145722
1516920
1611820
17920
1848819
1948319
2048019
2147919
2247819
2347319
2436519
2528719
2620019
27519
2848418
2947618
3046318
3144418
3242518
3331618
3418718
3540217
3638817
3734617
3833617
3922517
4022417
4120817
4218617
4317117
4451416
4549416
4640616
4738516
4834916
4918416
5016716
5114516
5213916

[0398]

Calculations(Epitopes/AAs)
ClusterAAPeptidesClusterWhole proteinRatio
11 to 174, 5, 27, 170.240.0982.39
2137-15313, 52, 510.180.0981.80
3167-17915, 43, 500.230.0982.35
4184-19534, 42, 490.250.0982.54
5200-22226, 41, 9, 120.170.0981.77
6224-23339, 400.200.0982.03
7336-35738, 11, 37, 480.180.0981.85
8365-37724, 80.150.0981.57
9380-3967, 47, 360.180.0981.80
10402-41435, 460.150.0981.57
11473-50229, 28, 23, 22,0.430.0984.41
21, 20, 6, 19, 3,
18, 1, 2, 45
12506-52210, 440.120.0981.20
444-52231, 30, 23, 29,0.220.0982.20
22, 21, 20, 6, 19,
28, 3, 18, 1, 2,
45, 10, 44

[0399]

BIMAS-NIH/Parker algorithm Results
RankStartScoreLog(Score)
1207540.4692.73
2369531.4552.73
31309.052.49
49266.3742.43
5490181.7942.26
6214177.5662.25
7224143.4512.16
817193.6561.97
950687.5861.94
1048783.5271.92
1149183.5271.92
12254.4741.74
1313747.9911.68
1420030.7771.49
1520826.2481.42
1646021.9191.34
1747819.4251.29
1836517.141.23
1938016.2281.21
2044413.2181.12
2147313.041.12
225710.8681.04
234828.2520.92
244837.3090.86
2556.9930.84
262255.8580.77
273435.1950.72
285145.1790.71

[0400]

Calculations (Epitopes/AAs)
ClusterAAPeptidesClusterWhole proteinRatio
11 to 173, 12, 25, 40.240.0534.45
2200-22214, 1, 15, 60.170.0533.29
3224-2337, 260.200.0533.78
4365-37718, 20.150.0532.91
5473-49921, 17, 23, 24,0.260.0534.90
10, 5, 11
6506-5229, 280.120.0532.22
7365-38818, 2, 190.130.0532.36
8444-49920, 16, 21, 17,0.160.0533.03
23, 24, 10, 5, 11
9444-52220, 16, 21, 17,0.140.0532.63
23, 24, 10, 5, 11,
9, 28
10200-23314, 1, 15, 6, 7, 260.180.0533.33

Example 20

[0401]

The following tables (52-75) present 9-mer epitopes predicted for HLA-A2 binding using both the SYFPEITHI and NIH algorithms and the epitope density of regions of overlapping epitopes, and of epitopes in the whole protein, and the ratio of these two densities. (The ratio must exceed one for there to be a cluster by the above definition; requiring higher values of this ratio reflect preferred embodiments). Individual 9-mers are ranked by score and identified by the position of their first amino in the complete protein sequence. Each potential cluster from a protein is numbered. The range of amino acid positions within the complete sequence that the cluster covers is indicated as are the rankings of the individual predicted epitopes it is made up of.

BIMAS-NIH/Parker algorithm Results for gp100
RankStartScore
16191493
2602413
3162226
418118
5178118
6273117
760181
824363
960660
1037350
1154436
1229129
1359229
1426829
154727
1658526
1757621
1846521
1957020
20919
2141619
222518
2356617
2460315
2538414
261314
2729012
2863710
296399
304859
314538
321028
333998
344567
351137
366227
37697
386046
393506
405835

[0402]

SYFPEITHI (Rammensee algorithm) Results for gp100
RankStartScore
160630
216229
345628
41828
560227
659827
760126
859726
91326
1058525
1144925
12425
1360324
1457624
1545324
1617824
1717124
181124
1961923
2028023
2126823
2259222
2354422
2446522
2539922
2637322
2727322
2824322
2956621
3056321
3148521
3238421
3335021
34921
3546320
3639720
3729120
3826920
39220
4061019
4159419
4259119
4358319
4457019
4548819
4644619
4732219
4826719
4925019
5020519
5118019
5216919
538819
544719
551019
5664818
5760518
5860418
5959518
6057118
6156918
6245018
6340918
6440018
6537118
6634318
6729818
6820918
6910218
709718
717618
726918
736018
741718
7561317
7659917
7757217
7855717
7955617
8051217
8140617
8232417
8329017
8410117
859517
8663516
8758816
8858416
8957716
9055916
9153916
9249416
9348216
9446816
9544216
9641316
9740816
9840216
9928616
10023416
10121716
10221116
10317616
10410716
1059616
1068016
1071616
1081416
109716

[0403]

Prediction of clusters for gp100
Total AAs: 661
Total 9-mers: 653
SYFPEITHI ≧ 16: 109 9-mers
NIH ≧ 5: 40 9-mers
Epitopes/AA
Whole
Cluster #AAsEpitopes (by Rank)ClusterPrRatio
SYFPEITHI 12 to 2639, 12, 109, 34, 55, 11, 9,0.4400.1652.668
108, 107, 74, 4
 2 69-11572, 71, 106, 53, 85, 105,0.2130.1651.290
70, 84, 69, 104
 3 95-11585, 105, 70, 84, 690.2380.1651.444
 4162-1882, 52, 17, 103, 16, 510.2220.1651.348
 5205-22550, 68, 102, 1010.1900.1651.155
 6243-25828, 490.1250.1650.758
 7267-30648, 21, 38, 27, 20, 99, 83, 37, 670.2250.1651.364
 8322-33247, 820.1820.1651.103
 9343-35866, 330.1250.1650.758
10371-38165, 260.1820.1651.103
11397-42136, 25, 64, 98, 81, 97, 63, 960.3200.1651.941
12442-47695, 46, 11, 62, 15, 3, 35, 24, 940.2570.1651.559
13482-50293, 31, 45, 930.1900.1651.155
14539-55291, 230.1430.1650.866
15556-62779, 78, 90, 30, 29, 61, 44, 60, 77,0.4310.1652.611
14, 89, 43, 88, 10, 87, 42, 22, 41,
59, 8, 6, 76, 7, 5, 13, 58, 57, 1,
40, 75, 19
NIH 19 to 3320, 26, 4, 220.1600.0612.644
 2268-28114, 60.1430.0612.361
 3290-29927, 120.2000.0613.305
 4*102-12132, 350.1000.0611.653
 5*373-39210, 250.1000.0611.653
 6453-47331, 34, 180.1430.0612.361
 7566-60023, 19, 17, 40, 16, 130.1710.0612.833
 8601-6147, 2, 24, 38, 90.3570.0615.902
 9619-6301, 360.170.0612.754
10637-64728, 290.180.0613.005

*Nearby but not overlapping epitopes

[0404]

BIMAS-NIH/Parker algorithm Results for PSMA
RankStartScore
16631360
27111055
34485
427400
526375
6668261
7707251
8469193
9731177
103567
113364
1255459
1342750
1411547
152040
1621726
1758324
1841519
1919314
2024012
2162711
2226010
2313010
247419
2539
267338
277267
282866
291745
307005

[0405]

SYFPEITHI (Rammensee algorithm) Results for PSMA
RankStartScore
146927
22727
374126
471126
535425
6425
766324
813024
95724
1070723
1126023
122023
1360322
1421822
1510922
1673121
1766821
1866021
1950721
2045421
2142721
2235821
2328421
2411521
253321
2660620
2756820
2847320
2946120
3020020
312620
32320
3358319
3457919
3555419
3655019
3754719
3839019
3921919
4019319
4170018
4247218
4336418
4431718
4525318
469118
476118
481318
4973317
5067317
5167117
5264217
5357117
5449217
5544217
5644117
5739717
5839117
5935717
6034417
6130517
6230417
6328617
6428217
6516917
6614217
6712217
6873816
6963416
7063116
7151516
7245616
7344016
7438516
7537316
7636516
7736116
7828916
7927816
8025816
8124716
8221716
8310716
8410016
857516
863716
873016
882116

[0406]

Prediction of clusters for prostate-specific membrane antigen (PSMA)
Total AAs: 750
Total 9-mers: 742
SYFPEITHI ≧ 16: 88 9-mers
NIH ≧ 5: 30 9-mers
Epitopes/AA
Whole
Cluster #AasEpitopes (by rank)ClusterPrRatio
SYFPEITHI 13 to 1232, 60.2000.1171.705
 213-4513, 12, 88, 31, 2, 87, 25, 860.2420.1172.066
 357-699, 470.1540.1171.311
 4100-13884, 83, 15, 24, 67, 80.1540.1171.311
 5193-20840, 300.1250.1171.065
 6217-22782, 14, 390.2730.1172.324
 7247-26881, 45, 80, 110.1820.1171.550
 8278-29779, 64, 23, 63, 780.2500.1172.131
 9354-3815, 59, 22, 77, 43, 76, 750.2500.1172.131
10385-40574, 38, 58, 570.1900.1171.623
11440-45073, 56, 550.2730.1172.324
12454-48120, 72, 29, 1, 42, 280.2140.1171.826
13507-52317, 710.1180.1171.003
14547-56237, 36, 350.1880.1171.598
15568-59127, 53, 34, 330.1670.1171.420
16603-61413, 260.1670.1171.420
17631-65070, 69, 520.1500.1171.278
18660-68118, 7, 17, 51, 500.2270.1171.937
19700-71941, 10, 40.1500.1171.278
20731-74916, 49, 68, 30.2110.1171.794
NIH 13 to 1225, 30.2000.0405.000
 220-4315, 5, 4, 11, 100.2080.0405.208
 3*415-43518, 130.0950.0402.381
 4663-6761, 60.1430.0403.571
 5700-71530, 7, 30.1880.0404.688
 6726-74927, 9, 26, 240.1670.0404.167

*Nearby but not overlapping epitopes

[0407]

BIMAS-NIH/Parker algorithm Results for PSA
RankStartScore
17607
2170243
352124
453112
5195101
616523
77218
824518
9216
105916
1112215
1212515
1319113
1498
15146
161755
171305

[0408]

SYFPEITHI (Rammensee algorithm) Results for PSA
RankStartScore
17226
217022
35322
4722
523421
616621
714021
86621
924120
1017520
111220
124119
132019
141419
1513018
1612418
1712118
184718
191718
2021817
2113317
2212517
2312217
2411817
2511017
266717
275217
282117
291617
30217
3118416
3217916
3315816
347916
357316
36416

[0409]

Prediction of clusters for prostate specific antigen (PSA)
Total AAs: 261
Total 9-mers: 253
SYFPEITHI ≧ 16: 36 9-mers
NIH ≧ 5: 17 9-mers
Epitopes/AA
Whole
Cluster #AAsEpitopes (by rank)ClusterPrRatio
SYFPEITHI12 to 2930, 36, 4, 11, 14, 29, 19, 13, 280.3210.1382.330
241-6112, 18, 27, 30.1900.1381.381
366-878, 26, 1, 35, 340.2270.1381.648
4110-14825, 24, 17, 23, 16, 22, 15, 21, 70.1840.1381.332
5158-19233, 6, 2, 10, 32, 310.1710.1381.243
6234-2495, 90.1250.1380.906
 7*118-13324, 17, 23, 16, 220.3130.1382.266
 8*118-13824, 17, 23, 16, 22, 150.2860.1382.071
NIH1 2-229, 1, 14, 150.1900.0652.924
252-673, 4, 100.1880.0652.879
3122-13811, 12, 170.1760.0652.709
4165-1836, 2, 160.1580.0652.424
5191-20313, 50.1540.0652.362
 6**52-803, 4, 10, 70.1380.0652.118

*These clusters are internal to the less preferred cluster #4.

**Includes a nearby but not overlapping epitope.

[0410]

BIMAS-NIH/Parker algorithm Results for PSCA
RankStartScore
143153
2584
3779
410936
510525
610824
71421
82018
911517
104215
113615
12999
13588

[0411]

SYFPEITHI (Rammensee algorithm) Results for PSCA
RankStartScore
110830
21430
310529
4528
511526
69926
7726
810924
95323
1010721
112021
12821
131320
1410219
156019
165719
175419
181219
19419
20119
2111218
2210118
239818
245118
254318
2610617
2710417
288317
296317
305017
31317
32916
339216

[0412]

Prediction of clusters for prostate stem cell antigen (PSCA)
Total AAs: 123
Total 9-mers: 115
SYFPEITHI ≧ 16: 33;
SYFPEITHI ≧ 20: 13
NIH ≧ 5: 13
Epitopes/AA
Cluster #AAsEpitopes (by rank)ClusterWhole Pr.Ratio
SYFPEITHI > 1611 to 2820, 31, 19, 4, 7, 12, 33, 18, 13, 2,0.3930.2681.464
11
243-7125, 30, 24, 9, 17, 16, 15, 290.2760.2681.028
3 92-12332, 23, 6, 27, 14, 22, 3, 26, 10,0.4060.2681.514
1, 8, 21, 5
SYFPEITHI > 2015 to 284, 7, 12, 13, 2, 110.2500.1062.365
2 99-1236, 3, 10, 1, 8, 50.2400.1062.271
NIH15 to 282, 3, 7, 80.1670.1061.577
236-5111, 10, 10.1880.1061.774
3 99-12312, 5, 6, 4, 90.2000.1061.892
4*105-1165, 6, 40.2500.1062.365

*This cluster is internal to the less preferred cluster #3.

[0413]

In tables 49-60 epitope prediction and cluster analysis data for each algorithm are presented together in a single table.

Prediction of clusters for MAGE-1 (NIH algorithm)
Total AAs: 309
Total 9-mers: 301
NIH ≧ 5: 19 9-mers
Epitopes/AA
ClusterEpitopeStartNIHWhole
#AAsRankPositionScoreClusterPr.Ratio
118-32161890.1330.0632.112
19247
2101-11314101110.1540.0632.442
710544
3146-1599146320.1430.0632.263
3151169
4169-20210169320.1760.0632.796
1317416
181818
171878
618874
5194110
5264-27722641900.1430.0632.263
1226920
6278-29012787430.1540.0632.437
1128228

[0414]

Prediction of clusters for MAGE-1 (SYFPEITHI algorithm)
Total AAs: 309
Total 9-mers: 301
SYFPEITHI ≧ 16: 46 9-mers
Clus-Epi-Epitopes/AA
tertopeStartSYFPEITHIClus-
#AasRankPositionScoreterWholeRatio
1 7-49227190.2330.1531.522
91522
271818
162020
282218
292418
333117
303518
23826
174120
2 89-1321089220.2730.1531.783
189220
79323
239619
439816
410125
810523
3410717
3510817
3611317
3711817
1912420
3167-20344167160.2700.1531.766
2016920
1217421
2418119
618724
3118818
2519119
3819217
119427
1319521
4230-24614230210.1180.1530.769
3923817
5264-29715264210.2350.1531.538
3226918
4027017
2627119
4627516
327826
2128220
4128917

[0415]

Prediction of clusters for MAGE-2 (NIH algorithm)
Total AAs: 314
Total 9-mers: 308
NIH >= 5: 20 9-mers
Epi-Epitope/AA
ClustertopeStartNIHClus-Whole
#AAsRankPositionScoreterPr.Ratio
1101-120181015.3730.1500.0652.310
161086.756
11122800.697
2153-167815331.8830.2000.0653.080
4158168.552
715932.138
3169-211141698.5350.2090.0653.223
191745.346
617649.993
1118115.701
151887.536
1219512.809
520088.783
1020116.725
172035.609
4271-2843271398.3240.1430.0652.200
927619.658

[0416]

Prediction of clusters for MAGE-2 (SYFPEITHI algorithm)
Total AAs: 314
Total 9-mers: 308
SYFPEITHI ≧ 16: 52 9-mers
Clus-Epi-Epitopes/AA
tertopeStartSYFPEITHIClus-Whole
#AAsRankPositionScoreterPr.Ratio
115-321315210.2780.1691.645
291818
432016
302218
212419
237-563137180.2500.1691.481
164020
444416
144521
224819
3 96-1333696170.2110.1691.247
4610116
610825
4710916
211227
3712017
3812517
1713120
4153-21612153220.3440.1692.036
3915817
715925
2316119
2416219
4816416
4916716
3217018
5017116
417426
917624
5117716
1518121
2518819
1819420
3319518
1919820
320027
120128
4020217
1020323
5220816
5237-25426237190.1670.1690.987
2724519
3424618
6271-2998271250.2410.1691.430
3527618
4127717
1127823
2828319
2028520
4229117

[0417]

Prediction of clusters for MAGE-3 (NIH algorithm)
Total AAs: 314
Total 9-mers: 308
NIH ≧ 5: 22 9-mers
Epi-Epitopes/AA
ClustertopeStartNIHClus-Whole
#AAsRankPositionScoreterPr.Ratio
1101-1201510111.0020.2000.0712.800
211056.488
810849.134
2112339.313
2153-167181537.7760.2000.0712.800
615851.77
221595.599
3174-209171748.8320.1940.0712.722
717649.993
1318115.701
191887.536
1419512.809
520088.783
1220116.725
4237-2511623710.8680.2000.0712.800
4238148.896
202436.88
5271-28412712655.4950.1430.0712.000
1127619.658

[0418]

Prediction of clusters for MAGE-3 (SYFPEITHI algorithm)
Total AAs: 314
Total 9-mers: 308
SYFPEITHI ≧ 16: 47 9-mers
Clus-Epi-Epitopes/AA
tertopeStartSYFPEITHIClus-Whole
#AAsRankPositionScoreterPr.Ratio
115-321215210.2780.1531.820
261818
372016
272218
182419
238-563838160.2630.1531.725
154020
394416
134521
194819
3101-14228101180.1900.1531.248
4010516
110831
611225
3112017
3212517
1613120
4113416
4153-21620153190.3130.1532.048
2915618
3315817
2115919
3416117
4216416
4316716
1017422
817623
1418121
2218819
4419316
1119422
2319519
4519716
1719820
320027
220128
3520217
4620816
5220-2305220260.1820.1531.191
4722216
6237-2467237250.2000.1531.311
923823
7271-2934271270.2170.1531.425
3027618
2427819
3628317
2528519

[0419]

Prediction of clusters for PRAME (NIH algorithm)
Total AAs: 509
Total 9-mers: 501
NIH ≧ 5: 40 9-mers
Epitopes/AA
ClusterEpitopeStartNIHWhole
#AAsRankPositionScoreClusterPr.Ratio
133-472033180.1330.0801.670
173921
271-81971500.20.079842.505
32737
3 99-10823100150.20.079842.505
249913
4126-1353812650.20.079842.505
351276
5224-24652241240.1300.0801.634
823063
392385
6290-30318290180.2140.0802.684
1429223
729566
7305-32428305100.2000.0802.505
303088
2531213
363166
8394-40923941820.1880.0802.348
1239742
314017
9422-44310422490.2270.0802.847
3425182
344317
294329
4435160
10459-48715459210.1720.0802.159
1146245
2246615
404725
374796

[0420]

Prediction of clusters for PRAME (SYFPEITHI algorithm)
Total AAs: 509
Total 9-mers: 501
SYFPEITHI ≧ 17: 80 9-mers
Clus-EpiEpitopes/AA
tertopeStartSYFPEITHIClus-Whole
#AAsRankPositionScoreterPr.Ratio
118-596518170.2380.1601.491
502118
662617
353320
223422
513718
53927
234022
134424
465119
2 78-1153678200.2630.1601.648
678017
528418
248622
539118
259322
99925
810026
5410318
5510718
3191-20256191180.1670.1601.044
3819420
4205-21526205220.1820.1601.139
2720722
5222-23847222190.2350.1601.474
1422424
6922717
5723018
6241-27370241170.2120.1601.328
1524824
7125517
3025821
3925920
5826118
4026520
7290-34272290170.2080.1601.300
4829319
3129821
7330117
1830523
630827
1031225
1931623
2831922
4132620
7433417
8343-36359343180.2380.1601.491
6034818
7535117
2035323
7635517
9364-44749364190.2500.1601.566
3237121
1137225
6137518
7738217
2139023
7839117
139430
4239720
6240318
3341021
4341820
3441921
742227
242529
7942617
6342818
6443118
1243225
1643524
8043917
10455-47429455220.2000.1601.253
1745924
446228
346629

[0421]

Prediction of clusters for CEA (NIH algorithm)
Total AAs: 702
Total 9-mers: 694
NIH ≧ 5: 30 9-mers
Clus-Peptides/AAs
terPeptidesStartWhole
#AARankPositionScoreClusterPr.Ratio
117-3251779.0410.1880.0434.388
71846.873
202412.668
2113-1292113167.9910.1180.0432.753
1512121.362
3172-187251729.1650.1250.0432.925
1417927.995
4278-291302785.8180.1430.0433.343
1728319.301
5350-365935043.0750.1250.0432.925
1235727.995
6528-543852843.0750.1250.0432.925
1353527.995
7631-645236319.5630.2000.0434.680
1963413.381
246379.245
8691-7021691196.4070.1670.0433.900
276947.769

[0422]

Prediction of clusters for CEA (SYFPEITHI algorithm)
Total AAs: 702
Total 9-mers: 694
SYFPEITHI ≧ 16: 81 9-mers
Peptides/AAs
ClusterPeptidesStartWhole
#AARankPositionScoreClusterPr.Ratio
1 5-36675160.2500.1172.140
231219
241619
91722
251819
321918
682316
332818
237-624137170.2690.1172.305
204420
264519
424617
275019
435317
445417
3 99-1151499210.2350.1172.014
510023
4510417
3410718
4116-12969116160.1430.1171.223
2112120
5172-18746172170.1250.1171.070
7017916
6192-2023192240.1820.1171.557
4719417
7226-24148226170.1880.1171.605
4922917
1523321
8307-31811307220.2500.1172.140
7130816
5131017
9319-34952319170.1290.1171.105
5332717
7233516
3534118
10370-38812370220.2110.1171.802
5437217
7437516
638023
11403-41956403170.2350.1172.014
5740417
5840717
2841119
12427-44259427170.1880.1171.605
7543216
7643416
13450-46277450160.1540.1171.317
1345422
14488-50536488180.1670.1171.427
1849221
6049717
15548-5584548240.1820.1171.557
6155017
16565-57762565170.1540.1171.317
1956921
17579-59778579160.1430.1171.223
7958216
758923
18605-6182605250.1430.1171.223
3861018
19631-66929631190.1540.1171.317
6363717
8064416
6465217
3966018
8166116
20675-70222675200.2860.1172.446
3068319
3168719
4068818
6569017
169131
6669217
869423

[0423]

Prediction of clusters for SCP-1 (NIH algorithm)
Total AAs: 976
Total 9-mers: 968
NIH ≧ 5: 37 9-mers
Peptides/AAs
Clus-PeptidesStartClus-
ter #AARankPositionScoreterWhole Pr.Ratio
1101-1161510140.5890.1250.0383.270
1310857.255
 2*281-3051428144.9440.120.0383.139
2428815.203
1729732.857
3431-447843180.2170.0730.0381.914
2643811.861
4439148.896
4557-5791155764.3350.1740.0384.550
1956024.937
656487.586
1857132.765
5635-6501063569.5520.1250.0383.270
346426.542
6755-767367555.5990.1540.0384.025
357595.928
7838-8542838284.5170.1180.0383.078
2884611.426

[0424]

Prediction of clusters for SCP-1
Total AAs: 976
Total 9-mers: 968
Rammensee ≧ 16: 118 9-mers
Clus-PeptidesStartPeptides/AAs
ter #AARankPositionScoreClusterWhole Pr.Ratio
1 8-28998160.1430.1211.182
771517
1002016
263-807863170.2220.1211.838
506619
1026916
607218
3 94-1237994170.1330.1211.103
1210123
1710822
10311516
4126-15835126200.1820.1211.504
3613320
5113919
8014017
6114318
3715020
5161-18938161200.2070.1211.711
5216519
8117117
8217717
6217818
3918120
6213-23040213200.1670.1211.379
1322023
2822221
7235-25063235180.1250.1211.034
1824222
8260-29683260170.2430.1212.012
10526216
8426717
10626916
4127020
6427118
8527417
1928122
328825
9312-338108312160.1480.1211.225
2931921
3032321
6533018
10339-35566339180.2350.1211.946
3134021
4234420
5334719
11376-44754376190.1940.1211.608
4338220
4438620
2039022
5539719
640424
8640717
4541120
6741718
2142522
4643120
6843218
3243821
743924
12455-48833455210.2350.1211.946
4745920
5646219
8746317
8846617
1447023
10947316
3448021
13515-53057515190.1250.1211.034
2252222
14557-5908557240.1470.1211.216
2356422
957124
9057517
5858219
15610-62569610180.1250.1211.034
9161717
16633-66892633170.222
1063524
7063818
9364017
4864220
4964520
11165216
11266016
17674-68571674180.1670.1211.379
1167724
18687-7021687260.1250.1211.034
9469417
19744-767113744160.2500.1212.068
9574517
474525
2475222
275526
7275918
20812-82797812170.1250.1211.034
11581916
21838-857116838160.1500.1211.241
2584622
7484918
22896-913117896160.2220.1211.838
9889917
2690222
7690518

[0425]

The embodiments of the invention are applicable to and contemplate variations in the sequences of the target antigens provided herein, including those disclosed in the various databases that are accessible by the world wide web. Specifically for the specific sequences disclosed herein, variation in sequences can be found by using the provided accession numbers to access information for each antigen.

[0426]

All patents and publications mentioned in the specification are indicative of the levels of those skilled in the art to which the invention pertains. The entire contents of all patents and publications discussed herein are incorporated by reference in their entirety to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference in its entirety.

[0427]

The invention illustratively described herein suitably may be practiced in the absence of any element or elements, limitation or limitations which is not specifically disclosed herein. The terms and expressions which have been employed are used as terms of description and not of limitation, and there is no intention that in the use of such terms and expressions indicates the exclusion of equivalents of the features shown and described or portions thereof. It is recognized that various modifications are possible within the scope of the invention claimed. Thus, it should be understood that although the present invention has been specifically disclosed by preferred embodiments and optional features, modification and variation of the concepts herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention as defined by the appended claims.

Как компенсировать расходы
на инновационную разработку
Похожие патенты