EP3504328A1 - Host cell protein modification - Google Patents
- ️Wed Jul 03 2019
EP3504328A1 - Host cell protein modification - Google Patents
Host cell protein modificationInfo
-
Publication number
- EP3504328A1 EP3504328A1 EP17798018.2A EP17798018A EP3504328A1 EP 3504328 A1 EP3504328 A1 EP 3504328A1 EP 17798018 A EP17798018 A EP 17798018A EP 3504328 A1 EP3504328 A1 EP 3504328A1 Authority
- EP
- European Patent Office Prior art keywords
- protein
- seq
- modification
- host cell
- cell Prior art date
- 2016-08-24 Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000009145 protein modification Effects 0.000 title description 3
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 563
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 362
- 230000014509 gene expression Effects 0.000 claims abstract description 190
- 238000000034 method Methods 0.000 claims abstract description 165
- 239000000203 mixture Substances 0.000 claims abstract description 70
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims abstract description 31
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims abstract description 31
- 239000002773 nucleotide Substances 0.000 claims description 264
- 125000003729 nucleotide group Chemical group 0.000 claims description 262
- 230000004048 modification Effects 0.000 claims description 212
- 238000012986 modification Methods 0.000 claims description 212
- 108010013563 Lipoprotein Lipase Proteins 0.000 claims description 180
- 102000043296 Lipoprotein lipases Human genes 0.000 claims description 180
- 102100036164 Putative phospholipase B-like 2 Human genes 0.000 claims description 179
- 101001074414 Homo sapiens Putative phospholipase B-like 2 Proteins 0.000 claims description 173
- 108020005296 Acid Ceramidase Proteins 0.000 claims description 168
- 102000040430 polynucleotide Human genes 0.000 claims description 132
- 108091033319 polynucleotide Proteins 0.000 claims description 132
- 239000002157 polynucleotide Substances 0.000 claims description 132
- 230000000694 effects Effects 0.000 claims description 92
- 108090001060 Lipase Proteins 0.000 claims description 89
- 108091026890 Coding region Proteins 0.000 claims description 87
- 102000004882 Lipase Human genes 0.000 claims description 87
- 230000007423 decrease Effects 0.000 claims description 84
- 235000014113 dietary fatty acids Nutrition 0.000 claims description 72
- 239000000194 fatty acid Substances 0.000 claims description 72
- 229930195729 fatty acid Natural products 0.000 claims description 72
- 150000007523 nucleic acids Chemical group 0.000 claims description 70
- 239000012634 fragment Substances 0.000 claims description 59
- 102000004157 Hydrolases Human genes 0.000 claims description 54
- 108090000604 Hydrolases Proteins 0.000 claims description 54
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 53
- 108010064785 Phospholipases Proteins 0.000 claims description 53
- 102000015439 Phospholipases Human genes 0.000 claims description 52
- 238000003780 insertion Methods 0.000 claims description 50
- 230000037431 insertion Effects 0.000 claims description 50
- 230000008569 process Effects 0.000 claims description 48
- 238000012217 deletion Methods 0.000 claims description 45
- 230000037430 deletion Effects 0.000 claims description 45
- ZIIUUSVHCHPIQD-UHFFFAOYSA-N 2,4,6-trimethyl-N-[3-(trifluoromethyl)phenyl]benzenesulfonamide Chemical compound CC1=CC(C)=CC(C)=C1S(=O)(=O)NC1=CC=CC(C(F)(F)F)=C1 ZIIUUSVHCHPIQD-UHFFFAOYSA-N 0.000 claims description 43
- 230000027455 binding Effects 0.000 claims description 35
- 238000009472 formulation Methods 0.000 claims description 35
- 238000000746 purification Methods 0.000 claims description 35
- 102100026001 Lysosomal acid lipase/cholesteryl ester hydrolase Human genes 0.000 claims description 34
- 238000005349 anion exchange Methods 0.000 claims description 29
- 108700028369 Alleles Proteins 0.000 claims description 28
- 108010055297 Sterol Esterase Proteins 0.000 claims description 28
- 101001004953 Homo sapiens Lysosomal acid lipase/cholesteryl ester hydrolase Proteins 0.000 claims description 26
- 238000005341 cation exchange Methods 0.000 claims description 24
- 238000004587 chromatography analysis Methods 0.000 claims description 21
- 108091006020 Fc-tagged proteins Proteins 0.000 claims description 18
- -1 fatty acid ester Chemical class 0.000 claims description 18
- 238000004191 hydrophobic interaction chromatography Methods 0.000 claims description 12
- 230000002829 reductive effect Effects 0.000 claims description 12
- 239000000872 buffer Substances 0.000 claims description 10
- 239000003017 thermal stabilizer Substances 0.000 claims description 6
- 239000012501 chromatography medium Substances 0.000 claims description 5
- 102000025171 antigen binding proteins Human genes 0.000 claims description 4
- 108091000831 antigen binding proteins Proteins 0.000 claims description 4
- 102000006772 Acid Ceramidase Human genes 0.000 claims 69
- 101000836983 Homo sapiens Secretoglobin family 1D member 1 Proteins 0.000 claims 24
- 125000003275 alpha amino acid group Chemical group 0.000 claims 24
- 102100027280 Fanconi anemia group A protein Human genes 0.000 claims 6
- 210000004027 cell Anatomy 0.000 abstract description 438
- 210000003527 eukaryotic cell Anatomy 0.000 abstract description 14
- 238000002955 isolation Methods 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 276
- 102100024005 Acid ceramidase Human genes 0.000 description 103
- 239000004367 Lipase Substances 0.000 description 79
- 235000019421 lipase Nutrition 0.000 description 79
- 102000006486 Phosphoinositide Phospholipase C Human genes 0.000 description 69
- 108010044302 Phosphoinositide phospholipase C Proteins 0.000 description 69
- 108090000371 Esterases Proteins 0.000 description 51
- 241000699802 Cricetulus griseus Species 0.000 description 44
- 108020004414 DNA Proteins 0.000 description 41
- 239000013612 plasmid Substances 0.000 description 39
- 108020005004 Guide RNA Proteins 0.000 description 36
- 238000010362 genome editing Methods 0.000 description 36
- 101000609363 Homo sapiens Platelet-activating factor acetylhydrolase IB subunit alpha1 Proteins 0.000 description 34
- 102100039446 Platelet-activating factor acetylhydrolase IB subunit alpha1 Human genes 0.000 description 34
- 230000015556 catabolic process Effects 0.000 description 34
- 238000006731 degradation reaction Methods 0.000 description 34
- 239000013543 active substance Substances 0.000 description 33
- 102000004190 Enzymes Human genes 0.000 description 30
- 108090000790 Enzymes Proteins 0.000 description 30
- 102000013392 Carboxylesterase Human genes 0.000 description 29
- 108010051152 Carboxylesterase Proteins 0.000 description 29
- 108010058834 acylcarnitine hydrolase Proteins 0.000 description 29
- 239000000047 product Substances 0.000 description 29
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 28
- 238000004949 mass spectrometry Methods 0.000 description 28
- 102100022146 Arylsulfatase A Human genes 0.000 description 27
- 108010036867 Cerebroside-Sulfatase Proteins 0.000 description 27
- 102100030817 Liver carboxylesterase 1 Human genes 0.000 description 27
- 101710181187 Liver carboxylesterase 1 Proteins 0.000 description 27
- 101001019605 Homo sapiens Isoamyl acetate-hydrolyzing esterase 1 homolog Proteins 0.000 description 25
- 102100035008 Isoamyl acetate-hydrolyzing esterase 1 homolog Human genes 0.000 description 25
- 102100040865 Phospholipase A2 group XV Human genes 0.000 description 25
- 101710127148 Phospholipase A2 group XV Proteins 0.000 description 25
- 150000001413 amino acids Chemical group 0.000 description 25
- 238000004519 manufacturing process Methods 0.000 description 25
- 241000282414 Homo sapiens Species 0.000 description 24
- 239000012906 subvisible particle Substances 0.000 description 24
- 108091033409 CRISPR Proteins 0.000 description 23
- 101710163270 Nuclease Proteins 0.000 description 23
- 230000037433 frameshift Effects 0.000 description 23
- 238000011144 upstream manufacturing Methods 0.000 description 23
- 101150093509 PLBD2 gene Proteins 0.000 description 22
- 108010091086 Recombinases Proteins 0.000 description 22
- 102000018120 Recombinases Human genes 0.000 description 22
- 239000002245 particle Substances 0.000 description 22
- 238000010561 standard procedure Methods 0.000 description 22
- 229920000136 polysorbate Polymers 0.000 description 20
- 102000000019 Sterol Esterase Human genes 0.000 description 19
- 150000004665 fatty acids Chemical class 0.000 description 19
- 102000039446 nucleic acids Human genes 0.000 description 19
- 108020004707 nucleic acids Proteins 0.000 description 19
- 238000003753 real-time PCR Methods 0.000 description 19
- 241000700159 Rattus Species 0.000 description 17
- 229950008882 polysorbate Drugs 0.000 description 17
- 241000894007 species Species 0.000 description 17
- 108700023418 Amidases Proteins 0.000 description 16
- 102000005922 amidase Human genes 0.000 description 16
- 230000006801 homologous recombination Effects 0.000 description 16
- 238000002744 homologous recombination Methods 0.000 description 16
- CNWINRVXAYPOMW-FCNJXWMTSA-N 1-stearoyl-2-arachidonoyl-sn-glycero-3-phospho-1D-myo-inositol 4,5-biphosphate Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(=O)O[C@H](COC(=O)CCCCCCCCCCCCCCCCC)COP(O)(=O)O[C@@H]1[C@H](O)[C@H](O)[C@@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@H]1O CNWINRVXAYPOMW-FCNJXWMTSA-N 0.000 description 15
- 239000000427 antigen Substances 0.000 description 15
- 108091007433 antigens Proteins 0.000 description 15
- 102000036639 antigens Human genes 0.000 description 15
- 239000003814 drug Substances 0.000 description 15
- 238000002474 experimental method Methods 0.000 description 15
- 108090000765 processed proteins & peptides Proteins 0.000 description 15
- 230000007017 scission Effects 0.000 description 15
- 241000699800 Cricetinae Species 0.000 description 14
- 229920001213 Polysorbate 20 Polymers 0.000 description 14
- 230000002411 adverse Effects 0.000 description 14
- 238000003776 cleavage reaction Methods 0.000 description 14
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 14
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 14
- 229940068977 polysorbate 20 Drugs 0.000 description 14
- 230000008685 targeting Effects 0.000 description 14
- 108091093088 Amplicon Proteins 0.000 description 13
- 241000699666 Mus <mouse, genus> Species 0.000 description 13
- 229960000074 biopharmaceutical Drugs 0.000 description 12
- 230000006798 recombination Effects 0.000 description 12
- 238000005215 recombination Methods 0.000 description 12
- 238000003860 storage Methods 0.000 description 12
- 230000006870 function Effects 0.000 description 11
- 108020004999 messenger RNA Proteins 0.000 description 11
- 230000001105 regulatory effect Effects 0.000 description 11
- 239000000523 sample Substances 0.000 description 11
- 238000001890 transfection Methods 0.000 description 11
- 239000013598 vector Substances 0.000 description 11
- 108010051219 Cre recombinase Proteins 0.000 description 10
- 108700024394 Exon Proteins 0.000 description 10
- 102000014384 Type C Phospholipases Human genes 0.000 description 10
- 108010079194 Type C Phospholipases Proteins 0.000 description 10
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 10
- MMWCIQZXVOZEGG-HOZKJCLWSA-N [(1S,2R,3S,4S,5R,6S)-2,3,5-trihydroxy-4,6-diphosphonooxycyclohexyl] dihydrogen phosphate Chemical compound O[C@H]1[C@@H](O)[C@H](OP(O)(O)=O)[C@@H](OP(O)(O)=O)[C@H](O)[C@H]1OP(O)(O)=O MMWCIQZXVOZEGG-HOZKJCLWSA-N 0.000 description 10
- 239000000356 contaminant Substances 0.000 description 10
- 150000001982 diacylglycerols Chemical class 0.000 description 10
- 108091006047 fluorescent proteins Proteins 0.000 description 10
- 102000034287 fluorescent proteins Human genes 0.000 description 10
- 230000010354 integration Effects 0.000 description 10
- 230000001404 mediated effect Effects 0.000 description 10
- 238000012360 testing method Methods 0.000 description 10
- 108090000751 Ceramidases Proteins 0.000 description 9
- 102000004142 Trypsin Human genes 0.000 description 9
- 108090000631 Trypsin Proteins 0.000 description 9
- 238000010364 biochemical engineering Methods 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 9
- 239000008194 pharmaceutical composition Substances 0.000 description 9
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 9
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 9
- 229920000053 polysorbate 80 Polymers 0.000 description 9
- 229940068968 polysorbate 80 Drugs 0.000 description 9
- 102000004196 processed proteins & peptides Human genes 0.000 description 9
- 102000005962 receptors Human genes 0.000 description 9
- 108020003175 receptors Proteins 0.000 description 9
- 239000006228 supernatant Substances 0.000 description 9
- 239000012588 trypsin Substances 0.000 description 9
- 102000004201 Ceramidases Human genes 0.000 description 8
- 108010042407 Endonucleases Proteins 0.000 description 8
- 102000004533 Endonucleases Human genes 0.000 description 8
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 8
- 108700026244 Open Reading Frames Proteins 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 238000000684 flow cytometry Methods 0.000 description 8
- 238000010363 gene targeting Methods 0.000 description 8
- 210000004962 mammalian cell Anatomy 0.000 description 8
- 239000003550 marker Substances 0.000 description 8
- 230000004060 metabolic process Effects 0.000 description 8
- 230000006780 non-homologous end joining Effects 0.000 description 8
- 238000012163 sequencing technique Methods 0.000 description 8
- 239000012679 serum free medium Substances 0.000 description 8
- 230000001225 therapeutic effect Effects 0.000 description 8
- 238000010459 TALEN Methods 0.000 description 7
- 230000003321 amplification Effects 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 238000004113 cell culture Methods 0.000 description 7
- 229940079593 drug Drugs 0.000 description 7
- 230000002209 hydrophobic effect Effects 0.000 description 7
- 150000002632 lipids Chemical class 0.000 description 7
- 238000003199 nucleic acid amplification method Methods 0.000 description 7
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 7
- 241000282412 Homo Species 0.000 description 6
- 101710200837 Putative phospholipase B-like 2 Proteins 0.000 description 6
- 230000009471 action Effects 0.000 description 6
- 230000001413 cellular effect Effects 0.000 description 6
- 239000010432 diamond Substances 0.000 description 6
- 230000005782 double-strand break Effects 0.000 description 6
- 229940126534 drug product Drugs 0.000 description 6
- 238000003197 gene knockdown Methods 0.000 description 6
- 239000012535 impurity Substances 0.000 description 6
- 239000000825 pharmaceutical preparation Substances 0.000 description 6
- 230000004962 physiological condition Effects 0.000 description 6
- 229920001184 polypeptide Polymers 0.000 description 6
- 238000003998 size exclusion chromatography high performance liquid chromatography Methods 0.000 description 6
- 238000001542 size-exclusion chromatography Methods 0.000 description 6
- 229940124597 therapeutic agent Drugs 0.000 description 6
- 239000003981 vehicle Substances 0.000 description 6
- 101100454808 Caenorhabditis elegans lgg-2 gene Proteins 0.000 description 5
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 5
- 241000699670 Mus sp. Species 0.000 description 5
- 108010093375 Phosphatidylinositol Diacylglycerol-Lyase Proteins 0.000 description 5
- 102000004861 Phosphoric Diester Hydrolases Human genes 0.000 description 5
- 108090001050 Phosphoric Diester Hydrolases Proteins 0.000 description 5
- 102000014450 RNA Polymerase III Human genes 0.000 description 5
- 108010078067 RNA Polymerase III Proteins 0.000 description 5
- 108091027544 Subgenomic mRNA Proteins 0.000 description 5
- 239000011575 calcium Substances 0.000 description 5
- 229910052791 calcium Inorganic materials 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 239000013604 expression vector Substances 0.000 description 5
- 108010035313 glycerophosphoinositol glycerophosphodiesterase Proteins 0.000 description 5
- 239000003446 ligand Substances 0.000 description 5
- 230000002132 lysosomal effect Effects 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 150000003905 phosphatidylinositols Chemical class 0.000 description 5
- 230000011664 signaling Effects 0.000 description 5
- 101100454807 Caenorhabditis elegans lgg-1 gene Proteins 0.000 description 4
- 101100476210 Caenorhabditis elegans rnt-1 gene Proteins 0.000 description 4
- 206010059866 Drug resistance Diseases 0.000 description 4
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 4
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 4
- 101000975753 Homo sapiens Acid ceramidase Proteins 0.000 description 4
- KLDXJTOLSGUMSJ-JGWLITMVSA-N Isosorbide Chemical compound O[C@@H]1CO[C@@H]2[C@@H](O)CO[C@@H]21 KLDXJTOLSGUMSJ-JGWLITMVSA-N 0.000 description 4
- 108020002496 Lysophospholipase Proteins 0.000 description 4
- 102100037883 Phospholipase B1, membrane-associated Human genes 0.000 description 4
- 102000009661 Repressor Proteins Human genes 0.000 description 4
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 description 4
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 230000003833 cell viability Effects 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 229960001231 choline Drugs 0.000 description 4
- OEYIOHPDSNJKLS-UHFFFAOYSA-N choline Chemical compound C[N+](C)(C)CCO OEYIOHPDSNJKLS-UHFFFAOYSA-N 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 239000003636 conditioned culture medium Substances 0.000 description 4
- 125000004093 cyano group Chemical group *C#N 0.000 description 4
- 230000003247 decreasing effect Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- 108091006104 gene-regulatory proteins Proteins 0.000 description 4
- 102000034356 gene-regulatory proteins Human genes 0.000 description 4
- 239000005090 green fluorescent protein Substances 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 230000005847 immunogenicity Effects 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 229960002479 isosorbide Drugs 0.000 description 4
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 4
- 101150062900 lpl gene Proteins 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- 210000001672 ovary Anatomy 0.000 description 4
- 150000003904 phospholipids Chemical class 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 239000004094 surface-active agent Substances 0.000 description 4
- 108700020534 tetracycline resistance-encoding transposon repressor Proteins 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- YDNKGFDKKRUKPY-JHOUSYSJSA-N C16 ceramide Natural products CCCCCCCCCCCCCCCC(=O)N[C@@H](CO)[C@H](O)C=CCCCCCCCCCCCCC YDNKGFDKKRUKPY-JHOUSYSJSA-N 0.000 description 3
- 230000004568 DNA-binding Effects 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- CRJGESKKUOMBCT-VQTJNVASSA-N N-acetylsphinganine Chemical compound CCCCCCCCCCCCCCC[C@@H](O)[C@H](CO)NC(C)=O CRJGESKKUOMBCT-VQTJNVASSA-N 0.000 description 3
- 229930193140 Neomycin Natural products 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- 239000008186 active pharmaceutical agent Substances 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 3
- 229940106189 ceramide Drugs 0.000 description 3
- ZVEQCJWYRWKARO-UHFFFAOYSA-N ceramide Natural products CCCCCCCCCCCCCCC(O)C(=O)NC(CO)C(O)C=CCCC=C(C)CCCCCCCCC ZVEQCJWYRWKARO-UHFFFAOYSA-N 0.000 description 3
- 229940088679 drug related substance Drugs 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 229960004927 neomycin Drugs 0.000 description 3
- VVGIYYKRAMHVLU-UHFFFAOYSA-N newbouldiamide Natural products CCCCCCCCCCCCCCCCCCCC(O)C(O)C(O)C(CO)NC(=O)CCCCCCCCCCCCCCCCC VVGIYYKRAMHVLU-UHFFFAOYSA-N 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 229940068965 polysorbates Drugs 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 230000014616 translation Effects 0.000 description 3
- WWUZIQQURGPMPG-UHFFFAOYSA-N (-)-D-erythro-Sphingosine Natural products CCCCCCCCCCCCCC=CC(O)C(N)CO WWUZIQQURGPMPG-UHFFFAOYSA-N 0.000 description 2
- JNYAEWCLZODPBN-JGWLITMVSA-N (2r,3r,4s)-2-[(1r)-1,2-dihydroxyethyl]oxolane-3,4-diol Chemical compound OC[C@@H](O)[C@H]1OC[C@H](O)[C@H]1O JNYAEWCLZODPBN-JGWLITMVSA-N 0.000 description 2
- 101150100998 Ace gene Proteins 0.000 description 2
- 108091079001 CRISPR RNA Proteins 0.000 description 2
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 2
- 102100022641 Coagulation factor IX Human genes 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 230000033616 DNA repair Effects 0.000 description 2
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 2
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 2
- 241000006867 Discosoma Species 0.000 description 2
- 102000009465 Growth Factor Receptors Human genes 0.000 description 2
- 108010009202 Growth Factor Receptors Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 101000773743 Homo sapiens Angiotensin-converting enzyme Proteins 0.000 description 2
- 101000619884 Homo sapiens Lipoprotein lipase Proteins 0.000 description 2
- 108010048946 IB1001 Proteins 0.000 description 2
- 108060003951 Immunoglobulin Proteins 0.000 description 2
- 108010002352 Interleukin-1 Proteins 0.000 description 2
- 101710085948 Isoamyl acetate-hydrolyzing esterase Proteins 0.000 description 2
- 208000025370 Middle East respiratory syndrome Diseases 0.000 description 2
- 101000773741 Mus musculus Angiotensin-converting enzyme Proteins 0.000 description 2
- 101000619881 Mus musculus Lipoprotein lipase Proteins 0.000 description 2
- 108020004485 Nonsense Codon Proteins 0.000 description 2
- 238000010222 PCR analysis Methods 0.000 description 2
- 108010033276 Peptide Fragments Proteins 0.000 description 2
- 102000007079 Peptide Fragments Human genes 0.000 description 2
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 2
- 229920001214 Polysorbate 60 Polymers 0.000 description 2
- 108010026552 Proteome Proteins 0.000 description 2
- 101710113948 Putative esterase Proteins 0.000 description 2
- 101000773749 Rattus norvegicus Angiotensin-converting enzyme Proteins 0.000 description 2
- 101100399284 Rattus norvegicus Lipa gene Proteins 0.000 description 2
- 101001003555 Rattus norvegicus Lipoprotein lipase Proteins 0.000 description 2
- 108700005075 Regulator Genes Proteins 0.000 description 2
- 108010034634 Repressor Proteins Proteins 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- 229930182558 Sterol Natural products 0.000 description 2
- 108091028113 Trans-activating crRNA Proteins 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 108700005077 Viral Genes Proteins 0.000 description 2
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 2
- 238000005903 acid hydrolysis reaction Methods 0.000 description 2
- 239000012190 activator Substances 0.000 description 2
- 108010081667 aflibercept Proteins 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 238000010923 batch production Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 229940126587 biotherapeutics Drugs 0.000 description 2
- 230000036760 body temperature Effects 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 238000011109 contamination Methods 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 239000007857 degradation product Substances 0.000 description 2
- 230000003413 degradative effect Effects 0.000 description 2
- 230000000593 degrading effect Effects 0.000 description 2
- 230000001627 detrimental effect Effects 0.000 description 2
- 150000005690 diesters Chemical class 0.000 description 2
- 235000005911 diet Nutrition 0.000 description 2
- 230000000378 dietary effect Effects 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 230000002222 downregulating effect Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000010931 ester hydrolysis Methods 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 230000007717 exclusion Effects 0.000 description 2
- 108010021843 fluorescent protein 583 Proteins 0.000 description 2
- 235000012041 food component Nutrition 0.000 description 2
- 239000005417 food ingredient Substances 0.000 description 2
- 238000002825 functional assay Methods 0.000 description 2
- 230000009368 gene silencing by RNA Effects 0.000 description 2
- 108010013219 group XV phospholipase A2 Proteins 0.000 description 2
- 208000009429 hemophilia B Diseases 0.000 description 2
- 239000013628 high molecular weight specie Substances 0.000 description 2
- 239000000710 homodimer Substances 0.000 description 2
- 102000056252 human ACE Human genes 0.000 description 2
- 102000048394 human LIPA Human genes 0.000 description 2
- 102000045312 human LPL Human genes 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 102000018358 immunoglobulin Human genes 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 239000004615 ingredient Substances 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 2
- 238000004811 liquid chromatography Methods 0.000 description 2
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 239000002207 metabolite Substances 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 108010008500 mouse lysosomal acid lipase Proteins 0.000 description 2
- 239000002547 new drug Substances 0.000 description 2
- 239000013618 particulate matter Substances 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 238000009522 phase III clinical trial Methods 0.000 description 2
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 2
- 125000001095 phosphatidyl group Chemical group 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 235000004252 protein component Nutrition 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 229940124272 protein stabilizer Drugs 0.000 description 2
- 108010054624 red fluorescent protein Proteins 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 230000002207 retinal effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- WWUZIQQURGPMPG-KRWOKUGFSA-N sphingosine Chemical compound CCCCCCCCCCCCC\C=C\[C@@H](O)[C@@H](N)CO WWUZIQQURGPMPG-KRWOKUGFSA-N 0.000 description 2
- 230000010473 stable expression Effects 0.000 description 2
- 150000003432 sterols Chemical class 0.000 description 2
- 235000003702 sterols Nutrition 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 239000011701 zinc Substances 0.000 description 2
- 229910052725 zinc Inorganic materials 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- 102100020928 14 kDa phosphohistidine phosphatase Human genes 0.000 description 1
- 101710082470 14 kDa phosphohistidine phosphatase Proteins 0.000 description 1
- 101150028074 2 gene Proteins 0.000 description 1
- HVAUUPRFYPCOCA-AREMUKBSSA-N 2-O-acetyl-1-O-hexadecyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCCOC[C@@H](OC(C)=O)COP([O-])(=O)OCC[N+](C)(C)C HVAUUPRFYPCOCA-AREMUKBSSA-N 0.000 description 1
- 101710126783 Acetyl-hydrolase Proteins 0.000 description 1
- 102100034083 Alkaline ceramidase 1 Human genes 0.000 description 1
- 101710200052 Alkaline ceramidase 1 Proteins 0.000 description 1
- 102100034050 Alkaline ceramidase 2 Human genes 0.000 description 1
- 101710200054 Alkaline ceramidase 2 Proteins 0.000 description 1
- 102100034082 Alkaline ceramidase 3 Human genes 0.000 description 1
- 101710200049 Alkaline ceramidase 3 Proteins 0.000 description 1
- 102000004092 Amidohydrolases Human genes 0.000 description 1
- 108090000531 Amidohydrolases Proteins 0.000 description 1
- 241001123248 Arma Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 1
- 238000010453 CRISPR/Cas method Methods 0.000 description 1
- 101100217502 Caenorhabditis elegans lgg-3 gene Proteins 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108091092236 Chimeric RNA Proteins 0.000 description 1
- 208000001528 Coronaviridae Infections Diseases 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 201000011001 Ebola Hemorrhagic Fever Diseases 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108010054218 Factor VIII Proteins 0.000 description 1
- 102000001690 Factor VIII Human genes 0.000 description 1
- 208000001948 Farber Lipogranulomatosis Diseases 0.000 description 1
- 208000033149 Farber disease Diseases 0.000 description 1
- BDAGIHXWWSANSR-UHFFFAOYSA-N Formic acid Chemical compound OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 1
- 102100041003 Glutamate carboxypeptidase 2 Human genes 0.000 description 1
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 101000892862 Homo sapiens Glutamate carboxypeptidase 2 Proteins 0.000 description 1
- 101001137987 Homo sapiens Lymphocyte activation gene 3 protein Proteins 0.000 description 1
- 101100189088 Homo sapiens PAFAH1B3 gene Proteins 0.000 description 1
- 101001001852 Homo sapiens Phospholipase B-like 1 Proteins 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 108010067003 Interleukin-33 Proteins 0.000 description 1
- 102000017761 Interleukin-33 Human genes 0.000 description 1
- 108010038486 Interleukin-4 Receptors Proteins 0.000 description 1
- 102100039078 Interleukin-4 receptor subunit alpha Human genes 0.000 description 1
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 101710098556 Lipase A Proteins 0.000 description 1
- 102100020862 Lymphocyte activation gene 3 protein Human genes 0.000 description 1
- 101710099648 Lysosomal acid lipase/cholesteryl ester hydrolase Proteins 0.000 description 1
- 101100189089 Mus musculus Pafah1b3 gene Proteins 0.000 description 1
- 101001074413 Mus musculus Putative phospholipase B-like 2 Proteins 0.000 description 1
- 108010020856 N-terminal nucleophile hydrolase Proteins 0.000 description 1
- 102100023996 Neutral ceramidase Human genes 0.000 description 1
- 241001452677 Ogataea methanolica Species 0.000 description 1
- 101150105115 PA gene Proteins 0.000 description 1
- 102100036316 Phospholipase B-like 1 Human genes 0.000 description 1
- 108010003541 Platelet Activating Factor Proteins 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 102100040678 Programmed cell death protein 1 Human genes 0.000 description 1
- 101710089372 Programmed cell death protein 1 Proteins 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 101100189091 Rattus norvegicus Pafah1b3 gene Proteins 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 108090000787 Subtilisin Proteins 0.000 description 1
- 241000255993 Trichoplusia ni Species 0.000 description 1
- 208000035896 Twin-reversed arterial perfusion sequence Diseases 0.000 description 1
- 102000016548 Vascular Endothelial Growth Factor Receptor-1 Human genes 0.000 description 1
- 108010053096 Vascular Endothelial Growth Factor Receptor-1 Proteins 0.000 description 1
- 102000016549 Vascular Endothelial Growth Factor Receptor-2 Human genes 0.000 description 1
- 108010053099 Vascular Endothelial Growth Factor Receptor-2 Proteins 0.000 description 1
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 1
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 1
- 208000020329 Zika virus infectious disease Diseases 0.000 description 1
- LWZFANDGMFTDAV-BURFUSLBSA-N [(2r)-2-[(2r,3r,4s)-3,4-dihydroxyoxolan-2-yl]-2-hydroxyethyl] dodecanoate Chemical compound CCCCCCCCCCCC(=O)OC[C@@H](O)[C@H]1OC[C@H](O)[C@H]1O LWZFANDGMFTDAV-BURFUSLBSA-N 0.000 description 1
- 239000000488 activin Substances 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 229960002833 aflibercept Drugs 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 229960004539 alirocumab Drugs 0.000 description 1
- 150000001414 amino alcohols Chemical class 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 230000002391 anti-complement effect Effects 0.000 description 1
- 230000003540 anti-differentiation Effects 0.000 description 1
- 230000002942 anti-growth Effects 0.000 description 1
- 230000000781 anti-lymphocytic effect Effects 0.000 description 1
- 230000000702 anti-platelet effect Effects 0.000 description 1
- 230000001263 anti-prolactin effect Effects 0.000 description 1
- 230000003097 anti-respiratory effect Effects 0.000 description 1
- 238000011091 antibody purification Methods 0.000 description 1
- 210000000628 antibody-producing cell Anatomy 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 108010008730 anticomplement Proteins 0.000 description 1
- 238000003782 apoptosis assay Methods 0.000 description 1
- 230000000923 atherogenic effect Effects 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 238000010504 bond cleavage reaction Methods 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- SUHOQUVVVLNYQR-MRVPVSSYSA-N choline alfoscerate Chemical compound C[N+](C)(C)CCOP([O-])(=O)OC[C@H](O)CO SUHOQUVVVLNYQR-MRVPVSSYSA-N 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000001728 clone cell Anatomy 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000003795 desorption Methods 0.000 description 1
- 230000001687 destabilization Effects 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- POULHZVOKOAJMA-UHFFFAOYSA-M dodecanoate Chemical compound CCCCCCCCCCCC([O-])=O POULHZVOKOAJMA-UHFFFAOYSA-M 0.000 description 1
- POULHZVOKOAJMA-UHFFFAOYSA-N dodecanoic acid Chemical class CCCCCCCCCCCC(O)=O POULHZVOKOAJMA-UHFFFAOYSA-N 0.000 description 1
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 229960003722 doxycycline Drugs 0.000 description 1
- XQTWDDCIUJNLTR-CVHRZJFOSA-N doxycycline monohydrate Chemical compound O.O=C1C2=C(O)C=CC=C2[C@H](C)[C@@H]2C1=C(O)[C@]1(O)C(=O)C(C(N)=O)=C(O)[C@@H](N(C)C)[C@@H]1[C@H]2O XQTWDDCIUJNLTR-CVHRZJFOSA-N 0.000 description 1
- 229950003468 dupilumab Drugs 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 230000007071 enzymatic hydrolysis Effects 0.000 description 1
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 1
- 229940116977 epidermal growth factor Drugs 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 229950004341 evinacumab Drugs 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 229950000335 fasinumab Drugs 0.000 description 1
- 239000003925 fat Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 235000013373 food additive Nutrition 0.000 description 1
- 239000002778 food additive Substances 0.000 description 1
- 235000019249 food preservative Nutrition 0.000 description 1
- 239000005452 food preservative Substances 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 235000021588 free fatty acids Nutrition 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 238000003205 genotyping method Methods 0.000 description 1
- 229960004666 glucagon Drugs 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 125000005456 glyceride group Chemical group 0.000 description 1
- 229960004956 glycerylphosphorylcholine Drugs 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 229940100601 interleukin-6 Drugs 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 229940070765 laurate Drugs 0.000 description 1
- 101150091094 lipA gene Proteins 0.000 description 1
- 230000006372 lipid accumulation Effects 0.000 description 1
- 230000002366 lipolytic effect Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 150000004668 long chain fatty acids Chemical class 0.000 description 1
- 210000003712 lysosome Anatomy 0.000 description 1
- 230000001868 lysosomic effect Effects 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 238000002715 modification method Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- AEMBWNDIEFEPTH-UHFFFAOYSA-N n-tert-butyl-n-ethylnitrous amide Chemical compound CCN(N=O)C(C)(C)C AEMBWNDIEFEPTH-UHFFFAOYSA-N 0.000 description 1
- 229940053128 nerve growth factor Drugs 0.000 description 1
- 229950002697 nesvacumab Drugs 0.000 description 1
- 239000012038 nucleophile Substances 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 230000009437 off-target effect Effects 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical class CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 210000004738 parenchymal cell Anatomy 0.000 description 1
- 102000042759 phospholipase A2 family Human genes 0.000 description 1
- 108091082051 phospholipase A2 family Proteins 0.000 description 1
- 229920001481 poly(stearyl methacrylate) Polymers 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 239000012521 purified sample Substances 0.000 description 1
- 238000004725 rapid separation liquid chromatography Methods 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 229960000160 recombinant therapeutic protein Drugs 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000007670 refining Methods 0.000 description 1
- 230000015227 regulation of liquid surface tension Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000001223 reverse osmosis Methods 0.000 description 1
- 108010046141 rilonacept Proteins 0.000 description 1
- 229960001886 rilonacept Drugs 0.000 description 1
- 229950005978 rinucumab Drugs 0.000 description 1
- 229950006348 sarilumab Drugs 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 229950006451 sorbitan laurate Drugs 0.000 description 1
- 235000011067 sorbitan monolaureate Nutrition 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000012421 spiking Methods 0.000 description 1
- 208000020045 spinal muscular atrophy with progressive myoclonic epilepsy Diseases 0.000 description 1
- 238000004885 tandem mass spectrometry Methods 0.000 description 1
- 229940126622 therapeutic monoclonal antibody Drugs 0.000 description 1
- 238000004809 thin layer chromatography Methods 0.000 description 1
- 238000001269 time-of-flight mass spectrometry Methods 0.000 description 1
- 229950006444 trevogrumab Drugs 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- DCXXMTOCNZCJGO-UHFFFAOYSA-N tristearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(OC(=O)CCCCCCCCCCCCCCCCC)COC(=O)CCCCCCCCCCCCCCCCC DCXXMTOCNZCJGO-UHFFFAOYSA-N 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 210000003501 vero cell Anatomy 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 239000012905 visible particle Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000001993 wax Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 229960002760 ziv-aflibercept Drugs 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/18—Carboxylic ester hydrolases (3.1.1)
- C12N9/20—Triglyceride splitting, e.g. by means of lipase
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/18—Carboxylic ester hydrolases (3.1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
- C12N9/80—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5) acting on amide bonds in linear amides (3.5.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/01—Carboxylic ester hydrolases (3.1.1)
- C12Y301/01013—Sterol esterase (3.1.1.13)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/01—Carboxylic ester hydrolases (3.1.1)
- C12Y301/01034—Lipoprotein lipase (3.1.1.34)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y305/00—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
- C12Y305/01—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in linear amides (3.5.1)
- C12Y305/01023—Ceramidase (3.5.1.23)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/10—Immunoglobulins specific features characterized by their source of isolation or production
- C07K2317/14—Specific host cells or culture conditions, e.g. components, pH or temperature
Definitions
- the invention provides for cells and methods for expression and purification of recombinant proteins in eukaryotic cells.
- the invention includes methods and compositions for expression of proteins in eukaryotic cells, particularly Chinese hamster (Cricetulus griseus) cell lines, that employ downregulating gene expression of endogenous proteins in order to control production of such unwanted "sticky" host cell proteins.
- the invention includes polynucleotides and modified cells that facilitate purification of an exogenous recombinant protein of interest.
- the methods of the invention efficiently target host cell proteins in the Chinese hamster cellular genome in order to facilitate enhanced and stable expression of recombinant proteins expressed by the modified cells.
- HCPs host cell proteins
- LC/MS Advanced liquid chromatography/mass spectrometry
- HCPs Changes in cell culture conditions of eukaryotic cells has been shown to impact the purity and stability of manufactured proteins, as seen by the increased quantity of HCPs of CHO cells upon downstream bioprocessing alterations (Tait, et al, 2013, Biotechnol Prog 29(3):688-696).
- the detrimental effect of leftover HCPs in any product may affect the overall quality or quantity, or both the quality and quantity of the product.
- HCPs if present even at low levels in a therapeutic product, may induce an undesired immune response which causes concern for patient safety and efficacy of the drug product (Singh SK. 201 1 .
- the invention provides a recombinant host cell, wherein the cell is modified to decrease the expression levels of two or more fatty acid hydrolases (FAHs) relative to the expression levels of FAH in an unmodified cell.
- FHs fatty acid hydrolases
- the invention provides a recombinant host cell, wherein the cell is modified to have no expression of two or more target FAHs.
- one of said two or more target FAHs is an esterase.
- the esterase is a lipase.
- the lipase is: (1 ) a
- one of said two or more target FAHs is an amidase.
- the amidase is a fatty acid acylase.
- the fatty acid acylase is an acid ceramidase.
- a gene of interest is exogenously added to the recombinant host cell.
- the exogenously added gene encodes a protein of interest (POI), for example the POI is selected form the group consisting of antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein.
- POI protein of interest
- the invention relates to a composition comprising one or more proteins of interest (POIs) obtainable by a method according to the invention.
- the protein may be a recombinant protein and may be e.g. selected form the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein, or any combinations thereof.
- the invention relates to a composition comprising one or more proteins of interest (POIs), wherein the adverse enzyme activity is ⁇ 50%, 40%, 48%, 47%, 46%, 45%, 44%, 43%, 42%, 41 %, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31 %, 30%, 29%, 28%, 26%, 25%, 24%, 23%, 22%, 21 %, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, 0.05%, or ⁇ 0.05% active relative to the adverse enzyme activity of a wild-type production system.
- POIs proteins of interest
- the phrase "adverse enzyme activity" refers to any enzyme and its action upon the resulting composition as a whole, wherein the action results in metabolism of any protein components which metabolites reduces the shelf-life of the composition or results in the formation of subvisible particles (SVPs) above prescribed regulations.
- the protein may be a recombinant protein and may e.g. be selected from the group consisting of antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein or any combinations thereof.
- the enzymes and their activities may be, e.g., various esterases, hydrolases, lipases, phospholipases, ceramidases and the likes, or any combinations thereof.
- Adverse enzyme activity may be measured using a functional assay ⁇ e.g., polysorbate fatty acid hydrolysis assay), or a structural assay ⁇ e.g., nano LC-MS of peptide fragments, liquid chromatography-tandem mass spectrometry (LC-MS/MS) or surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF-MS) or the like (deZafra et al, 2015, Biotechnol Bioeng;1 12(1 1 ):2284- 91 )).
- a functional assay e.g., polysorbate fatty acid hydrolysis assay
- a structural assay ⁇ e.g., nano LC-MS of peptide fragments, liquid chromatography-tandem mass spectrometry (LC-MS/MS) or surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF-
- the invention provides a cell comprising a nonfunctional PLBD2 protein and one or more additional nonfunctional fatty acid hydrolases (FAH).
- FAH nonfunctional fatty acid hydrolases
- nonfunctional FAH is a nonfunctional lipoprotein lipase (LPL), lysosomal acid lipase (LI PA), acid ceramidase (ACE), platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), phosphoinositide phospholipase C fragment (PIPLCf), phosphoinositide phospholipase C (PIPLC), liver carboxylesterase 1 (LCE), isoamyl acetate-hydrolyzing esterase 1 -like (IAH1 ), group XV phospholipase A2 (LPLA2), carboxylic ester hydrolase (CEH), and/or arylsulfatase A (ASA).
- LPL lipoprotein lipase
- LI PA lysosomal acid lipase
- ACE acid ceramidase
- PAFAHG platelet-activating factor acetylhydrolase IB subunit gamma
- the invention provides making a cell by FAH target disruption.
- the method comprises a site-specific nuclease for disrupting or editing the cell genome at a target site or sequence.
- the FAH target site is (1 ) a PLBD2 target site, (2) an LPL target site, (3) an LI PA target site, (4) an ACE target site, (5) a PAFAHG site, (6) a PIPLCf or PIPLC site, (7) an LCE site, (8) an IAH1 site, (9) an LPLA2 site, (9) an CEH site, and /or (10) an ASA site.
- the PLBD2 target site comprises a position within SEQ ID NO:33, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:33, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:33, within exon 1 of SEQ ID NO:33, within exon 2 of SEQ ID NO:33, or within exon 3 of SEQ ID NO:33.
- the PLBD2 target site comprises a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-230, 190-210, 200-220, 210-230, 220-240, 230-250, 240-260, and 250-270 of SEQ ID NO:33.
- the target site at a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 is selected from the group consisting of nucleotides spanning positions numbered 37-56, 44-56, 33-62, 40-69, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
- the PLBD2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:33 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:33 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LPL target site comprises a position within SEQ ID NO:61 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:61 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO:61 , within exon 1 of SEQ ID NO:61 , within exon 2 of SEQ ID NO:61 , or within exon 3 of SEQ ID NO:61.
- the LPL target site comprises a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440
- the target site at a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 is selected from the group consisting of nucleotides spanning positions numbered 465-484, 558-577, and 593-612 of SEQ ID NO:61.
- the LPL target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:61 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:61 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LIPA target site comprises a position within SEQ ID NO:69, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:69, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:69, within exon 1 of SEQ ID NO:69, within exon 2 of SEQ ID NO:69, or within exon 3 of SEQ ID NO:69.
- the LIPA target site comprises a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350 and 340-360 of SEQ ID NO:69.
- the target site at a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 is selected from the group consisting of nucleotides spanning positions numbered 180-199, 239-258, and 276-295 of SEQ ID NO:69.
- the LIPA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:69 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:69 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the ACE target site comprises a position within SEQ ID NO:77, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:77, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:77, within exon 1 of SEQ ID NO:77, within exon 2 of SEQ ID NO:77, within exon 3 of SEQ ID NO:77, or within exon 4 of SEQ ID NO:77.
- the ACE target site comprises a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370 and 360-380 of SEQ ID NO:77.
- the target site at a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 is selected from the group consisting of nucleotides spanning positions numbered 135-154, 237-256, and 332-351 of SEQ ID NO:77.
- the ACE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:77 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:77 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the PAFAHG target site comprises a position within SEQ ID NO:1 17, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 17, 100 nucleotides
- the PAFAHG target site comprises a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:1 17 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410
- the target site at a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:1 17 is selected from the group consisting of nucleotides spanning positions numbered 101-120, 1 1 1-130, 121 -140, 131-150, 141-160, and 150-169 of SEQ ID NO:
- the PAFAHG target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 17 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 17 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the PIPLC target site comprises a position within SEQ ID NO:1 18, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 18, 100 nucleotides
- the PIPLC target site comprises a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:1 18 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410
- the target site at a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:1 18 is selected from the group consisting of nucleotides spanning positions numbered 39-188, 39-58, 49-68, 59-78, 69-88, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 149-178, and 159-188 of SEQ ID NO:1 18.
- the PIPLC target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 18 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 18 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LCE target site comprises a position within SEQ ID NO:1 19, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 19, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 1 19, within exon 1 of SEQ ID NO: 1 19, within exon 2 of SEQ ID NO: 1 19, within exon 3 of SEQ ID NO: 1 19, within exon 4 of SEQ ID NO: 1 19, or within exon 5 of SEQ ID NO:1 19.
- the LCE target site comprises a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:1 19 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-
- the target site at a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:1 19 is selected from the group consisting of nucleotides spanning positions numbered 89-140, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, and 139-158 of SEQ ID NO:1 19.
- the LCE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 19 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 19 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the IAH1 target site comprises a position within SEQ ID NO:120, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 120, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 120, within exon 1 of SEQ ID NO: 120, within exon 2 of SEQ ID NO:120, within exon 3 of SEQ ID NO:120, or within exon 4 of SEQ ID NO:120.
- the IAH1 target site comprises a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO: 120 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430
- the target site at a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO: 120 is selected from the group consisting of nucleotides spanning positions numbered 104-325, 104-123, 1 14-133, 124-143, 134-153, 144-163, 154-173, 164-183, 174-193, 184-203, 194-213, 204-223, 214-233, 224-243, 234-253, 244-263, 254-273, 264-283, 274-293, 284-303, 294-313, 304-323, and 314-333 of SEQ ID NO:120.
- the IAH1 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 120 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:120 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LPLA2 target site comprises a position within SEQ ID NO:121 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 121 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 121 , within exon 1 of SEQ ID NO: 121 , within exon 2 of SEQ ID NO: 121 , within exon 3 of SEQ ID NO: 121 , or within exon 4 of SEQ ID NO: 121.
- the LPLA2 target site comprises a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-
- the target site at a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 is selected from the group consisting of nucleotides spanning positions numbered 69-195, 69-88, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, and 179-198 of SEQ ID NO:121.
- the LPLA2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:121 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:121 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the CEH target site comprises a position within SEQ ID NO:122, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 122, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 122, within exon 1 of SEQ ID NO: 122, within exon 2 of SEQ ID NO: 122, within exon 3 of SEQ ID NO: 122, within exon 4 of SEQ ID NO: 122, or within exon 5 of SEQ ID NO: 122.
- the CEH target site comprises a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO: 122 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430
- the target site at a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO: 122 is selected from the group consisting of nucleotides spanning positions numbered 79-186, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, and 169-188 of SEQ ID NO:122.
- the CEH target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:122 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:122 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the ASA target site comprises a position within SEQ ID NO:123, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 123, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 123, within exon 1 of SEQ ID NO: 123, within exon 2 of SEQ ID NO: 123, within exon 3 of SEQ ID NO: 123, or within exon 4 of SEQ ID NO: 123.
- the ASA target site comprises a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO: 123 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430
- the target site at a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO: 123 is selected from the group consisting of nucleotides spanning positions numbered 79-296, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, 179-198 , 189-208 , 199-218, 209-228, 219-238, 229-248, 239-258, 249-268, 259-278, 269-288, and 279-298 of SEQ ID NO:123.
- the ASA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:123 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:123 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the cell further integrates an exogenous nucleic acid sequence.
- the cell is capable of producing an exogenous protein of interest.
- the altered protein resulting from a disrupted gene does not bind to the protein of interest produced by the cell.
- an isolated Chinese hamster ovary (CHO) cell comprising an engineered nucleic acid sequence comprising a variant of the PLBD2 gene (such as a variant of SEQ ID NO:33).
- the PLBD2 gene comprises GACAGTCACG TGGCCCGACT GAGGCACGCG , nucleotides 1-30 of SEQ ID NO:33 (SEQ ID NO: 44).
- the PLBD2 gene is engineered to disrupt expression of the open reading frame.
- the invention provides an isolated CHO cell comprising (a) a disrupted PLBD2 gene comprising GACAGTCACG TGGCCCGACT GAGGCACGCG (SEQ ID NO: 44, also nucleotides 1- 30 of SEQ ID NO:33), (b) a disrupted esterase gene comprising a nucleotide encoding any one of the amino acid sequences in Table 2, or (c) a protein fragment of Table 2 expressed by a disrupted PLBD2 gene; and an exogenous nucleic acid sequence comprising a gene of interest.
- the CHO cell that comprises an engineered nucleic acid sequence comprising a variant of the PLBD2 gene also comprises variants of one or more of the genes encoding (1 ) LPL (variant of SEQ ID NO:61 ), (2) LIPA (SEQ ID NO:69), (3) ACE (SEQ ID NO:77), (4) PAFAHG (SEQ ID NO:1 17), (5) PIPLC (SEQ ID NO:1 18), (6) LCE (SEQ ID NO:1 19), (7) IAH1 (SEQ ID NO:120), (8) LPLA2 (SEQ ID NO:121 ), (9) CEH (SEQ ID NO:122), and/or (10) ASA (SEQ ID NO:123).
- a method of producing a protein of interest using a recombinant host cell wherein the host cell is modified to decrease the expression levels of esterase relative to the expression levels of esterase in an unmodified cell.
- the method comprises the modified host cell having decreased esterase expression and an exogenous nucleic acid sequence comprising a gene of interest (GOI).
- GOI gene of interest
- the exogenous nucleic acid sequence comprises one or more genes of interest.
- the one or more genes of interest are selected from the group consisting of a first GOI, a second GOI and a third GOI.
- the invention provides expression systems comprising the recombinant host cell comprising modified or nonfunctional esterase.
- the cell comprises a GOI operably linked to a promoter capable of driving expression of the GOI, wherein the promoter comprises a eukaryotic promoter that can be regulated by an activator or inhibitor.
- the eukaryotic promoter is operably linked to a prokaryotic operator, and the eukaryotic cell optionally further comprises a prokaryotic repressor protein.
- one or more selectable markers are expressed by the modified host cell.
- the genes of interest and/or the one or more selectable markers are operably linked to a promoter, wherein the promoter may be the same or different.
- the promoter comprises a eukaryotic promoter (such as, for example, a CMV promoter or an SV40 late promoter), optionally controlled by a prokaryotic operator (such as, for example, a tet operator).
- the cell further comprises a gene encoding a prokaryotic repressor (such as, for example, a tet repressor).
- a CHO host cell comprising recombinase recognition sites.
- the recombinase recognition sites are selected from a LoxP site, a Lox5 1 1 site, a Lox2272 site, Lox2372, Lox5171 , and a frt site.
- the cell further comprises a gene capable of expressing a recombinase.
- the recombinase is a Cre recombinase.
- the selectable marker gene is a drug resistance gene.
- the drug resistance gene is a neomycin resistance gene or a hygromycin resistance gene.
- the second and third selectable marker genes encode two different fluorescent proteins.
- the two different fluorescent proteins are selected from the group consisting of Discosoma coral (DsRed), green fluorescent protein (GFP), enhanced green fluorescent protein (eGFP), cyano fluorescent protein (CFP), enhanced cyano fluorescent protein (eCFP), yellow fluorescent protein (YFP), enhanced yellow fluorescent protein (eYFP) and far-red fluorescent protein (mKate).
- the first, second, and third promoters are the same. In another embodiment, the first, second, and third promoters are different from each other. In another embodiment, the first promoter is different from the second and third promoters, and the second and third promoters are the same. In more embodiments, the first promoter is an SV40 late promoter, and the second and third promoters are each a human CMV promoter. In other embodiments, the first and second promoters are operably linked to a prokaryotic operator.
- the host cell line has an exogenously added gene encoding a recombinase integrated into its genome, operably linked to a promoter.
- the recombinase is Cre recombinase.
- the host cell has a gene encoding a regulatory protein integrated into its genome, operably linked to a promoter.
- the regulatory protein is a tet repressor protein.
- the first GOI and the second GOI encode a light chain, or fragment thereof, of an antibody or a heavy chain, or fragment thereof, of an antibody.
- the first GOI encodes a light chain of an antibody and the second GOI encodes a heavy chain of an antibody.
- the first, second and third GOI encode a polypeptide selected from the group consisting of a first light chain, or fragment thereof, a second light chain, or fragment thereof and a heavy chain, or fragment thereof.
- the first, second and third GOI encode a polypeptide selected from the group consisting of a light chain, or fragment thereof, a first heavy chain, or fragment thereof and a second heavy chain, or fragment thereof.
- a method for making a protein of interest comprising (a) introducing into a CHO host cell a gene of interest (GOI), wherein the GOI integrates into a specific locus such as a locus described in US Patent No. 7771997B2, issued August 10, 2010 or other stable integration and/or expression-enhancing locus; (b) culturing the cell of (a) under conditions that allow expression of the GOI; and (c) recovering the protein of interest.
- the protein of interest is selected from the group consisting of a subunit of an immunoglobulin, or fragment thereof, and a receptor, or ligand-binding fragment thereof.
- the protein of interest is selected from the group consisting of an antibody light chain, or antigen-binding fragment thereof, and an antibody heavy chain, or antigen-binding fragment thereof.
- the CHO host cell genome comprises further modifications, and comprises one or more recombinase recognition sites as described above, and the GOI is introduced into a specific locus through the action of a recombinase that recognizes the
- the GOI is introduced into the cell employing a targeting vector for recombinase-mediated cassette exchange (RMCE) when the CHO host cell genome comprises at least one exogenous recognition sequence within a specific locus.
- RMCE recombinase-mediated cassette exchange
- the GOI is introduced into the cell employing a targeting vector for homologous recombination, and wherein the targeting vector comprises a 5' homology arm homologous to a sequence present in the specific locus, a GOI, and a 3' homology arm
- the targeting vector further comprises two, three, four, or five or more genes of interest.
- one or more of the genes of interest are operably linked to a promoter.
- a method for modifying a CHO cell genome to integrate an exogenous nucleic acid sequence comprising the step of introducing into the cell a vehicle comprising an exogenous nucleic acid sequence wherein the exogenous nucleic acid integrates within a locus of the genome.
- the invention provides a process for manufacturing a stable protein formulation comprising the steps of: (a) extracting a protein fraction from the modified host cell of the invention having decreased or ablated expression of esterase, (b) contacting the protein fraction comprising a protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (c) collecting the protein of interest from the media, wherein a reduced level of the esterase activity is associated with the protein fraction collected at step (c), thus providing a stable protein formulation.
- PA protein A affinity
- CEX cation exchange
- AEX anion exchange
- the invention provides a process for reducing esterase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate expression of esterase, (b) transfecting the host cell with a protein of interest, (c) extracting a protein fraction from the modified host cell, (c) contacting the protein fraction comprising the protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (d) collecting the protein of interest from the media, and (e) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer, thus providing a protein formulation essentially free of detectable esterase activity.
- the protein formulation is essentially free of PLBD2 protein or PLBD2 activity.
- a method for modifying a CHO cell genome to express a therapeutic agent comprising a vehicle for introducing, into the genome, an exogenous nucleic acid comprising a sequence for expression of the therapeutic agent, wherein the vehicle comprises a 5' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33, a nucleic acid encoding the therapeutic agent, and a 3' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33.
- the invention provides a modified CHO host cell comprising a modified CHO genome wherein the CHO genome is modified by disruption of target sequence within a nucleotide sequence at least 90% identical to SEQ ID NO: 33.
- the modified CHO host cell further comprises another FAH target sequence disruption.
- the another FAH target sequence is within (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:61 , (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:69 and/or (3) a nucleotide sequence at least 90% identical to SEQ ID NO:77.
- the invention provides a modified eukaryotic host cell comprising a modified eukaryotic genome wherein the eukaryotic genome is modified at a target sequence in a coding region of the target gene by a site-specific nuclease.
- the site-specific nuclease comprises a zinc finger nuclease (ZFN), a ZFN dimer, a transcription activator-like effector nuclease (TALEN), a TAL effector domain fusion protein, or an RNA-guided DNA endonuclease.
- ZFN zinc finger nuclease
- TALEN transcription activator-like effector nuclease
- TAL effector domain fusion protein or an RNA-guided DNA endonuclease.
- the invention also provides methods of making such a modified eukaryotic host cell.
- the target sequence can be placed in the indicated orientation as in SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 1 19, 120, 121 , 122, or 123; or in the reverse of the orientation of SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 1 19, 120, 121 , 122, or 123.
- Fig. 1 depicts the results of Taqman® quantitative polymerase chain reaction (qPCR) experiments to detect genomic (gDNA) or transcripts (mRNA) of the modified clones.
- Primers and probes were designed to flank the sequences predicted as subject to targeted disruption within exon 1 , either starting at nucleotide 37 (sgRNAI ) or starting at nucleotide 44 (sgRNA2) of SEQ ID NO:33.
- Relative amount of amplicons from clones targeted by either sgRNAI or sgRNA2 are graphed (i.e., relative to amplicons derived from the negative control transfection clones which were subject to no sgRNA or unmatched sgRNA).
- Clone 1 for example, has relatively no amplified gDNA nor mRNA per qPCR of the targeted exon 1 region. Clone 1 and several other clones were selected for follow up analysis. The vertical axes represent the relative amount of template containing sgRNAI sequence (upper panel) or sgRNA2 sequence (lower panel).
- FIG. 2A and Fig. 2B illustrate the results of further PCR analysis of a Clone 1 cells population compared to wild type Chinese hamster overy (CHO) cells.
- Fig. 2A shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from genomic DNA of wild type cells.
- Fig. 2B shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from mRNA of wild type cells. Sequencing confirmed an 1 1 bp deletion in the PLBD2 gene of Clone 1 .
- the vertical axes represent the size of the PCR fragments in base-pairs (bp).
- Fig. 3 illustrates the relative protein titer of monoclonal antibody 1 (mAbl )-expressing Clone 1 cells (RS001 ) or mAb1 -expressing wild type CHO cells (RS0WT) subject to the same fed-batch culture conditions for 12 days. Samples of conditioned medium were extracted for each culture, and the Protein A binding fraction was quantified at Day 2, 4, 6, 9 and 12.
- Fig. 4 shows the results of RS001 or RSOWT cells following production culture and protein purification using either Protein A (PA) alone, or PA and anion exchange (AEX) chromatography. PA-purified mAb1 from RS001 and RSOWT was analyzed for lipase abundance using trypsin digest mass spectrometry. As such, trypsin digests of RS001 - and RSOWT-produced mAb1 were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass
- spectrometer set to monitor a specific PLBD2 product fragment (as in Table 2).
- Control reactions containing reference samples of mAb1 (with no endogenous PLBD2) spiked with varying amounts of recombinant PLBD2 were also analyzed and plotted. The signals detected in the experiments were compared to the control reactions to determine concentration of PLBD2.
- mAb1 produced from Clone 1 shows no detectable amounts of PLBD2 when purified with PA alone.
- Fig. 5 is a line plot depicting percent cell viability as a function of time in days. Open circles (-0-) represent wildtype cells. Filled diamonds (- ⁇ -) represent PLBD2-KO cells. Filled squares (- ⁇ -) represent LPL-KO cells. Filled triangles (-A -) represent the double knock-out LPL-KO / PLBD2KO.
- Fig. 6 is a line plot depicting protein production (titer) in grams per liter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (- ⁇ -) represent PLBD2-KO cells. Filled squares (- ⁇ -) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
- Fig. 7 is a line plot depicting viable cell counts in cells per milliliter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (- ⁇ -) represent PLBD2-KO cells. Filled squares (- ⁇ -) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
- Fig. 8 shows a formatted alignment of LPL knock out constructs clone 19, clone 20, clone 21 , and clone 22, represented by SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161 , and SEQ ID NO: 162, respectively.
- the partial wildtype LPL sequence is represented by SEQ ID NO: 158.
- exogenously added gene refers to any DNA sequence or gene not present within the genome of the cell as found in nature.
- an "exogenously added gene” within a CHO genome can be a gene from any other species (e.g., a human gene), a chimeric gene (e.g., human/mouse), or a hamster gene not found in nature within the particular CHO locus in which the gene is inserted (i.e., a hamster gene from another locus in the hamster genome), or any other gene not found in nature to exist within a CHO locus of interest.
- Percent identity when describing an esterase, e.g., a hydrolase protein, such as SEQ ID NO:32, 34, 35, 37, 62,70, 74, 75, 76, 78, 82, 83, 84; 124, 125, 126, 127, 128, 129, and 130; or gene, such as SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 1 19, 120, 121 , 122, and 123 includes homologous sequences that display the recited identity along regions of contiguous homology, but the presence of gaps, deletions, or insertions that have no homolog in the compared sequence are not taken into account in calculating percent identity.
- a hydrolase protein such as SEQ ID NO:32, 34, 35, 37, 62,70, 74, 75, 76, 78, 82, 83, 84; 124, 125, 126, 127, 128, 129, and 130; or gene, such as SEQ ID NO
- a "percent identity" determination between, e.g., SEQ ID NO:32 with a species homolog would not include a comparison of sequences where the species homolog has no homologous sequence to compare in an alignment (i.e., SEQ ID NO:32 compared to a fragment thereof, or the species homolog has a gap or deletion, as the case may be). Thus, “percent identity” does not include penalties for gaps, deletions, and insertions.
- Target sites are the sites within the genomic sequence selected for cleavage or break by a nuclease.
- the DNA break is normally repaired by the non-homologous end-joining (NHEJ) DNA repair pathway.
- NHEJ non-homologous end-joining
- InDels insertions or deletions
- ORF open reading frame
- Targeted insertion refers to gene targeting methods employed to direct insertion or integration of a gene or nucleic acid sequence to a specific location on the genome, i.e., to direct the DNA to a specific site between two nucleotides in a contiguous polynucleotide chain. Targeted insertion may also be performed to introduce a small number of nucleotides or to introduce an entire gene cassette, which includes multiple genes, regulatory elements, and/or nucleic acid sequences. "Insertion” and “integration” are used interchangeably.
- Recognition site or “recognition sequence” is a specific DNA sequence recognized by a nuclease or other enzyme to bind and direct site-specific cleavage of the DNA backbone.
- Endonucleases cleave DNA within a DNA molecule.
- Recognition sites are also referred to in the art as recognition target sites.
- Polysorbates are fatty acid esters of sorbitan or iso-sorbide (polyoxyethylene sorbitan or iso- sorbide mono- or di- esters).
- the polyoxyethylene serves as the hydrophilic head group and the fatty acid as the lipophilic hydrophobic tail.
- the effectiveness as a surfactant of the polysorbate depends upon the amphiphilic nature of the molecule with both hydrophilic head and hydrophobic tail present (in a single molecule).
- SVP subvisible particle
- SVPs may attribute to immunogenicity. Regulatory authorities like the United States Food and Drug Administration (USFDA) provide limitations on the number of subvisible particles (SVPs) allowed in a pharmaceutical formulation. United States Pharmacopeia (USP) publishes standards for strength, purity and quality of drugs and drug ingredients, as well as food ingredients and dietary
- USP 31 monograph ⁇ 788> sets the limit for number of particles allowed in parenteral formulations.
- USP 31 monograph ⁇ 788> is available at http://www.uspnf.com/official- text/revision-bulletins/particulate-matter-iniections: and as Revision Bulletin Official July 1 , 2012, ⁇ 788> Particulate Matter in Injections, The United States Pharmacopeial Convention.
- the limit is set at no more than 25 particles of at least 10 microns per ml_, and no more than 3 particles of at least 25 microns per ml_.
- the limit is set at no more than 6,000 particles of at least 10 microns per container, and no more than 600 particles of at least 25 microns per container.
- the term "stability” refers to the retention of an acceptable degree of physical structure (colloidal, nature), chemical structure or biological function of the biologically active agent (e.g., biotherapeutic or other protein produced in a cell-based bioprocess) over time during storage (a.k.a. "shelf-life"), during processing, or after administration and while in vivo.
- the biologically active agent may be stable even though it does not maintain 100% of its structure or function after storage or administration for a defined amount of time.
- the biologically active agent and formulation containing the biologically active agent may be regarded as "stable".
- Stability can be measured, inter alia, by determining the percentage of native molecule that remains in the formulation after storage or administration for a defined amount of time at a defined temperature.
- the percentage of native molecule can be determined by, inter alia, size exclusion chromatography (e.g., size exclusion high performance liquid chromatography [SE-HPLC]), such that native means non-aggregated and non-degraded.
- SE-HPLC size exclusion high performance liquid chromatography
- At least about 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% of the native form of the biologically active agent can be detected in the formulation after a defined amount of time at a defined temperature or under physiological conditions after administration.
- the defined amount of time after which stability is measured can be about 14 days, about 28 days, about 1 month, about 2 months, about 3 months, about 4 months, about 5 months, about 6 months, about 7 months, about 8 months, about 9 months, about 10 months, about 1 1 months, about 12 months, about 18 months, about 24 months, or more.
- the temperature at which the formulation containing the biologically active agent may be kept when assessing stability can be any temperature from about -80°C to about 45°C, e.g., storage at about - 80°C, about -30°C, about -20°C, about 0°C, about 4°-8°C, about 5°C, about 25°C, about 35°C, about 37°C or other physiological temperatures, or about 45°C.
- the biologically active agent may be deemed stable if after 3 months under physiological conditions, greater than about 75%, 80%, 85% or 90% of native molecule is detected in the soluble fraction by SE-HPLC or other size exclusion or size determination method.
- “Physiological temperature” includes the body temperature of any vertebrate.
- physiological temperature of humans is about 37°C.
- physiological temperature is between about 25°C and about 45°C.
- physiological temperature is about 25°C, about 26°C, about 27°C, about 28°C, about 29°C, about 30°C, about 31 °C, about 32°C, about 33°C, about 34°C, about 35°C, about 36°C, about 37°C, about 38°C, about 39°C, about 40°C, about 41 °C, about 42°C, about 43°C, about 44°C, and about 45°C.
- Stability can be measured, inter alia, by determining the percentage of biologically active agent, such as a protein, that forms an aggregate (i.e., high molecular weight species) after a defined amount of time at a defined temperature, wherein stability is inversely proportional to the percent high molecular weight (HMW) species that is formed of the biologically active agent (protein).
- the percentage of HMW species of the biologically active agent can be determined by, inter alia, size exclusion chromatography, as described above.
- a pharmaceutical formulation containing the biologically active agent may also be deemed stable if after three months at physiological conditions less than about 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1 % of the biologically active agent is detected in a HMW form.
- Stability can be measured, inter alia, by determining the percentage of a biologically active agent, such as a protein, that is degraded or otherwise is found as a low molecular weight (LMW) species after a defined amount of time at a defined temperature. Stability is inversely proportional to the percent LMW species that is formed in the soluble fraction. The percentage of LMW species of the biologically active agent in the soluble fraction can be determined by, inter alia, size exclusion chromatography, as described above.
- LMW low molecular weight
- a pharmaceutical formulation may also be deemed stable if after three months under storage conditions less than about 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1 % of the biologically active agent is detected in a LMW form.
- compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines.
- the composition complies if tested for subvisible particles and the test results in the average number of particles present in the units tested does not exceed 12 per milliliter equal to or greater than 10 um in size, and does not exceed 2 per milliliter equal to or greater than 25 um.
- Various test for microscopic particles in solution are well-known in the art, including but not limited to tests recommended in ICH Guideline Q4B Annex 3(R1 ), dated 27 September 2010 (Evaluation And Recommendation Of Pharmacopoeial Texts For Use In The lch Regions On Test For Particulate Contamination: Sub-Visible Particles General Chapter).
- HCPs proteins produced or encoded by the host organisms used to produce recombinant therapeutic proteins. HCPs are generally process-related impurities during biologies production. The amount of residual HCPs in drug product is generally considered a critical quality attribute (CQA), due to their potential to affect product safety and efficacy. Regulatory authorities require a product sponsor to monitor the removal of HCPs in drug product during bioprocess development. A sensitive assay e.g., immunoassay, capable of detecting a wide range of protein impurities is generally utilized. This testing can include verification at commercial scale in accordance with regional regulations and may be done at the time of submission of a marketing approval application.
- CQA critical quality attribute
- ICH Specifications (Q6A and Q6B, section 2.3), if a drug substance or drug product does not contain any impurity in the specific formulation, i.e. if efficient control or removal to acceptable levels is demonstrated by suitable studies, further testing may be reduced or eliminated upon approval by the regulatory authorities.
- compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines.
- the composition prepared by the host cells described herein comprise less than 100 ng/mg (ppm), less than about 90 ppm, less than about 80 ppm, less than about 70 ppm, less than about 60 ppm, less than about 50 ppm, less than about 40 ppm, less than about 30 ppm, less than about 20 ppm, less than about 10 ppm, less than about 5 ppm, or 0 ppm of the target host cell protein, i.e. fatty acid hydrolase.
- fatty acid hydrolase or “FAH” refers to any hydrolytic enzyme that cleaves at a carbonyl group creating a carboxylic acid product in which the carboxylic acid comprises an R- group that is lipophilic or otherwise hydrophobic.
- the carboxylic acid product is a fatty acid.
- Esterases and “fatty acid acylases/amidases” are included as subgenera of fatty acid hydrolase.
- Lipases are a subgenus of esterases that cleave lipids (fats, waxes, sterols, glycerides and phospholipids.
- Phospholipases are a subgenus of lipases that cleave
- phospholipids phospholipids. Esterases cleave fatty acid esters into fatty acids and alcohols. Lipases include PLBD2, LPL and LIPA. "Ceramidase” is a subgenus of fatty acid acylase that cleaves ceramide and releases a fatty acid and sphingosine, which is an amino alcohol. Examples of ceramidases include acid ceramidase, neutral ceramidase, alkaline ceramidase 1 , alkaline ceramidase 2 and alkaline ceramidase 3.
- Protein A-binding fraction refers to the fraction of cell lysate from cultured cells expressing a protein of interest which binds to a Protein A affinity format. It is well understood in the art that Protein A affinity chromatography, such as Protein A chromatography medium, such as resins, beads, columns and the like, are utilized to capture Fc-containing proteins due to their affinity to Protein A.
- Protein A affinity chromatography such as Protein A chromatography medium, such as resins, beads, columns and the like
- Phospholipase B-like 2 refers to the homologs of a phospholipase gene known as NCBI RefSeq. XM_003510812.2 (SEQ ID NO:33) or protein known as NCBI RefSeq. XP_003510860.1 (SEQ ID NO:32), and further described herein.
- PLBD2 is also referred to in the art as putative phospholipase B-like 2 (PLBL2), 76 kDa protein, LAMA-like protein 2, PLB homolog 2, lamina ancestor homolog 2, mannose-6-phosphate protein associated protein p76, p76, phospholipase B-like 2 32 kDa form, phospholipase B-like 2 45 kDa form, or Lysosomal 66.3 kDa protein.
- Lipoprotein lipase is a glycosylated homodimer secreted by parenchymal cells and associated with endothelial cells of the capillary lumen.
- Exemplary LPL proteins include Chinese hamster LPL (SEQ ID NO:62), mouse LPL (SEQ ID NO:66), ), rat LPL (SEQ ID NO:67) and human LPL (SEQ ID NO:68).
- Mouse LPL is 92% identical to Chinese hamster LPL.
- Rat LPL is 92% identical to Chinese hamster LPL.
- Human LPL is 88% identical to Chinese hamster LPL.
- Chinese hamster LPL is encoded by a polynucleotide sequence of SEQ ID NO:61 .
- Lysosomal acid lipase also known as lysosomal lipase, lipase A, lysosomal acid and cholesterol esterase is an intracellular lipase that functions in the lysosome.
- LI PA reversibly catalyzes cholesteryl ester bond formation and cleavage.
- LI PA is also a glycosylated homodimer.
- Exemplary LIPA proteins include Chinese hamster LIPA (SEQ ID NO:70), mouse LIPA (SEQ ID NO:74), ), rat LIPA (SEQ ID NO:75) and human LIPA (SEQ ID NO:76).
- Mouse LIPA is 72% identical to Chinese hamster LIPA.
- Rat LIPA is 75% identical to Chinese hamster LIPA.
- Human LIPA is 74% identical to Chinese hamster LIPA.
- Chinese hamster LIPA is encoded by a polynucleotide sequence of SEQ ID NO:69.
- Acid ceramidase also known as ASAH1 , AC, ACDase, ASAH, PHP, PHP32, SMAPME, N-acylsphingosine amidohydrolase (acid ceramidase) 1 is an acylase that cleaves ceramide to produce fatty acid and sphingosine. It is a heterodimer comprising a non-glycosylated alpha subunit and a glycosylated beta subunit. Acid ceramidase has an acid pH optimum. The lipid accumulation disease, Farber Lipogranulomatosis, is associated with a deficiency in acid ceramidase activity.
- Exemplary acid ceramidases include Chinese hamster ACE (SEQ ID NO:78), mouse ACE (SEQ ID NO:82), ), rat ACE (SEQ ID NO:83) and human ACE (SEQ ID NO:84).
- Mouse ACE is 87% identical to Chinese hamster ACE.
- Rat ACE is 89% identical to Chinese hamster ACE.
- Human ACE is 83% identical to Chinese hamster ACE.
- Chinese hamster ACE is encoded by a polynucleotide sequence of SEQ ID NO:77.
- Platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), also known as PAFAH1 B3, PAFAHG, and platelet activating factor acetylhydrolase 1 b catalytic subunit 3 is one of the catalytic subunits along with beta of the cytosolic tetrameric platelet-activating factor
- acetylhydrolase IB belongs to the phospholipase A2 family and catalyzes the hydrolysis of the acyl group at position 2 of glycerol in bioactive phospholipids (see Stafforini et al., Journal of Biological Chemistry, 272:17895-17898, July 1997).
- Chinese hamster PAFAHG (SEQ ID NO:124) is 98% identical to both rat and mouse PAFAHG, and 96% identical to human PAFAHG.
- Chinese hamster PAFAHG is encoded by a polynucleotide sequence of SEQ ID NO:1 17.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Additional fatty acid hydrolases that can serve as targets for deletion, either individually, or in combination with one of more additional fatty acid hydrolases are listed in Table 1. Any of the following hydrolases, or their equivalents, listed in Table A may be the target protein in the creation of a knockout cell line, where removal of the host cell protein is necessitated due to contamination in the preparation of a biopharmaceutical product.
- cell or “cell line” includes any cell that is suitable for expressing a recombinant nucleic acid sequence.
- Cells include those of prokaryotes and eukaryotes (single-cell or multiple- cell), bacterial cells (e.g. , strains of E. coli, Bacillus spp., Streptomyces spp., etc.), mycobacteria cells, fungal cells, yeast cells (e.g.,S. cerevisiae, S. pombe, P. partoris, P.
- the cell is a human, monkey, ape, hamster, rat or mouse cell.
- the cell is eukaryotic and is selected from the following cells: CHO (e.g.,CHO K1 , DXB-1 1 CHO, Veggie-CHO), COS (e.g.,COS-7), retinal cells, Vero, CV1 , kidney (e.g.,HEK293, 293 EBNA, MSR 293, MDCK, HaK, BHK21 ), HeLa, HepG2, WI38, MRC 5, Colo25, HB 8065, HL-60, Jurkat, Daudi, A431 (epidermal), CV-1 , U937, 3T3, L cell, C127 cell, SP2/0, NS-0, MMT cell, tumor cell, and a cell line derived from an aforementioned cell.
- the cell comprises one or more viral genes, e.g., a retinal cell that expresses a viral gene (e.g., a PER.C6® cell).
- the invention is based at least in part on a recombinant host cell and cell expression system thereof that decreases expression of two or more an endogenous host cell fatty acid hydrolases (FAHs), decreases the enzymatic function or binding ability of two or more endogenous host cell FAHs, or lacks detectable expression of two or more FAHs.
- FHs endogenous host cell fatty acid hydrolases
- the inventors discovered that the disruption of genes encoding at least two FAHs allows for the optimized and efficient production and purification of biopharmaceutical products expressed in such expression systems.
- the invention may be employed in several ways, such as 1 ) utilizing gene editing tools to totally knockout FAH expression, whereas no measurable full-length FAH enzyme is expressed in the cell due to disruption of the gene encoding the FAH; 2) utilizing gene editing tools to eliminate or reduce enzymatic activity, whereas the FAH protein is expressed but rendered nonfunctional due to disruptions in its gene; and 3) utilizing gene editing tools to eliminate or reduce the ability of an endogenous host cell FAH to bind exogenous recombinant protein produced by the cell.
- FAH activity was determined in protein fractions of certain antibody-producing cells. Several particular fatty acid hydrolases were determined as contaminants in these protein fractions, including three carboxylic esterases (a.k.a.
- esterases phospholipase B-like (PLBD2), lipoprotein lipase (LPL) and lysosomal acid lipase (LIPA), and acid ceramidase, a carboxylic amidase (an acylase).
- PLBD2 phospholipase B-like
- LPL lipoprotein lipase
- LIPA lysosomal acid lipase
- acid ceramidase a carboxylic amidase (an acylase).
- Gene editing target sites were identified in hamster PLBD2, LPL, LIPA and acid ceramidase (ACE) genes that enable targeted disruption of those genes in a hamster cell (i.e., CHO) genome.
- ACE acid ceramidase
- An optimized host cell comprising a combination of genetic modifications that affect the expression of genes encoding (1 ) PLBD2 and LPL; (2) PLBD2 and LIPA; (3) PLBD2 and ACE; (4) LPL and LIPA; (5) LPL and ACE; (6) LIPA and ACE; (7) PLBD2, LPL and LIPA; (8) PLBD2, LPL and ACE; (9) PLBD2, LIPA and ACE; (10) LPL, LIPA and ACE, (1 1 ) PLBD2, LPL, LIPA and ACE, (12) PLBD2 and PIPLC, (13) PLBD2 and CEH, (14) PLBD2 and PAFAHG, (15) PLBD2 and LCE, (16) PLBD2 and ASA, (17) PLBD2 and IAH1 , (18) PLBD2 and LPLA2, (19) LPL and PIPLC, (20) LPL and CEH, (21 ) LPL and PAFAHG, (22) LPL and LCE, (23)
- Such a cell is envisioned to reduce the burden of certain purification steps, thereby reducing time and cost, while increasing production yield. Also, the formulated protein is expected to have improved stability due to the reduced hydrolase burden.
- the invention is also based on the specific targeting of an exogenous gene to the integration site.
- the methods of the invention allow efficient modification of the cell genome, thus producing a modified or recombinant host cell useful as a cell expression system for the bioprocessing of therapeutic or other commercial protein products.
- the methods of the invention employ cellular genome gene editing strategies for the alteration of particular genes of interest that otherwise may diminish or contaminate the quality of recombinant protein formulations, or require multiple purification steps.
- compositions of the invention can also be included in expression constructs for example, in expression vectors for cloning and engineering new cell lines. These cell lines comprise the modifications described herein, and further modifications for optimal incorporation of expression constructs for the purpose of protein expression are envisioned.
- Expression vectors comprising polynucleotides can be used to express proteins of interest transiently, or can be integrated into the cellular genome by random or targeted recombination such as, for example, homologous recombination or recombination mediated by recombinases that recognize specific recombination sites (e.g., Cre-lox-mediated recombination).
- Target sites for disruption or insertion of DNA are typically identified with the maximum effect of the gene disruption or insertion in mind.
- target sequences may be chosen near the N-terminus of the coding region of the gene of interest whereas a DNA break is introduced within the first or second exon of the gene.
- Introns non-coding regions
- the changes introduced by these modifications are permanent to the genomic DNA of the organism.
- GOI expressible gene of interest
- any other desirable elements such as, e.g., promoters, enhancers, markers, operators, ribosome binding sites (e.g., internal ribosome entry sites), etc. are also employed.
- the resulting recombinant cell line conveniently provides more efficient downstream bioprocess methods with respect to expressible exogenous genes of interest (GOIs), since purification steps for exogenous proteins of interest are eliminated due to the absence of the contaminant host cell protein. Eliminating or refining purification procedures also results in higher amounts (titer) of the recovered protein of interest.
- GOIs expressible exogenous genes of interest
- Applicants have discovered enzymatic activities associated with the destabilization of polysorbates (including polysorbate 20 and polysorbate 80) and/or enzymatic activities co-purifying with highly concentrated, multimerized or aggregated protein. Those activities were found to be associated with one or more fatty acid hydrolases (FAHs).
- FAHs fatty acid hydrolases
- One such FAH was identified from the peptide sequences listed in Table 2.
- a BLAST search of those peptide sequences revealed identity with a putative phospholipase B-like 2 (PLBD2, also referred to as PLBL2).
- PLBD2 is highly conserved in hamster (SEQ ID NO:32), mice (SEQ ID NO:34), rat (SEQ ID NO:35), human (SEQ ID NO:36), and bovine (SEQ ID NO:37).
- the applicants discovered that PLBD2 which co-purifies under certain processes with some classes of proteins-of-interest manufactured in a mammalian cell line, has enzymatic activity responsible for the hydrolysis of polysorbate 20 and 80.
- Other FAH species, of which PLBD2 is an example may contribute to polysorbate instability or persist as hydrophobic "sticky" proteins that bind protein multimers or aggregates during purification and ultimate formulation, depending upon the particular protein-of-interest and/or background of the host cell.
- IgGs formulated with polysorbate 20 did not form particles and the putative esterase did not hydrolyze the polysorbate 20.
- the author reported that the putative lipase associated with the IgG did not affect saturated C12 fatty acid (i.e., laurate) (Id at 7.)
- LPL lipoprotein lipase
- LIPA lysosomal acid lipase
- ACE acid ceramidase
- LPL is a triacylglycerol/diacylglycerol hydrolase of the carboxylic ester hydrolase (esterase) family (see Hide ef al., "Structure and evolution of the lipase superfamily," J Lipid Res. 1992 Feb; 33(2): 167-78).
- LIPA is a sterol esterase and synthase that acts on esters of sterols and long-chain fatty acids (see Dubland and Francis, “Lysosomal acid lipase: at the crossroads of normal and atherogenic cholesterol metabolism,” Front. Cell Dev. Biol. 2015 Feb 2; 3(3): 1-1 1 ).
- Acid ceramidase is not a carboxylic ester hydrolase, but rather an amide hydrolase that cleaves fatty acids from ceramide at the amide bond (carboxylic amide hydrolase) (see Park and Schuchman, "Acid ceramidase and human disease,” Biochim. Biophys. Acta. 2006 Dec; 1758(12): 2133-8).
- Phospholipases are a family of esterase enzymes that catalyze the cleavage of phospholipids. Each phospholipase subclass has different substrate specificity based on its target cleavage site.
- Phospholipase B was identified as related to a group of prokaryotic and eukaryotic lipase proteins by virtue of the presence of a highly conserved amino acid sequence motif, Gly-Asp-Ser-Leu (GDSL) (Upton, C, and Buckley, JT. A new family of lipolytic enzymes? Trends Biochem Sci. 1995; 20:178-179).
- phospholipase B is also classified with known GDSL(S) hydrolases, and has little sequence homology to true lipases, differentiating itself structurally from phospholipases by having a serine-containing motif closer to the N-terminus than other lipases.
- phospholipase B-like proteins are also classified as N-terminal nucleophile (Ntn) hydrolases.
- Ntn N-terminal nucleophile hydrolases.
- PLB-like proteins such as phospholipase B-like protein 1 (PLBD1 ) and phospholipase B-like protein 2 (PLBD2), also have amidase activity, similar to other Ntn hydrolases (Repo, H. et al, Proteins 2014; 82:300-31 1 ).
- Lipoprotein lipases have also been demonstrated to cleave carboxylic ester bonds of polysorbate 20 and 80; and to associate with some monoclonal antibodies during production (see N. Levy, "Host cell protein impurities and protein-protein interactions in downstream purification of monoclonal antibodies," Dissertation submitted to the Faculty of the University of Delaware, Summer 2014, UMI 3642330, Published by ProQuest LLC, 2014).
- Cell-cultures of a CHO-K1 RNAi knock-down of LPL revealed diminished polysorbate esterase activity. The effect of such a knockdown on overall cell viability or the production of useful titers of ectopic protein has not been investigated.
- Knockout of a host cell gene such as an FAH, more particularly one or more of phospholipase B-like protein 2, lipoprotein lipase, lysosomal acid lipase and acid ceramidase may be accomplished in several ways. Rendering the FAH encoding gene nonfunctional, or reducing the functional activity of the target FAH protein may be done by introducing point mutations in the FAH genomic sequence, particularly in the exons (coding regions).
- nucleic acid sequences of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 were identified and sequences upstream and downstream of the target site (i.e., homologous arms) may be utilized to integrate an expression cassette comprising a mutated gene by homologous recombination. Further gene editing tools are described herein in accordance with the invention.
- Cell lines devoid of multiple FAH activities are useful for the production of therapeutic proteins to be purified and stored long term, and such cell lines solve problems associated with long term storage of pharmaceutical compositions in a formulation containing a fatty acid ester surfactant by maintaining protein stability and reducing subvisible particle (SVP) formation (see also PCT International Application No. PCT/US 15/54600 filed October 8, 2015, which is hereby incorporated in its entirety into the specification).
- SVP subvisible particle
- Assays to detect FAH activity include polysorbate degradation measurements. Unpurified protein supernatants or fractions from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, is tested for stability of polysorbate, such as polysorbate 20 or 80. The measurement of percent intact polysorbate reported is inversely proportional to the amount of contaminating FAH activity. Other measurements for detection of FAH activity or presence of FAH in a protein sample are known in the art. Detection of FAH protein ⁇ e.g., lipase, phospholipase, PLBD2, LPL, LIPA, acylase, ACE) may be done by trypsin digest mass spectrometry.
- micrometers in diameter may be counted in the protein formulation in order to detect esterase or other FAH activity.
- glycerophospho[ 3 H]choline formation from phosphatidyl[3H]choline following incubation of phosphatidyl[ 3 H]choline and protein supernatant may be determined by thin-layer chromatography (following similar protocols according to Kanoh, H. et al. 1991 Comp Biochem Physiol 102B(2):367- 369).
- SEQ ID NO:32 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of PLBD2, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:32, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical
- compositions and are candidates for targeted disruption in the engineered cell line are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:32 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
- the mammalian PLBD2 sequences are conserved among hamster, human, mouse and rat genomes.
- Table 3 identifies exemplary mammalian PLBD2 proteins and their degree of homology.
- the targeted disruption of SEQ ID NO:33 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
- the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
- the PLBD2 nucleic acid sequence is at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:33 or target sequence thereof.
- SEQ ID NO:62 disclosed herein was identified from proteins expressed in CHO cells.
- Other mammalian species such as, for example, humans, rats, mice
- Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques.
- variants of LPL can then be tested for FAH activity as described herein.
- DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:62, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:62 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
- the mammalian LPL sequences are conserved among hamster, human, mouse and rat genomes.
- Table 4 identifies exemplary mammalian LPL proteins and their degree of homology.
- the targeted disruption of SEQ ID NO:61 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400- 420, 410-430, 420-440, 430-450, 440-460
- the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410- 430, 420-440, 430-450, 440-460, 450-470
- the LPL nucleic acid sequence is at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:61 or target sequence thereof.
- SEQ ID NO:70 disclosed herein was identified from proteins expressed in CHO cells.
- Other mammalian species such as, for example, humans, rats, mice
- Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques.
- variants of LIPA can then be tested for FAH activity as described herein.
- DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:70, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:70 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
- the mammalian LIPA sequences are conserved among hamster, human, mouse and rat genomes.
- Table 5 identifies exemplary mammalian LIPA proteins and their degree of homology.
- the targeted disruption of SEQ ID NO:69 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69.
- the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69.
- the LIPA nucleic acid sequence is at least about 70% identical, at least about 71 % identical, at least about 72% identical, at least about 73% identical, at least about 74% identical, at least about 75% identical, at least about 76% identical, at least about 77% identical, at least about 78% identical, or at least about 79% identical, at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:69 or target sequence thereof.
- SEQ ID NO:78 disclosed herein was identified from proteins expressed in CHO cells.
- Other mammalian species such as, for example, humans, rats, mice
- Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques.
- variants of LPL can then be tested for FAH activity as described herein.
- DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:78, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:78 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
- ACE mammalian acid ceramidase
- the targeted disruption of SEQ ID NO:77 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ ID NO:77.
- the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ I D NO:77.
- the ACE nucleic acid sequence is at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:77 or target sequence thereof.
- Cell populations expressing enhanced levels of a protein of interest can be developed using the cell lines and methods provided herein.
- the isolated commercial protein, protein supernatant or fraction thereof, produced by the cells of the invention have no detectable esterase or esterase activity.
- Cell pools further modified with exogenous sequence(s) integrated within the genome of the modified cells of the invention are expected to be stable over time, and can be treated as stable cell lines for most purposes. Recombination steps can also be delayed until later in the process of development of the cell lines of the invention.
- Methods for genetically engineering a host cell genome in a particular location may be achieved in several ways. Genetic editing techniques were used to modify a nucleic acid sequence in a eukaryotic cell, wherein the nucleic acid sequence is an endogenous sequence normally found in such cells and expressing a contaminant host cell protein. Clonal expansion is necessary to ensure that the cell progeny will share the identical genotypic and phenotypic characteristics of the engineered cell line.
- native cells are modified by a homologous recombination technique to integrate a nonfunctional or mutated target nucleic acid sequence encoding a host cell protein, such as a variant of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77.
- One such method of editing the CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA genomic sequences involves the use of guide RNAs and a type II Cas enzyme to specifically target an exon of PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and/or ASA.
- RNAs directed to particular exons of CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA have been employed (Table 7) in a site-specific nuclease editing method as described herein.
- Other methods of targeted genome editing, for example nucleases, recombination-based methods, or RNA interference, to modify the FAH genes may be employed for the targeted disruption of the CHO genome.
- the engineered mammalian host cell comprises one or more disruptions within gene sequences selected from the group consisting of nucleotides 37-63 of SEQ ID NO:33, nucleotides 465-612 of SEQ ID NO:61 , nucleotides 180-295 of SEQ ID NO:69, nucleotides 135-351 of SEQ ID NO:77, nucleotides 249-388 of SEQ ID NO:1 17, nucleotides 1624-2157 of SEQ ID NO:163, nucleotides 372-1399 of SEQ ID NO:1 18, nucleotides 1 155-1600 of SEQ ID NO:1 19, nucleotides 423-615 of SEQ ID NO:120, nucleotides 753-1 141 of SEQ ID NO:121 , nucleotides 31 1 -581 of SEQ ID NO:164, nucleotides 1 155-1443 of SEQ ID NO:122, and nucle
- methods and compositions for knockout or downregulation of a nucleic acid molecule encoding an ortholog of a host cell FAH protein having at least 80% identity to SEQ ID NO:33 (PLBD2), SEQ ID NO:61 (LPL), SEQ ID NO:69 (LIPA) and/or SEQ ID NO:77 (ACE); or antibody-binding variant thereof, are via homologous recombination.
- a nucleic acid molecule encoding an FAH protein (or any protein of interest in general) can be targeted by homologous recombination or by using site-specific nuclease methods that specifically target sequences at the FAH-expressing site of the host cell genome.
- homologous polynucleotide molecules i.e., homologous arms
- a transgene can be introduced during this exchange if the transgene is flanked by homologous genomic sequences.
- a recombinase recognition site can also be introduced into the host cell genome at the integration sites.
- Homologous recombination in eukaryotic cells can be facilitated by introducing a break in the chromosomal DNA at the integration site.
- Model systems have demonstrated that the frequency of homologous recombination during gene targeting increases if a double-strand break is introduced within the chromosomal target sequence. This may be accomplished by targeting certain nucleases to the specific site of integration. DNA-binding proteins that recognize DNA sequences at the target gene are known in the art. Gene targeting vectors are also employed to facilitate homologous recombination.
- NHEJ non-homologous end-joining
- ZFNs zinc finger nucleases
- Some embodiments can utilize ZFNs with a combination of individual zinc finger domains targeting multiple target sequences.
- ZFN methods to target disruption of two or more FAH genes are also embodied by the invention.
- TAL effector nucleases may also be employed for site-specific genome editing.
- TAL effector protein DNA-binding domain is typically utilized in combination with a non-specific cleavage domain of a restriction nuclease, such as Fokl.
- a fusion protein comprising a TAL effector protein DNA-binding domain and a restriction nuclease cleavage domain is employed to recognize and cleave DNA at a target sequence within an exon of the gene encoding the target host cell protein, for example an esterase, such as a phospholipase B-like 2 protein(or other mammalian phospholipase), a lipoprotein lipase and a lysosomal acid lipase, and/or a fatty acylase, such as acid ceramidase (or other mammalian ceramidase).
- an esterase such as a phospholipase B-like 2 protein(or other mammalian phospholipase), a lipoprotein lipase and a lysosomal acid lipase, and/or a fatty acylase, such as acid ceramidase (or other mammalian ceramidase).
- Targeted disruption or insertion of exogenous sequences into a specific exon of the CHO protein encoded by SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77 may be done by employing a TALE nuclease (TALEN) targeted to locations within exon 1 , exon 2, exon 3, etc. of the fatty acid hydrolase genomic DNA (see Tables 6 and 7).
- TALE nuclease TALEN
- the TALEN target cleavage site within the gene sequences may be selected based on ZiFit.partners.org (ZiFit Targeter Version 4.2) and then TALENs are designed based on known methods (Boch J et al., 2009 Science 326:1509-1512; Bogdanove, A. J. & Voytas, D. F. 201 1 Science 333, 1843-1846; Miller, J. C. et al., 201 1 Nat Biotechnol 29, 143-148).
- TALEN methods to target disruption of two or more of the PLBD2 gene ⁇ e.g., exon 1 or exon 2), LPL gene ⁇ e.g., exon 2, 3 or 4), LI PA gene ⁇ e.g., exon 1 or 2) and ACE gene ⁇ e.g., exon 1 , 3 or 4) are also embodied by the invention.
- RNA-guided endonucleases are programmable genome engineering tools that were developed from bacterial adaptive immune machinery.
- CRISPR clustered regularly interspaced short palindromic repeats
- Cas CRISPR-associated
- the protein Cas9 forms a sequence-specific endonuclease when complexed with two RNAs, one of which guides target selection.
- RGENs consist of components (Cas9 and tracrRNA) and a target- specific CRISPR RNA (crRNA).
- CRISPR-Cas9 methods to target disruption of two or more of the PLBD2 gene e.g., exon 1 or exon 2
- LPL gene e.g., exon 2, 3 or 4
- LIPA gene e.g., exon 1 or 2
- ACE gene e.g., exon 1 , 3 or 4
- BuD-derived nucleases with precise DNA-binding specificities (Stella, S. et al. Acta Cryst. 2014, D70, 2042-2052).
- a single residue-to-nucleotide code guides the BuDN to the specific DNA target within polynucleotide of interest ⁇ e.g., SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and or SEQ ID NO:77).
- Sequence-specific endonucleases may be directed to a target sequence at any one of the exons encoding PLBD2, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614971 .1 , at: Exon 1 within nucleotides (nt) 175367 to 175644 (SEQ ID NO:47); Exon 2 within nt 168958 to 169051 (SEQ ID NO:48); Exon 3 within nt 166451 to166609 (SEQ ID NO:49); Exon 4 within nt 164966 to 165066 (SEQ ID NO:50); Exon 5 within nt 164564 to164778 (SEQ ID NO:51 ); Exon 6 within nt 162682 to162779 (SEQ ID NO:52); Exon 7 within nt 160036 to160196 (SEQ ID NO:53); Exon 8 within nt 159733 to 159828 (S
- Sequence-specific endonucleases may be directed to a target sequence at any one of the exons encoding LPL, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613760.1 , at: Exon 1 within nucleotides (nt) 1257424 to 1257507 (SEQ ID NO:85); Exon 2 within nt 1266450 to 1266610 (SEQ ID NO:86); Exon 3 within nt 1270069 1270248 (SEQ ID NO:87); Exon 4 within nt 1271770 to 1271881 (SEQ ID NO:88); Exon 5 within nt 12283518 12283751 (SEQ ID NO:89); Exon 6 within nt 123715 1273957 (SEQ ID NO:90); Exon 7 within nt 1276672 1276792 (SEQ ID NO:91 ); Exon 8 within nt 1278328 to 12
- Sequence-specific endonucleases may be directed to a target sequence at any one of the exons encoding LIPA, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614200.1 , at: Exon 1 within nucleotides (nt) 985778 to 985674 (SEQ ID NO:94); Exon 2 within nt 984375 to 984258 (SEQ ID NO:95); Exon 3 within nt 970771 970573 (SEQ ID NO:96); Exon 4 within nt 969327 to 969218 (SEQ ID NO:97); Exon 5 within nt 968139 968003 (SEQ ID NO:98); Exon 6 within nt 961871 to 961725 (SEQ ID NO:99); Exon 7 within nt 960826 to 960755 (SEQ ID NO:100); Exon 8 within nt 95
- Sequence-specific endonucleases may be directed to a target sequence at any one of the exons encoding ACE, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613654.1 , at: Exon 1 within nucleotides (nt) 1378167 to 1378244 (SEQ ID NO: 103); Exon 2 within nt 1393746 to 1393792 (SEQ ID NO: 104); Exon 3 within nt 1398208 to 1398298 (SEQ ID NO:105); Exon 4 within nt 1399171 to 1399257 (SEQ ID NO: 106); Exon 5 within nt 1402147 to 1402225 (SEQ ID NO: 107); Exon 6 within nt 1404854 to 1404928 (SEQ ID NO:108); Exon 7 within nt 1405714 to 1405759 (SEQ ID NO:109); Exon 8 within nt 140
- Precise genome modification methods are chosen based on the tools available compatible with unique target sequences within SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 so that disruption of the cell phenotype is avoided.
- any protein of interest suitable for expression in prokaryotic or eukaryotic cells can be used in the engineered host cell systems provided.
- the protein of interest includes, but is not limited to, an antibody or antigen-binding fragment thereof, a chimeric antibody or antigen- binding fragment thereof, an ScFv or fragment thereof, an Fc-fusion protein or fragment thereof, a growth factor or a fragment thereof, a cytokine or a fragment thereof, or an extracellular domain of a cell surface receptor or a fragment thereof.
- Proteins of interest may be simple polypeptides consisting of a single subunit, or complex multisubunit proteins comprising two or more subunits.
- the protein of interest may be a biopharmaceutical product, food additive or preservative, or any protein product subject to purification and quality standards.
- the protein product is an antibody, a human antibody, a humanized antibody, a chimeric antibody, a monoclonal antibody, a multispecific antibody, a bispecific antibody, an antigen binding antibody fragment, a single chain antibody, a diabody, triabody or tetrabody, a Fab fragment or a F(ab')2 fragment, an IgD antibody, an IgE antibody, an IgM antibody, an IgG antibody, an lgG1 antibody, an lgG2 antibody, an lgG3 antibody, or an lgG4 antibody.
- the antibody is an lgG1 antibody.
- the antibody is an lgG2 antibody.
- the antibody is an lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1/lgG4 antibody. [00172] In some embodiments, the antibody is selected from the group consisting of an anti- Programmed Cell Death 1 antibody (e.g. an anti-PD1 antibody as described in U.S. Pat. Appln. Pub. No. US2015/0203579A1 ), an anti-Programmed Cell Death Ligand-1 (e.g. an anti-PD-L1 antibody as described in in U.S. Pat. Appln. Pub. No. US2015/0203580A1 ), an anti-DII4 antibody, an anti-Angiopoetin-2 antibody (e.g. an anti-ANG2 antibody as described in U.S. Pat. No.
- an anti- Angiopoetin-Like 3 antibody e.g. an anti-AngPtl3 antibody as described in U.S. Pat. No. 9,018,356
- an anti-platelet derived growth factor receptor antibody e.g. an anti-PDGFR antibody as described in U.S. Pat. No. 9,265,827
- an anti-Erb3 antibody an anti- Prolactin
- Receptor antibody e.g. anti-PRLR antibody as described in U.S. Pat. No. 9,302,015
- an anti- Complement 5 antibody e.g. an anti-C5 antibody as described in U.S. Pat. Appln. Pub. No
- an anti-TNF antibody an anti-epidermal growth factor receptor antibody (e.g. an anti-EGFR antibody as described in U.S. Pat. No. 9,132,192 or an anti-EGFRvlll antibody as described in U.S. Pat. Appln. Pub. No. US2015/0259423A1 ), an anti-Proprotein Convertase Subtilisin Kexin-9 antibody (e.g. an anti-PCSK9 antibody as described in U.S. Pat. No. 8,062,640 or U.S. Pat. Appln. Pub. No. US2014/0044730A1 ), an anti-Growth And Differentiation Factor-8 antibody (e.g.
- an anti-GDF8 antibody also known as anti-myostatin antibody, as described in U.S. Pat Nos. 8,871 ,209 or 9,260,515)
- an anti-Glucagon Receptor e.g. anti-GCGR antibody as described in U.S. Pat. Appln. Pub. Nos. US2015/0337045A1 or US2016/0075778A1
- an anti-VEGF antibody e.g. anti-IL1 R antibody
- an interleukin 4 receptor antibody e.g an anti-IL4R antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271681 A1 or U.S. Pat Nos. 8,735,095 or
- an anti-interleukin 6 receptor antibody e.g. an anti-IL6R antibody as described in U.S. Pat. Nos. 7,582,298, 8,043,617 or 9,173,880
- an anti-IL1 antibody e.g. an anti-IL2 antibody as described in U.S. Pat. Nos. 7,582,298, 8,043,617 or 9,173,880
- an anti-IL1 antibody e.g. an anti-IL2 antibody as described in U.S. Pat. Nos. 7,582,298, 8,043,617 or 9,173,880
- an anti-IL1 antibody e.g. an anti-IL2 antibody, an anti-IL3 antibody, an anti-l L4 antibody, an anti-IL5 antibody, an anti-l L6 antibody, an anti-l L7 antibody
- an anti-interleukin 33 e.g. anti- IL33 antibody as described in U.S. Pat. Appln. Pub. Nos.
- an anti-Respiratory syncytial virus antibody e.g. anti- RSV antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271653A1
- an anti-Cluster of differentiation 3 e.g. an anti-CD3 antibody, as described in U.S. Pat. Appln. Pub. Nos.
- an anti- Cluster of differentiation 20 e.g. an anti-CD20 antibody as described in U.S. Pat. Appln. Pub. Nos. US2014/0088295A1 and US20150266966A1 , and in U.S. Pat. No. 7,879,984
- an anti-CD19 antibody, an anti-CD28 antibody, an anti- Cluster of Differentiation-48 e.g. anti-CD48 antibody as described in U.S. Pat. No. 9,228,014
- an anti-Fel d1 antibody e.g. as described in U.S. Pat. No.
- an anti-Middle East Respiratory Syndrome virus e.g. an anti-MERS antibody as described in U.S. Pat. Appln. Pub. No. US2015/0337029A1
- an anti-Ebola virus antibody e.g. as described in U.S. Pat. Appln. Pub. No. US2016/0215040
- an anti-Zika virus antibody e.g. an anti- Lymphocyte Activation Gene 3 antibody, or an anti-CD223 antibody
- an anti-Nerve Growth Factor antibody e.g. an anti-NGF antibody as described in U.S. Pat. Appln. Pub. No. US2016/0017029 and U.S. Pat. Nos.
- the bispecific antibody is selected from the group consisting of an anti-CD3 x anti-CD20 bispecific antibody (as described in U.S. Pat. Appln. Pub. Nos.
- the protein of interest is selected from the group consisting of alirocumab, sarilumab, fasinumab, nesvacumab, dupilumab, trevogrumab, evinacumab, and rinucumab. All publications mentioned throughout this disclosure are incorporated herein by reference in their entirety.
- the protein of interest is a recombinant protein that contains an Fc moiety and another domain, (e.g., an Fc-fusion protein).
- an Fc-fusion protein is a receptor Fc-fusion protein, which contains one or more extracellular domain(s) of a receptor coupled to an Fc moiety.
- the Fc moiety comprises a hinge region followed by a CH2 and CH3 domain of an IgG.
- the receptor Fc-fusion protein contains two or more distinct receptor chains that bind to either a single ligand or multiple ligands.
- an Fc-fusion protein is a TRAP protein, such as for example an IL-1 trap (e.g., rilonacept, which contains the IL-1 RAcP ligand binding region fused to the 11-1 R1 extracellular region fused to Fc of hlgG1 ; see U.S. Pat. No. 6,927,004, which is herein incorporated by reference in its entirety), or a VEGF trap (e.g., aflibercept or ziv-aflibercept, which contains the Ig domain 2 of the VEGF receptor Flt1 fused to the Ig domain 3 of the VEGF receptor Flk1 fused to Fc of hlgG1 ; see U.S. Pat. Nos.
- IL-1 trap e.g., rilonacept, which contains the IL-1 RAcP ligand binding region fused to the 11-1 R1 extracellular region fused to Fc of hlgG1 ; see U.S. Pat. No
- an Fc-fusion protein is a ScFv-Fc-fusion protein, which contains one or more of one or more antigen-binding domain(s), such as a variable heavy chain fragment and a variable light chain fragment, of an antibody coupled to an Fc moiety.
- the host cells used in the methods of the invention are eukaryotic host cells including, for example, Chinese hamster ovary (CHO) cells, human cells, rat cells and mouse cells.
- the invention provides a cell comprising a disrupted nucleic acid sequence fragment of SEQ ID NO:33 and at least one more of SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77.
- the invention includes an engineered mammalian host cell further transfected with an expression vector comprising an exogenous gene of interest, such gene encoding the
- biopharmaceutical product While any mammalian cell may be used, in one particular embodiment the host cell is a CHO cell.
- Transfected host cells include cells that have been transfected with expression vectors that comprise a sequence encoding a protein or polypeptide. Expressed proteins will preferably be secreted into the culture medium for use in the invention, depending on the nucleic acid sequence selected, but may be retained in the cell or deposited in the cell membrane. Various mammalian cell culture systems can be employed to express recombinant proteins. Other cell lines developed for specific selection or amplification schemes will also be useful with the methods and
- compositions provided herein provided that at least two genes encoding a different fatty acid hydrolase (FAH) having at least 80% homology to at least two of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 have been downregulated, knocked out or otherwise disrupted in accordance with the invention.
- An embodied cell line is the CHO cell line designated K1 .
- the host cell line may be pre-adapted to bioreactor medium in the appropriate case.
- transfection protocols are known in the art, and are reviewed in Kaufman (1988) Meth. Enzymology 185:537.
- the transfection protocol chosen will depend on the host cell type and the nature of the GOI, and can be chosen based upon routine experimentation. The basic requirements of any such protocol are first to introduce DNA encoding the protein of interest into a suitable host cell, and then to identify and isolate host cells which have incorporated the
- heterologous DNA in a relatively stable, expressible manner.
- Electroporation can also be used to introduce DNA directly into the cytoplasm of a host cell, for example, as described by Potter ef al. (Proc. Natl. Acad. Sci. USA 81 :7161 , 1988) or Shigekawa ef al. (BioTechniques 6:742, 1988). Unlike protoplast fusion, electroporation does not require the selection marker and the GOI to be on the same plasmid.
- reagents useful for introducing heterologous DNA into a mammalian cell have been described, such as LipofectinTM Reagent and LipofectamineTM Reagent (Gibco BRL, Gaithersburg, Md.). Both of these commercially available reagents are used to form lipid-nucleic acid complexes (or liposomes) which, when applied to cultured cells, facilitate uptake of the nucleic acid into the cells.
- Methods for amplifying the GOI are also desirable for expression of the recombinant protein of interest, and typically involves the use of a selection marker (reviewed in Kaufman supra). Resistance to cytotoxic drugs is the characteristic most frequently used as a selection marker, and can be the result of either a dominant trait ⁇ e.g., can be used independent of host cell type) or a recessive trait ⁇ e.g., useful in particular host cell types that are deficient in whatever activity is being selected for).
- amplifiable markers are suitable for use in the cell lines of the invention and may be introduced by expression vectors and techniques well known in the art (e.g., as described in Sambrook, Molecular Biology: A Laboratory Manual, Cold Spring Harbor Laboratory, NY, 1989; pgs 16.9-16.14).
- Useful selectable markers and other tools for gene amplification such as regulatory elements, described previously or known in the art, can also be included in the nucleic acid constructs used to transfect mammalian cells.
- the transfection protocol chosen and the elements selected for use therein will depend on the type of host cell used. Those of skill in the art are aware of numerous different protocols and host cells in order to adapt the invention for a particular use, and can select an appropriate system for expression of a desired protein, based on the
- the invention relates to the following items:
- a recombinant host cell comprising a modification in two or more genes encoding two or more fatty acid hydrolases (FAH).
- the recombinant host cell according to item 1 wherein the two or more FAHs are selected from the group consisting of phospholipase B-like 2 (PLBD2) protein, lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE).
- PLBD2 phospholipase B-like 2
- LPL lipoprotein lipase
- LI PA lysosomal acid lipase
- ACE acid ceramidase
- [00191] a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification;
- [00194] a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification;
- PLBD2 protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:35, and SEQ ID NO:36.
- the LPL protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:62.
- LPL protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:62, SEQ ID NO:66, SEQ ID NO:67, and SEQ ID NO:68.
- LI PA protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:70, SEQ ID NO:74, SEQ ID NO:75, and SEQ ID NO:76.
- ACE protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:78, SEQ ID NO:82, SEQ ID NO:83, and SEQ ID NO:84.
- the recombinant host cell according to any of previous items further comprising a polynucleotide encoding an exogenous protein of interest.
- the exogenous protein of interest is selected from the group consisting of an antibody heavy chain, an antibody light chain, an antigen-binding fragment, an antigen-binding protein, and an Fc-fusion protein.
- a method of producing a recombinant protein comprising the steps of: (a) obtaining a sample comprising a recombinant protein and a plurality of host cell proteins from a host cell that is modified to produce reduced levels of fatty acid hydrolase compared to an unmodified cell; and (b) subjecting the sample to at least one purification step to remove at least one host cell protein.
- a. does not comprise a detectable amount of a phospholipase B-like 2 (PLBD2) protein
- b a modification in a coding sequence of a polynucleotide encoding a fatty acid hydrolase selected from the group consisting of lipoprotein lipase (LPL), lysosomeal acid lipase (LI PA), acid ceramidase (ACE), or a combination thereof.
- LPL lipoprotein lipase
- LI PA lysosomeal acid lipase
- ACE acid ceramidase
- a process for reducing fatty acid hydrolase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein; (b) modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and/or acid ceramidase (ACE); (c) transfecting the host cell with a polynucleotide encoding a protein of interest; (d) extracting a protein fraction comprising the protein of interest from the host cell; (e) contacting the protein fraction with a chromatography media selected from the group consisting of protein A affinity (PA) media, cation exchange (CEX) media, and anion exchange (AEX) media; (f) collecting the protein of interest from the media; and (g) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer.
- PA protein A
- step of modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein comprises inserting or deleting at least one nucleotide within exon 1 of a polynucleotide encoding the PLBD2 protein.
- step of modifying the host cell to decrease or ablate the expression of lipoprotein lipase comprises inserting or deleting at least one nucleotide within exon 2, exon 3 or exon 4 of a polynucleotide encoding the LPL protein.
- step of modifying the host cell to decrease or ablate the expression of acid ceramidase comprises inserting or deleting at least one nucleotide within exon 1 , exon 3 or exon 4 of a polynucleotide encoding the ACE protein.
- composition comprising one or more recombinant proteins obtainable by the method according to any of items 31-36 or the process according to any of items 37-45.
- composition obtainable according to any of items 46-47, wherein the stable composition is characterized by one or more of:
- composition obtainable according to any of items 46-48, wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
- composition comprising one or more recombinant proteins, wherein the composition is stable.
- composition according to item 50, wherein the stable composition is characterized by one or more of:
- composition according to any of items 50-51 , wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
- a Type II CRISPR/Cas system which requires at least 20 nucleotides (nt) of homology between a chimeric RNA (i.e., guide RNA) and its genomic target was used.
- Guide RNA sequences were designed for specific targeting of an exon within the CHO phospholipase B-like 2 (PLBD2) nucleic acid (SEQ ID NO:33) and are considered unique (to minimize off-target effects in the genome).
- sgRNA small guide RNAs
- the sgRNA expression plasmid (System Biosciences, CAS940A-1 ) contains a human H1 promoter that drives expression of the small guide RNA and the tracrRNA following the sgRNA.
- Immortalized Chinese hamster ovary (CHO) cells were transfected with the plasmid encoding Cas9- H1 enzyme followed by one of the sgRNA sequences, for instance sgRNAI (SEQ ID NO:45) or sgRNA2 (SEQ ID NO:46), designed to target the first exon of CHO PLBD2.
- sgRNAI and sgRNA2 were predicted to generate a double strand break (DSB) at or around nucleotides 53 and 59 of SEQ ID NO:33, respectively.
- a DSB was therefore predicted to occur approx. 23 or 29 nucleotides downstream of the PLBD2 start codon. (Note that nucleotides 1-30 of SEQ ID NO:33 encode a signal peptide.)
- a negative control transfection was performed where the parental CHO line was transfected with the plasmid encoding Cas9-H1 enzyme without a proceeding sgRNA, or an sgRNA encoding a gene sequence not present in the CHO genome.
- Genomic DNA gDNA
- messenger RNA mRNA
- qPCR primers and probes were designed to overlap with the sgRNA sequence used for the double strand break targeting event, in order to detect disruption of the genomic DNA and its transcription.
- the relative abundance of PLBD2 gene or transcript in the candidate clones was determined using relative qPCR method, where the clones derived from the negative control transfection were used as a calibrator. See Figure 1.
- the qPCR primers and probes were designed to detect sequences either in the sgRNAI or sgRNA2 position in PLBD2 exon 1 .
- gDNA and RNA isolated from clone 1 failed to support qPCR amplification of PLBD2 exon 1 in either sgRNAI or sgRNA2 regions, but amplification of the housekeeping gene, GAPDH, was detected. Based on this data, clone 1 was identified as a potential knock out of PLBD2 in which both genomic alleles of PLBD2 of exonl were disrupted. It is noted that amplification of genomic DNA and mRNA was not detected in Clone 8 using primers overlapping with sgRNA2, however, sgRNAI primers/probes detected genomic DNA above control values. Clone 8, and others were further analyzed in order to understand the performance of the site- directed nuclease method.
- the inventors also unexpectedly identified Clone 8 as a PLBD2 knockout despite the fact that genomic DNA fragments were identified by qPCR primers overlapping with the sgRNAI sequence.
- the identification of a clone that has no detectable phospholipase activity or no detectable phospholipase protein was technically challenging and time-consuming.
- Site-directed nuclease techniques may provide an ease-of-use, however, careful screening and elimination of false positives is necessary and still there may be unpredictable outcomes with regard to the identity of a single clone having two disrupted alleles.
- Clone 1 and the wild type control host cell line were transfected with plasmids encoding the light and heavy chains of mAb1 , a fully human IgG, in the presence of Cre recombinase to facilitate recombination mediated cassette exchange (RMCE) into EESYR locus (US Patent No. 7771997B2, issued August 10, 2010).
- the transfected cultures were selected for 1 1 days in serum-free medium containing 400 ug/mL hygromycin. Cells that underwent RMCE, were isolated by flow cytometry.
- PLBD2 knock out clone 1 and the wild type host cell line produced equivalent observed
- the clone 1 derived isogenic cell line expressing mAb1 was designated RS001
- the mAb1 expressing cell line originated from the PLBD2 wild type host was designated RS0WT.
- Polysorbate 20 or polysorbate 80 degradation was measured to detect putative esterase activity in the supernatants of PLBD2 mutants. Unpurified protein supernatant from CHO cells, and supernatant taken at each step or sequence of steps when subjected to sequential purification steps, was tested for stability of polysorbate. The percent intact polysorbate reported was inversely proportional to the amount of contaminant esterase activity. Unpurified protein supernatant from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, was tested in assays measuring polysorbate degradation. The relative levels of intact polysorbate reported is inversely proportional to levels of contaminant esterase activity.
- Monoclonal antibody was produced in an unmodified CHO cell and purified by different processes according to Table 1 1 , and the esterase activity measured by percent intact polysorbate 20, as in Table 9.
- HIC Hydrophobic interaction chromatography
- Example 4 Esterase protein abundance and activity detection in mAbl purified from modified compared to unmodified CHO cells.
- mAbl was produced from RS001 and RSOWT and purified from the conditioned media using either PA alone, or PA and AEX chromatography
- the PA-purified mAB1 from RS001 and RSOWT were analyzed for lipase abundance using trypsin digest mass spectrometry.
- the trypsin digests of RS001 and RSOWT mAbl were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass spectrometer set to monitor a specific product ion fragmented from SEQ ID NO:32.
- a hydrophobic interaction column (Phenyl Sepharose® High Performance [GE Healthcare, Little Chalfont, Buckinghamshire, UK]) was used to generate a "HIC strip” fraction containing a protein of interest (i.e., mAb2) and associated host cell proteins.
- the column was first equilibrated with two column volumes (CV) of buffer containing 40 mM Tris, 200 mM citrate at pH8.0.
- the monoclonal antibody-containing load material from an anion exchange pool (“Q pool”) was adjusted to 200 mM sodium citrate, pH8.0, then loaded onto the column at a loading amount of 20-40 grams of protein per liter of the phenyl sepharose resin.
- the column was washed with six CVs of 40 mM Tris, 200 mM citrate, at pH8.0. Then the column was stripped with three CVs of reverse osmosis deionized water. The stripped fraction was collected for subsequent analysis.
- a 50 ⁇ g aliquot of the mAb2-containing HIC strip sample was denatured and reduced in a solution containing 5 mM acetic acid and 5 mM tris(2-carboxyethyl)phosphine-HCI by heating at 80°C for 10 minutes.
- the sample was then diluted in 50 mM Tris-HCI buffer (pH 8.0) and alkylated with 1 .5 mM iodoacetamide (IAA) and digested with trypsin (modified, sequencing grade from Promega, Madison, Wl) with an enzyme to substrate ratio of 1 :20 (w/w) at 37°C in the dark for three hours.
- the digestion was then stopped by addition of 10% trifluoroacetic acid (TFA).
- Peptides were separated using a linear gradient from 1 % mobile phase B (0.1 % FA in acetonitrile) to 7% mobile phase B for the first 5 minutes, followed by a second linear increase from 7% to 27% mobile phase B over the next 1 10 minutes, and another subsequent linear increase from 27%-40% in 10 minutes and a final increase to 90% in 5 minutes. The gradient was held at 90% for 20 minutes.
- a Thermo Q ExactiveTM Plus mass spectrometer (Thermo Scientific, Waltham, MA) was used for peptide mass analyses, with high-energy collisional dissociation (HCD) employed for peptide fragmentation for MS/MS experiments.
- Thermo XcaliburTM version 2.2.42
- Proteome Discoverer version 1.4
- Thermo Scientific, Waltham, MA was also used to perform the peptide identification using both Mascot and Sequent search engines.
- the peptide spectra from the Proteome Discovery was manually examined to confirm the spectral assignment and protein identification.
- Lipoprotein lipase (UniProtein ID: P06858), lysosomal acid lipase (UniProtein ID: P38571 ) and acid ceramidase (UniProtein ID: Q13510) were identified as potential active fatty acid hydrolases by more than three unique peptides per protein in the mAb2-containing HIC strip fraction.
- Clone 1 in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) were transfected with a pair of plasmids.
- One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lipoprotein lipase (LPL) gene (SEQ ID NO:61 ): sgRNA3 (SEQ ID NO:63), sgRNA4 (SEQ ID NO:64) or sgRNA5 (SEQ ID NO:65) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette was used in some
- Cre recombinase was co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016).
- the transfected cultures were selected for 1 1 days in serum-free medium containing 400 ⁇ g/mL hygromycin. Cells that underwent RMCE were isolated by flow cytometry.
- the desired knock out genotype was confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing only the LPL knock-out loci is designated as "clone 9".
- the resultant clone containing both the PLBD2 and LPL knock-out loci is designated as "clone 10".
- the wild-type host cell, Clone 1 (PLBD2-KO), Clone 9 (LPL-KO) and Clone 10 (LPL+PLBD2 KO) were cultured in 2L bioreactors under fed-batch conditions at 36.5°C (pH 6.9-7.4) with dissolved oxygen.
- Cells were inoculated in the production bioreactor (i.e. transferred from a seed train culture (N-1 ) at a cell titer of 5.0 x 10 6 - 7.0 x 10 6 cells/mL) and protein production was induced by the addition of doxycycline.
- Cells were cultured for 14 days in chemically defined media supplemented with nutrient feeds as needed during the batch culture up to day 12.
- FIG. 8 depicts an alignment of LPL-KO clones 19-22 (SEQ ID NOs:159-162, respectively) compared to a partial Chinese hamster LPL sequence (SEQ ID NO:158) showing the gaps in the respective clone sequences.
- LPL-KO clones (as well as any other fatty acid hydrolase knock-out clone), plasmids encoding for Cas9 nuclease, eYFP and site-specific sgRNA were stably integrated into the CHO genome using Lipofectamine-based transfection protocol followed by selection for neomycin resistant cells. Seventeen days post transfection, the YFP positive cells were enriched by flow cytometry prior to single cell-sorting. After a 21 -day expansion, mRNA was isolated and the single cell clones were analyzed by qPCR for the presence of gene disruption.
- 1 , 2 or 3 different sgRNA expression cassettes may be placed in the same plasmid.
- Some constructs are manufactured using at least 2 sgRNA cassettes per lipase in a single plasmid.
- Clone 1 in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids.
- One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) or sgRNA8 (SEQ ID NO:73) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs.
- Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent
- the transfected cultures are selected for 1 1 days in serum-free medium containing 400 ⁇ g/mL hygromycin.
- Cells that undergo RMCE are isolated by flow cytometry.
- the desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing only the LI PA knock-out or knock-down loci is designated as "clone 1 1 ".
- the resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 12".
- Clone 1 in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids.
- One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA1 1 (SEQ ID NO:81 ) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs.
- Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016).
- the transfected cultures are selected for 1 1 days in serum-free medium containing 400 ⁇ g/mL hygromycin.
- Cells that undergo RMCE are isolated by flow cytometry.
- the desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing only the ACE knock-out or knock-down loci is designated as "clone 13".
- the resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 14".
- Clone 9 in which the LPL gene and is modified
- Clone 10 in which both the LPL gene and the PLBD2 gene are modified
- plasmids encodes the Cas9 protein
- the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) and/or sgRNA8 (SEQ ID NO:73) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs.
- Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016).
- the transfected cultures are selected for 1 1 days in serum-free medium containing 400 ⁇ g/mL hygromycin.
- Cells that undergo RMCE are isolated by flow cytometry.
- the desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing the LIPA modified locus and the LPL modified locus designated as "clone 15".
- the resultant clone containing the PLBD2, LPL and LIPA modified loci is designated as "clone 16".
- LIPA Lipase
- ACE Acid Ceramidase
- Clone 15 in which the LPL and LIPA genes and are modified
- Clone 16 in which the PLBD2, LPL and LIPA genes are modified
- a pair of plasmids One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA1 1 (SEQ ID NO:81 ) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs.
- Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016).
- the transfected cultures are selected for 1 1 days in serum-free medium containing 400 ⁇ g/mL hygromycin.
- Cells that undergo RMCE are isolated by flow cytometry.
- the desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 17".
- the resultant clone containing the PLBD2 modified locus, the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 18".
- the invention provides for cells and methods for expression and purification of recombinant proteins in eukaryotic cells.
- the invention includes methods and compositions for expression of proteins in eukaryotic cells, particularly Chinese hamster (Cricetulus griseus) cell lines, that employ downregulating gene expression of endogenous proteins in order to control production of such unwanted "sticky" host cell proteins.
- the invention includes polynucleotides and modified cells that facilitate purification of an exogenous recombinant protein of interest.
- the methods of the invention efficiently target host cell proteins in the Chinese hamster cellular genome in order to facilitate enhanced and stable expression of recombinant proteins expressed by the modified cells.
- HCPs host cell proteins
- LC/MS Advanced liquid chromatography/mass spectrometry
- HCPs Changes in cell culture conditions of eukaryotic cells has been shown to impact the purity and stability of manufactured proteins, as seen by the increased quantity of HCPs of CHO cells upon downstream bioprocessing alterations (Tait, et al, 2013, Biotechnol Prog 29(3):688-696).
- the detrimental effect of leftover HCPs in any product may affect the overall quality or quantity, or both the quality and quantity of the product.
- HCPs if present even at low levels in a therapeutic product, may induce an undesired immune response which causes concern for patient safety and efficacy of the drug product (Singh SK. 201 1.
- the invention provides a recombinant host cell, wherein the cell is modified to decrease the expression levels of two or more fatty acid hydrolases (FAHs) relative to the expression levels of FAH in an unmodified cell.
- FHs fatty acid hydrolases
- the invention provides a recombinant host cell, wherein the cell is modified to have no expression of two or more target FAHs.
- one of said two or more target FAHs is an esterase.
- the esterase is a lipase.
- the lipase is: (1 ) a phospholipase, such as a phospholipase B-like protein or a phospholipase B-like 2 protein, (2) a lipoprotein lipase or (3) a lysosomal acid lipase.
- one of said two or more target FAHs is an amidase.
- the amidase is a fatty acid acylase.
- the fatty acid acylase is an acid ceramidase.
- a gene of interest is exogenously added to the recombinant host cell.
- the exogenously added gene encodes a protein of interest (POI), for example the POI is selected form the group consisting of antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein.
- POI protein of interest
- the invention relates to a composition comprising one or more proteins of interest (POIs) obtainable by a method according to the invention.
- the protein may be a recombinant protein and may be e.g. selected form the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein, or any combinations thereof.
- the invention relates to a composition comprising one or more proteins of interest (POIs), wherein the adverse enzyme activity is ⁇ 50%, 40%, 48%, 47%, 46%, 45%, 44%, 43%, 42%, 41 %, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31 %, 30%, 29%, 28%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, 0.05%, or ⁇ 0.05% active relative to the adverse enzyme activity of a wild-type production system.
- POIs proteins of interest
- the phrase "adverse enzyme activity" refers to any enzyme and its action upon the resulting composition as a whole, wherein the action results in metabolism of any protein components which metabolites reduces the shelf-life of the composition or results in the formation of subvisible particles (SVPs) above prescribed regulations.
- the protein may be a recombinant protein and may e.g. be selected from the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein or any combinations thereof.
- the enzymes and their activities may be, e.g., various esterases, hydrolases, lipases, phospholipases, ceramidases and the likes, or any combinations thereof.
- Adverse enzyme activity may be measured using a functional assay ⁇ e.g., polysorbate fatty acid hydrolysis assay), or a structural assay ⁇ e.g., nano LC-MS of peptide fragments, or the like).
- the invention provides a cell comprising a nonfunctional PLBD2 protein and one or more additional nonfunctional fatty acid hydrolases (FAH).
- FAH nonfunctional fatty acid hydrolases
- nonfunctional FAH is a nonfunctional lipoprotein lipase (LPL), lysosomal acid lipase (LI PA), acid ceramidase (ACE), platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), phosphoinositide phospholipase C fragment (PIPLCf), phosphoinositide phospholipase C (PIPLC), liver carboxylesterase 1 (LCE), isoamyl acetate-hydrolyzing esterase 1 -like (IAH1 ), group XV phospholipase A2 (LPLA2), carboxylic ester hydrolase (CEH), and/or arylsulfatase A (ASA).
- LPL lipoprotein lipase
- LI PA lysosomal acid lipase
- ACE acid ceramidase
- PAFAHG platelet-activating factor acetylhydrolase IB subunit gamma
- the invention provides making a cell by FAH target disruption.
- the method comprises a site-specific nuclease for disrupting or editing the cell genome at a target site or sequence.
- the FAH target site is (1 ) a PLBD2 target site, (2) an LPL target site, (3) an LI PA target site, (4) an ACE target site, (5) a PAFAHG site, (6) a PIPLCf or PIPLC site, (7) an LCE site, (8) an IAH1 site, (9) an LPLA2 site, (9) an CEH site, and /or (10) an ASA site.
- the PLBD2 target site comprises a position within SEQ ID NO:33, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:33, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:33, within exon 1 of SEQ ID NO:33, within exon 2 of SEQ ID NO:33, or within exon 3 of SEQ ID NO:33.
- the PLBD2 target site comprises a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-230, 190-210, 200-220, 210-230, 220-240, 230-250, 240-260, and 250-270 of SEQ ID NO:33.
- the target site at a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 is selected from the group consisting of nucleotides spanning positions numbered 37-56, 44-56, 33-62, 40-69, 110-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
- the PLBD2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:33 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:33 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LPL target site comprises a position within SEQ ID NO:61 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:61 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO:61 , within exon 1 of SEQ ID NO:61 , within exon 2 of SEQ ID NO:61 , or within exon 3 of SEQ ID NO:61.
- the LPL target site comprises a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440
- the target site at a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 is selected from the group consisting of nucleotides spanning positions numbered 465-484, 558-577, and 593-612 of SEQ ID NO:61.
- the LPL target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:61 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:61 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LIPA target site comprises a position within SEQ ID NO:69, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:69, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:69, within exon 1 of SEQ ID NO:69, within exon 2 of SEQ ID NO:69, or within exon 3 of SEQ ID NO:69.
- the LIPA target site comprises a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350 and 340-360 of SEQ ID NO:69.
- the target site at a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 is selected from the group consisting of nucleotides spanning positions numbered 180-199, 239-258, and 276-295 of SEQ ID NO:69.
- the LIPA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:69 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:69 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the ACE target site comprises a position within SEQ ID NO:77, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:77, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:77, within exon 1 of SEQ ID NO:77, within exon 2 of SEQ ID NO:77, within exon 3 of SEQ ID NO:77, or within exon 4 of SEQ ID NO:77.
- the ACE target site comprises a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370 and 360-380 of SEQ ID NO:77.
- the target site at a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 is selected from the group consisting of nucleotides spanning positions numbered 135-154, 237-256, and 332-351 of SEQ ID NO:77.
- the ACE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:77 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:77 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the PAFAHG target site comprises a position within SEQ ID NO:1 17, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:117, 100 nucleotides
- the PAFAHG target site comprises a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:1 17 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410
- the target site at a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:117 is selected from the group consisting of nucleotides spanning positions numbered 101-120, 1 1 1-130, 121 -140, 131-150, 141-160, and 150-169 of SEQ ID NO:1 17.
- the PAFAHG target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 17 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 17 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the PIPLC target site comprises a position within SEQ ID NO:1 18, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:118, 100 nucleotides
- SEQ ID NO:1 downstream of the 5-prime end of SEQ ID NO: 118, within exon 1 of SEQ ID NO:1 18, within exon 2 of SEQ ID NO:118, within exon 3 of SEQ ID NO:118, within exon 4 of SEQ ID NO:1 18, within exon 5 of SEQ ID NO:1 18, within exon 6 of SEQ ID NO:1 18, within exon 7 of SEQ ID NO:1 18, within exon 8 of SEQ ID NO: 1 18, within exon 9 of SEQ ID NO: 1 18, within exon 10 of SEQ ID NO: 1 18, within exon 1 1 of SEQ ID NO:1 18, within exon 12 of SEQ ID NO:1 18, or within exon 13 of SEQ ID NO:1 18.
- the PIPLC target site comprises a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:1 18 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410
- the target site at a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:118 is selected from the group consisting of nucleotides spanning positions numbered 39-188, 39-58, 49-68, 59-78, 69-88, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 149-178, and 159-188 of SEQ ID NO:118.
- the PIPLC target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 18 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 18 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LCE target site comprises a position within SEQ ID NO:1 19, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 19, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:1 19, within exon 1 of SEQ ID NO:1 19, within exon 2 of SEQ ID NO:1 19, within exon 3 of SEQ ID NO:119, within exon 4 of SEQ ID NO:1 19, or within exon 5 of SEQ ID NO:1 19.
- the LCE target site comprises a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:1 19 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-
- the target site at a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:119 is selected from the group consisting of nucleotides spanning positions numbered 89-140, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, and 139-158 of SEQ ID NO:1 19.
- the LCE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 19 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 19 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the IAH1 target site comprises a position within SEQ ID NO:120, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 120, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:120, within exon 1 of SEQ ID NO:120, within exon 2 of SEQ ID NO:120, within exon 3 of SEQ ID NO:120, or within exon 4 of SEQ ID NO:120.
- the IAH1 target site comprises a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO:120 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430
- the target site at a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO:120 is selected from the group consisting of nucleotides spanning positions numbered 104-325, 104-123, 1 14-133, 124-143, 134-153, 144-163, 154-173, 164-183, 174-193, 184-203, 194-213, 204-223, 214-233, 224-243, 234-253, 244-263, 254-273, 264-283, 274-293, 284-303, 294-313, 304-323, and 314-333 of SEQ ID NO:120.
- the IAH1 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 120 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:120 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LPLA2 target site comprises a position within SEQ ID NO:121 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 121 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 121 , within exon 1 of SEQ ID NO: 121 , within exon 2 of SEQ ID NO: 121 , within exon 3 of SEQ ID NO: 121 , or within exon 4 of SEQ ID NO: 121.
- the LPLA2 target site comprises a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-
- the target site at a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 is selected from the group consisting of nucleotides spanning positions numbered 69-195, 69-88, 79-98, 89-108, 99-118, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, and 179-198 of SEQ ID NO:121.
- the LPLA2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 121 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:121 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- a standard method such as mass spectrometry
- the CEH target site comprises a position within SEQ ID NO:122, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 122, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 122, within exon 1 of SEQ ID NO: 122, within exon 2 of SEQ ID NO: 122, within exon 3 of SEQ ID NO: 122, within exon 4 of SEQ ID NO: 122, or within exon 5 of SEQ ID NO: 122.
- the CEH target site comprises a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO:122 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430,
- the target site at a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO:122 is selected from the group consisting of nucleotides spanning positions numbered 79-186, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, and 169-188 of SEQ ID NO:122.
- the CEH target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:122 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO: 122 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the ASA target site comprises a position within SEQ ID NO:123, within 100 nucleotides upstream of the 5 prime end of SEQ ID NO: 123 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 123, within exon 1 of SEQ ID NO: 123, within exon 2 of SEQ ID NO: 123, within exon 3 of SEQ ID NO: 123, or within exon 4 of SEQ ID NO: 123.
- the ASA target site comprises a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO:123 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430,
- the target site at a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO:123 is selected from the group consisting of nucleotides spanning positions numbered 79-296, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, 179-198 , 189-208 , 199-218, 209-228, 219-238, 229-248, 239-258, 249-268, 259-278, 269-288, and 279-298 of SEQ ID NO:123.
- the ASA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:123 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO: 123 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the cell further integrates an exogenous nucleic acid sequence.
- the cell is capable of producing an exogenous protein of interest.
- the altered protein resulting from a disrupted gene does not bind to the protein of interest produced by the cell.
- an isolated Chinese hamster ovary (CHO) cell comprising an engineered nucleic acid sequence comprising a variant of the PLBD2 gene (such as a variant of SEQ ID NO:33).
- the PLBD2 gene comprises GACAGTCACG TGGCCCGACT GAGGCACGCG , nucleotides 1-30 of SEQ ID NO:33 (SEQ ID NO: 44).
- the PLBD2 gene is engineered to disrupt expression of the open reading frame.
- the invention provides an isolated CHO cell comprising (a) a disrupted PLBD2 gene comprising GACAGTCACG TGGCCCGACT GAGGCACGCG (SEQ ID NO: 44, also nucleotides 1- 30 of SEQ ID NO:33), (b) a disrupted esterase gene comprising a nucleotide encoding any one of the amino acid sequences in Table 2, or (c) a protein fragment of Table 2 expressed by a disrupted PLBD2 gene; and an exogenous nucleic acid sequence comprising a gene of interest.
- the CHO cell that comprises an engineered nucleic acid sequence comprising a variant of the PLBD2 gene also comprises variants of one or more of the genes encoding (1 ) LPL (variant of SEQ ID NO:61 ), (2) LIPA (SEQ ID NO:69), (3) ACE (SEQ ID NO:77), (4) PAFAHG (SEQ ID NO:1 17), (5) PIPLC (SEQ ID NO:1 18), (6) LCE (SEQ ID NO:1 19), (7) IAH1 (SEQ ID NO:120), (8) LPLA2 (SEQ ID NO:121 ), (9) CEH (SEQ ID NO:122), and/or (10) ASA (SEQ ID NO:123).
- a method of producing a protein of interest using a recombinant host cell wherein the host cell is modified to decrease the expression levels of esterase relative to the expression levels of esterase in an unmodified cell.
- the method comprises the modified host cell having decreased esterase expression and an exogenous nucleic acid sequence comprising a gene of interest (GOI).
- GOI gene of interest
- the exogenous nucleic acid sequence comprises one or more genes of interest.
- the one or more genes of interest are selected from the group consisting of a first GOI, a second GOI and a third GOI.
- the invention provides expression systems comprising the recombinant host cell comprising modified or nonfunctional esterase.
- the cell comprises a GOI operably linked to a promoter capable of driving expression of the GOI, wherein the promoter comprises a eukaryotic promoter that can be regulated by an activator or inhibitor.
- the eukaryotic promoter is operably linked to a prokaryotic operator, and the eukaryotic cell optionally further comprises a prokaryotic repressor protein.
- one or more selectable markers are expressed by the modified host cell.
- the genes of interest and/or the one or more selectable markers are operably linked to a promoter, wherein the promoter may be the same or different.
- the promoter comprises a eukaryotic promoter (such as, for example, a CMV promoter or an SV40 late promoter), optionally controlled by a prokaryotic operator (such as, for example, a tet operator).
- the cell further comprises a gene encoding a prokaryotic repressor (such as, for example, a tet repressor).
- a CHO host cell comprising recombinase recognition sites.
- the recombinase recognition sites are selected from a LoxP site, a /_ox51 1 site, a Lox2272 site, Lox2372, Lox5171 , and a frt site.
- the cell further comprises a gene capable of expressing a recombinase.
- the recombinase is a Cre recombinase.
- the selectable marker gene is a drug resistance gene.
- the drug resistance gene is a neomycin resistance gene or a hygromycin resistance gene.
- the second and third selectable marker genes encode two different fluorescent proteins.
- the two different fluorescent proteins are selected from the group consisting of Discosoma coral (DsRed), green fluorescent protein (GFP), enhanced green fluorescent protein (eGFP), cyano fluorescent protein (CFP), enhanced cyano fluorescent protein (eCFP), yellow fluorescent protein (YFP), enhanced yellow fluorescent protein (eYFP) and far-red fluorescent protein (mKate).
- the first, second, and third promoters are the same. In another embodiment, the first, second, and third promoters are different from each other. In another embodiment, the first promoter is different from the second and third promoters, and the second and third promoters are the same. In more embodiments, the first promoter is an SV40 late promoter, and the second and third promoters are each a human CMV promoter. In other embodiments, the first and second promoters are operably linked to a prokaryotic operator.
- the host cell line has an exogenously added gene encoding a recombinase integrated into its genome, operably linked to a promoter.
- the recombinase is Cre recombinase.
- the host cell has a gene encoding a regulatory protein integrated into its genome, operably linked to a promoter.
- the regulatory protein is a tet repressor protein.
- the first GOI and the second GOI encode a light chain, or fragment thereof, of an antibody or a heavy chain, or fragment thereof, of an antibody.
- the first GOI encodes a light chain of an antibody and the second GOI encodes a heavy chain of an antibody.
- the first, second and third GOI encode a polypeptide selected from the group consisting of a first light chain, or fragment thereof, a second light chain, or fragment thereof and a heavy chain, or fragment thereof.
- the first, second and third GOI encode a polypeptide selected from the group consisting of a light chain, or fragment thereof, a first heavy chain, or fragment thereof and a second heavy chain, or fragment thereof.
- a method for making a protein of interest comprising (a) introducing into a CHO host cell a gene of interest (GOI), wherein the GOI integrates into a specific locus such as a locus described in US Patent No. 7771997B2, issued August 10, 2010 or other stable integration and/or expression-enhancing locus; (b) culturing the cell of (a) under conditions that allow expression of the GOI; and (c) recovering the protein of interest.
- the protein of interest is selected from the group consisting of a subunit of an immunoglobulin, or fragment thereof, and a receptor, or ligand-binding fragment thereof.
- the protein of interest is selected from the group consisting of an antibody light chain, or antigen-binding fragment thereof, and an antibody heavy chain, or antigen-binding fragment thereof.
- the CHO host cell genome comprises further modifications, and comprises one or more recombinase recognition sites as described above, and the GOI is introduced into a specific locus through the action of a recombinase that recognizes the
- the GOI is introduced into the cell employing a targeting vector for recombinase-mediated cassette exchange (RMCE) when the CHO host cell genome comprises at least one exogenous recognition sequence within a specific locus.
- RMCE recombinase-mediated cassette exchange
- the GOI is introduced into the cell employing a targeting vector for homologous recombination, and wherein the targeting vector comprises a 5' homology arm homologous to a sequence present in the specific locus, a GOI, and a 3' homology arm
- the targeting vector further comprises two, three, four, or five or more genes of interest.
- one or more of the genes of interest are operably linked to a promoter.
- a method for modifying a CHO cell genome to integrate an exogenous nucleic acid sequence comprising the step of introducing into the cell a vehicle comprising an exogenous nucleic acid sequence wherein the exogenous nucleic acid integrates within a locus of the genome.
- the invention provides a process for manufacturing a stable protein formulation comprising the steps of: (a) extracting a protein fraction from the modified host cell of the invention having decreased or ablated expression of esterase, (b) contacting the protein fraction comprising a protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (c) collecting the protein of interest from the media, wherein a reduced level of the esterase activity is associated with the protein fraction collected at step (c), thus providing a stable protein formulation.
- PA protein A affinity
- CEX cation exchange
- AEX anion exchange
- the invention provides a process for reducing esterase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate expression of esterase, (b) transfecting the host cell with a protein of interest, (c) extracting a protein fraction from the modified host cell, (c) contacting the protein fraction comprising the protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (d) collecting the protein of interest from the media, and (e) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer, thus providing a protein formulation essentially free of detectable esterase activity.
- the protein formulation is essentially free of PLBD2 protein or PLBD2 activity.
- a method for modifying a CHO cell genome to express a therapeutic agent comprising a vehicle for introducing, into the genome, an exogenous nucleic acid comprising a sequence for expression of the therapeutic agent, wherein the vehicle comprises a 5' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33, a nucleic acid encoding the therapeutic agent, and a 3' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33.
- the invention provides a modified CHO host cell comprising a modified CHO genome wherein the CHO genome is modified by disruption of target sequence within a nucleotide sequence at least 90% identical to SEQ ID NO: 33.
- the modified CHO host cell further comprises another FAH target sequence disruption.
- the another FAH target sequence is within (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:61 , (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:69 and/or (3) a nucleotide sequence at least 90% identical to SEQ ID NO:77.
- the invention provides a modified eukaryotic host cell comprising a modified eukaryotic genome wherein the eukaryotic genome is modified at a target sequence in a coding region of the target gene by a site-specific nuclease.
- the site-specific nuclease comprises a zinc finger nuclease (ZFN), a ZFN dimer, a transcription activator-like effector nuclease (TALEN), a TAL effector domain fusion protein, or an RNA-guided DNA endonuclease.
- ZFN zinc finger nuclease
- TALEN transcription activator-like effector nuclease
- TAL effector domain fusion protein or an RNA-guided DNA endonuclease.
- the invention also provides methods of making such a modified eukaryotic host cell.
- the target sequence can be placed in the indicated orientation as in SEQ ID NO:33, 61 , 69, 77, 1 17, 118, 119, 120, 121 , 122, or 123; or in the reverse of the orientation of SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 119, 120, 121 , 122, or 123.
- Fig. 1 depicts the results of Taqman® quantitative polymerase chain reaction (qPCR) experiments to detect genomic (gDNA) or transcripts (mRNA) of the modified clones.
- Primers and probes were designed to flank the sequences predicted as subject to targeted disruption within exon 1 , either starting at nucleotide 37 (sgRNAI ) or starting at nucleotide 44 (sgRNA2) of SEQ ID NO:33.
- Relative amount of amplicons from clones targeted by either sgRNAI or sgRNA2 are graphed (i.e., relative to amplicons derived from the negative control transfection clones which were subject to no sgRNA or unmatched sgRNA).
- Clone 1 for example, has relatively no amplified gDNA nor mRNA per qPCR of the targeted exon 1 region. Clone 1 and several other clones were selected for follow up analysis. The vertical axes represent the relative amount of template containing sgRNAI sequence (upper panel) or sgRNA2 sequence (lower panel).
- FIG. 2A and Fig. 2B illustrate the results of further PCR analysis of a Clone 1 cells population compared to wild type Chinese hamster overy (CHO) cells.
- Fig. 2A shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from genomic DNA of wild type cells.
- Fig. 2B shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from mRNA of wild type cells. Sequencing confirmed an 1 1 bp deletion in the PLBD2 gene of Clone 1 .
- the vertical axes represent the size of the PCR fragments in base-pairs (bp).
- Fig. 3 illustrates the relative protein titer of monoclonal antibody 1 (mAbl )-expressing Clone 1 cells (RS001 ) or mAb1 -expressing wild type CHO cells (RS0WT) subject to the same fed-batch culture conditions for 12 days. Samples of conditioned medium were extracted for each culture, and the Protein A binding fraction was quantified at Day 2, 4, 6, 9 and 12.
- Fig. 4 shows the results of RS001 or RS0WT cells following production culture and protein purification using either Protein A (PA) alone, or PA and anion exchange (AEX) chromatography.
- PA-purified mAb1 from RS001 and RS0WT was analyzed for lipase abundance using trypsin digest mass spectrometry.
- trypsin digests of RS001 - and RSOWT-produced mAb1 were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass spectrometer set to monitor a specific PLBD2 product fragment (as in Table 2).
- Control reactions containing reference samples of mAb1 (with no endogenous PLBD2) spiked with varying amounts of recombinant PLBD2 were also analyzed and plotted. The signals detected in the experiments were compared to the control reactions to determine concentration of PLBD2.
- mAb1 produced from Clone 1 shows no detectable amounts of PLBD2 when purified with PA alone.
- Fig. 5 is a line plot depicting percent cell viability as a function of time in days. Open circles (-0-) represent wildtype cells. Filled diamonds (- ⁇ -) represent PLBD2-KO cells. Filled squares (- ⁇ -) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
- Fig. 6 is a line plot depicting protein production (titer) in grams per liter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (- ⁇ -) represent PLBD2-KO cells. Filled squares (- ⁇ -) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
- Fig. 7 is a line plot depicting viable cell counts in cells per milliliter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (- ⁇ -) represent PLBD2-KO cells. Filled squares (- ⁇ -) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
- Fig. 8 shows a formatted alignment of LPL knock out constructs clone 19, clone 20, clone 21 , and clone 22, represented by SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161 , and SEQ ID NO:162, respectively.
- the partial wildtype LPL sequence is represented by SEQ ID NO:158.
- exogenously added gene refers to any DNA sequence or gene not present within the genome of the cell as found in nature.
- an "exogenously added gene” within a CHO genome can be a gene from any other species (e.g., a human gene), a chimeric gene (e.g., human/mouse), or a hamster gene not found in nature within the particular CHO locus in which the gene is inserted (i.e., a hamster gene from another locus in the hamster genome), or any other gene not found in nature to exist within a CHO locus of interest.
- Percent identity when describing an esterase, e.g., a hydrolase protein, such as SEQ ID NO:32, 34, 35, 37, 62,70, 74, 75, 76, 78, 82, 83, 84; 124, 125, 126, 127, 128, 129, and 130; or gene, such as SEQ ID NO:33, 61 , 69, 77, 1 17, 118, 119, 120, 121 , 122, and 123 includes homologous sequences that display the recited identity along regions of contiguous homology, but the presence of gaps, deletions, or insertions that have no homolog in the compared sequence are not taken into account in calculating percent identity.
- a hydrolase protein such as SEQ ID NO:32, 34, 35, 37, 62,70, 74, 75, 76, 78, 82, 83, 84; 124, 125, 126, 127, 128, 129, and 130; or gene, such as SEQ
- a "percent identity" determination between, e.g., SEQ ID NO:32 with a species homolog would not include a comparison of sequences where the species homolog has no homologous sequence to compare in an alignment (i.e., SEQ ID NO:32 compared to a fragment thereof, or the species homolog has a gap or deletion, as the case may be). Thus, “percent identity” does not include penalties for gaps, deletions, and insertions.
- “Targeted disruption” of a gene or nucleic acid sequence refers to gene targeting methods that direct cleavage or breaks (such as double stranded breaks) in genomic DNA and thus cause a modification to the coding sequence of such gene or nucleic acid sequence.
- Gene target sites are the sites selected for cleavage or break by a nuclease.
- the DNA break is normally repaired by the non-homologous end-joining (NHEJ) DNA repair pathway.
- NHEJ non-homologous end-joining
- InDels insertions or deletions
- a small number of nucleotides are either inserted or deleted at random at the site of the break and these InDels may shift or disrupt the open reading frame (ORF) of the target gene. Shifts in the ORF may cause significant changes in the resulting amino acid sequence downstream of the DNA break, or may introduce a premature stop codon, therefore the expressed protein, if any, is rendered nonfunctional or subject to degradation.
- Targeted insertion refers to gene targeting methods employed to direct insertion or integration of a gene or nucleic acid sequence to a specific location on the genome, i.e., to direct the DNA to a specific site between two nucleotides in a contiguous polynucleotide chain. Targeted insertion may also be performed to introduce a small number of nucleotides or to introduce an entire gene cassette, which includes multiple genes, regulatory elements, and/or nucleic acid sequences. "Insertion” and “integration” are used interchangeably.
- Recognition site or “recognition sequence” is a specific DNA sequence recognized by a nuclease or other enzyme to bind and direct site-specific cleavage of the DNA backbone.
- Endonucleases cleave DNA within a DNA molecule.
- Recognition sites are also referred to in the art as recognition target sites.
- Polysorbates are fatty acid esters of sorbitan or iso-sorbide (polyoxyethylene sorbitan or iso- sorbide mono- or di- esters).
- the polyoxyethylene serves as the hydrophilic head group and the fatty acid as the lipophilic hydrophobic tail.
- the effectiveness as a surfactant of the polysorbate depends upon the amphiphilic nature of the molecule with both hydrophilic head and hydrophobic tail present (in a single molecule).
- SVP subvisible particle
- SVPs may attribute to immunogenicity. Regulatory authorities like the United States Food and Drug Administration (USFDA) provide limitations on the number of subvisible particles (SVPs) allowed in a pharmaceutical formulation. United States Pharmacopeia (USP) publishes standards for strength, purity and quality of drugs and drug ingredients, as well as food ingredients and dietary
- USP 31 monograph ⁇ 788> sets the limit for number of particles allowed in parenteral formulations.
- USP 31 monograph ⁇ 788> is available at http://www.uspnf.com/official- text/revision-bulletins/particulate-matter-injections; and as Revision Bulletin Official July 1 , 2012, ⁇ 788> Particulate Matter in Injections, The United States Pharmacopeial Convention.
- the limit is set at no more than 25 particles of at least 10 microns per ml_, and no more than 3 particles of at least 25 microns per ml_.
- the limit is set at no more than 6,000 particles of at least 10 microns per container, and no more than 600 particles of at least 25 microns per container.
- the term "stability” refers to the retention of an acceptable degree of physical structure (colloidal, nature), chemical structure or biological function of the biologically active agent (e.g., biotherapeutic or other protein produced in a cell-based bioprocess) over time during storage (a.k.a. "shelf-life"), during processing, or after administration and while in vivo.
- the biologically active agent may be stable even though it does not maintain 100% of its structure or function after storage or administration for a defined amount of time.
- the biologically active agent and formulation containing the biologically active agent may be regarded as "stable".
- Stability can be measured, inter alia, by determining the percentage of native molecule that remains in the formulation after storage or administration for a defined amount of time at a defined temperature.
- the percentage of native molecule can be determined by, inter alia, size exclusion chromatography (e.g., size exclusion high performance liquid chromatography [SE-HPLC]), such that native means non-aggregated and non-degraded.
- SE-HPLC size exclusion high performance liquid chromatography
- At least about 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% of the native form of the biologically active agent can be detected in the formulation after a defined amount of time at a defined temperature or under physiological conditions after administration.
- the defined amount of time after which stability is measured can be about 14 days, about 28 days, about 1 month, about 2 months, about 3 months, about 4 months, about 5 months, about 6 months, about 7 months, about 8 months, about 9 months, about 10 months, about 1 1 months, about 12 months, about 18 months, about 24 months, or more.
- the temperature at which the formulation containing the biologically active agent may be kept when assessing stability can be any temperature from about -80°C to about 45°C, e.g., storage at about - 80°C, about -30°C, about -20°C, about 0°C, about 4°-8°C, about 5°C, about 25°C, about 35°C, about 37°C or other physiological temperatures, or about 45°C.
- the biologically active agent may be deemed stable if after 3 months under physiological conditions, greater than about 75%, 80%, 85% or 90% of native molecule is detected in the soluble fraction by SE-HPLC or other size exclusion or size determination method.
- “Physiological temperature” includes the body temperature of any vertebrate.
- physiological temperature of humans is about 37°C.
- physiological temperature is between about 25°C and about 45°C.
- physiological temperature is about 25°C, about 26°C, about 27°C, about 28°C, about 29°C, about 30°C, about 31 °C, about 32°C, about 33°C, about 34°C, about 35°C, about 36°C, about 37°C, about 38°C, about 39°C, about 40°C, about 41 °C, about 42°C, about 43°C, about 44°C, and about 45°C.
- Stability can be measured, inter alia, by determining the percentage of biologically active agent, such as a protein, that forms an aggregate (i.e., high molecular weight species) after a defined amount of time at a defined temperature, wherein stability is inversely proportional to the percent high molecular weight (HMW) species that is formed of the biologically active agent (protein).
- the percentage of HMW species of the biologically active agent can be determined by, inter alia, size exclusion chromatography, as described above.
- a pharmaceutical formulation containing the biologically active agent may also be deemed stable if after three months at physiological conditions less than about 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1% of the biologically active agent is detected in a HMW form.
- Stability can be measured, inter alia, by determining the percentage of a biologically active agent, such as a protein, that is degraded or otherwise is found as a low molecular weight (LMW) species after a defined amount of time at a defined temperature. Stability is inversely proportional to the percent LMW species that is formed in the soluble fraction. The percentage of LMW species of the biologically active agent in the soluble fraction can be determined by, inter alia, size exclusion chromatography, as described above.
- LMW low molecular weight
- a pharmaceutical formulation may also be deemed stable if after three months under storage conditions less than about 15%, 14%, 13%, 12%, 1 1%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1% of the biologically active agent is detected in a LMW form.
- compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines.
- the composition complies if tested for subvisible particles and the test results in the average number of particles present in the units tested does not exceed 12 per milliliter equal to or greater than 10 um in size, and does not exceed 2 per milliliter equal to or greater than 25 um.
- Various test for microscopic particles in solution are well-known in the art, including but not limited to tests recommended in ICH Guideline Q4B Annex 3(R1 ), dated 27 September 2010 (Evaluation And Recommendation Of Pharmacopoeial Texts For Use In The lch Regions On Test For Particulate Contamination: Sub-Visible Particles General Chapter).
- HCPs proteins produced or encoded by the host organisms used to produce recombinant therapeutic proteins. HCPs are generally process-related impurities during biologies production. The amount of residual HCPs in drug product is generally considered a critical quality attribute (CQA), due to their potential to affect product safety and efficacy. Regulatory authorities require a product sponsor to monitor the removal of HCPs in drug product during bioprocess development. A sensitive assay e.g., immunoassay, capable of detecting a wide range of protein impurities is generally utilized. This testing can include verification at commercial scale in accordance with regional regulations and may be done at the time of submission of a marketing approval application.
- CQA critical quality attribute
- ICH Specifications (Q6A and Q6B, section 2.3), if a drug substance or drug product does not contain any impurity in the specific formulation, i.e. if efficient control or removal to acceptable levels is demonstrated by suitable studies, further testing may be reduced or eliminated upon approval by the regulatory authorities.
- compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines.
- the composition prepared by the host cells described herein comprise less than 100 ng/mg (ppm), less than about 90 ppm, less than about 80 ppm, less than about 70 ppm, less than about 60 ppm, less than about 50 ppm, less than about 40 ppm, less than about 30 ppm, less than about 20 ppm, less than about 10 ppm, less than about 5 ppm, or 0 ppm of the target host cell protein, i.e. fatty acid hydrolase.
- fatty acid hydrolase or “FAH” refers to any hydrolytic enzyme that cleaves at a carbonyl group creating a carboxylic acid product in which the carboxylic acid comprises an R- group that is lipophilic or otherwise hydrophobic.
- the carboxylic acid product is a fatty acid.
- Esterases and “fatty acid acylases/amidases” are included as subgenera of fatty acid hydrolase.
- Lipases are a subgenus of esterases that cleave lipids (fats, waxes, sterols, glycerides and phospholipids.
- Phospholipases are a subgenus of lipases that cleave
- phospholipids phospholipids. Esterases cleave fatty acid esters into fatty acids and alcohols. Lipases include PLBD2, LPL and LIPA. "Ceramidase” is a subgenus of fatty acid acylase that cleaves ceramide and releases a fatty acid and sphingosine, which is an amino alcohol. Examples of ceramidases include acid ceramidase, neutral ceramidase, alkaline ceramidase 1 , alkaline ceramidase 2 and alkaline ceramidase 3.
- Protein A-binding fraction refers to the fraction of cell lysate from cultured cells expressing a protein of interest which binds to a Protein A affinity format. It is well understood in the art that Protein A affinity chromatography, such as Protein A chromatography medium, such as resins, beads, columns and the like, are utilized to capture Fc-containing proteins due to their affinity to Protein A.
- Protein A affinity chromatography such as Protein A chromatography medium, such as resins, beads, columns and the like
- Phospholipase B-like 2 refers to the homologs of a phospholipase gene known as NCBI RefSeq. XM_003510812.2 (SEQ ID NO:33) or protein known as NCBI RefSeq.
- PLBD2 is also referred to in the art as putative phospholipase B-like 2 (PLBL2), 76 kDa protein, LAMA-like protein 2, PLB homolog 2, lamina ancestor homolog 2, mannose-6-phosphate protein associated protein p76, p76, phospholipase B-like 2 32 kDa form, phospholipase B-like 2 45 kDa form, or Lysosomal 66.3 kDa protein.
- Lipoprotein lipase is a glycosylated homodimer secreted by parenchymal cells and associated with endothelial cells of the capillary lumen.
- Exemplary LPL proteins include Chinese hamster LPL (SEQ ID NO:62), mouse LPL (SEQ ID NO:66), ), rat LPL (SEQ ID NO:67) and human LPL (SEQ ID NO:68).
- Mouse LPL is 92% identical to Chinese hamster LPL.
- Rat LPL is 92% identical to Chinese hamster LPL.
- Human LPL is 88% identical to Chinese hamster LPL.
- Chinese hamster LPL is encoded by a polynucleotide sequence of SEQ ID NO:61 .
- Lysosomal acid lipase also known as lysosomal lipase, lipase A, lysosomal acid and cholesterol esterase is an intracellular lipase that functions in the lysosome.
- LIPA reversibly catalyzes cholesteryl ester bond formation and cleavage.
- LIPA is also a glycosylated homodimer.
- Exemplary LIPA proteins include Chinese hamster LIPA (SEQ ID NO:70), mouse LIPA (SEQ ID NO:74), ), rat LIPA (SEQ ID NO:75) and human LIPA (SEQ ID NO:76).
- Mouse LIPA is 72% identical to Chinese hamster LIPA.
- Rat LIPA is 75% identical to Chinese hamster LIPA.
- Human LIPA is 74% identical to Chinese hamster LIPA.
- Chinese hamster LIPA is encoded by a polynucleotide sequence of SEQ ID NO:69.
- Acid ceramidase also known as ASAH1 , AC, ACDase, ASAH, PHP, PHP32, SMAPME, N-acylsphingosine amidohydrolase (acid ceramidase) 1 is an acylase that cleaves ceramide to produce fatty acid and sphingosine. It is a heterodimer comprising a non-glycosylated alpha subunit and a glycosylated beta subunit. Acid ceramidase has an acid pH optimum. The lipid accumulation disease, Farber Lipogranulomatosis, is associated with a deficiency in acid ceramidase activity.
- Exemplary acid ceramidases include Chinese hamster ACE (SEQ ID NO:78), mouse ACE (SEQ ID NO:82), ), rat ACE (SEQ ID NO:83) and human ACE (SEQ ID NO:84).
- Mouse ACE is 87% identical to Chinese hamster ACE.
- Rat ACE is 89% identical to Chinese hamster ACE.
- Human ACE is 83% identical to Chinese hamster ACE.
- Chinese hamster ACE is encoded by a polynucleotide sequence of SEQ ID NO:77.
- Platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), also known as PAFAH1 B3, PAFAHG, and platelet activating factor acetylhydrolase 1 b catalytic subunit 3 is one of the catalytic subunits along with beta of the cytosolic tetrameric platelet-activating factor acetylhydrolase IB.
- PAFAHG belongs to the phospholipase A2 family and catalyzes the hydrolysis of the acyl group at position 2 of glycerol in bioactive phospholipids (see Stafforini et al., Journal of Biological Chemistry, 272:17895-17898, July 1997).
- Chinese hamster PAFAHG (SEQ ID NO:124) is 98% identical to both rat and mouse PAFAHG, and 96% identical to human PAFAHG.
- Chinese hamster PAFAHG is encoded by a polynucleotide sequence of SEQ ID NO:1 17.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- phosphodiesterase monophosphatidylinositol phosphodiesterase, phosphatidylinositol phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Phosphoinositide phospholipase C also known as triphosphoinositide
- inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling.
- PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG).
- IP3 inositol triphosphate
- DAG diacylglycerol
- Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC.
- Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
- Additional fatty acid hydrolases that can serve as targets for deletion, either individually, or in combination with one of more additional fatty acid hydrolases are listed in Table 1. Any of the following hydrolases, or their equivalents, listed in Table A may be the target protein in the creation of a knockout cell line, where removal of the host cell protein is necessitated due to contamination in the preparation of a biopharmaceutical product.
- cell or “cell line” includes any cell that is suitable for expressing a recombinant nucleic acid sequence.
- Cells include those of prokaryotes and eukaryotes (single-cell or multiple- cell), bacterial cells (e.g., strains of E. coli, Bacillus spp., Streptomyces spp., etc.), mycobacteria cells, fungal cells, yeast cells (e.g.,S. cerevisiae, S. pombe, P. partoris, P.
- the cell is a human, monkey, ape, hamster, rat or mouse cell.
- the cell is eukaryotic and is selected from the following cells: CHO (e.g.,CHO K1 , DXB-1 1 CHO, Veggie-CHO), COS (e.g.,COS-7), retinal cells, Vero, CV1 , kidney (e.g.,HEK293, 293 EBNA, MSR 293, MDCK, HaK, BHK21 ), HeLa, HepG2, WI38, MRC 5, Colo25, HB 8065, HL-60, Jurkat, Daudi, A431 (epidermal), CV-1 , U937, 3T3, L cell, C127 cell, SP2/0, NS-0, MMT cell, tumor cell, and a cell line derived from an aforementioned cell.
- the cell comprises one or more viral genes, e.g., a retinal cell that expresses a viral gene (e.g.,a PER.C6® cell).
- the invention is based at least in part on a recombinant host cell and cell expression system thereof that decreases expression of two or more an endogenous host cell fatty acid hydrolases (FAHs), decreases the enzymatic function or binding ability of two or more endogenous host cell FAHs, or lacks detectable expression of two or more FAHs.
- FHs endogenous host cell fatty acid hydrolases
- the inventors discovered that the disruption of genes encoding at least two FAHs allows for the optimized and efficient production and purification of biopharmaceutical products expressed in such expression systems.
- the invention may be employed in several ways, such as 1 ) utilizing gene editing tools to totally knockout FAH expression, whereas no measurable full-length FAH enzyme is expressed in the cell due to disruption of the gene encoding the FAH; 2) utilizing gene editing tools to eliminate or reduce enzymatic activity, whereas the FAH protein is expressed but rendered nonfunctional due to disruptions in its gene; and 3) utilizing gene editing tools to eliminate or reduce the ability of an endogenous host cell FAH to bind exogenous recombinant protein produced by the cell.
- FAH activity was determined in protein fractions of certain antibody-producing cells. Several particular fatty acid hydrolases were determined as contaminants in these protein fractions, including three carboxylic esterases (a.k.a.
- esterases phospholipase B-like (PLBD2), lipoprotein lipase (LPL) and lysosomal acid lipase (LIPA), and acid ceramidase, a carboxylic amidase (an acylase).
- PLBD2 phospholipase B-like
- LPL lipoprotein lipase
- LIPA lysosomal acid lipase
- acid ceramidase a carboxylic amidase (an acylase).
- Gene editing target sites were identified in hamster PLBD2, LPL, LIPA and acid ceramidase (ACE) genes that enable targeted disruption of those genes in a hamster cell ⁇ i.e., CHO) genome.
- ACE acid ceramidase
- An optimized host cell comprising a combination of genetic modifications that affect the expression of genes encoding (1 ) PLBD2 and LPL; (2) PLBD2 and LIPA; (3) PLBD2 and ACE; (4) LPL and LIPA; (5) LPL and ACE; (6) LIPA and ACE; (7) PLBD2, LPL and LIPA; (8) PLBD2, LPL and ACE; (9) PLBD2, LIPA and ACE; (10) LPL, LIPA and ACE, (1 1 ) PLBD2, LPL, LIPA and ACE, (12) PLBD2 and PIPLC, (13) PLBD2 and CEH, (14) PLBD2 and PAFAHG, (15) PLBD2 and LCE, (16) PLBD2 and ASA, (17) PLBD2 and IAH1 , (18) PLBD2 and LPLA2, (19) LPL and PIPLC, (20) LPL and CEH, (21 ) LPL and PAFAHG, (22) LPL and LCE, (23)
- Such a cell is envisioned to reduce the burden of certain purification steps, thereby reducing time and cost, while increasing production yield. Also, the formulated protein is expected to have improved stability due to the reduced hydrolase burden.
- the invention is also based on the specific targeting of an exogenous gene to the integration site.
- the methods of the invention allow efficient modification of the cell genome, thus producing a modified or recombinant host cell useful as a cell expression system for the
- the methods of the invention employ cellular genome gene editing strategies for the alteration of particular genes of interest that otherwise may diminish or contaminate the quality of recombinant protein formulations, or require multiple purification steps.
- compositions of the invention can also be included in expression constructs for example, in expression vectors for cloning and engineering new cell lines. These cell lines comprise the modifications described herein, and further modifications for optimal incorporation of expression constructs for the purpose of protein expression are envisioned.
- Expression vectors comprising polynucleotides can be used to express proteins of interest transiently, or can be integrated into the cellular genome by random or targeted recombination such as, for example, homologous recombination or recombination mediated by recombinases that recognize specific recombination sites (e.g., Cre-lox-mediated recombination).
- Target sites for disruption or insertion of DNA are typically identified with the maximum effect of the gene disruption or insertion in mind.
- target sequences may be chosen near the N-terminus of the coding region of the gene of interest whereas a DNA break is introduced within the first or second exon of the gene.
- Introns non-coding regions
- the changes introduced by these modifications are permanent to the genomic DNA of the organism.
- RNA editing protocols were employed to render a nonfunctional versions of two or more of those genes.
- the one of those genes encodes PLBD2 (e.g., SEQ ID NO:33).
- protocols known in the art for introducing an expressible gene of interest (GOI), such as a multi- subunit antibody, along with any other desirable elements such as, e.g., promoters, enhancers, markers, operators, ribosome binding sites (e.g., internal ribosome entry sites), efc. are also employed.
- the resulting recombinant cell line conveniently provides more efficient downstream bioprocess methods with respect to expressible exogenous genes of interest (GOIs), since purification steps for exogenous proteins of interest are eliminated due to the absence of the contaminant host cell protein. Eliminating or refining purification procedures also results in higher amounts (titer) of the recovered protein of interest.
- GOIs expressible exogenous genes of interest
- Applicants have discovered enzymatic activities associated with the destabilization of polysorbates (including polysorbate 20 and polysorbate 80) and/or enzymatic activities co-purifying with highly concentrated, multimerized or aggregated protein. Those activities were found to be associated with one or more fatty acid hydrolases (FAHs).
- FAHs fatty acid hydrolases
- One such FAH was identified from the peptide sequences listed in Table 2.
- a BLAST search of those peptide sequences revealed identity with a putative phospholipase B-like 2 (PLBD2, also referred to as PLBL2).
- PLBD2 is highly conserved in hamster (SEQ ID NO:32), mice (SEQ ID NO:34), rat (SEQ ID NO:35), human (SEQ ID NO:36), and bovine (SEQ ID NO:37).
- the applicants discovered that PLBD2 which co-purifies under certain processes with some classes of proteins-of-interest manufactured in a mammalian cell line, has enzymatic activity responsible for the hydrolysis of polysorbate 20 and 80.
- Other FAH species, of which PLBD2 is an example may contribute to polysorbate instability or persist as hydrophobic "sticky" proteins that bind protein multimers or aggregates during purification and ultimate formulation, depending upon the particular protein-of-interest and/or background of the host cell.
- IgGs formulated with polysorbate 20 did not form particles and the putative esterase did not hydrolyze the polysorbate 20.
- the author reported that the putative lipase associated with the IgG did not affect saturated C 12 fatty acid ⁇ i.e., laurate) (Id at 7.)
- LPL lipoprotein lipase
- LI PA lysosomal acid lipase
- ACE acid ceramidase
- LPL is a triacylglycerol/diacylglycerol hydrolase of the carboxylic ester hydrolase (esterase) family (see Hide et al., "Structure and evolution of the lipase superfamily," J Lipid Res. 1992 Feb; 33(2): 167-78).
- LI PA is a sterol esterase and synthase that acts on esters of sterols and long-chain fatty acids (see Dubland and Francis, “Lysosomal acid lipase: at the crossroads of normal and atherogenic cholesterol metabolism,” Front. Cell Dev. Biol. 2015 Feb 2; 3(3): 1-11 ).
- Acid ceramidase is not a carboxylic ester hydrolase, but rather an amide hydrolase that cleaves fatty acids from ceramide at the amide bond (carboxylic amide hydrolase) (see Park and Schuchman, "Acid ceramidase and human disease,” Biochim. Biophys. Acta. 2006 Dec; 1758(12): 2133-8).
- Phospholipases are a family of esterase enzymes that catalyze the cleavage of phospholipids. Each phospholipase subclass has different substrate specificity based on its target cleavage site.
- Phospholipase B was identified as related to a group of prokaryotic and eukaryotic lipase proteins by virtue of the presence of a highly conserved amino acid sequence motif, Gly-Asp-Ser-Leu (GDSL) (Upton, C, and Buckley, JT. A new family of lipolytic enzymes? Trends Biochem Sci. 1995; 20:178-179).
- phospholipase B is also classified with known GDSL(S) hydrolases, and has little sequence homology to true lipases, differentiating itself structurally from phospholipases by having a serine-containing motif closer to the N-terminus than other lipases.
- phospholipase B-like proteins are also classified as N-terminal nucleophile (Ntn) hydrolases.
- Ntn N-terminal nucleophile hydrolases.
- PLB-like proteins such as phospholipase B-like protein 1 (PLBD1 ) and phospholipase B-like protein 2 (PLBD2), also have amidase activity, similar to other Ntn hydrolases (Repo, H. et al, Proteins 2014; 82:300-31 1 ).
- Lipoprotein lipases have also been demonstrated to cleave carboxylic ester bonds of polysorbate 20 and 80; and to associate with some monoclonal antibodies during production (see N. Levy, "Host cell protein impurities and protein-protein interactions in downstream purification of monoclonal antibodies," Dissertation submitted to the Faculty of the University of Delaware, Summer 2014, UMI 3642330, Published by ProQuest LLC, 2014).
- Cell-cultures of a CHO-K1 RNAi knock-down of LPL revealed diminished polysorbate esterase activity. The effect of such a knockdown on overall cell viability or the production of useful titers of ectopic protein has not been investigated.
- Knockout of a host cell gene such as an FAH, more particularly one or more of phospholipase B-like protein 2, lipoprotein lipase, lysosomal acid lipase and acid ceramidase may be accomplished in several ways. Rendering the FAH encoding gene nonfunctional, or reducing the functional activity of the target FAH protein may be done by introducing point mutations in the FAH genomic sequence, particularly in the exons (coding regions).
- nucleic acid sequences of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 were identified and sequences upstream and downstream of the target site ⁇ i.e., homologous arms) may be utilized to integrate an expression cassette comprising a mutated gene by homologous recombination. Further gene editing tools are described herein in accordance with the invention.
- Cell lines devoid of multiple FAH activities are useful for the production of therapeutic proteins to be purified and stored long term, and such cell lines solve problems associated with long term storage of pharmaceutical compositions in a formulation containing a fatty acid ester surfactant by maintaining protein stability and reducing subvisible particle (SVP) formation (see also PCT International Application No. PCT/US 15/54600 filed October 8, 2015, which is hereby incorporated in its entirety into the specification).
- SVP subvisible particle
- Assays to detect FAH activity include polysorbate degradation measurements. Unpurified protein supernatants or fractions from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, is tested for stability of polysorbate, such as polysorbate 20 or 80. The measurement of percent intact polysorbate reported is inversely proportional to the amount of contaminating FAH activity. Other measurements for detection of FAH activity or presence of FAH in a protein sample are known in the art. Detection of FAH protein (e.g., lipase, phospholipase, PLBD2, LPL, LIPA, acylase, ACE) may be done by trypsin digest mass spectrometry.
- Detection of FAH protein e.g., lipase, phospholipase, PLBD2, LPL, LIPA, acylase, ACE
- micrometers in diameter may be counted in the protein formulation in order to detect esterase or other FAH activity.
- glycerophospho[ 3 H]choline formation from phosphatidyl[3H]choline following incubation of phosphatidyl[ 3 H]choline and protein supernatant may be determined by thin-layer chromatography (following similar protocols according to Kanoh, H. et al. 1991 Comp Biochem Physiol 102B(2):367- 369).
- SEQ ID NO:32 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of PLBD2, can then be tested for FAH activity as described herein.
- DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:32, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:32 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
- the mammalian PLBD2 sequences are conserved among hamster, human, mouse and rat genomes.
- Table 3 identifies exemplary mammalian PLBD2 proteins and their degree of homology.
- the targeted disruption of SEQ ID NO:33 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
- the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
- the PLBD2 nucleic acid sequence is at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:33 or target sequence thereof.
- SEQ ID NO:62 disclosed herein was identified from proteins expressed in CHO cells.
- Other mammalian species such as, for example, humans, rats, mice
- Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques.
- variants of LPL can then be tested for FAH activity as described herein.
- DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:62, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:62 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
- the mammalian LPL sequences are conserved among hamster, human, mouse and rat genomes.
- Table 4 identifies exemplary mammalian LPL proteins and their degree of homology. TABLE 4: Amino acid identity of LPL homologs
- the targeted disruption of SEQ ID NO:61 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400- 420, 410-430, 420-440, 430-450, 440-460
- the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410- 430, 420-440, 430-450, 440-460, 450-470
- the LPL nucleic acid sequence is at least about 80% identical, at least about 81% identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:61 or target sequence thereof.
- SEQ ID NO:70 disclosed herein was identified from proteins expressed in CHO cells.
- Other mammalian species such as, for example, humans, rats, mice
- Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques.
- variants of LI PA can then be tested for FAH activity as described herein.
- DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:70, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:70 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
- the mammalian LIPA sequences are conserved among hamster, human, mouse and rat genomes.
- Table 5 identifies exemplary mammalian LIPA proteins and their degree of homology.
- the targeted disruption of SEQ ID NO:69 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69.
- the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69.
- the LIPA nucleic acid sequence is at least about 70% identical, at least about 71% identical, at least about 72% identical, at least about 73% identical, at least about 74% identical, at least about 75% identical, at least about 76% identical, at least about 77% identical, at least about 78% identical, or at least about 79% identical, at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:69 or target sequence thereof.
- SEQ ID NO:78 disclosed herein was identified from proteins expressed in CHO cells.
- Other mammalian species such as, for example, humans, rats, mice
- Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques.
- variants of LPL can then be tested for FAH activity as described herein.
- DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:78, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:78 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
- ACE mammalian acid ceramidase
- the targeted disruption of SEQ ID NO:77 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ ID NO:77.
- the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ I D NO:77.
- the ACE nucleic acid sequence is at least about 80% identical, at least about 81% identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:77 or target sequence thereof.
- Cell populations expressing enhanced levels of a protein of interest can be developed using the cell lines and methods provided herein.
- the isolated commercial protein, protein supernatant or fraction thereof, produced by the cells of the invention have no detectable esterase or esterase activity.
- Cell pools further modified with exogenous sequence(s) integrated within the genome of the modified cells of the invention are expected to be stable over time, and can be treated as stable cell lines for most purposes. Recombination steps can also be delayed until later in the process of development of the cell lines of the invention.
- Methods for genetically engineering a host cell genome in a particular location may be achieved in several ways. Genetic editing techniques were used to modify a nucleic acid sequence in a eukaryotic cell, wherein the nucleic acid sequence is an endogenous sequence normally found in such cells and expressing a contaminant host cell protein. Clonal expansion is necessary to ensure that the cell progeny will share the identical genotypic and phenotypic characteristics of the engineered cell line.
- native cells are modified by a homologous recombination technique to integrate a nonfunctional or mutated target nucleic acid sequence encoding a host cell protein, such as a variant of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77.
- One such method of editing the CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA genomic sequences involves the use of guide RNAs and a type II Cas enzyme to specifically target an exon of PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and/or ASA.
- RNAs directed to particular exons of CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA have been employed (Table 7) in a site-specific nuclease editing method as described herein.
- Other methods of targeted genome editing, for example nucleases, recombination-based methods, or RNA interference, to modify the FAH genes may be employed for the targeted disruption of the CHO genome.
- the engineered mammalian host cell comprises one or more disruptions within gene sequences selected from the group consisting of nucleotides 37-63 of SEQ ID NO:33, nucleotides 465-612 of SEQ ID NO:61 , nucleotides 180-295 of SEQ ID NO:69, nucleotides 135-351 of SEQ ID NO:77, nucleotides 249-388 of SEQ ID NO:1 17, nucleotides 1624-2157 of SEQ ID NO:163, nucleotides 372-1399 of SEQ ID NO:1 18, nucleotides 1 155-1600 of SEQ ID NO:1 19, nucleotides 423-615 of SEQ ID NO:120, nucleotides 753-1 141 of SEQ ID NO:121 , nucleotides 311 -581 of SEQ ID NO:164, nucleotides 1 155-1443 of SEQ ID NO:122, and nucle
- methods and compositions for knockout or downregulation of a nucleic acid molecule encoding an ortholog of a host cell FAH protein having at least 80% identity to SEQ ID NO:33 (PLBD2), SEQ ID NO:61 (LPL), SEQ ID NO:69 (LIPA) and/or SEQ ID NO:77 (ACE); or antibody-binding variant thereof, are via homologous recombination.
- a nucleic acid molecule encoding an FAH protein (or any protein of interest in general) can be targeted by homologous recombination or by using site-specific nuclease methods that specifically target sequences at the FAH-expressing site of the host cell genome.
- homologous polynucleotide molecules ⁇ i.e., homologous arms
- a transgene can be introduced during this exchange if the transgene is flanked by homologous genomic sequences.
- a recombinase recognition site can also be introduced into the host cell genome at the integration sites.
- Homologous recombination in eukaryotic cells can be facilitated by introducing a break in the chromosomal DNA at the integration site.
- Model systems have demonstrated that the frequency of homologous recombination during gene targeting increases if a double-strand break is introduced within the chromosomal target sequence. This may be accomplished by targeting certain nucleases to the specific site of integration. DNA-binding proteins that recognize DNA sequences at the target gene are known in the art. Gene targeting vectors are also employed to facilitate homologous recombination.
- NHEJ non-homologous end-joining
- ZFNs zinc finger nucleases
- Some embodiments can utilize ZFNs with a combination of individual zinc finger domains targeting multiple target sequences.
- ZFN methods to target disruption of two or more FAH genes are also embodied by the invention.
- TAL effector nucleases may also be employed for site-specific genome editing.
- TAL effector protein DNA-binding domain is typically utilized in combination with a non-specific cleavage domain of a restriction nuclease, such as Fokl.
- a fusion protein comprising a TAL effector protein DNA-binding domain and a restriction nuclease cleavage domain is employed to recognize and cleave DNA at a target sequence within an exon of the gene encoding the target host cell protein, for example an esterase, such as a phospholipase B-like 2 protein(or other mammalian phospholipase), a lipoprotein lipase and a lysosomal acid lipase, and/or a fatty acylase, such as acid ceramidase (or other mammalian ceramidase).
- an esterase such as a phospholipase B-like 2 protein(or other mammalian phospholipase), a lipoprotein lipase and a lysosomal acid lipase, and/or a fatty acylase, such as acid ceramidase (or other mammalian ceramidase).
- Targeted disruption or insertion of exogenous sequences into a specific exon of the CHO protein encoded by SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77 may be done by employing a TALE nuclease (TALEN) targeted to locations within exon 1 , exon 2, exon 3, etc. of the fatty acid hydrolase genomic DNA (see Tables 6 and 7).
- TALE nuclease TALEN
- the TALEN target cleavage site within the gene sequences may be selected based on ZiFit.partners.org (ZiFit Targeter Version 4.2) and then TALENs are designed based on known methods (Boch J et al., 2009 Science 326:1509-1512; Bogdanove, A. J. & Voytas, D. F. 201 1 Science 333, 1843-1846; Miller, J. C. et al., 201 1 Nat Biotechnol 29, 143-148).
- TALEN methods to target disruption of two or more of the PLBD2 gene ⁇ e.g., exon 1 or exon 2), LPL gene ⁇ e.g., exon 2, 3 or 4), LI PA gene ⁇ e.g., exon 1 or 2) and ACE gene ⁇ e.g., exon 1 , 3 or 4) are also embodied by the invention.
- RNA-guided endonucleases are programmable genome engineering tools that were developed from bacterial adaptive immune machinery.
- CRISPR clustered regularly interspaced short palindromic repeats
- Cas CRISPR-associated
- the protein Cas9 forms a sequence-specific endonuclease when complexed with two RNAs, one of which guides target selection.
- RGENs consist of components (Cas9 and tracrRNA) and a target- specific CRISPR RNA (crRNA).
- CRISPR-Cas9 methods to target disruption of two or more of the PLBD2 gene e.g., exon 1 or exon 2
- LPL gene e.g., exon 2, 3 or 4
- LIPA gene e.g., exon 1 or 2
- ACE gene e.g., exon 1 , 3 or 4
- BuD-derived nucleases with precise DNA-binding specificities (Stella, S. et al. Acta Cryst. 2014, D70, 2042-2052).
- a single residue-to-nucleotide code guides the BuDN to the specific DNA target within polynucleotide of interest ⁇ e.g., SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and or SEQ ID NO:77).
- Sequence-specific endonucleases may be directed to a target sequence at any one of the exons encoding PLBD2, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614971 .1 , at: Exon 1 within nucleotides (nt) 175367 to 175644 (SEQ ID NO:47); Exon 2 within nt 168958 to 169051 (SEQ ID NO:48); Exon 3 within nt 166451 to166609 (SEQ ID NO:49); Exon 4 within nt 164966 to 165066 (SEQ ID NO:50); Exon 5 within nt 164564 to164778 (SEQ ID NO:51 ); Exon 6 within nt 162682 to162779 (SEQ ID NO:52); Exon 7 within nt 160036 to160196 (SEQ ID NO:53); Exon 8 within nt 159733 to 159828 (S
- Sequence-specific endonucleases may be directed to a target sequence at any one of the exons encoding LPL, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613760.1 , at: Exon 1 within nucleotides (nt) 1257424 to 1257507 (SEQ ID NO:85); Exon 2 within nt 1266450 to 1266610 (SEQ ID NO:86); Exon 3 within nt 1270069 1270248 (SEQ ID NO:87); Exon 4 within nt 1271770 to 1271881 (SEQ ID NO:88); Exon 5 within nt 12283518 12283751 (SEQ ID NO:89); Exon 6 within nt 123715 1273957 (SEQ ID NO:90); Exon 7 within nt 1276672 1276792 (SEQ ID NO:91 ); Exon 8 within nt 1278328 to 12
- Sequence-specific endonucleases may be directed to a target sequence at any one of the exons encoding LI PA, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614200.1 , at: Exon 1 within nucleotides (nt) 985778 to 985674 (SEQ ID NO:94); Exon 2 within nt 984375 to 984258 (SEQ ID NO:95); Exon 3 within nt 970771 970573 (SEQ ID NO:96); Exon 4 within nt 969327 to 969218 (SEQ ID NO:97); Exon 5 within nt 968139 968003 (SEQ ID NO:98); Exon 6 within nt 961871 to 961725 (SEQ ID NO:99); Exon 7 within nt 960826 to 960755 (SEQ ID NO:100); Exon 8 within nt 95
- Sequence-specific endonucleases may be directed to a target sequence at any one of the exons encoding ACE, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613654.1 , at: Exon 1 within nucleotides (nt) 1378167 to 1378244 (SEQ ID NO: 103); Exon 2 within nt 1393746 to 1393792 (SEQ ID NO: 104); Exon 3 within nt 1398208 to 1398298 (SEQ ID NO:105); Exon 4 within nt 1399171 to 1399257 (SEQ ID NO: 106); Exon 5 within nt 1402147 to 1402225 (SEQ ID NO:107); Exon 6 within nt 1404854 to 1404928 (SEQ ID NO:108); Exon 7 within nt 1405714 to 1405759 (SEQ ID NO:109); Exon 8 within nt 14067
- Precise genome modification methods are chosen based on the tools available compatible with unique target sequences within SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 so that disruption of the cell phenotype is avoided.
- any protein of interest suitable for expression in prokaryotic or eukaryotic cells can be used in the engineered host cell systems provided.
- the protein of interest includes, but is not limited to, an antibody or antigen-binding fragment thereof, a chimeric antibody or antigen- binding fragment thereof, an ScFv or fragment thereof, an Fc-fusion protein or fragment thereof, a growth factor or a fragment thereof, a cytokine or a fragment thereof, or an extracellular domain of a cell surface receptor or a fragment thereof.
- Proteins of interest may be simple polypeptides consisting of a single subunit, or complex multisubunit proteins comprising two or more subunits.
- the protein of interest may be a biopharmaceutical product, food additive or preservative, or any protein product subject to purification and quality standards.
- the protein product is an antibody, a human antibody, a humanized antibody, a chimeric antibody, a monoclonal antibody, a multispecific antibody, a bispecific antibody, an antigen binding antibody fragment, a single chain antibody, a diabody, triabody or tetrabody, a Fab fragment or a F(ab')2 fragment, an IgD antibody, an IgE antibody, an IgM antibody, an IgG antibody, an lgG1 antibody, an lgG2 antibody, an lgG3 antibody, or an lgG4 antibody.
- the antibody is an lgG1 antibody.
- the antibody is an lgG2 antibody.
- the antibody is an lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1/lgG4 antibody.
- the antibody is selected from the group consisting of an anti- Programmed Cell Death 1 antibody (e.g. an anti-PD1 antibody as described in U.S. Pat. Appln. Pub. No. US2015/0203579A1 ), an anti-Programmed Cell Death Ligand-1 (e.g. an anti-PD-L1 antibody as described in in U.S. Pat. Appln. Pub. No. US2015/0203580A1 ), an anti-DII4 antibody, an anti-Angiopoetin-2 antibody (e.g. an anti-ANG2 antibody as described in U.S. Pat. No.
- an anti- Programmed Cell Death 1 antibody e.g. an anti-PD1 antibody as described in U.S. Pat. Appln. Pub. No. US2015/0203579A1
- an anti-Programmed Cell Death Ligand-1 e.g. an anti-PD-L1 antibody as described in in U.S. Pat. Appln. Pub. No. US2015/0203580
- an anti- Angiopoetin-Like 3 antibody e.g. an anti-AngPtl3 antibody as described in U.S. Pat. No. 9,018,356
- an anti-platelet derived growth factor receptor antibody e.g. an anti-PDGFR antibody as described in U.S. Pat. No. 9,265,827
- an anti-Erb3 antibody e.g. an anti- Prolactin Receptor antibody
- an anti- Complement 5 antibody e.g. an anti-C5 antibody as described in U.S. Pat. Appln. Pub. No
- an anti-TNF antibody an anti-epidermal growth factor receptor antibody (e.g. an anti-EGFR antibody as described in U.S. Pat. No. 9,132,192 or an anti-EGFRvlll antibody as described in U.S. Pat. Appln. Pub. No. US2015/0259423A1 ), an anti-Proprotein Convertase Subtilisin Kexin-9 antibody (e.g. an anti-PCSK9 antibody as described in U.S. Pat. No. 8,062,640 or U.S. Pat. Appln. Pub. No. US2014/0044730A1 ), an anti-Growth And Differentiation Factor-8 antibody (e.g.
- an anti-GDF8 antibody also known as anti-myostatin antibody, as described in U.S. Pat Nos. 8,871 ,209 or 9,260,515)
- an anti-Glucagon Receptor e.g. anti-GCGR antibody as described in U.S. Pat. Appln. Pub. Nos. US2015/0337045A1 or US2016/0075778A1
- an anti-VEGF antibody e.g. anti-VEGF antibody
- an anti-l L1 R antibody e.g an anti-IL4R antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271681 A1 or U.S. Pat Nos. 8,735,095 or
- an anti-interleukin 6 receptor antibody e.g. an anti-IL6R antibody as described in U.S. Pat. Nos. 7,582,298, 8,043,617 or 9,173,880
- an anti-IL1 antibody e.g. an anti-IL2 antibody as described in U.S. Pat. Nos. 7,582,298, 8,043,617 or 9,173,880
- an anti-IL1 antibody e.g. an anti-IL2 antibody as described in U.S. Pat. Nos. 7,582,298, 8,043,617 or 9,173,880
- an anti-IL1 antibody e.g. an anti-IL2 antibody, an anti-IL3 antibody, an anti-l L4 antibody, an anti-IL5 antibody, an anti-l L6 antibody, an anti-l L7 antibody
- an anti-interleukin 33 e.g. anti- IL33 antibody as described in U.S. Pat. Appln. Pub. Nos.
- an anti-Respiratory syncytial virus antibody e.g. anti- RSV antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271653A1
- an anti-Cluster of differentiation 3 e.g. an anti-CD3 antibody, as described in U.S. Pat. Appln. Pub. Nos.
- an anti- Cluster of differentiation 20 e.g. an anti-CD20 antibody as described in U.S. Pat. Appln. Pub. Nos. US2014/0088295A1 and US20150266966A1 , and in U.S. Pat. No. 7,879,984
- an anti-CD19 antibody, an anti-CD28 antibody, an anti- Cluster of Differentiation-48 e.g. anti-CD48 antibody as described in U.S. Pat. No. 9,228,014
- an anti-Fel d1 antibody e.g. as described in U.S. Pat. No.
- an anti-Middle East Respiratory Syndrome virus e.g. an anti-MERS antibody as described in U.S. Pat. Appln. Pub. No. US2015/0337029A1
- an anti-Ebola virus antibody e.g. as described in U.S. Pat. Appln. Pub. No. US2016/0215040
- an anti-Zika virus antibody e.g. an anti- Lymphocyte Activation Gene 3 antibody, or an anti-CD223 antibody
- an anti-Nerve Growth Factor antibody e.g. an anti-NGF antibody as described in U.S. Pat. Appln. Pub. No. US2016/0017029 and U.S. Pat. Nos.
- the bispecific antibody is selected from the group consisting of an anti-CD3 x anti-CD20 bispecific antibody (as described in U.S. Pat. Appln. Pub. Nos.
- the protein of interest is selected from the group consisting of alirocumab, sarilumab, fasinumab, nesvacumab, dupilumab, trevogrumab, evinacumab, and rinucumab. All publications mentioned throughout this disclosure are incorporated herein by reference in their entirety.
- the protein of interest is a recombinant protein that contains an Fc moiety and another domain, (e.g., an Fc-fusion protein).
- an Fc-fusion protein is a receptor Fc-fusion protein, which contains one or more extracellular domain(s) of a receptor coupled to an Fc moiety.
- the Fc moiety comprises a hinge region followed by a CH2 and CH3 domain of an IgG.
- the receptor Fc-fusion protein contains two or more distinct receptor chains that bind to either a single ligand or multiple ligands.
- an Fc-fusion protein is a TRAP protein, such as for example an IL-1 trap (e.g., rilonacept, which contains the IL-1 RAcP ligand binding region fused to the 11-1 R1 extracellular region fused to Fc of hlgG1 ; see U.S. Pat. No. 6,927,004, which is herein incorporated by reference in its entirety), or a VEGF trap (e.g., aflibercept or ziv-aflibercept, which contains the Ig domain 2 of the VEGF receptor Flt1 fused to the Ig domain 3 of the VEGF receptor Flk1 fused to Fc of hlgG1 ; see U.S. Pat. Nos.
- IL-1 trap e.g., rilonacept, which contains the IL-1 RAcP ligand binding region fused to the 11-1 R1 extracellular region fused to Fc of hlgG1 ; see U.S. Pat. No
- an Fc-fusion protein is a ScFv-Fc-fusion protein, which contains one or more of one or more antigen-binding domain(s), such as a variable heavy chain fragment and a variable light chain fragment, of an antibody coupled to an Fc moiety.
- the host cells used in the methods of the invention are eukaryotic host cells including, for example, Chinese hamster ovary (CHO) cells, human cells, rat cells and mouse cells.
- the invention provides a cell comprising a disrupted nucleic acid sequence fragment of SEQ ID NO:33 and at least one more of SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77.
- the invention includes an engineered mammalian host cell further transfected with an expression vector comprising an exogenous gene of interest, such gene encoding the
- biopharmaceutical product While any mammalian cell may be used, in one particular embodiment the host cell is a CHO cell.
- Transfected host cells include cells that have been transfected with expression vectors that comprise a sequence encoding a protein or polypeptide. Expressed proteins will preferably be secreted into the culture medium for use in the invention, depending on the nucleic acid sequence selected, but may be retained in the cell or deposited in the cell membrane. Various mammalian cell culture systems can be employed to express recombinant proteins.
- cell lines developed for specific selection or amplification schemes will also be useful with the methods and compositions provided herein, provided that at least two genes encoding a different fatty acid hydrolase (FAH) having at least 80% homology to at least two of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 have been downregulated, knocked out or otherwise disrupted in accordance with the invention.
- An embodied cell line is the CHO cell line designated K1 .
- the host cell line may be pre-adapted to bioreactor medium in the appropriate case.
- transfection protocols are known in the art, and are reviewed in Kaufman (1988) Meth. Enzymology 185:537.
- the transfection protocol chosen will depend on the host cell type and the nature of the GOI, and can be chosen based upon routine experimentation. The basic requirements of any such protocol are first to introduce DNA encoding the protein of interest into a suitable host cell, and then to identify and isolate host cells which have incorporated the
- heterologous DNA in a relatively stable, expressible manner.
- Electroporation can also be used to introduce DNA directly into the cytoplasm of a host cell, for example, as described by Potter et al. (Proc. Natl. Acad. Sci. USA 81 :7161 , 1988) or Shigekawa et al. (BioTechniques 6:742, 1988). Unlike protoplast fusion, electroporation does not require the selection marker and the GOI to be on the same plasmid.
- reagents useful for introducing heterologous DNA into a mammalian cell have been described, such as LipofectinTM Reagent and LipofectamineTM Reagent (Gibco BRL, Gaithersburg, Md.). Both of these commercially available reagents are used to form lipid-nucleic acid complexes (or liposomes) which, when applied to cultured cells, facilitate uptake of the nucleic acid into the cells.
- Methods for amplifying the GOI are also desirable for expression of the recombinant protein of interest, and typically involves the use of a selection marker (reviewed in Kaufman supra). Resistance to cytotoxic drugs is the characteristic most frequently used as a selection marker, and can be the result of either a dominant trait (e.g., can be used independent of host cell type) or a recessive trait (e.g., useful in particular host cell types that are deficient in whatever activity is being selected for).
- a dominant trait e.g., can be used independent of host cell type
- a recessive trait e.g., useful in particular host cell types that are deficient in whatever activity is being selected for.
- amplifiable markers are suitable for use in the cell lines of the invention and may be introduced by expression vectors and techniques well known in the art (e.g., as described in Sambrook, Molecular Biology: A Laboratory Manual, Cold Spring Harbor Laboratory, NY, 1989; pgs 16.9-16.14).
- Useful selectable markers and other tools for gene amplification such as regulatory elements, described previously or known in the art, can also be included in the nucleic acid constructs used to transfect mammalian cells.
- the transfection protocol chosen and the elements selected for use therein will depend on the type of host cell used. Those of skill in the art are aware of numerous different protocols and host cells in order to adapt the invention for a particular use, and can select an appropriate system for expression of a desired protein, based on the
- the invention relates to the following items:
- a recombinant host cell comprising a modification in two or more genes encoding two or more fatty acid hydrolases (FAH).
- the recombinant host cell according to item 1 wherein the two or more FAHs are selected from the group consisting of phospholipase B-like 2 (PLBD2) protein, lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE).
- PLBD2 phospholipase B-like 2
- LPL lipoprotein lipase
- LI PA lysosomal acid lipase
- ACE acid ceramidase
- [00191] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and [00192] b. a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
- [00194] a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification;
- [00202] a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and [00203] b. a modification in a coding sequence of a polynucleotide encoding the ACE protein, wherein the modification decreases the expression level of the ACE protein relative to the expression level of a ACE protein in a cell lacking the modification.
- PLBD2 protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:35, and SEQ ID NO:36.
- LPL protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:62, SEQ ID NO:66, SEQ ID NO:67, and SEQ ID NO:68.
- LI PA protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:70, SEQ ID NO:74, SEQ ID NO:75, and SEQ ID NO:76.
- ACE protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:78, SEQ ID NO:82, SEQ ID NO:83, and SEQ ID NO:84.
- exogenous protein of interest is selected from the group consisting of an antibody heavy chain, an antibody light chain, an antigen-binding fragment, an antigen-binding protein, and an Fc-fusion protein.
- a method of producing a recombinant protein comprising the steps of: (a) obtaining a sample comprising a recombinant protein and a plurality of host cell proteins from a host cell that is modified to produce reduced levels of fatty acid hydrolase compared to an unmodified cell; and (b) subjecting the sample to at least one purification step to remove at least one host cell protein.
- a. does not comprise a detectable amount of a phospholipase B-like 2 (PLBD2) protein
- b a modification in a coding sequence of a polynucleotide encoding a fatty acid hydrolase selected from the group consisting of lipoprotein lipase (LPL), lysosomeal acid lipase (LI PA), acid ceramidase (ACE), or a combination thereof.
- LPL lipoprotein lipase
- LI PA lysosomeal acid lipase
- ACE acid ceramidase
- a process for reducing fatty acid hydrolase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein; (b) modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and/or acid ceramidase (ACE); (c) transfecting the host cell with a polynucleotide encoding a protein of interest; (d) extracting a protein fraction comprising the protein of interest from the host cell; (e) contacting the protein fraction with a chromatography media selected from the group consisting of protein A affinity (PA) media, cation exchange (CEX) media, and anion exchange (AEX) media; (f) collecting the protein of interest from the media; and (g) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer.
- PA protein A
- step of modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein comprises inserting or deleting at least one nucleotide within exon 1 of a polynucleotide encoding the PLBD2 protein.
- step of modifying the host cell to decrease or ablate the expression of acid ceramidase comprises inserting or deleting at least one nucleotide within exon 1 , exon 3 or exon 4 of a polynucleotide encoding the ACE protein.
- composition comprising one or more recombinant proteins obtainable by the method according to any of items 31-36 or the process according to any of items 37-45.
- composition obtainable according to any of items 46-47, wherein the stable composition is characterized by one or more of:
- composition obtainable according to any of items 46-48, wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
- composition comprising one or more recombinant proteins, wherein the composition is stable.
- stable composition is characterized by one or more of:
- composition according to any of items 50-51 , wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
- a Type II CRISPR/Cas system which requires at least 20 nucleotides (nt) of homology between a chimeric RNA (i.e., guide RNA) and its genomic target was used.
- Guide RNA sequences were designed for specific targeting of an exon within the CHO phospholipase B-like 2 (PLBD2) nucleic acid (SEQ ID NO:33) and are considered unique (to minimize off-target effects in the genome).
- sgRNA small guide RNAs
- the sgRNA expression plasmid (System Biosciences, CAS940A-1 ) contains a human H1 promoter that drives expression of the small guide RNA and the tracrRNA following the sgRNA.
- Immortalized Chinese hamster ovary (CHO) cells were transfected with the plasmid encoding Cas9- H1 enzyme followed by one of the sgRNA sequences, for instance sgRNAI (SEQ ID NO:45) or sgRNA2 (SEQ ID NO:46), designed to target the first exon of CHO PLBD2.
- sgRNAI and sgRNA2 were predicted to generate a double strand break (DSB) at or around nucleotides 53 and 59 of SEQ ID NO:33, respectively.
- a DSB was therefore predicted to occur approx. 23 or 29 nucleotides downstream of the PLBD2 start codon. (Note that nucleotides 1-30 of SEQ ID NO:33 encode a signal peptide.)
- a negative control transfection was performed where the parental CHO line was transfected with the plasmid encoding Cas9-H1 enzyme without a proceeding sgRNA, or an sgRNA encoding a gene sequence not present in the CHO genome.
- Genomic DNA gDNA
- messenger RNA mRNA
- qPCR primers and probes were designed to overlap with the sgRNA sequence used for the double strand break targeting event, in order to detect disruption of the genomic DNA and its transcription.
- the relative abundance of PLBD2 gene or transcript in the candidate clones was determined using relative qPCR method, where the clones derived from the negative control transfection were used as a calibrator. See Figure 1.
- the qPCR primers and probes were designed to detect sequences either in the sgRNAI or sgRNA2 position in PLBD2 exon 1 .
- gDNA and RNA isolated from clone 1 failed to support qPCR amplification of PLBD2 exon 1 in either sgRNAI or sgRNA2 regions, but amplification of the housekeeping gene, GAPDH, was detected. Based on this data, clone 1 was identified as a potential knock out of PLBD2 in which both genomic alleles of PLBD2 of exonl were disrupted. It is noted that amplification of genomic DNA and mRNA was not detected in Clone 8 using primers overlapping with sgRNA2, however, sgRNAI primers/probes detected genomic DNA above control values. Clone 8, and others were further analyzed in order to understand the performance of the site- directed nuclease method.
- the inventors also unexpectedly identified Clone 8 as a PLBD2 knockout despite the fact that genomic DNA fragments were identified by qPCR primers overlapping with the sgRNAI sequence.
- the identification of a clone that has no detectable phospholipase activity or no detectable phospholipase protein was technically challenging and time-consuming.
- Site-directed nuclease techniques may provide an ease-of-use, however, careful screening and elimination of false positives is necessary and still there may be unpredictable outcomes with regard to the identity of a single clone having two disrupted alleles.
- Clone 1 and the wild type control host cell line were transfected with plasmids encoding the light and heavy chains of mAbl , a fully human IgG, in the presence of Cre recombinase to facilitate recombination mediated cassette exchange (RMCE) into EESYR locus (US Patent No. 7771997B2, issued August 10, 2010).
- the transfected cultures were selected for 1 1 days in serum-free medium containing 400 ug/mL hygromycin. Cells that underwent RMCE, were isolated by flow cytometry.
- PLBD2 knock out clone 1 and the wild type host cell line produced equivalent observed
- the clone 1 derived isogenic cell line expressing mAbl was designated RS001
- the mAbl expressing cell line originated from the PLBD2 wild type host was designated RS0WT.
- Polysorbate 20 or polysorbate 80 degradation was measured to detect putative esterase activity in the supernatants of PLBD2 mutants. Unpurified protein supernatant from CHO cells, and supernatant taken at each step or sequence of steps when subjected to sequential purification steps, was tested for stability of polysorbate. The percent intact polysorbate reported was inversely proportional to the amount of contaminant esterase activity. Unpurified protein supernatant from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, was tested in assays measuring polysorbate degradation. The relative levels of intact polysorbate reported is inversely proportional to levels of contaminant esterase activity.
- Monoclonal antibody was produced in an unmodified CHO cell and purified by different processes according to Table 1 1 , and the esterase activity measured by percent intact polysorbate 20, as in Table 9.
- HIC Hydrophobic interaction chromatography
- Example 4 Esterase protein abundance and activity detection in mAbl purified from modified compared to unmodified CHO cells.
- mAbl was produced from RS001 and RSOWT and purified from the conditioned media using either PA alone, or PA and AEX chromatography
- the PA-purified mAB1 from RS001 and RSOWT were analyzed for lipase abundance using trypsin digest mass spectrometry.
- the trypsin digests of RS001 and RSOWT mAbl were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass spectrometer set to monitor a specific product ion fragmented from SEQ ID NO:32.
- a hydrophobic interaction column (Phenyl Sepharose® High Performance [GE Healthcare, Little Chalfont, Buckinghamshire, UK]) was used to generate a "HIC strip” fraction containing a protein of interest ⁇ i.e., mAb2) and associated host cell proteins.
- the column was first equilibrated with two column volumes (CV) of buffer containing 40 mM Tris, 200 mM citrate at pH8.0.
- the monoclonal antibody-containing load material from an anion exchange pool (“Q pool”) was adjusted to 200 mM sodium citrate, pH8.0, then loaded onto the column at a loading amount of 20-40 grams of protein per liter of the phenyl sepharose resin.
- the column was washed with six CVs of 40 mM Tris, 200 mM citrate, at pH8.0. Then the column was stripped with three CVs of reverse osmosis deionized water. The stripped fraction was collected for subsequent analysis.
- a 50 pg aliquot of the mAb2-containing HIC strip sample was denatured and reduced in a solution containing 5 mM acetic acid and 5 mM ir/ ' s(2-carboxyethyl)phosphine-HCI by heating at 80°C for 10 minutes.
- the sample was then diluted in 50 mM Tris-HCI buffer (pH 8.0) and alkylated with 1 .5 mM iodoacetamide (IAA) and digested with trypsin (modified, sequencing grade from Promega, Madison, Wl) with an enzyme to substrate ratio of 1 :20 (w/w) at 37°C in the dark for three hours.
- the digestion was then stopped by addition of 10% trifluoroacetic acid (TFA).
- Peptides were separated using a linear gradient from 1 % mobile phase B (0.1 % FA in acetonitrile) to 7% mobile phase B for the first 5 minutes, followed by a second linear increase from 7% to 27% mobile phase B over the next 1 10 minutes, and another subsequent linear increase from 27%-40% in 10 minutes and a final increase to 90% in 5 minutes. The gradient was held at 90% for 20 minutes.
- a Thermo Q ExactiveTM Plus mass spectrometer (Thermo Scientific, Waltham, MA) was used for peptide mass analyses, with high-energy collisional dissociation (HCD) employed for peptide fragmentation for MS/MS experiments.
- Thermo XcaliburTM version 2.2.42
- Proteome Discoverer version 1.4
- Thermo Scientific, Waltham, MA was also used to perform the peptide identification using both Mascot and Sequent search engines.
- the peptide spectra from the Proteome Discovery was manually examined to confirm the spectral assignment and protein identification.
- Lipoprotein lipase (UniProtein ID: P06858), lysosomal acid lipase (UniProtein ID: P38571 ) and acid ceramidase (UniProtein ID: Q13510) were identified as potential active fatty acid hydrolases by more than three unique peptides per protein in the mAb2-containing HIC strip fraction.
- Clone 1 in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) were transfected with a pair of plasmids.
- One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lipoprotein lipase (LPL) gene (SEQ ID NO:61 ): sgRNA3 (SEQ ID NO:63), sgRNA4 (SEQ ID NO:64) or sgRNA5 (SEQ ID NO:65) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette was used in some
- Cre recombinase was co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/0115502A1 , published April 28, 2016).
- the transfected cultures were selected for 1 1 days in serum-free medium containing 400 ⁇ g mL hygromycin. Cells that underwent RMCE were isolated by flow cytometry.
- the desired knock out genotype was confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing only the LPL knock-out or knock-down loci is designated as "clone 9".
- the resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 10".
- FIG. 8 depicts an alignment of LPL-KO clones 19-22 (SEQ ID NOs:159-162, respectively) compared to a partial Chinese hamster LPL sequence (SEQ ID NO:158) showing the gaps in the respective clone sequences.
- LPL-KO clones (as well as any other fatty acid hydrolase knock-out clone), plasmids encoding for Cas9 nuclease, eYFP and site-specific sgRNA were stably integrated into the CHO genome using Lipofectamine-based transfection protocol followed by selection for neomycin resistant cells. Seventeen days post transfection, the YFP positive cells were enriched by flow cytometry prior to single cell-sorting. After a 21 -day expansion, mRNA was isolated and the single cell clones were analyzed by qPCR for the presence of gene disruption.
- 1 , 2 or 3 different sgRNA expression cassettes may be placed in the same plasmid.
- Some constructs are manufactured using at least 2 sgRNA cassettes per lipase in a single plasmid.
- Clone 1 in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids.
- One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) or sgRNA8 (SEQ ID NO:73) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs.
- Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent
- the transfected cultures are selected for 11 days in serum-free medium containing 400 pg/mL hygromycin.
- Cells that undergo RMCE are isolated by flow cytometry.
- the desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing only the LIPA knock-out or knock-down loci is designated as "clone 1 1 ".
- the resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 12".
- Clone 1 in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids.
- One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA11 (SEQ ID NO:81 ) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs.
- Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/0115502A1 , published April 28, 2016).
- the transfected cultures are selected for 1 1 days in serum-free medium containing 400 pg/mL hygromycin.
- Cells that undergo RMCE are isolated by flow cytometry.
- the desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing only the ACE knock-out or knock-down loci is designated as "clone 13".
- the resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 14".
- Clone 9 in which the LPL gene and is modified
- Clone 10 in which both the LPL gene and the PLBD2 gene are modified
- plasmids encodes the Cas9 protein
- the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) and/or sgRNA8 (SEQ ID NO:73) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs.
- Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016).
- the transfected cultures are selected for 1 1 days in serum-free medium containing 400 ⁇ g/mL hygromycin.
- Cells that undergo RMCE are isolated by flow cytometry.
- the desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing the LIPA modified locus and the LPL modified locus designated as "clone 15".
- the resultant clone containing the PLBD2, LPL and LIPA modified loci is designated as "clone 16".
- Example 10 Targeted Disruption of a Lipoprotein Lipase (LPL) Gene, a Lysosomal Acid Lipase (LIPA) Gene and an Acid Ceramidase (ACE) Gene
- Clone 15 in which the LPL and LIPA genes and are modified
- Clone 16 in which the PLBD2, LPL and LIPA genes are modified
- a pair of plasmids One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA1 1 (SEQ ID NO:81 ) (Table 7).
- Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used.
- a single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs.
- Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016).
- the transfected cultures are selected for 1 1 days in serum-free medium containing 400 ⁇ g/mL hygromycin.
- Cells that undergo RMCE are isolated by flow cytometry.
- the desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing.
- the resultant clone containing the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 17".
- the resultant clone containing the PLBD2 modified locus, the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 18".
- the invention provides for cells and methods for expression and purification of recombinant proteins in eukaryotic cells.
- the invention includes methods and compositions for expression of proteins in eukaryotic cells, particularly Chinese hamster (Cricetulus griseus) cell lines, that employ downregulating gene expression of endogenous proteins in order to control production of such unwanted "sticky" host cell proteins.
- the invention includes polynucleotides and modified cells that facilitate purification of an exogenous recombinant protein of interest.
- the methods of the invention efficiently target host cell proteins in the Chinese hamster cellular genome in order to facilitate enhanced and stable expression of recombinant proteins expressed by the modified cells.
- HCPs host cell proteins
- LC/MS Advanced liquid chromatography/mass spectrometry
- HCPs Changes in cell culture conditions of eukaryotic cells has been shown to impact the purity and stability of manufactured proteins, as seen by the increased quantity of HCPs of CHO cells upon downstream bioprocessing alterations (Tait, et al, 2013, Biotechnol Prog 29(3):688-696).
- the detrimental effect of leftover HCPs in any product may affect the overall quality or quantity, or both the quality and quantity of the product.
- HCPs if present even at low levels in a therapeutic product, may induce an undesired immune response which causes concern for patient safety and efficacy of the drug product (Singh SK. 201 1.
- the invention provides a recombinant host cell, wherein the cell is modified to decrease the expression levels of two or more fatty acid hydrolases (FAHs) relative to the expression levels of FAH in an unmodified cell.
- FHs fatty acid hydrolases
- the invention provides a recombinant host cell, wherein the cell is modified to have no expression of two or more target FAHs.
- one of said two or more target FAHs is an esterase.
- the esterase is a lipase.
- the lipase is: (1 ) a phospholipase, such as a phospholipase B-like protein or a phospholipase B-like 2 protein, (2) a lipoprotein lipase or (3) a lysosomal acid lipase.
- one of said two or more target FAHs is an amidase.
- the amidase is a fatty acid acylase.
- the fatty acid acylase is an acid ceramidase.
- a gene of interest is exogenously added to the recombinant host cell.
- the exogenously added gene encodes a protein of interest (POI), for example the POI is selected form the group consisting of antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein.
- POI protein of interest
- the invention relates to a composition comprising one or more proteins of interest (POIs) obtainable by a method according to the invention.
- the protein may be a recombinant protein and may be e.g. selected form the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein, or any combinations thereof.
- the invention relates to a composition comprising one or more proteins of interest (POIs), wherein the adverse enzyme activity is ⁇ 50%, 40%, 48%, 47%, 46%, 45%, 44%, 43%, 42%, 41 %, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31 %, 30%, 29%, 28%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, 0.05%, or ⁇ 0.05% active relative to the adverse enzyme activity of a wild-type production system.
- POIs proteins of interest
- the phrase "adverse enzyme activity" refers to any enzyme and its action upon the resulting composition as a whole, wherein the action results in metabolism of any protein components which metabolites reduces the shelf-life of the composition or results in the formation of subvisible particles (SVPs) above prescribed regulations.
- the protein may be a recombinant protein and may e.g. be selected from the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein or any combinations thereof.
- the enzymes and their activities may be, e.g., various esterases, hydrolases, lipases, phospholipases, ceramidases and the likes, or any combinations thereof.
- Adverse enzyme activity may be measured using a functional assay ⁇ e.g., polysorbate fatty acid hydrolysis assay), or a structural assay ⁇ e.g., nano LC-MS of peptide fragments, or the like).
- the invention provides a cell comprising a nonfunctional PLBD2 protein and one or more additional nonfunctional fatty acid hydrolases (FAH).
- FAH nonfunctional fatty acid hydrolases
- nonfunctional FAH is a nonfunctional lipoprotein lipase (LPL), lysosomal acid lipase (LI PA), acid ceramidase (ACE), platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), phosphoinositide phospholipase C fragment (PIPLCf), phosphoinositide phospholipase C (PIPLC), liver carboxylesterase 1 (LCE), isoamyl acetate-hydrolyzing esterase 1 -like (IAH1 ), group XV phospholipase A2 (LPLA2), carboxylic ester hydrolase (CEH), and/or arylsulfatase A (ASA).
- LPL lipoprotein lipase
- LI PA lysosomal acid lipase
- ACE acid ceramidase
- PAFAHG platelet-activating factor acetylhydrolase IB subunit gamma
- the invention provides making a cell by FAH target disruption.
- the method comprises a site-specific nuclease for disrupting or editing the cell genome at a target site or sequence.
- the FAH target site is (1 ) a PLBD2 target site, (2) an LPL target site, (3) an LI PA target site, (4) an ACE target site, (5) a PAFAHG site, (6) a PIPLCf or PIPLC site, (7) an LCE site, (8) an IAH1 site, (9) an LPLA2 site, (9) an CEH site, and /or (10) an ASA site.
- the PLBD2 target site comprises a position within SEQ ID NO:33, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:33, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:33, within exon 1 of SEQ ID NO:33, within exon 2 of SEQ ID NO:33, or within exon 3 of SEQ ID NO:33.
- the PLBD2 target site comprises a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-230, 190-210, 200-220, 210-230, 220-240, 230-250, 240-260, and 250-270 of SEQ ID NO:33.
- the target site at a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 is selected from the group consisting of nucleotides spanning positions numbered 37-56, 44-56, 33-62, 40-69, 110-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
- the PLBD2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:33 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:33 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LPL target site comprises a position within SEQ ID NO:61 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:61 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO:61 , within exon 1 of SEQ ID NO:61 , within exon 2 of SEQ ID NO:61 , or within exon 3 of SEQ ID NO:61.
- the LPL target site comprises a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440
- the target site at a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 is selected from the group consisting of nucleotides spanning positions numbered 465-484, 558-577, and 593-612 of SEQ ID NO:61.
- the LPL target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:61 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:61 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LIPA target site comprises a position within SEQ ID NO:69, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:69, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:69, within exon 1 of SEQ ID NO:69, within exon 2 of SEQ ID NO:69, or within exon 3 of SEQ ID NO:69.
- the LIPA target site comprises a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350 and 340-360 of SEQ ID NO:69.
- the target site at a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 is selected from the group consisting of nucleotides spanning positions numbered 180-199, 239-258, and 276-295 of SEQ ID NO:69.
- the LIPA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:69 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:69 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the ACE target site comprises a position within SEQ ID NO:77, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:77, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:77, within exon 1 of SEQ ID NO:77, within exon 2 of SEQ ID NO:77, within exon 3 of SEQ ID NO:77, or within exon 4 of SEQ ID NO:77.
- the ACE target site comprises a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370 and 360-380 of SEQ ID NO:77.
- the target site at a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 is selected from the group consisting of nucleotides spanning positions numbered 135-154, 237-256, and 332-351 of SEQ ID NO:77.
- the ACE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:77 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:77 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the PAFAHG target site comprises a position within SEQ ID NO:1 17, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:117, 100 nucleotides
- the PAFAHG target site comprises a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:1 17 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410
- the target site at a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:117 is selected from the group consisting of nucleotides spanning positions numbered 101-120, 1 1 1-130, 121 -140, 131-150, 141-160, and 150-169 of SEQ ID NO:1 17.
- the PAFAHG target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 17 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 17 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the PIPLC target site comprises a position within SEQ ID NO:1 18, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:118, 100 nucleotides
- SEQ ID NO:1 downstream of the 5-prime end of SEQ ID NO: 118, within exon 1 of SEQ ID NO:1 18, within exon 2 of SEQ ID NO:118, within exon 3 of SEQ ID NO:118, within exon 4 of SEQ ID NO:1 18, within exon 5 of SEQ ID NO:1 18, within exon 6 of SEQ ID NO:1 18, within exon 7 of SEQ ID NO:1 18, within exon 8 of SEQ ID NO: 1 18, within exon 9 of SEQ ID NO: 1 18, within exon 10 of SEQ ID NO: 1 18, within exon 1 1 of SEQ ID NO:1 18, within exon 12 of SEQ ID NO:1 18, or within exon 13 of SEQ ID NO:1 18.
- the PIPLC target site comprises a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:1 18 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410
- the target site at a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:118 is selected from the group consisting of nucleotides spanning positions numbered 39-188, 39-58, 49-68, 59-78, 69-88, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 149-178, and 159-188 of SEQ ID NO:118.
- the PIPLC target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 18 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 18 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LCE target site comprises a position within SEQ ID NO:1 19, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 19, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:1 19, within exon 1 of SEQ ID NO:1 19, within exon 2 of SEQ ID NO:1 19, within exon 3 of SEQ ID NO:119, within exon 4 of SEQ ID NO:1 19, or within exon 5 of SEQ ID NO:1 19.
- the LCE target site comprises a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:1 19 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-
- the target site at a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:119 is selected from the group consisting of nucleotides spanning positions numbered 89-140, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, and 139-158 of SEQ ID NO:1 19.
- the LCE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 19 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 19 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the IAH1 target site comprises a position within SEQ ID NO:120, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 120, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:120, within exon 1 of SEQ ID NO:120, within exon 2 of SEQ ID NO:120, within exon 3 of SEQ ID NO:120, or within exon 4 of SEQ ID NO:120.
- the IAH1 target site comprises a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO:120 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430
- the target site at a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO:120 is selected from the group consisting of nucleotides spanning positions numbered 104-325, 104-123, 1 14-133, 124-143, 134-153, 144-163, 154-173, 164-183, 174-193, 184-203, 194-213, 204-223, 214-233, 224-243, 234-253, 244-263, 254-273, 264-283, 274-293, 284-303, 294-313, 304-323, and 314-333 of SEQ ID NO:120.
- the IAH1 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 120 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:120 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the LPLA2 target site comprises a position within SEQ ID NO:121 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 121 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 121 , within exon 1 of SEQ ID NO: 121 , within exon 2 of SEQ ID NO: 121 , within exon 3 of SEQ ID NO: 121 , or within exon 4 of SEQ ID NO: 121.
- the LPLA2 target site comprises a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-
- the target site at a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 is selected from the group consisting of nucleotides spanning positions numbered 69-195, 69-88, 79-98, 89-108, 99-118, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, and 179-198 of SEQ ID NO:121.
- the LPLA2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 121 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:121 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the CEH target site comprises a position within SEQ ID NO:122, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 122, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 122, within exon 1 of SEQ ID NO: 122, within exon 2 of SEQ ID NO: 122, within exon 3 of SEQ ID NO: 122, within exon 4 of SEQ ID NO: 122, or within exon 5 of SEQ ID NO: 122.
- the CEH target site comprises a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO:122 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430,
- the target site at a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO:122 is selected from the group consisting of nucleotides spanning positions numbered 79-186, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, and 169-188 of SEQ ID NO:122.
- the CEH target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:122 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO: 122 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell ⁇ i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the ASA target site comprises a position within SEQ ID NO:123, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 123, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 123, within exon 1 of SEQ ID NO: 123, within exon 2 of SEQ ID NO: 123, within exon 3 of SEQ ID NO: 123, or within exon 4 of SEQ ID NO: 123.
- the ASA target site comprises a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO:123 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430,
- the target site at a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO:123 is selected from the group consisting of nucleotides spanning positions numbered 79-296, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, 179-198 , 189-208 , 199-218, 209-228, 219-238, 229-248, 239-258, 249-268, 259-278, 269-288, and 279-298 of SEQ ID NO:123.
- the ASA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:123 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO: 123 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing).
- the subsequent transcript of an altered gene results in a frameshift of the translated protein.
- the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity.
- the targeted disruption or editing occurs on both alleles of the gene.
- the cell further integrates an exogenous nucleic acid sequence.
- the cell is capable of producing an exogenous protein of interest.
- the altered protein resulting from a disrupted gene does not bind to the protein of interest produced by the cell.
- an isolated Chinese hamster ovary (CHO) cell comprising an engineered nucleic acid sequence comprising a variant of the PLBD2 gene (such as a variant of SEQ ID NO:33).
- the PLBD2 gene comprises GACAGTCACG TGGCCCGACT GAGGCACGCG , nucleotides 1-30 of SEQ ID NO:33 (SEQ ID NO: 44).
- the PLBD2 gene is engineered to disrupt expression of the open reading frame.
- the invention provides an isolated CHO cell comprising (a) a disrupted PLBD2 gene comprising GACAGTCACG TGGCCCGACT GAGGCACGCG (SEQ ID NO: 44, also nucleotides 1- 30 of SEQ ID NO:33), (b) a disrupted esterase gene comprising a nucleotide encoding any one of the amino acid sequences in Table 2, or (c) a protein fragment of Table 2 expressed by a disrupted PLBD2 gene; and an exogenous nucleic acid sequence comprising a gene of interest.
- the CHO cell that comprises an engineered nucleic acid sequence comprising a variant of the PLBD2 gene also comprises variants of one or more of the genes encoding (1 ) LPL (variant of SEQ ID NO:61 ), (2) LIPA (SEQ ID NO:69), (3) ACE (SEQ ID NO:77), (4) PAFAHG (SEQ ID NO:1 17), (5) PIPLC (SEQ ID NO:1 18), (6) LCE (SEQ ID NO:1 19), (7) IAH1 (SEQ ID NO:120), (8) LPLA2 (SEQ ID NO:121 ), (9) CEH (SEQ ID NO:122), and/or (10) ASA (SEQ ID NO:123).
- a method of producing a protein of interest using a recombinant host cell wherein the host cell is modified to decrease the expression levels of esterase relative to the expression levels of esterase in an unmodified cell.
- the method comprises the modified host cell having decreased esterase expression and an exogenous nucleic acid sequence comprising a gene of interest (GOI).
- GOI gene of interest
- the exogenous nucleic acid sequence comprises one or more genes of interest.
- the one or more genes of interest are selected from the group consisting of a first GOI, a second GOI and a third GOI.
- the invention provides expression systems comprising the recombinant host cell comprising modified or nonfunctional esterase.
- the cell comprises a GOI operably linked to a promoter capable of driving expression of the GOI, wherein the promoter comprises a eukaryotic promoter that can be regulated by an activator or inhibitor.
- the eukaryotic promoter is operably linked to a prokaryotic operator, and the eukaryotic cell optionally further comprises a prokaryotic repressor protein.
- one or more selectable markers are expressed by the modified host cell.
- the genes of interest and/or the one or more selectable markers are operably linked to a promoter, wherein the promoter may be the same or different.
- the promoter comprises a eukaryotic promoter (such as, for example, a CMV promoter or an SV40 late promoter), optionally controlled by a prokaryotic operator (such as, for example, a tet operator).
- the cell further comprises a gene encoding a prokaryotic repressor (such as, for example, a tet repressor).
- a CHO host cell comprising recombinase recognition sites.
- the recombinase recognition sites are selected from a LoxP site, a /_ox51 1 site, a Lox2272 site, Lox2372, Lox5171 , and a frt site.
- the cell further comprises a gene capable of expressing a recombinase.
- the recombinase is a Cre recombinase.
- the selectable marker gene is a drug resistance gene.
- the drug resistance gene is a neomycin resistance gene or a hygromycin resistance gene.
- the second and third selectable marker genes encode two different fluorescent proteins.
- the two different fluorescent proteins are selected from the group consisting of Discosoma coral (DsRed), green fluorescent protein (GFP), enhanced green fluorescent protein (eGFP), cyano fluorescent protein (CFP), enhanced cyano fluorescent protein (eCFP), yellow fluorescent protein (YFP), enhanced yellow fluorescent protein (eYFP) and far-red fluorescent protein (mKate).
- the first, second, and third promoters are the same. In another embodiment, the first, second, and third promoters are different from each other. In another embodiment, the first promoter is different from the second and third promoters, and the second and third promoters are the same. In more embodiments, the first promoter is an SV40 late promoter, and the second and third promoters are each a human CMV promoter. In other embodiments, the first and second promoters are operably linked to a prokaryotic operator.
- the host cell line has an exogenously added gene encoding a recombinase integrated into its genome, operably linked to a promoter.
- the recombinase is Cre recombinase.
- the host cell has a gene encoding a regulatory protein integrated into its genome, operably linked to a promoter.
- the regulatory protein is a tet repressor protein.
- the first GOI and the second GOI encode a light chain, or fragment thereof, of an antibody or a heavy chain, or fragment thereof, of an antibody.
- the first GOI encodes a light chain of an antibody and the second GOI encodes a heavy chain of an antibody.
- the first, second and third GOI encode a polypeptide selected from the group consisting of a first light chain, or fragment thereof, a second light chain, or fragment thereof and a heavy chain, or fragment thereof.
- the first, second and third GOI encode a polypeptide selected from the group consisting of a light chain, or fragment thereof, a first heavy chain, or fragment thereof and a second heavy chain, or fragment thereof.
- a method for making a protein of interest comprising (a) introducing into a CHO host cell a gene of interest (GOI), wherein the GOI integrates into a specific locus such as a locus described in US Patent No. 7771997B2, issued August 10, 2010 or other stable integration and/or expression-enhancing locus; (b) culturing the cell of (a) under conditions that allow expression of the GOI; and (c) recovering the protein of interest.
- the protein of interest is selected from the group consisting of a subunit of an immunoglobulin, or fragment thereof, and a receptor, or ligand-binding fragment thereof.
- the protein of interest is selected from the group consisting of an antibody light chain, or antigen-binding fragment thereof, and an antibody heavy chain, or antigen-binding fragment thereof.
- the CHO host cell genome comprises further modifications, and comprises one or more recombinase recognition sites as described above, and the GOI is introduced into a specific locus through the action of a recombinase that recognizes the
- the GOI is introduced into the cell employing a targeting vector for recombinase-mediated cassette exchange (RMCE) when the CHO host cell genome comprises at least one exogenous recognition sequence within a specific locus.
- RMCE recombinase-mediated cassette exchange
- the GOI is introduced into the cell employing a targeting vector for homologous recombination, and wherein the targeting vector comprises a 5' homology arm homologous to a sequence present in the specific locus, a GOI, and a 3' homology arm
- the targeting vector further comprises two, three, four, or five or more genes of interest.
- one or more of the genes of interest are operably linked to a promoter.
- a method for modifying a CHO cell genome to integrate an exogenous nucleic acid sequence comprising the step of introducing into the cell a vehicle comprising an exogenous nucleic acid sequence wherein the exogenous nucleic acid integrates within a locus of the genome.
- the invention provides a process for manufacturing a stable protein formulation comprising the steps of: (a) extracting a protein fraction from the modified host cell of the invention having decreased or ablated expression of esterase, (b) contacting the protein fraction comprising a protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (c) collecting the protein of interest from the media, wherein a reduced level of the esterase activity is associated with the protein fraction collected at step (c), thus providing a stable protein formulation.
- PA protein A affinity
- CEX cation exchange
- AEX anion exchange
- the invention provides a process for reducing esterase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate expression of esterase, (b) transfecting the host cell with a protein of interest, (c) extracting a protein fraction from the modified host cell, (c) contacting the protein fraction comprising the protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (d) collecting the protein of interest from the media, and (e) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer, thus providing a protein formulation essentially free of detectable esterase activity.
- the protein formulation is essentially free of PLBD2 protein or PLBD2 activity.
- a method for modifying a CHO cell genome to express a therapeutic agent comprising a vehicle for introducing, into the genome, an exogenous nucleic acid comprising a sequence for expression of the therapeutic agent, wherein the vehicle comprises a 5' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33, a nucleic acid encoding the therapeutic agent, and a 3' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33.
- the invention provides a modified CHO host cell comprising a modified CHO genome wherein the CHO genome is modified by disruption of target sequence within a nucleotide sequence at least 90% identical to SEQ ID NO: 33.
- the modified CHO host cell further comprises another FAH target sequence disruption.
- the another FAH target sequence is within (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:61 , (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:69 and/or (3) a nucleotide sequence at least 90% identical to SEQ ID NO:77.
- the invention provides a modified eukaryotic host cell comprising a modified eukaryotic genome wherein the eukaryotic genome is modified at a target sequence in a coding region of the target gene by a site-specific nuclease.
- the site-specific nuclease comprises a zinc finger nuclease (ZFN), a ZFN dimer, a transcription activator-like effector nuclease (TALEN), a TAL effector domain fusion protein, or an RNA-guided DNA endonuclease.
- ZFN zinc finger nuclease
- TALEN transcription activator-like effector nuclease
- TAL effector domain fusion protein or an RNA-guided DNA endonuclease.
- the invention also provides methods of making such a modified eukaryotic host cell.
- the target sequence can be placed in the indicated orientation as in SEQ ID NO:33, 61 , 69, 77, 1 17, 118, 119, 120, 121 , 122, or 123; or in the reverse of the orientation of SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 119, 120, 121 , 122, or 123.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Immunology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
Compositions and methods for engineered cell lines and expressions systems are provided that allow for expression of recombinant proteins in eukaryotic cells and their ease of isolation. Cell expression systems capable of expressing a protein of interest essentially free of a bound host cell protein are also provided.
Description
HOST CELL PROTEIN MODIFICATION
FIELD
[0001] The invention provides for cells and methods for expression and purification of recombinant proteins in eukaryotic cells. In particular, the invention includes methods and compositions for expression of proteins in eukaryotic cells, particularly Chinese hamster (Cricetulus griseus) cell lines, that employ downregulating gene expression of endogenous proteins in order to control production of such unwanted "sticky" host cell proteins. The invention includes polynucleotides and modified cells that facilitate purification of an exogenous recombinant protein of interest. The methods of the invention efficiently target host cell proteins in the Chinese hamster cellular genome in order to facilitate enhanced and stable expression of recombinant proteins expressed by the modified cells.
BACKGROUND
[0002] Cellular expression systems aim to provide a reliable and efficient source for the
manufacture of biopharmaceutical products for therapeutic use. Purification of any recombinant protein produced by either eukaryotic or prokaryotic cells in such systems is an ongoing challenge due to, for example, the plethora of host cell proteins and nucleic acid molecules that need to be eliminated from the final pharmaceutical grade product.
[0003] Certain dynamics of host cell proteins (HCPs), viewed as impure byproducts, have been surveyed during various stages of bioprocessing. Advanced liquid chromatography/mass spectrometry (LC/MS) was done to detect and monitor E. coli HCPs accompanying peptibodies produced by cell culture (Schenauer, MR., et al, 2013, Biotech nol Prog 29(4):951-7). The information obtained by HCP profiles is useful for monitoring process development and assessing quality and purity of the product in order to assess safety risks posed by any one or more HCP(s).
Changes in cell culture conditions of eukaryotic cells has been shown to impact the purity and stability of manufactured proteins, as seen by the increased quantity of HCPs of CHO cells upon downstream bioprocessing alterations (Tait, et al, 2013, Biotechnol Prog 29(3):688-696). The detrimental effect of leftover HCPs in any product may affect the overall quality or quantity, or both the quality and quantity of the product. HCPs, if present even at low levels in a therapeutic product, may induce an undesired immune response which causes concern for patient safety and efficacy of the drug product (Singh SK. 201 1 . "Impact of product-related factors on immunogenicity of biotherapeutics." J Pharm Sci 100:354-387; Ipsen Press Release "Ipsen's partner Inspiration Biopharmaceuticals announces hold of phase III clinical trials evaluating IB1001 for the treatment and prevention of hemophilia B", 10 July 2012). Current protocols seek to alter the protein of interest produced by the cell (e.g., therapeutic antibody) to eliminate differential binding or interaction with the protein of interest and the host cell protein (Zhang, Q. et al, mAbs, Published online: 1 1 Feb 2014). Alternative bioprocessing or purification techniques may be warranted in order to minimize the risk of excess impurities (Yuk, et al. 2015, Biotechnol. Bioeng. 9999: 1-16).
[0004] Despite the availability of numerous cell expression systems, engineered cell lines and systems that do not negatively impact the biological properties of an expressed protein of interest are particularly advantageous. Accordingly, there is a need in the art for improved methods towards preparation of quality protein samples for downstream bioprocessing and subsequently commercial use.
BRIEF SUMMARY
[0005] The use of gene editing tools to eliminate two or more contaminant host cell proteins is contemplated, and thus, engineered host cells for more efficient manufacturing processing of proteins is provided.
[0006] In one aspect, the invention provides a recombinant host cell, wherein the cell is modified to decrease the expression levels of two or more fatty acid hydrolases (FAHs) relative to the expression levels of FAH in an unmodified cell.
[0007] In another aspect, the invention provides a recombinant host cell, wherein the cell is modified to have no expression of two or more target FAHs.
[0008] In some embodiments, one of said two or more target FAHs is an esterase. In more specific embodiments, the esterase is a lipase. In more specific embodiments, the lipase is: (1 ) a
phospholipase, such as a phospholipase B-like protein or a phospholipase B-like 2 protein, (2) a lipoprotein lipase or (3) a lysosomal acid lipase. In some embodiments, one of said two or more target FAHs is an amidase. In more specific embodiments, the amidase is a fatty acid acylase. In a more specific embodiment, the fatty acid acylase is an acid ceramidase.
[0009] In some embodiments, a gene of interest is exogenously added to the recombinant host cell. In other embodiments, the exogenously added gene encodes a protein of interest (POI), for example the POI is selected form the group consisting of antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein.
[0010] In further embodiments, the invention relates to a composition comprising one or more proteins of interest (POIs) obtainable by a method according to the invention. The protein may be a recombinant protein and may be e.g. selected form the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein, or any combinations thereof.
[0011] In some embodiments, the invention relates to a composition comprising one or more proteins of interest (POIs), wherein the adverse enzyme activity is < 50%, 40%, 48%, 47%, 46%, 45%, 44%, 43%, 42%, 41 %, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31 %, 30%, 29%, 28%, 26%, 25%, 24%, 23%, 22%, 21 %, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, 0.05%, or <0.05% active relative to the adverse enzyme activity of a wild-type production system. The phrase "adverse enzyme activity" refers to any enzyme and its action upon the resulting composition as a whole, wherein the action results in metabolism of any protein components which metabolites reduces the shelf-life of the composition or results in the formation of subvisible particles (SVPs) above prescribed regulations. The protein may be a recombinant protein and may e.g. be selected from the group consisting of antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein or any combinations thereof. The enzymes and their activities may be, e.g., various esterases, hydrolases, lipases, phospholipases, ceramidases and the likes, or any combinations thereof. Adverse enzyme activity may be measured using a functional assay {e.g., polysorbate fatty acid hydrolysis assay), or a structural assay {e.g., nano LC-MS of peptide fragments, liquid chromatography-tandem mass spectrometry (LC-MS/MS) or surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF-MS) or the like (deZafra et al, 2015, Biotechnol Bioeng;1 12(1 1 ):2284- 91 )).
[0012] The invention provides a cell comprising a nonfunctional PLBD2 protein and one or more additional nonfunctional fatty acid hydrolases (FAH). In one embodiment, the additional
nonfunctional FAH is a nonfunctional lipoprotein lipase (LPL), lysosomal acid lipase (LI PA), acid ceramidase (ACE), platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), phosphoinositide phospholipase C fragment (PIPLCf), phosphoinositide phospholipase C (PIPLC), liver carboxylesterase 1 (LCE), isoamyl acetate-hydrolyzing esterase 1 -like (IAH1 ), group XV phospholipase A2 (LPLA2), carboxylic ester hydrolase (CEH), and/or arylsulfatase A (ASA).
[0013] The invention provides making a cell by FAH target disruption. In some embodiments, the method comprises a site-specific nuclease for disrupting or editing the cell genome at a target site or sequence. In some embodiments, the FAH target site is (1 ) a PLBD2 target site, (2) an LPL target site, (3) an LI PA target site, (4) an ACE target site, (5) a PAFAHG site, (6) a PIPLCf or PIPLC site, (7) an LCE site, (8) an IAH1 site, (9) an LPLA2 site, (9) an CEH site, and /or (10) an ASA site.
[0014] In one embodiment, the PLBD2 target site comprises a position within SEQ ID NO:33, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:33, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:33, within exon 1 of SEQ ID NO:33, within exon 2 of SEQ ID NO:33, or within exon 3 of SEQ ID NO:33.
[0015] In one embodiment, the PLBD2 target site comprises a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-230, 190-210, 200-220, 210-230, 220-240, 230-250, 240-260, and 250-270 of SEQ ID NO:33.
[0016] In another embodiment, the target site at a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 is selected from the group consisting of nucleotides spanning positions numbered 37-56, 44-56, 33-62, 40-69, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33. In this regard, the PLBD2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:33 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:33 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0017] In one embodiment, the LPL target site comprises a position within SEQ ID NO:61 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:61 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO:61 , within exon 1 of SEQ ID NO:61 , within exon 2 of SEQ ID NO:61 , or within exon 3 of SEQ ID NO:61.
[0018] In one embodiment, the LPL target site comprises a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490- 510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590- 610, 600-620, 610-630, 620-640 and 630-650 of SEQ ID NO:61.
[0019] In another embodiment, the target site at a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 is selected from the group consisting of nucleotides spanning positions numbered 465-484, 558-577, and 593-612 of SEQ ID NO:61. In this regard, the LPL target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:61 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:61 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0020] In one embodiment, the LIPA target site comprises a position within SEQ ID NO:69, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:69, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:69, within exon 1 of SEQ ID NO:69, within exon 2 of SEQ ID NO:69, or within exon 3 of SEQ ID NO:69.
[0021] In one embodiment, the LIPA target site comprises a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350 and 340-360 of SEQ ID NO:69.
[0022] In another embodiment, the target site at a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 is selected from the group consisting of nucleotides spanning positions numbered 180-199, 239-258, and 276-295 of SEQ ID NO:69. In this regard, the LIPA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:69 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:69 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0023] In one embodiment, the ACE target site comprises a position within SEQ ID NO:77, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:77, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:77, within exon 1 of SEQ ID NO:77, within exon 2 of SEQ ID NO:77, within exon 3 of SEQ ID NO:77, or within exon 4 of SEQ ID NO:77.
[0024] In one embodiment, the ACE target site comprises a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370 and 360-380 of SEQ ID NO:77.
[0025] In another embodiment, the target site at a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 is selected from the group consisting of nucleotides spanning positions numbered 135-154, 237-256, and 332-351 of SEQ ID NO:77. In this regard, the ACE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:77 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:77 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0026] In one embodiment, the PAFAHG target site comprises a position within SEQ ID NO:1 17, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 17, 100 nucleotides
downstream of the 5-prime end of SEQ ID NO: 1 17, within exon 1 of SEQ ID NO: 1 17, within exon 2 of SEQ ID NO:1 17, or within exon 3 of SEQ ID NO:1 17.
[0027] In one embodiment, the PAFAHG target site comprises a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:1 17 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, 760-780, and 770-787 of SEQ ID NO: 1 17.
[0028] In another embodiment, the target site at a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:1 17 is selected from the group consisting of nucleotides spanning positions numbered 101-120, 1 1 1-130, 121 -140, 131-150, 141-160, and 150-169 of SEQ ID
NO:1 17. In this regard, the PAFAHG target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 17 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 17 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0029] In one embodiment, the PIPLC target site comprises a position within SEQ ID NO:1 18, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 18, 100 nucleotides
downstream of the 5-prime end of SEQ ID NO: 1 18, within exon 1 of SEQ ID NO: 1 18, within exon 2 of SEQ ID NO: 1 18, within exon 3 of SEQ ID NO: 1 18, within exon 4 of SEQ ID NO: 1 18, within exon 5 of SEQ ID NO:1 18, within exon 6 of SEQ ID NO:1 18, within exon 7 of SEQ ID NO:1 18, within exon 8 of SEQ ID NO:1 18, within exon 9 of SEQ ID NO:1 18, within exon 10 of SEQ ID NO:1 18, within exon 1 1 of SEQ ID NO:1 18, within exon 12 of SEQ ID NO:1 18, or within exon 13 of SEQ ID NO:1 18.
[0030] In one embodiment, the PIPLC target site comprises a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:1 18 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO: 1 18.
[0031] In another embodiment, the target site at a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:1 18 is selected from the group consisting of nucleotides spanning positions numbered 39-188, 39-58, 49-68, 59-78, 69-88, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 149-178, and 159-188 of SEQ ID NO:1 18. In this regard, the PIPLC target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 18 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 18 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0032] In one embodiment, the LCE target site comprises a position within SEQ ID NO:1 19, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 19, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 1 19, within exon 1 of SEQ ID NO: 1 19, within exon 2 of SEQ ID NO: 1 19, within exon 3 of SEQ ID NO: 1 19, within exon 4 of SEQ ID NO: 1 19, or within exon 5 of SEQ ID NO:1 19.
[0033] In one embodiment, the LCE target site comprises a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:1 19 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO: 1 19.
[0034] In another embodiment, the target site at a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:1 19 is selected from the group consisting of nucleotides spanning positions numbered 89-140, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, and 139-158 of SEQ ID NO:1 19. In this regard, the LCE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 19 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 19 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0035] In one embodiment, the IAH1 target site comprises a position within SEQ ID NO:120, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 120, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 120, within exon 1 of SEQ ID NO: 120, within exon 2 of SEQ ID NO:120, within exon 3 of SEQ ID NO:120, or within exon 4 of SEQ ID NO:120.
[0036] In one embodiment, the IAH1 target site comprises a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO: 120 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO:120.
[0037] In another embodiment, the target site at a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO: 120 is selected from the group consisting of nucleotides spanning positions numbered 104-325, 104-123, 1 14-133, 124-143, 134-153, 144-163, 154-173, 164-183, 174-193, 184-203, 194-213, 204-223, 214-233, 224-243, 234-253, 244-263, 254-273, 264-283, 274-293, 284-303, 294-313, 304-323, and 314-333 of SEQ ID NO:120. In this regard, the IAH1 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 120 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:120 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0038] In one embodiment, the LPLA2 target site comprises a position within SEQ ID NO:121 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 121 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 121 , within exon 1 of SEQ ID NO: 121 , within exon 2 of SEQ ID NO: 121 , within exon 3 of SEQ ID NO: 121 , or within exon 4 of SEQ ID NO: 121.
[0039] In one embodiment, the LPLA2 target site comprises a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO:121.
[0040] In another embodiment, the target site at a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 is selected from the group consisting of nucleotides spanning positions numbered 69-195, 69-88, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, and 179-198 of SEQ ID NO:121. In this regard, the LPLA2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:121 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:121 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0041] In one embodiment, the CEH target site comprises a position within SEQ ID NO:122, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 122, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 122, within exon 1 of SEQ ID NO: 122, within exon 2 of SEQ ID NO: 122, within exon 3 of SEQ ID NO: 122, within exon 4 of SEQ ID NO: 122, or within exon 5 of SEQ ID NO: 122.
[0042] In one embodiment, the CEH target site comprises a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO: 122 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO: 122.
[0043] In another embodiment, the target site at a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO: 122 is selected from the group consisting of nucleotides spanning positions numbered 79-186, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, and 169-188 of SEQ ID NO:122. In this regard, the CEH target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:122 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:122 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0044] In one embodiment, the ASA target site comprises a position within SEQ ID NO:123, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 123, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 123, within exon 1 of SEQ ID NO: 123, within exon 2 of SEQ ID NO: 123, within exon 3 of SEQ ID NO: 123, or within exon 4 of SEQ ID NO: 123.
[0045] In one embodiment, the ASA target site comprises a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO: 123 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO:123.
[0046] In another embodiment, the target site at a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO: 123 is selected from the group consisting of nucleotides spanning positions numbered 79-296, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, 179-198 , 189-208 , 199-218, 209-228, 219-238, 229-248, 239-258, 249-268, 259-278, 269-288, and 279-298 of SEQ ID NO:123. In this regard, the ASA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:123 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:123 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0047] In certain embodiments, the cell further integrates an exogenous nucleic acid sequence. In other embodiments, the cell is capable of producing an exogenous protein of interest. In still other embodiments, the altered protein resulting from a disrupted gene does not bind to the protein of interest produced by the cell.
[0048] In another aspect, an isolated Chinese hamster ovary (CHO) cell is provided that comprises an engineered nucleic acid sequence comprising a variant of the PLBD2 gene (such as a variant of SEQ ID NO:33). In one embodiment, the PLBD2 gene comprises GACAGTCACG TGGCCCGACT GAGGCACGCG , nucleotides 1-30 of SEQ ID NO:33 (SEQ ID NO: 44). In another embodiment, the PLBD2 gene is engineered to disrupt expression of the open reading frame. In other embodiments, the invention provides an isolated CHO cell comprising (a) a disrupted PLBD2 gene comprising GACAGTCACG TGGCCCGACT GAGGCACGCG (SEQ ID NO: 44, also nucleotides 1- 30 of SEQ ID NO:33), (b) a disrupted esterase gene comprising a nucleotide encoding any one of the amino acid sequences in Table 2, or (c) a protein fragment of Table 2 expressed by a disrupted PLBD2 gene; and an exogenous nucleic acid sequence comprising a gene of interest.
[0049] In one embodiment, the CHO cell that comprises an engineered nucleic acid sequence comprising a variant of the PLBD2 gene also comprises variants of one or more of the genes encoding (1 ) LPL (variant of SEQ ID NO:61 ), (2) LIPA (SEQ ID NO:69), (3) ACE (SEQ ID NO:77), (4) PAFAHG (SEQ ID NO:1 17), (5) PIPLC (SEQ ID NO:1 18), (6) LCE (SEQ ID NO:1 19), (7) IAH1 (SEQ ID NO:120), (8) LPLA2 (SEQ ID NO:121 ), (9) CEH (SEQ ID NO:122), and/or (10) ASA (SEQ ID NO:123).
[0050] In another aspect, a method of producing a protein of interest using a recombinant host cell is provided, wherein the host cell is modified to decrease the expression levels of esterase relative to the expression levels of esterase in an unmodified cell.
[0051] In another embodiment, the method comprises the modified host cell having decreased esterase expression and an exogenous nucleic acid sequence comprising a gene of interest (GOI).
[0052] In certain embodiments, the exogenous nucleic acid sequence comprises one or more genes of interest. In some embodiments, the one or more genes of interest are selected from the group consisting of a first GOI, a second GOI and a third GOI.
[0053] In another aspect, the invention provides expression systems comprising the recombinant host cell comprising modified or nonfunctional esterase.
[0054] In yet another embodiment, the cell comprises a GOI operably linked to a promoter capable of driving expression of the GOI, wherein the promoter comprises a eukaryotic promoter that can be regulated by an activator or inhibitor. In other embodiments, the eukaryotic promoter is operably linked to a prokaryotic operator, and the eukaryotic cell optionally further comprises a prokaryotic repressor protein.
[0055] In another embodiment, one or more selectable markers are expressed by the modified host cell. In some embodiments, the genes of interest and/or the one or more selectable markers are operably linked to a promoter, wherein the promoter may be the same or different. In another embodiment, the promoter comprises a eukaryotic promoter (such as, for example, a CMV promoter or an SV40 late promoter), optionally controlled by a prokaryotic operator (such as, for example, a tet operator). In other embodiments, the cell further comprises a gene encoding a prokaryotic repressor (such as, for example, a tet repressor).
[0056] In one aspect, a CHO host cell is provided, comprising recombinase recognition sites. In some embodiments, the recombinase recognition sites are selected from a LoxP site, a Lox5 1 1 site, a Lox2272 site, Lox2372, Lox5171 , and a frt site.
[0057] In another embodiment, the cell further comprises a gene capable of expressing a recombinase. In some embodiments, the recombinase is a Cre recombinase.
[0058] In one embodiment, the selectable marker gene is a drug resistance gene. In another embodiment, the drug resistance gene is a neomycin resistance gene or a hygromycin resistance gene. In another embodiment, the second and third selectable marker genes encode two different fluorescent proteins. In one embodiment, the two different fluorescent proteins are selected from the group consisting of Discosoma coral (DsRed), green fluorescent protein (GFP), enhanced green fluorescent protein (eGFP), cyano fluorescent protein (CFP), enhanced cyano fluorescent protein (eCFP), yellow fluorescent protein (YFP), enhanced yellow fluorescent protein (eYFP) and far-red fluorescent protein (mKate).
[0059] In one embodiment, the first, second, and third promoters are the same. In another embodiment, the first, second, and third promoters are different from each other. In another embodiment, the first promoter is different from the second and third promoters, and the second and third promoters are the same. In more embodiments, the first promoter is an SV40 late promoter, and the second and third promoters are each a human CMV promoter. In other embodiments, the first and second promoters are operably linked to a prokaryotic operator.
[0060] In one embodiment, the host cell line has an exogenously added gene encoding a recombinase integrated into its genome, operably linked to a promoter. In another embodiment, the recombinase is Cre recombinase. In another embodiment, the host cell has a gene encoding a regulatory protein integrated into its genome, operably linked to a promoter. In more embodiments, the regulatory protein is a tet repressor protein.
[0061] In one embodiment, the first GOI and the second GOI encode a light chain, or fragment thereof, of an antibody or a heavy chain, or fragment thereof, of an antibody. In another embodiment, the first GOI encodes a light chain of an antibody and the second GOI encodes a heavy chain of an antibody.
[0062] In certain embodiments, the first, second and third GOI encode a polypeptide selected from the group consisting of a first light chain, or fragment thereof, a second light chain, or fragment thereof and a heavy chain, or fragment thereof. In yet another embodiment, the first, second and third GOI encode a polypeptide selected from the group consisting of a light chain, or fragment thereof, a first heavy chain, or fragment thereof and a second heavy chain, or fragment thereof.
[0063] In one aspect, a method is provided for making a protein of interest, comprising (a) introducing into a CHO host cell a gene of interest (GOI), wherein the GOI integrates into a specific locus such as a locus described in US Patent No. 7771997B2, issued August 10, 2010 or other stable integration and/or expression-enhancing locus; (b) culturing the cell of (a) under conditions that allow expression of the GOI; and (c) recovering the protein of interest. In one embodiment, the protein of interest is selected from the group consisting of a subunit of an immunoglobulin, or fragment thereof, and a receptor, or ligand-binding fragment thereof. In certain embodiments, the protein of interest is selected from the group consisting of an antibody light chain, or antigen-binding fragment thereof, and an antibody heavy chain, or antigen-binding fragment thereof.
[0064] In certain embodiments, the CHO host cell genome comprises further modifications, and comprises one or more recombinase recognition sites as described above, and the GOI is introduced into a specific locus through the action of a recombinase that recognizes the
recombinase recognition site.
[0065] In some embodiments, the GOI is introduced into the cell employing a targeting vector for recombinase-mediated cassette exchange (RMCE) when the CHO host cell genome comprises at least one exogenous recognition sequence within a specific locus.
[0066] In another embodiment, the GOI is introduced into the cell employing a targeting vector for homologous recombination, and wherein the targeting vector comprises a 5' homology arm homologous to a sequence present in the specific locus, a GOI, and a 3' homology arm
homologous to a sequence present in the specific locus. In another embodiment, the targeting vector further comprises two, three, four, or five or more genes of interest. In another embodiment, one or more of the genes of interest are operably linked to a promoter.
[0067] In another aspect, a method is provided for modifying a CHO cell genome to integrate an exogenous nucleic acid sequence, comprising the step of introducing into the cell a vehicle comprising an exogenous nucleic acid sequence wherein the exogenous nucleic acid integrates within a locus of the genome.
[0068] In yet another aspect, the invention provides a process for manufacturing a stable protein formulation comprising the steps of: (a) extracting a protein fraction from the modified host cell of the invention having decreased or ablated expression of esterase, (b) contacting the protein fraction comprising a protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (c) collecting the protein of interest from the media, wherein a reduced level of the esterase activity is associated with the protein fraction collected at step (c), thus providing a stable protein formulation.
[0069] In yet another aspect, the invention provides a process for reducing esterase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate expression of esterase, (b) transfecting the host cell with a protein of interest, (c) extracting a protein fraction from the modified host cell, (c) contacting the protein fraction comprising the protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (d) collecting the protein of interest from the media, and (e) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer, thus providing a protein formulation essentially free of detectable esterase activity. In some embodiments, the protein formulation is essentially free of PLBD2 protein or PLBD2 activity.
[0070] In yet another aspect, a method is provided for modifying a CHO cell genome to express a therapeutic agent comprising a vehicle for introducing, into the genome, an exogenous nucleic acid comprising a sequence for expression of the therapeutic agent, wherein the vehicle comprises a 5' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33, a nucleic acid encoding the therapeutic agent, and a 3' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33.
[0071] In one more aspect, the invention provides a modified CHO host cell comprising a modified CHO genome wherein the CHO genome is modified by disruption of target sequence within a nucleotide sequence at least 90% identical to SEQ ID NO: 33. In one embodiment, the modified CHO host cell further comprises another FAH target sequence disruption. In some embodiments, the another FAH target sequence is within (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:61 , (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:69 and/or (3) a nucleotide sequence at least 90% identical to SEQ ID NO:77.
[0072] In another aspect, the invention provides a modified eukaryotic host cell comprising a modified eukaryotic genome wherein the eukaryotic genome is modified at a target sequence in a coding region of the target gene by a site-specific nuclease. In some embodiments, the site-specific nuclease comprises a zinc finger nuclease (ZFN), a ZFN dimer, a transcription activator-like effector nuclease (TALEN), a TAL effector domain fusion protein, or an RNA-guided DNA endonuclease. The invention also provides methods of making such a modified eukaryotic host cell.
[0073] In any of the aspects and embodiments described above, the target sequence can be placed in the indicated orientation as in SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 1 19, 120, 121 , 122, or 123; or in the reverse of the orientation of SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 1 19, 120, 121 , 122, or 123.
[0074] Any of the aspects and embodiments of the invention can be used in conjunction with any other aspect or embodiment of the invention, unless otherwise specified or apparent from the context.
[0075] Other objects and advantages will become apparent from a review of the ensuing detailed description.
BRIEF DESCRIPTION OF THE DRAWINGS
[0076] Fig. 1 depicts the results of Taqman® quantitative polymerase chain reaction (qPCR) experiments to detect genomic (gDNA) or transcripts (mRNA) of the modified clones. Primers and probes were designed to flank the sequences predicted as subject to targeted disruption within exon 1 , either starting at nucleotide 37 (sgRNAI ) or starting at nucleotide 44 (sgRNA2) of SEQ ID NO:33. Relative amount of amplicons from clones targeted by either sgRNAI or sgRNA2 are graphed (i.e., relative to amplicons derived from the negative control transfection clones which were subject to no sgRNA or unmatched sgRNA). Clone 1 , for example, has relatively no amplified gDNA nor mRNA per qPCR of the targeted exon 1 region. Clone 1 and several other clones were selected for follow up analysis. The vertical axes represent the relative amount of template containing sgRNAI sequence (upper panel) or sgRNA2 sequence (lower panel).
[0077] Fig. 2A and Fig. 2B illustrate the results of further PCR analysis of a Clone 1 cells population compared to wild type Chinese hamster overy (CHO) cells. Fig. 2A shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from genomic DNA of wild type cells. Fig. 2B shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from mRNA of wild type cells. Sequencing confirmed an 1 1 bp deletion in the PLBD2 gene of Clone 1 . The vertical axes represent the size of the PCR fragments in base-pairs (bp).
[0078] Fig. 3 illustrates the relative protein titer of monoclonal antibody 1 (mAbl )-expressing Clone 1 cells (RS001 ) or mAb1 -expressing wild type CHO cells (RS0WT) subject to the same fed-batch culture conditions for 12 days. Samples of conditioned medium were extracted for each culture, and the Protein A binding fraction was quantified at Day 2, 4, 6, 9 and 12. [0079] Fig. 4 shows the results of RS001 or RSOWT cells following production culture and protein purification using either Protein A (PA) alone, or PA and anion exchange (AEX) chromatography. PA-purified mAb1 from RS001 and RSOWT was analyzed for lipase abundance using trypsin digest mass spectrometry. As such, trypsin digests of RS001 - and RSOWT-produced mAb1 were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass
spectrometer set to monitor a specific PLBD2 product fragment (as in Table 2). Control reactions containing reference samples of mAb1 (with no endogenous PLBD2) spiked with varying amounts of recombinant PLBD2 were also analyzed and plotted. The signals detected in the experiments were compared to the control reactions to determine concentration of PLBD2. mAb1 produced from Clone 1 shows no detectable amounts of PLBD2 when purified with PA alone.
[0080] Fig. 5 is a line plot depicting percent cell viability as a function of time in days. Open circles (-0-) represent wildtype cells. Filled diamonds (-♦-) represent PLBD2-KO cells. Filled squares (-■-) represent LPL-KO cells. Filled triangles (-A -) represent the double knock-out LPL-KO / PLBD2KO.
[0081] Fig. 6 is a line plot depicting protein production (titer) in grams per liter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (-♦-) represent PLBD2-KO cells. Filled squares (-■-) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
[0082] Fig. 7 is a line plot depicting viable cell counts in cells per milliliter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (-♦-) represent PLBD2-KO cells. Filled squares (-■-) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
[0083] Fig. 8 shows a formatted alignment of LPL knock out constructs clone 19, clone 20, clone 21 , and clone 22, represented by SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161 , and SEQ ID NO: 162, respectively. The partial wildtype LPL sequence is represented by SEQ ID NO: 158.
DETAILED DESCRIPTION
[0084] Before the present methods are described, it is to be understood that this invention is not limited to particular methods, and experimental conditions described, as such methods and conditions may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only by the appended claims.
[0085] As used in this specification and the appended claims, the singular forms "a", "an", and "the" include plural references unless the context clearly dictates otherwise. Thus for example, a reference to "a method" includes one or more methods, and/or steps of the type described herein and/or which will become apparent to those persons skilled in the art upon reading this disclosure.
[0086] Unless defined otherwise, or otherwise specified, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
[0087] Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, particular methods and materials are now described. All publications mentioned herein are incorporated herein by reference in their entirety.
Definitions
[0088] The phrase "exogenously added gene" or "exogenously added nucleic acid" refers to any DNA sequence or gene not present within the genome of the cell as found in nature. For example, an "exogenously added gene" within a CHO genome, can be a gene from any other species (e.g., a human gene), a chimeric gene (e.g., human/mouse), or a hamster gene not found in nature within the particular CHO locus in which the gene is inserted (i.e., a hamster gene from another locus in the hamster genome), or any other gene not found in nature to exist within a CHO locus of interest.
[0089] Percent identity, when describing an esterase, e.g., a hydrolase protein, such as SEQ ID NO:32, 34, 35, 37, 62,70, 74, 75, 76, 78, 82, 83, 84; 124, 125, 126, 127, 128, 129, and 130; or gene, such as SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 1 19, 120, 121 , 122, and 123 includes homologous sequences that display the recited identity along regions of contiguous homology, but the presence of gaps, deletions, or insertions that have no homolog in the compared sequence are not taken into account in calculating percent identity.
[0090] A "percent identity" determination between, e.g., SEQ ID NO:32 with a species homolog would not include a comparison of sequences where the species homolog has no homologous sequence to compare in an alignment (i.e., SEQ ID NO:32 compared to a fragment thereof, or the species homolog has a gap or deletion, as the case may be). Thus, "percent identity" does not include penalties for gaps, deletions, and insertions.
[0091] "Targeted disruption" of a gene or nucleic acid sequence refers to gene targeting methods that direct cleavage or breaks (such as double stranded breaks) in genomic DNA and thus cause a modification to the coding sequence of such gene or nucleic acid sequence. Gene target sites (i.e. target sequences) are the sites within the genomic sequence selected for cleavage or break by a nuclease. The DNA break is normally repaired by the non-homologous end-joining (NHEJ) DNA repair pathway. During NHEJ repair, insertions or deletions (InDels) may occur, as such, a small number of nucleotides are either inserted or deleted at random at the site of the break and these InDels may shift or disrupt the open reading frame (ORF) of the target gene. Shifts in the ORF may cause significant changes in the resulting amino acid sequence downstream of the DNA break, or may introduce a premature stop codon, therefore the expressed protein, if any, is rendered nonfunctional or subject to degradation.
[0092] "Targeted insertion" refers to gene targeting methods employed to direct insertion or integration of a gene or nucleic acid sequence to a specific location on the genome, i.e., to direct the DNA to a specific site between two nucleotides in a contiguous polynucleotide chain. Targeted insertion may also be performed to introduce a small number of nucleotides or to introduce an entire gene cassette, which includes multiple genes, regulatory elements, and/or nucleic acid sequences. "Insertion" and "integration" are used interchangeably.
[0093] "Recognition site" or "recognition sequence" is a specific DNA sequence recognized by a nuclease or other enzyme to bind and direct site-specific cleavage of the DNA backbone.
Endonucleases cleave DNA within a DNA molecule. Recognition sites are also referred to in the art as recognition target sites.
[0094] Polysorbates are fatty acid esters of sorbitan or iso-sorbide (polyoxyethylene sorbitan or iso- sorbide mono- or di- esters). The polyoxyethylene serves as the hydrophilic head group and the fatty acid as the lipophilic hydrophobic tail. The effectiveness as a surfactant of the polysorbate depends upon the amphiphilic nature of the molecule with both hydrophilic head and hydrophobic tail present (in a single molecule). When a polysorbate degrades (hydrolyzes) into its component head group and fatty acid tail, it loses its effectiveness as a protein stabilizer, potentially allowing for aggregation and subsequent subvisible particle (SVP) formation is an indicator of such degradation. SVPs may attribute to immunogenicity. Regulatory authorities like the United States Food and Drug Administration (USFDA) provide limitations on the number of subvisible particles (SVPs) allowed in a pharmaceutical formulation. United States Pharmacopeia (USP) publishes standards for strength, purity and quality of drugs and drug ingredients, as well as food ingredients and dietary
supplements. For example, USP 31 monograph <788> sets the limit for number of particles allowed in parenteral formulations. USP 31 monograph <788> is available at http://www.uspnf.com/official- text/revision-bulletins/particulate-matter-iniections: and as Revision Bulletin Official July 1 , 2012, <788> Particulate Matter in Injections, The United States Pharmacopeial Convention. For large volume parenterals (greater than 100 ml_), the limit is set at no more than 25 particles of at least 10 microns per ml_, and no more than 3 particles of at least 25 microns per ml_. For small volume parenterals (100 ml. or less), the limit is set at no more than 6,000 particles of at least 10 microns per container, and no more than 600 particles of at least 25 microns per container.
[0095] The importance of maintaining a stable composition to minimize losses of the biologically active agent due to any contaminant or degradative process is emphasized by the International Conference on Harmonisation of Technical Requirements For Registration of Pharmaceuticals For Human Use (ICH). According to ICH Specifications (Q6A and Q6B), if a drug substance does not degrade in the specific formulation and under the specific storage conditions proposed in a new drug application, as demonstrated via appropriate analytical methodology, then degradation product testing may be reduced or eliminated upon approval by the regulatory authorities.
[0096] The term "stability" refers to the retention of an acceptable degree of physical structure (colloidal, nature), chemical structure or biological function of the biologically active agent (e.g., biotherapeutic or other protein produced in a cell-based bioprocess) over time during storage (a.k.a. "shelf-life"), during processing, or after administration and while in vivo. The biologically active agent may be stable even though it does not maintain 100% of its structure or function after storage or administration for a defined amount of time. Under certain circumstances, if about 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97% 98%, 99%, or > 99% of the biologically active agents have a native conformation, structure or function, the biologically active agent and formulation containing the biologically active agent may be regarded as "stable".
[0097] Stability can be measured, inter alia, by determining the percentage of native molecule that remains in the formulation after storage or administration for a defined amount of time at a defined temperature. The percentage of native molecule can be determined by, inter alia, size exclusion chromatography (e.g., size exclusion high performance liquid chromatography [SE-HPLC]), such that native means non-aggregated and non-degraded. In certain embodiments, at least about 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% of the native form of the biologically active agent can be detected in the formulation after a defined amount of time at a defined temperature or under physiological conditions after administration. The defined amount of time after which stability is measured can be about 14 days, about 28 days, about 1 month, about 2 months, about 3 months, about 4 months, about 5 months, about 6 months, about 7 months, about 8 months, about 9 months, about 10 months, about 1 1 months, about 12 months, about 18 months, about 24 months, or more. The temperature at which the formulation containing the biologically active agent may be kept when assessing stability can be any temperature from about -80°C to about 45°C, e.g., storage at about - 80°C, about -30°C, about -20°C, about 0°C, about 4°-8°C, about 5°C, about 25°C, about 35°C, about 37°C or other physiological temperatures, or about 45°C. For example, the biologically active agent may be deemed stable if after 3 months under physiological conditions, greater than about 75%, 80%, 85% or 90% of native molecule is detected in the soluble fraction by SE-HPLC or other size exclusion or size determination method. "Physiological temperature" includes the body temperature of any vertebrate. For example, the physiological temperature of humans is about 37°C. In some embodiments of the invention, physiological temperature is between about 25°C and about 45°C. In some embodiments, physiological temperature is about 25°C, about 26°C, about 27°C, about 28°C, about 29°C, about 30°C, about 31 °C, about 32°C, about 33°C, about 34°C, about 35°C, about 36°C, about 37°C, about 38°C, about 39°C, about 40°C, about 41 °C, about 42°C, about 43°C, about 44°C, and about 45°C.
[0098] Stability can be measured, inter alia, by determining the percentage of biologically active agent, such as a protein, that forms an aggregate (i.e., high molecular weight species) after a defined amount of time at a defined temperature, wherein stability is inversely proportional to the percent high molecular weight (HMW) species that is formed of the biologically active agent (protein). The percentage of HMW species of the biologically active agent can be determined by, inter alia, size exclusion chromatography, as described above. A pharmaceutical formulation containing the biologically active agent may also be deemed stable if after three months at physiological conditions less than about 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1 % of the biologically active agent is detected in a HMW form.
[0099] Stability can be measured, inter alia, by determining the percentage of a biologically active agent, such as a protein, that is degraded or otherwise is found as a low molecular weight (LMW) species after a defined amount of time at a defined temperature. Stability is inversely proportional to the percent LMW species that is formed in the soluble fraction. The percentage of LMW species of the biologically active agent in the soluble fraction can be determined by, inter alia, size exclusion chromatography, as described above. A pharmaceutical formulation may also be deemed stable if after three months under storage conditions less than about 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1 % of the biologically active agent is detected in a LMW form.
[00100] Pharmaceutical compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines. The composition complies if tested for subvisible particles and the test results in the average number of particles present in the units tested does not exceed 12 per milliliter equal to or greater than 10 um in size, and does not exceed 2 per milliliter equal to or greater than 25 um. Various test for microscopic particles in solution are well-known in the art, including but not limited to tests recommended in ICH Guideline Q4B Annex 3(R1 ), dated 27 September 2010 (Evaluation And Recommendation Of Pharmacopoeial Texts For Use In The lch Regions On Test For Particulate Contamination: Sub-Visible Particles General Chapter).
[00101] "Host cell proteins" or "HCPs" refer to proteins produced or encoded by the host organisms used to produce recombinant therapeutic proteins. HCPs are generally process-related impurities during biologies production. The amount of residual HCPs in drug product is generally considered a critical quality attribute (CQA), due to their potential to affect product safety and efficacy. Regulatory authorities require a product sponsor to monitor the removal of HCPs in drug product during bioprocess development. A sensitive assay e.g., immunoassay, capable of detecting a wide range of protein impurities is generally utilized. This testing can include verification at commercial scale in accordance with regional regulations and may be done at the time of submission of a marketing approval application. According to ICH Specifications (Q6A and Q6B, section 2.3), if a drug substance or drug product does not contain any impurity in the specific formulation, i.e. if efficient control or removal to acceptable levels is demonstrated by suitable studies, further testing may be reduced or eliminated upon approval by the regulatory authorities.
[00102] Pharmaceutical compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines. The composition prepared by the host cells described herein comprise less than 100 ng/mg (ppm), less than about 90 ppm, less than about 80 ppm, less than about 70 ppm, less than about 60 ppm, less than about 50 ppm, less than about 40 ppm, less than about 30 ppm, less than about 20 ppm, less than about 10 ppm, less than about 5 ppm, or 0 ppm of the target host cell protein, i.e. fatty acid hydrolase.
[00103] The term "fatty acid hydrolase" or "FAH" refers to any hydrolytic enzyme that cleaves at a carbonyl group creating a carboxylic acid product in which the carboxylic acid comprises an R- group that is lipophilic or otherwise hydrophobic. In some embodiments, the carboxylic acid product is a fatty acid. "Esterases" and "fatty acid acylases/amidases" are included as subgenera of fatty acid hydrolase. "Lipases" are a subgenus of esterases that cleave lipids (fats, waxes, sterols, glycerides and phospholipids. "Phospholipases" are a subgenus of lipases that cleave
phospholipids. Esterases cleave fatty acid esters into fatty acids and alcohols. Lipases include PLBD2, LPL and LIPA. "Ceramidase" is a subgenus of fatty acid acylase that cleaves ceramide and releases a fatty acid and sphingosine, which is an amino alcohol. Examples of ceramidases include acid ceramidase, neutral ceramidase, alkaline ceramidase 1 , alkaline ceramidase 2 and alkaline ceramidase 3.
[00104] Protein A-binding fraction refers to the fraction of cell lysate from cultured cells expressing a protein of interest which binds to a Protein A affinity format. It is well understood in the art that Protein A affinity chromatography, such as Protein A chromatography medium, such as resins, beads, columns and the like, are utilized to capture Fc-containing proteins due to their affinity to Protein A.
[00105] Phospholipase B-like 2 (PLBD2) refers to the homologs of a phospholipase gene known as NCBI RefSeq. XM_003510812.2 (SEQ ID NO:33) or protein known as NCBI RefSeq. XP_003510860.1 (SEQ ID NO:32), and further described herein. PLBD2 is also referred to in the art as putative phospholipase B-like 2 (PLBL2), 76 kDa protein, LAMA-like protein 2, PLB homolog 2, lamina ancestor homolog 2, mannose-6-phosphate protein associated protein p76, p76, phospholipase B-like 2 32 kDa form, phospholipase B-like 2 45 kDa form, or Lysosomal 66.3 kDa protein.
[00106] Lipoprotein lipase (LPL) is a glycosylated homodimer secreted by parenchymal cells and associated with endothelial cells of the capillary lumen. Exemplary LPL proteins include Chinese hamster LPL (SEQ ID NO:62), mouse LPL (SEQ ID NO:66), ), rat LPL (SEQ ID NO:67) and human LPL (SEQ ID NO:68). Mouse LPL is 92% identical to Chinese hamster LPL. Rat LPL is 92% identical to Chinese hamster LPL. Human LPL is 88% identical to Chinese hamster LPL. In one embodiment, Chinese hamster LPL is encoded by a polynucleotide sequence of SEQ ID NO:61 .
[00107] Lysosomal acid lipase (LI PA), also known as lysosomal lipase, lipase A, lysosomal acid and cholesterol esterase is an intracellular lipase that functions in the lysosome. LI PA reversibly catalyzes cholesteryl ester bond formation and cleavage. LI PA is also a glycosylated homodimer. Exemplary LIPA proteins include Chinese hamster LIPA (SEQ ID NO:70), mouse LIPA (SEQ ID NO:74), ), rat LIPA (SEQ ID NO:75) and human LIPA (SEQ ID NO:76). Mouse LIPA is 72% identical to Chinese hamster LIPA. Rat LIPA is 75% identical to Chinese hamster LIPA. Human LIPA is 74% identical to Chinese hamster LIPA. In one embodiment, Chinese hamster LIPA is encoded by a polynucleotide sequence of SEQ ID NO:69.
[00108] Acid ceramidase (CE), also known as ASAH1 , AC, ACDase, ASAH, PHP, PHP32, SMAPME, N-acylsphingosine amidohydrolase (acid ceramidase) 1 is an acylase that cleaves ceramide to produce fatty acid and sphingosine. It is a heterodimer comprising a non-glycosylated alpha subunit and a glycosylated beta subunit. Acid ceramidase has an acid pH optimum. The lipid accumulation disease, Farber Lipogranulomatosis, is associated with a deficiency in acid ceramidase activity. Exemplary acid ceramidases (ACE) include Chinese hamster ACE (SEQ ID NO:78), mouse ACE (SEQ ID NO:82), ), rat ACE (SEQ ID NO:83) and human ACE (SEQ ID NO:84). Mouse ACE is 87% identical to Chinese hamster ACE. Rat ACE is 89% identical to Chinese hamster ACE. Human ACE is 83% identical to Chinese hamster ACE. In one embodiment, Chinese hamster ACE is encoded by a polynucleotide sequence of SEQ ID NO:77.
[00109] Platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), also known as PAFAH1 B3, PAFAHG, and platelet activating factor acetylhydrolase 1 b catalytic subunit 3 is one of the catalytic subunits along with beta of the cytosolic tetrameric platelet-activating factor
acetylhydrolase IB. PAFAHG belongs to the phospholipase A2 family and catalyzes the hydrolysis of the acyl group at position 2 of glycerol in bioactive phospholipids (see Stafforini et al., Journal of Biological Chemistry, 272:17895-17898, July 1997). Chinese hamster PAFAHG (SEQ ID NO:124) is 98% identical to both rat and mouse PAFAHG, and 96% identical to human PAFAHG. In one embodiment, Chinese hamster PAFAHG is encoded by a polynucleotide sequence of SEQ ID NO:1 17.
[00110] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00111] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00112] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00113] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00114] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO: 125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00115] Additional fatty acid hydrolases that can serve as targets for deletion, either individually, or in combination with one of more additional fatty acid hydrolases are listed in Table 1. Any of the following hydrolases, or their equivalents, listed in Table A may be the target protein in the creation of a knockout cell line, where removal of the host cell protein is necessitated due to contamination in the preparation of a biopharmaceutical product.
Table 1
[00116] The term "cell" or "cell line" includes any cell that is suitable for expressing a recombinant nucleic acid sequence. Cells include those of prokaryotes and eukaryotes (single-cell or multiple- cell), bacterial cells (e.g. , strains of E. coli, Bacillus spp., Streptomyces spp., etc.), mycobacteria cells, fungal cells, yeast cells (e.g.,S. cerevisiae, S. pombe, P. partoris, P. methanolica, etc.), plant cells, insect cells (e.g.,SF-9, SF-21 , baculovirus-infected insect cells, Trichoplusia ni, etc.), non- human animal cells, mammalian cells, human cells, or cell fusions such as, for example, hybridomas or quadromas. In certain embodiments, the cell is a human, monkey, ape, hamster, rat or mouse cell. In certain embodiments, the cell is eukaryotic and is selected from the following cells: CHO (e.g.,CHO K1 , DXB-1 1 CHO, Veggie-CHO), COS (e.g.,COS-7), retinal cells, Vero, CV1 , kidney (e.g.,HEK293, 293 EBNA, MSR 293, MDCK, HaK, BHK21 ), HeLa, HepG2, WI38, MRC 5, Colo25, HB 8065, HL-60, Jurkat, Daudi, A431 (epidermal), CV-1 , U937, 3T3, L cell, C127 cell, SP2/0, NS-0, MMT cell, tumor cell, and a cell line derived from an aforementioned cell. In some embodiments, the cell comprises one or more viral genes, e.g., a retinal cell that expresses a viral gene (e.g., a PER.C6® cell).
General Description
[00117] The invention is based at least in part on a recombinant host cell and cell expression system thereof that decreases expression of two or more an endogenous host cell fatty acid hydrolases (FAHs), decreases the enzymatic function or binding ability of two or more endogenous host cell FAHs, or lacks detectable expression of two or more FAHs. The inventors discovered that the disruption of genes encoding at least two FAHs allows for the optimized and efficient production and purification of biopharmaceutical products expressed in such expression systems. The invention may be employed in several ways, such as 1 ) utilizing gene editing tools to totally knockout FAH expression, whereas no measurable full-length FAH enzyme is expressed in the cell due to disruption of the gene encoding the FAH; 2) utilizing gene editing tools to eliminate or reduce enzymatic activity, whereas the FAH protein is expressed but rendered nonfunctional due to disruptions in its gene; and 3) utilizing gene editing tools to eliminate or reduce the ability of an endogenous host cell FAH to bind exogenous recombinant protein produced by the cell. FAH activity was determined in protein fractions of certain antibody-producing cells. Several particular fatty acid hydrolases were determined as contaminants in these protein fractions, including three carboxylic esterases (a.k.a. esterases): phospholipase B-like (PLBD2), lipoprotein lipase (LPL) and lysosomal acid lipase (LIPA), and acid ceramidase, a carboxylic amidase (an acylase). Gene editing target sites were identified in hamster PLBD2, LPL, LIPA and acid ceramidase (ACE) genes that enable targeted disruption of those genes in a hamster cell (i.e., CHO) genome.
[00118] An optimized host cell comprising a combination of genetic modifications that affect the expression of genes encoding (1 ) PLBD2 and LPL; (2) PLBD2 and LIPA; (3) PLBD2 and ACE; (4) LPL and LIPA; (5) LPL and ACE; (6) LIPA and ACE; (7) PLBD2, LPL and LIPA; (8) PLBD2, LPL and ACE; (9) PLBD2, LIPA and ACE; (10) LPL, LIPA and ACE, (1 1 ) PLBD2, LPL, LIPA and ACE, (12) PLBD2 and PIPLC, (13) PLBD2 and CEH, (14) PLBD2 and PAFAHG, (15) PLBD2 and LCE, (16) PLBD2 and ASA, (17) PLBD2 and IAH1 , (18) PLBD2 and LPLA2, (19) LPL and PIPLC, (20) LPL and CEH, (21 ) LPL and PAFAHG, (22) LPL and LCE, (23) LPL and ASA, (24) LPL and IAH1 , (25) LPL and LPLA2, (26) LIPA and PIPLC, (27) LIPA and CEH, (28) LIPA and PAFAHG, (29) LIPA AND LCE, (30) LIPA and ASA, (31 ) LIPA and IAH1 , (32) LIPA and LPLA2, (33) ACE and PIPLC, (34) ACE and CEH, (35) ACE and PAFAHG, (36) ACE and LCE, (37) ACE and ASA, (38) ACE and IAH1 , (39) ACE and LPLA2, (40) PIPLC and CEH, (41 ) PIPLC and PAFAHG, (42) PIPLC and LCE, (43) PIPLC and ASA, (44) PIPLC and IAH1 , (45) PIPLC and LPLA2, (46) CEH and PAFAHG, (47) CEH and LCE, (48) CEH and ASA, (49) CEH and IAH1 , (50) CEH and LPLA2, (51 ) PAFAHG and LCE, (53) PAFAHG and ASA, (54) PAFAHG and IAH1 , (55) PAFAHG and LPLA2, (56) LCE and ASA, (57) LCE and IAH1 , (58) LCE and LPLA2, (56) ASA, and IAH1 , (58) ASA and LPLA2, or (59) IAH1 and LPLA2, and the like is/are useful for the biological production of high-quality proteins.
[00119] Such a cell is envisioned to reduce the burden of certain purification steps, thereby reducing time and cost, while increasing production yield. Also, the formulated protein is expected to have improved stability due to the reduced hydrolase burden.
[00120] The invention is also based on the specific targeting of an exogenous gene to the integration site. The methods of the invention allow efficient modification of the cell genome, thus producing a modified or recombinant host cell useful as a cell expression system for the bioprocessing of therapeutic or other commercial protein products. To this end, the methods of the invention employ cellular genome gene editing strategies for the alteration of particular genes of interest that otherwise may diminish or contaminate the quality of recombinant protein formulations, or require multiple purification steps.
[00121] The compositions of the invention, e.g., gene editing tools, can also be included in expression constructs for example, in expression vectors for cloning and engineering new cell lines. These cell lines comprise the modifications described herein, and further modifications for optimal incorporation of expression constructs for the purpose of protein expression are envisioned.
Expression vectors comprising polynucleotides can be used to express proteins of interest transiently, or can be integrated into the cellular genome by random or targeted recombination such as, for example, homologous recombination or recombination mediated by recombinases that recognize specific recombination sites (e.g., Cre-lox-mediated recombination).
[00122] Target sites for disruption or insertion of DNA are typically identified with the maximum effect of the gene disruption or insertion in mind. For example, target sequences may be chosen near the N-terminus of the coding region of the gene of interest whereas a DNA break is introduced within the first or second exon of the gene. Introns (non-coding regions) are not typically targeted for disruption as repair of the DNA break in that region may not disrupt the target gene. The changes introduced by these modifications are permanent to the genomic DNA of the organism.
[00123] Essentially, following identification of target sites within SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77, gene editing protocols were employed to render a nonfunctional versions of two or more of those genes. In some embodiments, one of the two or more genes encodes PLBD2 (e.g., SEQ ID NO:33). Once the contaminant FAH host cell proteins are
eliminated, protocols known in the art for introducing an expressible gene of interest (GOI), such as a multi-subunit antibody, along with any other desirable elements such as, e.g., promoters, enhancers, markers, operators, ribosome binding sites (e.g., internal ribosome entry sites), etc. are also employed.
[00124] The resulting recombinant cell line conveniently provides more efficient downstream bioprocess methods with respect to expressible exogenous genes of interest (GOIs), since purification steps for exogenous proteins of interest are eliminated due to the absence of the contaminant host cell protein. Eliminating or refining purification procedures also results in higher amounts (titer) of the recovered protein of interest.
Physical and Functional Characterization of Modified CHO Cells
[00125] Applicants have discovered enzymatic activities associated with the destabilization of polysorbates (including polysorbate 20 and polysorbate 80) and/or enzymatic activities co-purifying with highly concentrated, multimerized or aggregated protein. Those activities were found to be associated with one or more fatty acid hydrolases (FAHs). One such FAH was identified from the peptide sequences listed in Table 2. A BLAST search of those peptide sequences revealed identity with a putative phospholipase B-like 2 (PLBD2, also referred to as PLBL2). PLBD2 is highly conserved in hamster (SEQ ID NO:32), mice (SEQ ID NO:34), rat (SEQ ID NO:35), human (SEQ ID NO:36), and bovine (SEQ ID NO:37). The applicants discovered that PLBD2, which co-purifies under certain processes with some classes of proteins-of-interest manufactured in a mammalian cell line, has enzymatic activity responsible for the hydrolysis of polysorbate 20 and 80. Other FAH species, of which PLBD2 is an example, may contribute to polysorbate instability or persist as hydrophobic "sticky" proteins that bind protein multimers or aggregates during purification and ultimate formulation, depending upon the particular protein-of-interest and/or background of the host cell.
Table 2
[00126] Ester hydrolysis of polysorbate 80 was recently reported (see Labrenz, S.R., "Ester hydrolysis of polysorbate 80 in mAb drug product: evidence in support of the hypothesized risk after observation of visible particulate in mAb formulations," J. P arma. Sci. 103(8):2268-77 (2014)). That paper reported the formation of visible particles in a formulation containing IgG. The author postulated that the colloidal IgG particles formed due to the enzymatic hydrolysis of oleate esters of polysorbate 80. Although no esterase was directly identified, the author speculates that a lipase or tweenase co-purified with the IgG, which was responsible for degrading the polysorbate 80.
Interestingly, IgGs formulated with polysorbate 20 did not form particles and the putative esterase did not hydrolyze the polysorbate 20. The author reported that the putative lipase associated with the IgG did not affect saturated C12 fatty acid (i.e., laurate) (Id at 7.)
[00127] The applicants discovered three other FAH proteins in addition to PLBD2 that co-purify with antibodies produced in mammalian cells: lipoprotein lipase(LPL), lysosomal acid lipase (LIPA) and acid ceramidase (ACE). LPL is a triacylglycerol/diacylglycerol hydrolase of the carboxylic ester hydrolase (esterase) family (see Hide ef al., "Structure and evolution of the lipase superfamily," J Lipid Res. 1992 Feb; 33(2): 167-78). LIPA is a sterol esterase and synthase that acts on esters of sterols and long-chain fatty acids (see Dubland and Francis, "Lysosomal acid lipase: at the crossroads of normal and atherogenic cholesterol metabolism," Front. Cell Dev. Biol. 2015 Feb 2; 3(3): 1-1 1 ). Acid ceramidase is not a carboxylic ester hydrolase, but rather an amide hydrolase that cleaves fatty acids from ceramide at the amide bond (carboxylic amide hydrolase) (see Park and Schuchman, "Acid ceramidase and human disease," Biochim. Biophys. Acta. 2006 Dec; 1758(12): 2133-8).
[00128] Phospholipases are a family of esterase enzymes that catalyze the cleavage of phospholipids. Each phospholipase subclass has different substrate specificity based on its target cleavage site. Phospholipase B (PLB) was identified as related to a group of prokaryotic and eukaryotic lipase proteins by virtue of the presence of a highly conserved amino acid sequence motif, Gly-Asp-Ser-Leu (GDSL) (Upton, C, and Buckley, JT. A new family of lipolytic enzymes? Trends Biochem Sci. 1995; 20:178-179). However, phospholipase B is also classified with known GDSL(S) hydrolases, and has little sequence homology to true lipases, differentiating itself structurally from phospholipases by having a serine-containing motif closer to the N-terminus than other lipases. Thus, phospholipase B-like proteins are also classified as N-terminal nucleophile (Ntn) hydrolases. Functionally, phospholipase B-like enzymes hydrolyze their target substrate (fatty acid esters such as diacylglycerophospholipids) to produce free fatty acids and ester-containing compounds {e.g., produces glycerophosphocholine), in a similar in manner as other
phospholipases. It has been suggested that PLB-like proteins, such as phospholipase B-like protein 1 (PLBD1 ) and phospholipase B-like protein 2 (PLBD2), also have amidase activity, similar to other Ntn hydrolases (Repo, H. et al, Proteins 2014; 82:300-31 1 ).
[00129] Lipoprotein lipases have also been demonstrated to cleave carboxylic ester bonds of polysorbate 20 and 80; and to associate with some monoclonal antibodies during production (see N. Levy, "Host cell protein impurities and protein-protein interactions in downstream purification of monoclonal antibodies," Dissertation submitted to the Faculty of the University of Delaware, Summer 2014, UMI 3642330, Published by ProQuest LLC, 2014). Cell-cultures of a CHO-K1 RNAi knock-down of LPL reveled diminished polysorbate esterase activity. The effect of such a knockdown on overall cell viability or the production of useful titers of ectopic protein has not been investigated.
[00130] Knockout of a host cell gene, such as an FAH, more particularly one or more of phospholipase B-like protein 2, lipoprotein lipase, lysosomal acid lipase and acid ceramidase may be accomplished in several ways. Rendering the FAH encoding gene nonfunctional, or reducing the functional activity of the target FAH protein may be done by introducing point mutations in the FAH genomic sequence, particularly in the exons (coding regions). In some embodiments, the nucleic acid sequences of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 were identified and sequences upstream and downstream of the target site (i.e., homologous arms) may be utilized to integrate an expression cassette comprising a mutated gene by homologous recombination. Further gene editing tools are described herein in accordance with the invention.
[00131] Cell lines devoid of multiple FAH activities, particularly PLBD2, LPL, LIPA and ACE activity, are useful for the production of therapeutic proteins to be purified and stored long term, and such cell lines solve problems associated with long term storage of pharmaceutical compositions in a formulation containing a fatty acid ester surfactant by maintaining protein stability and reducing subvisible particle (SVP) formation (see also PCT International Application No. PCT/US 15/54600 filed October 8, 2015, which is hereby incorporated in its entirety into the specification).
[00132] Assays to detect FAH activity include polysorbate degradation measurements. Unpurified protein supernatants or fractions from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, is tested for stability of polysorbate, such as polysorbate 20 or 80. The measurement of percent intact polysorbate reported is inversely proportional to the amount of contaminating FAH activity. Other measurements for detection of FAH activity or presence of FAH in a protein sample are known in the art. Detection of FAH protein {e.g., lipase, phospholipase, PLBD2, LPL, LIPA, acylase, ACE) may be done by trypsin digest mass spectrometry.
[00133] It is hypothesized that the instability of the non-ionic detergent, i.e., surfactant such as polysorbate, in a protein (e.g., antibody) formulation contributes to the formation of subvisible particles. Thus, degradation of the polysorbate incurs loss of surfactant activity, and therefore allows the protein to aggregate and form subvisible particles. Additionally or alternatively, the fatty acids released by the degrading sorbitan fatty acid esters may also contribute to subvisible particle formation as immiscible fatty acid droplets. Therefore, levels of subvisible particles≥ 10
micrometers in diameter may be counted in the protein formulation in order to detect esterase or other FAH activity.
[00134] Other assays for detecting FAH, especially esterase activity are known in the art. For example, glycerophospho[3H]choline formation from phosphatidyl[3H]choline following incubation of phosphatidyl[3H]choline and protein supernatant may be determined by thin-layer chromatography (following similar protocols according to Kanoh, H. et al. 1991 Comp Biochem Physiol 102B(2):367- 369).
[00135] SEQ ID NO:32 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of PLBD2, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:32, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical
compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:32 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00136] The mammalian PLBD2 sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 3 identifies exemplary mammalian PLBD2 proteins and their degree of homology.
TABLE 3: Amino acid identity of PLBD2 homologs
[00137] In certain embodiments, the targeted disruption of SEQ ID NO:33 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
[00138] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
[00139] In another embodiment, the PLBD2 nucleic acid sequence is at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:33 or target sequence thereof.
[00140] SEQ ID NO:62 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of LPL, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:62, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:62 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00141] The mammalian LPL sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 4 identifies exemplary mammalian LPL proteins and their degree of homology.
TABLE 4: Amino acid identity of LPL homologs
[00142] In certain embodiments, the targeted disruption of SEQ ID NO:61 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400- 420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490- 510, 500- 520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590- 610, 600- 620, 610-630, 620-640, 630-650, 465-484, 558-577, and 593-612 of SEQ ID NO:61.
[00143] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410- 430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490- 510, 500-520, 510- 530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590- 610, 600-620, 610- 630, 620-640, 630-650, 465-484, 558-577, and 593-612 of SEQ ID NO:61.
[00144] In another embodiment, the LPL nucleic acid sequence is at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:61 or target sequence thereof.
[00145] SEQ ID NO:70 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of LIPA, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:70, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:70 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00146] The mammalian LIPA sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 5 identifies exemplary mammalian LIPA proteins and their degree of homology.
TABLE 5: Amino acid identity of LIPA homologs
[00147] In certain embodiments, the targeted disruption of SEQ ID NO:69 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69.
[00148] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69.
[00149] In another embodiment, the LIPA nucleic acid sequence is at least about 70% identical, at least about 71 % identical, at least about 72% identical, at least about 73% identical, at least about 74% identical, at least about 75% identical, at least about 76% identical, at least about 77% identical, at least about 78% identical, or at least about 79% identical, at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:69 or target sequence thereof.
[00150] SEQ ID NO:78 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of LPL, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:78, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:78 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00151] The mammalian acid ceramidase (ACE) sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 6 identifies exemplary mammalian ACE proteins and their degree of homology. TABLE 6: Amino acid identity of ACE homologs
[00152] In certain embodiments, the targeted disruption of SEQ ID NO:77 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ ID NO:77.
[00153] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ I D NO:77.
[00154] In another embodiment, the ACE nucleic acid sequence is at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:77 or target sequence thereof.
[00155] Cell populations expressing enhanced levels of a protein of interest can be developed using the cell lines and methods provided herein. The isolated commercial protein, protein supernatant or fraction thereof, produced by the cells of the invention have no detectable esterase or esterase activity. Cell pools further modified with exogenous sequence(s) integrated within the genome of the modified cells of the invention are expected to be stable over time, and can be treated as stable cell lines for most purposes. Recombination steps can also be delayed until later in the process of development of the cell lines of the invention.
Genetically Modifying the Target Host Cell Protein
[00156] Methods for genetically engineering a host cell genome in a particular location (i.e., target host cell protein) may be achieved in several ways. Genetic editing techniques were used to modify a nucleic acid sequence in a eukaryotic cell, wherein the nucleic acid sequence is an endogenous sequence normally found in such cells and expressing a contaminant host cell protein. Clonal expansion is necessary to ensure that the cell progeny will share the identical genotypic and phenotypic characteristics of the engineered cell line. In some examples, native cells are modified by a homologous recombination technique to integrate a nonfunctional or mutated target nucleic acid sequence encoding a host cell protein, such as a variant of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77.
[00157] One such method of editing the CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA genomic sequences involves the use of guide RNAs and a type II Cas enzyme to specifically target an exon of PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and/or ASA. Specific guide RNAs directed to particular exons of CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA have been employed (Table 7) in a site-specific nuclease editing method as described herein. Other methods of targeted genome editing, for example nucleases, recombination-based methods, or RNA interference, to modify the FAH genes may be employed for the targeted disruption of the CHO genome.
[00158] In some embodiments, the engineered mammalian host cell (i.e. non-natural cell) comprises one or more disruptions within gene sequences selected from the group consisting of nucleotides 37-63 of SEQ ID NO:33, nucleotides 465-612 of SEQ ID NO:61 , nucleotides 180-295 of SEQ ID NO:69, nucleotides 135-351 of SEQ ID NO:77, nucleotides 249-388 of SEQ ID NO:1 17, nucleotides 1624-2157 of SEQ ID NO:163, nucleotides 372-1399 of SEQ ID NO:1 18, nucleotides 1 155-1600 of SEQ ID NO:1 19, nucleotides 423-615 of SEQ ID NO:120, nucleotides 753-1 141 of SEQ ID NO:121 , nucleotides 31 1 -581 of SEQ ID NO:164, nucleotides 1 155-1443 of SEQ ID NO:122, and nucleotides 813-1414 of SEQ ID NO:123.
Table 7
[00159] In one aspect, methods and compositions for knockout or downregulation of a nucleic acid molecule encoding an ortholog of a host cell FAH protein having at least 80% identity to SEQ ID NO:33 (PLBD2), SEQ ID NO:61 (LPL), SEQ ID NO:69 (LIPA) and/or SEQ ID NO:77 (ACE); or antibody-binding variant thereof, are via homologous recombination. A nucleic acid molecule encoding an FAH protein (or any protein of interest in general) can be targeted by homologous recombination or by using site-specific nuclease methods that specifically target sequences at the FAH-expressing site of the host cell genome. For homologous recombination, homologous polynucleotide molecules (i.e., homologous arms) line up and exchange a stretch of their sequences. A transgene can be introduced during this exchange if the transgene is flanked by homologous genomic sequences. In one example, a recombinase recognition site can also be introduced into the host cell genome at the integration sites.
[00160] Homologous recombination in eukaryotic cells can be facilitated by introducing a break in the chromosomal DNA at the integration site. Model systems have demonstrated that the frequency of homologous recombination during gene targeting increases if a double-strand break is introduced within the chromosomal target sequence. This may be accomplished by targeting certain nucleases to the specific site of integration. DNA-binding proteins that recognize DNA sequences at the target gene are known in the art. Gene targeting vectors are also employed to facilitate homologous recombination. In the absence of a gene targeting vector for homology directed repair, the cells frequently close the double-strand break by non-homologous end-joining (NHEJ) which may lead to deletion or insertion of multiple nucleotides at the cleavage site. Gene targeting vector construction and nuclease selection are within the skill of the artisan to whom this invention pertains.
[00161] In some examples, zinc finger nucleases (ZFNs), which have a modular structure and contain individual zinc finger domains, recognize a particular 3-nucleotide sequence in the target sequence. Some embodiments can utilize ZFNs with a combination of individual zinc finger domains targeting multiple target sequences. ZFN methods to target disruption of two or more FAH genes are also embodied by the invention.
[00162] Transcription activator-like (TAL) effector nucleases (TALENs) may also be employed for site-specific genome editing. TAL effector protein DNA-binding domain is typically utilized in combination with a non-specific cleavage domain of a restriction nuclease, such as Fokl. In some embodiments, a fusion protein comprising a TAL effector protein DNA-binding domain and a restriction nuclease cleavage domain is employed to recognize and cleave DNA at a target sequence within an exon of the gene encoding the target host cell protein, for example an esterase, such as a phospholipase B-like 2 protein(or other mammalian phospholipase), a lipoprotein lipase and a lysosomal acid lipase, and/or a fatty acylase, such as acid ceramidase (or other mammalian ceramidase). Targeted disruption or insertion of exogenous sequences into a specific exon of the CHO protein encoded by SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77 may be done by employing a TALE nuclease (TALEN) targeted to locations within exon 1 , exon 2, exon 3, etc. of the fatty acid hydrolase genomic DNA (see Tables 6 and 7). The TALEN target cleavage site within the gene sequences {e.g., SEQ ID NOs:33, 61 , 69 and 77) may be selected based on ZiFit.partners.org (ZiFit Targeter Version 4.2) and then TALENs are designed based on known methods (Boch J et al., 2009 Science 326:1509-1512; Bogdanove, A. J. & Voytas, D. F. 201 1 Science 333, 1843-1846; Miller, J. C. et al., 201 1 Nat Biotechnol 29, 143-148). TALEN methods to target disruption of two or more of the PLBD2 gene {e.g., exon 1 or exon 2), LPL gene {e.g., exon 2, 3 or 4), LI PA gene {e.g., exon 1 or 2) and ACE gene {e.g., exon 1 , 3 or 4) are also embodied by the invention.
[00163] RNA-guided endonucleases (RGENs) are programmable genome engineering tools that were developed from bacterial adaptive immune machinery. In this system— the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) immune response— the protein Cas9 forms a sequence-specific endonuclease when complexed with two RNAs, one of which guides target selection. RGENs consist of components (Cas9 and tracrRNA) and a target- specific CRISPR RNA (crRNA). Both the efficiency of DNA target cleavage and the location of the cleavage sites vary based on the position of a protospacer adjacent motif (PAM), an additional requirement for target recognition (Chen, H. et al, J. Biol. Chem. published online March 14, 2014, as Manuscript M1 13.539726). CRISPR-Cas9 methods to target disruption of two or more of the PLBD2 gene (e.g., exon 1 or exon 2), LPL gene (e.g., exon 2, 3 or 4), LIPA gene (e.g., exon 1 or 2) and ACE gene (e.g., exon 1 , 3 or 4) are also embodied by the invention..
[00164] Still other methods of homologous recombination are available to the skilled artisan, such as BuD-derived nucleases (BuDNs) with precise DNA-binding specificities (Stella, S. et al. Acta Cryst. 2014, D70, 2042-2052). A single residue-to-nucleotide code guides the BuDN to the specific DNA target within polynucleotide of interest {e.g., SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and or SEQ ID NO:77). [00165] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding PLBD2, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614971 .1 , at: Exon 1 within nucleotides (nt) 175367 to 175644 (SEQ ID NO:47); Exon 2 within nt 168958 to 169051 (SEQ ID NO:48); Exon 3 within nt 166451 to166609 (SEQ ID NO:49); Exon 4 within nt 164966 to 165066 (SEQ ID NO:50); Exon 5 within nt 164564 to164778 (SEQ ID NO:51 ); Exon 6 within nt 162682 to162779 (SEQ ID NO:52); Exon 7 within nt 160036 to160196 (SEQ ID NO:53); Exon 8 within nt 159733 to 159828 (SEQ ID NO:54); Exon 9 within nt 159491 to 159562 (SEQ ID NO:55); Exon 10 within nt 158726 to 158878 (SEQ ID NO:56); Exon 1 1 within nt 158082 to 158244 (SEQ ID NO:57); or Exon 12 at nucleotides (nt) 157747 to 157914 (SEQ ID NO:58), wherein PLBD2 exons 1 -12 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00166] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding LPL, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613760.1 , at: Exon 1 within nucleotides (nt) 1257424 to 1257507 (SEQ ID NO:85); Exon 2 within nt 1266450 to 1266610 (SEQ ID NO:86); Exon 3 within nt 1270069 1270248 (SEQ ID NO:87); Exon 4 within nt 1271770 to 1271881 (SEQ ID NO:88); Exon 5 within nt 12283518 12283751 (SEQ ID NO:89); Exon 6 within nt 123715 1273957 (SEQ ID NO:90); Exon 7 within nt 1276672 1276792 (SEQ ID NO:91 ); Exon 8 within nt 1278328 to 1278510 (SEQ ID NO:92); or Exon 9 within nt 1279637 to 1279738 (SEQ ID NO:93), wherein LPL exons 1-9 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00167] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding LIPA, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614200.1 , at: Exon 1 within nucleotides (nt) 985778 to 985674 (SEQ ID NO:94); Exon 2 within nt 984375 to 984258 (SEQ ID NO:95); Exon 3 within nt 970771 970573 (SEQ ID NO:96); Exon 4 within nt 969327 to 969218 (SEQ ID NO:97); Exon 5 within nt 968139 968003 (SEQ ID NO:98); Exon 6 within nt 961871 to 961725 (SEQ ID NO:99); Exon 7 within nt 960826 to 960755 (SEQ ID NO:100); Exon 8 within nt 955312 to 955241 (SEQ ID NO: 101 ); or Exon 9 within nt 954457 to 954224 (SEQ ID NO: 102), wherein LIPA exons 1-9 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00168] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding ACE, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613654.1 , at: Exon 1 within nucleotides (nt) 1378167 to 1378244 (SEQ ID NO: 103); Exon 2 within nt 1393746 to 1393792 (SEQ ID NO: 104); Exon 3 within nt 1398208 to 1398298 (SEQ ID NO:105); Exon 4 within nt 1399171 to 1399257 (SEQ ID NO: 106); Exon 5 within nt 1402147 to 1402225 (SEQ ID NO: 107); Exon 6 within nt 1404854 to 1404928 (SEQ ID NO:108); Exon 7 within nt 1405714 to 1405759 (SEQ ID NO:109); Exon 8 within nt 1406783 to 1406927 (SEQ ID NO:1 10); Exon 9 within nt 1407840 to 1407894 (SEQ ID NO:1 1 1 ); Exon 10 within nt 1408004 to 1408085 (SEQ ID NO:1 12); Exon 1 1 within nt 1409791 to 1409922 (SEQ ID NO:1 13); Exon 12 within nt 1410031 to 1410154 (SEQ ID NO:1 14); Exon 13 within nt 1410568 to 1410624 (SEQ ID NO: 1 15); or Exon 14 at nucleotides (nt) 141 1555 to 141 1644 (SEQ ID NO:1 16), wherein ACE exons 1 -14 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00169] Precise genome modification methods are chosen based on the tools available compatible with unique target sequences within SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 so that disruption of the cell phenotype is avoided.
Proteins of Interest
[00170] Any protein of interest suitable for expression in prokaryotic or eukaryotic cells can be used in the engineered host cell systems provided. For example, the protein of interest includes, but is not limited to, an antibody or antigen-binding fragment thereof, a chimeric antibody or antigen- binding fragment thereof, an ScFv or fragment thereof, an Fc-fusion protein or fragment thereof, a growth factor or a fragment thereof, a cytokine or a fragment thereof, or an extracellular domain of a cell surface receptor or a fragment thereof. Proteins of interest may be simple polypeptides consisting of a single subunit, or complex multisubunit proteins comprising two or more subunits. The protein of interest may be a biopharmaceutical product, food additive or preservative, or any protein product subject to purification and quality standards.
[00171] In some embodiments, the protein product (protein of interest) is an antibody, a human antibody, a humanized antibody, a chimeric antibody, a monoclonal antibody, a multispecific antibody, a bispecific antibody, an antigen binding antibody fragment, a single chain antibody, a diabody, triabody or tetrabody, a Fab fragment or a F(ab')2 fragment, an IgD antibody, an IgE antibody, an IgM antibody, an IgG antibody, an lgG1 antibody, an lgG2 antibody, an lgG3 antibody, or an lgG4 antibody. In one embodiment, the antibody is an lgG1 antibody. In one embodiment, the antibody is an lgG2 antibody. In one embodiment, the antibody is an lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1/lgG4 antibody. [00172] In some embodiments, the antibody is selected from the group consisting of an anti- Programmed Cell Death 1 antibody (e.g. an anti-PD1 antibody as described in U.S. Pat. Appln. Pub. No. US2015/0203579A1 ), an anti-Programmed Cell Death Ligand-1 (e.g. an anti-PD-L1 antibody as described in in U.S. Pat. Appln. Pub. No. US2015/0203580A1 ), an anti-DII4 antibody, an anti-Angiopoetin-2 antibody (e.g. an anti-ANG2 antibody as described in U.S. Pat. No.
9,402,898), an anti- Angiopoetin-Like 3 antibody (e.g. an anti-AngPtl3 antibody as described in U.S. Pat. No. 9,018,356), an anti-platelet derived growth factor receptor antibody (e.g. an anti-PDGFR antibody as described in U.S. Pat. No. 9,265,827), an anti-Erb3 antibody, an anti- Prolactin
Receptor antibody (e.g. anti-PRLR antibody as described in U.S. Pat. No. 9,302,015), an anti- Complement 5 antibody (e.g. an anti-C5 antibody as described in U.S. Pat. Appln. Pub. No
US2015/0313194A1 ), an anti-TNF antibody, an anti-epidermal growth factor receptor antibody (e.g. an anti-EGFR antibody as described in U.S. Pat. No. 9,132,192 or an anti-EGFRvlll antibody as described in U.S. Pat. Appln. Pub. No. US2015/0259423A1 ), an anti-Proprotein Convertase Subtilisin Kexin-9 antibody (e.g. an anti-PCSK9 antibody as described in U.S. Pat. No. 8,062,640 or U.S. Pat. Appln. Pub. No. US2014/0044730A1 ), an anti-Growth And Differentiation Factor-8 antibody (e.g. an anti-GDF8 antibody, also known as anti-myostatin antibody, as described in U.S. Pat Nos. 8,871 ,209 or 9,260,515), an anti-Glucagon Receptor (e.g. anti-GCGR antibody as described in U.S. Pat. Appln. Pub. Nos. US2015/0337045A1 or US2016/0075778A1 ), an anti-VEGF antibody, an anti-IL1 R antibody, an interleukin 4 receptor antibody (e.g an anti-IL4R antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271681 A1 or U.S. Pat Nos. 8,735,095 or
8,945,559), an anti-interleukin 6 receptor antibody (e.g. an anti-IL6R antibody as described in U.S. Pat. Nos. 7,582,298, 8,043,617 or 9,173,880), an anti-IL1 antibody, an anti-IL2 antibody, an anti-IL3 antibody, an anti-l L4 antibody, an anti-IL5 antibody, an anti-l L6 antibody, an anti-l L7 antibody, an anti-interleukin 33 (e.g. anti- IL33 antibody as described in U.S. Pat. Appln. Pub. Nos.
US2014/0271658A1 or US2014/0271642A1 ), an anti-Respiratory syncytial virus antibody (e.g. anti- RSV antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271653A1 ), an anti-Cluster of differentiation 3 (e.g. an anti-CD3 antibody, as described in U.S. Pat. Appln. Pub. Nos.
US2014/0088295A1 and US20150266966A1 , and in U.S. Application No. 62/222,605), an anti- Cluster of differentiation 20 (e.g. an anti-CD20 antibody as described in U.S. Pat. Appln. Pub. Nos. US2014/0088295A1 and US20150266966A1 , and in U.S. Pat. No. 7,879,984), an anti-CD19 antibody, an anti-CD28 antibody, an anti- Cluster of Differentiation-48 (e.g. anti-CD48 antibody as described in U.S. Pat. No. 9,228,014), an anti-Fel d1 antibody (e.g. as described in U.S. Pat. No. 9,079,948), an anti-Middle East Respiratory Syndrome virus (e.g. an anti-MERS antibody as described in U.S. Pat. Appln. Pub. No. US2015/0337029A1 ), an anti-Ebola virus antibody (e.g. as described in U.S. Pat. Appln. Pub. No. US2016/0215040), an anti-Zika virus antibody, an anti- Lymphocyte Activation Gene 3 antibody (e.g. an anti-LAG3 antibody, or an anti-CD223 antibody), an anti-Nerve Growth Factor antibody (e.g. an anti-NGF antibody as described in U.S. Pat. Appln. Pub. No. US2016/0017029 and U.S. Pat. Nos. 8,309,088 and 9,353,176) and an anti-Activin A antibody. In some embodiments, the bispecific antibody is selected from the group consisting of an anti-CD3 x anti-CD20 bispecific antibody (as described in U.S. Pat. Appln. Pub. Nos.
US2014/0088295A1 and US20150266966A1 ), an anti-CD3 x anti-Mucin 16 bispecific antibody (e.g., an anti-CD3 x anti-Mud 6 bispecific antibody), and an anti-CD3 x anti- Prostate-specific membrane antigen bispecific antibody (e.g., an anti-CD3 x anti-PSMA bispecific antibody). In some embodiments, the protein of interest is selected from the group consisting of alirocumab, sarilumab, fasinumab, nesvacumab, dupilumab, trevogrumab, evinacumab, and rinucumab. All publications mentioned throughout this disclosure are incorporated herein by reference in their entirety.
[00173] In some embodiments, the protein of interest is a recombinant protein that contains an Fc moiety and another domain, (e.g., an Fc-fusion protein). In some embodiments, an Fc-fusion protein is a receptor Fc-fusion protein, which contains one or more extracellular domain(s) of a receptor coupled to an Fc moiety. In some embodiments, the Fc moiety comprises a hinge region followed by a CH2 and CH3 domain of an IgG. In some embodiments, the receptor Fc-fusion protein contains two or more distinct receptor chains that bind to either a single ligand or multiple ligands. For example, an Fc-fusion protein is a TRAP protein, such as for example an IL-1 trap (e.g., rilonacept, which contains the IL-1 RAcP ligand binding region fused to the 11-1 R1 extracellular region fused to Fc of hlgG1 ; see U.S. Pat. No. 6,927,004, which is herein incorporated by reference in its entirety), or a VEGF trap (e.g., aflibercept or ziv-aflibercept, which contains the Ig domain 2 of the VEGF receptor Flt1 fused to the Ig domain 3 of the VEGF receptor Flk1 fused to Fc of hlgG1 ; see U.S. Pat. Nos. 7,087,41 1 and 7,279,159). In other embodiments, an Fc-fusion protein is a ScFv-Fc-fusion protein, which contains one or more of one or more antigen-binding domain(s), such as a variable heavy chain fragment and a variable light chain fragment, of an antibody coupled to an Fc moiety.
Host Cells and Transfection
[00174] The host cells used in the methods of the invention are eukaryotic host cells including, for example, Chinese hamster ovary (CHO) cells, human cells, rat cells and mouse cells. In a preferred embodiment, the invention provides a cell comprising a disrupted nucleic acid sequence fragment of SEQ ID NO:33 and at least one more of SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77.
[00175] The invention includes an engineered mammalian host cell further transfected with an expression vector comprising an exogenous gene of interest, such gene encoding the
biopharmaceutical product. While any mammalian cell may be used, in one particular embodiment the host cell is a CHO cell.
[00176] Transfected host cells include cells that have been transfected with expression vectors that comprise a sequence encoding a protein or polypeptide. Expressed proteins will preferably be secreted into the culture medium for use in the invention, depending on the nucleic acid sequence selected, but may be retained in the cell or deposited in the cell membrane. Various mammalian cell culture systems can be employed to express recombinant proteins. Other cell lines developed for specific selection or amplification schemes will also be useful with the methods and
compositions provided herein, provided that at least two genes encoding a different fatty acid hydrolase (FAH) having at least 80% homology to at least two of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 have been downregulated, knocked out or otherwise disrupted in accordance with the invention. An embodied cell line is the CHO cell line designated K1 . To achieve high volume production of recombinant proteins, the host cell line may be pre-adapted to bioreactor medium in the appropriate case.
[00177] Several transfection protocols are known in the art, and are reviewed in Kaufman (1988) Meth. Enzymology 185:537. The transfection protocol chosen will depend on the host cell type and the nature of the GOI, and can be chosen based upon routine experimentation. The basic requirements of any such protocol are first to introduce DNA encoding the protein of interest into a suitable host cell, and then to identify and isolate host cells which have incorporated the
heterologous DNA in a relatively stable, expressible manner.
[00178] One commonly used method of introducing heterologous DNA into a cell is calcium phosphate precipitation, for example, as described by Wigler ef al. (Proc. Natl. Acad. Sci. USA 77:3567, 1980). DNA introduced into a host cell by this method frequently undergoes
rearrangement, making this procedure useful for cotransfection of independent genes.
[00179] Polyethylene-induced fusion of bacterial protoplasts with mammalian cells (Schaffner ef al., (1980) Proc. Natl. Acad. Sci. USA 77:2163) is another useful method of introducing heterologous DNA. Protoplast fusion protocols frequently yield multiple copies of the plasmid DNA integrated into the mammalian host cell genome, and this technique requires the selection and amplification marker to be on the same plasmid as the GOI.
[00180] Electroporation can also be used to introduce DNA directly into the cytoplasm of a host cell, for example, as described by Potter ef al. (Proc. Natl. Acad. Sci. USA 81 :7161 , 1988) or Shigekawa ef al. (BioTechniques 6:742, 1988). Unlike protoplast fusion, electroporation does not require the selection marker and the GOI to be on the same plasmid.
[00181] Other reagents useful for introducing heterologous DNA into a mammalian cell have been described, such as Lipofectin™ Reagent and Lipofectamine™ Reagent (Gibco BRL, Gaithersburg, Md.). Both of these commercially available reagents are used to form lipid-nucleic acid complexes (or liposomes) which, when applied to cultured cells, facilitate uptake of the nucleic acid into the cells.
[00182] Methods for amplifying the GOI are also desirable for expression of the recombinant protein of interest, and typically involves the use of a selection marker (reviewed in Kaufman supra). Resistance to cytotoxic drugs is the characteristic most frequently used as a selection marker, and can be the result of either a dominant trait {e.g., can be used independent of host cell type) or a recessive trait {e.g., useful in particular host cell types that are deficient in whatever activity is being selected for). Several amplifiable markers are suitable for use in the cell lines of the invention and may be introduced by expression vectors and techniques well known in the art (e.g., as described in Sambrook, Molecular Biology: A Laboratory Manual, Cold Spring Harbor Laboratory, NY, 1989; pgs 16.9-16.14).
[00183] Useful selectable markers and other tools for gene amplification such as regulatory elements, described previously or known in the art, can also be included in the nucleic acid constructs used to transfect mammalian cells. The transfection protocol chosen and the elements selected for use therein will depend on the type of host cell used. Those of skill in the art are aware of numerous different protocols and host cells in order to adapt the invention for a particular use, and can select an appropriate system for expression of a desired protein, based on the
requirements of the cell culture system.
[00184] In specific embodiments, the invention relates to the following items:
[00185] 1 . A recombinant host cell comprising a modification in two or more genes encoding two or more fatty acid hydrolases (FAH).
[00186] 2. The recombinant host cell according to item 1 , wherein the two or more FAHs are selected from the group consisting of phospholipase B-like 2 (PLBD2) protein, lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE).
[00187] 3. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00188] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
[00189] b. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification.
[00190] 4. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00191] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
[00192] b. a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
[00193] 5. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00194] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
[00195] b. a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
[00196] 6. The recombinant host cell according to any of previous items, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
[00197] 7. The recombinant host cell according to any of previous items, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
[00198] 8. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00199] a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
[00200] b. a modification in a coding sequence of a polynucleotide encoding the LI PA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LI PA protein in a cell lacking the modification.
[00201] 9. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00202] a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
[00203] b. a modification in a coding sequence of a polynucleotide encoding the ACE protein, wherein the modification decreases the expression level of the ACE protein relative to the expression level of a ACE protein in a cell lacking the modification.
[00204] 10. The recombinant host cell according to any of previous items, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the LI PA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LI PA protein in a cell lacking the modification.
[00205] 1 1. The recombinant host cell according to any of previous items, wherein all alleles of the coding sequence of the two or more genes encoding the two or more FAHs comprise the modification.
[00206] 12. The recombinant host cell according to any of previous items, wherein the cell does not express detectable levels of PLBD2 protein, LPL, LIPA and/or ACE.
[00207] 13. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 1 of the coding sequence of the
polynucleotide encoding the PLBD2 protein.
[00208] 14. The recombinant host cell according to any of previous items, wherein the PLBD2 protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:32.
[00209] 15. The recombinant host cell according to any of previous items, wherein the PLBD2 protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:35, and SEQ ID NO:36.
[00210] 16. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:47.
[00211] 17. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 2, exon 3, or exon 4 of the coding sequence of the polynucleotide encoding the LPL protein. [00212] 18. The recombinant host cell according to any of previous items, wherein the LPL protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:62.
[00213] 19. The recombinant host cell according to any of previous items, wherein the LPL protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:62, SEQ ID NO:66, SEQ ID NO:67, and SEQ ID NO:68.
[00214] 20. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:86, SEQ ID NO:87 or SEQ ID NO:88.
[00215] 21. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 1 or exon 2 of the coding sequence of the polynucleotide encoding the LI PA protein.
[00216] 22. The recombinant host cell according to any of previous items, wherein the LI PA protein comprises an amino acid sequence at least 70% identical to SEQ ID NO:70.
[00217] 23. The recombinant host cell according to any of previous items, wherein the LI PA protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:70, SEQ ID NO:74, SEQ ID NO:75, and SEQ ID NO:76.
[00218] 24. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:94 or SEQ ID NO:95.
[00219] 25. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 1 , exon 3 or exon 4 of the coding sequence of the polynucleotide encoding the ACE protein.
[00220] 26. The recombinant host cell according to any of previous items, wherein the ACE protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:78.
[00221] 27. The recombinant host cell according to any of previous items, wherein the ACE protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:78, SEQ ID NO:82, SEQ ID NO:83, and SEQ ID NO:84.
[00222] 28. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:103, SEQ ID NO:105 or SEQ ID NO:106.
[00223] 29. The recombinant host cell according to any of previous items further comprising a polynucleotide encoding an exogenous protein of interest. [00224] 30. The recombinant host cell according to any of previous items, wherein the exogenous protein of interest is selected from the group consisting of an antibody heavy chain, an antibody light chain, an antigen-binding fragment, an antigen-binding protein, and an Fc-fusion protein.
[00225] 31. The recombinant host cell according to any of previous items, wherein the cell produces a Protein A-binding fraction having no detectable fatty acid hydrolase activity.
[00226] 32. A method of producing a recombinant protein comprising the steps of: (a) obtaining a sample comprising a recombinant protein and a plurality of host cell proteins from a host cell that is modified to produce reduced levels of fatty acid hydrolase compared to an unmodified cell; and (b) subjecting the sample to at least one purification step to remove at least one host cell protein.
[00227] 33. The method according to any of items 31 -32, wherein the plurality of host cell proteins
[00228] a. does not comprise a detectable amount of a phospholipase B-like 2 (PLBD2) protein; and
[00229] b. does not comprise a detectable amount of at least one other fatty acid hydrolase.
[00230] 34. The method according to any of items 31 -33, wherein the host cell comprises:
[00231] a. a modification in a coding sequence of a polynucleotide encoding a phospholipase B- like 2 (PLBD2) protein; and
[00232] b. a modification in a coding sequence of a polynucleotide encoding a fatty acid hydrolase selected from the group consisting of lipoprotein lipase (LPL), lysosomeal acid lipase (LI PA), acid ceramidase (ACE), or a combination thereof.
[00233] 35. The method according to any of items 31 -34, wherein the purification step is selected from the group consisting of protein A affinity (PA) chromatography, cation exchange (CEX) chromatography, and anion exchange (AEX) chromatography.
[00234] 36. The method according to any of items 31 -35, wherein the purification step does not comprise hydrophobic interaction chromatography.
[00235] 37. A process for reducing fatty acid hydrolase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein; (b) modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and/or acid ceramidase (ACE); (c) transfecting the host cell with a polynucleotide encoding a protein of interest; (d) extracting a protein fraction comprising the protein of interest from the host cell; (e) contacting the protein fraction with a chromatography media selected from the group consisting of protein A affinity (PA) media, cation exchange (CEX) media, and anion exchange (AEX) media; (f) collecting the protein of interest from the media; and (g) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer.
[00236] 38. The process according to item 37, wherein the step of modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein comprises inserting or deleting at least one nucleotide within exon 1 of a polynucleotide encoding the PLBD2 protein.
[00237] 39. The process according to any of items 37-38, wherein the polynucleotide encoding the PLBD2 protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:32.
[00238] 40. The process according to any of items 37-39, wherein the step of modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL) comprises inserting or deleting at least one nucleotide within exon 2, exon 3 or exon 4 of a polynucleotide encoding the LPL protein.
[00239] 41. The process according to any of items 37-40, wherein the polynucleotide encoding the LPL protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:62.
[00240] 42. The process according to any of items 37-41 , wherein the step of modifying the host cell to decrease or ablate the expression of lysosomal acid lipase (LI PA) comprises inserting or deleting at least one nucleotide within exon 1 or exon 2 of a polynucleotide encoding the LI PA protein.
[00241] 43. The process according to any of items 37-42, wherein the polynucleotide encoding the LIPA protein comprises a nucleic acid sequence that is at least 70% identical to SEQ ID NO:70.
[00242] 44. The process according to any of items 37-43, wherein the step of modifying the host cell to decrease or ablate the expression of acid ceramidase (ACE) comprises inserting or deleting at least one nucleotide within exon 1 , exon 3 or exon 4 of a polynucleotide encoding the ACE protein.
[00243] 45. The process according to any of items 37-44, wherein the polynucleotide encoding the ACE protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:78.
[00244] 46 . A composition comprising one or more recombinant proteins obtainable by the method according to any of items 31-36 or the process according to any of items 37-45.
[00245] 47. The composition obtainable according to item 46, wherein the composition is stable.
[00246] 48. The composition obtainable according to any of items 46-47, wherein the stable composition is characterized by one or more of:
[00247] (i) having an adverse enzyme activity of less than 50% active enzyme relative to the adverse enzyme activity of a wild-type production system.
[00248] (ii) having a shelf-life of at least about 3 months, about 4 months, about 5 months, about 6 months, about 7 months, about 8 months, about 9 months, about 10 months, about 1 1 months, about 12 months, about 18 months, about 24 months, or more.
[00249] (iii) having a SVP count of less than about 25 particles of at least 10 microns per ml. for large volume parenterals (greater than 100 ml_), less than about 3 particles of at least 25 microns per ml. for large volume parenterals (greater than 100 ml_), less than about 6,000 particles of at least 10 microns per container for small volume parenterals (100 mL or less), or less than about 600 particles of at least 25 microns per container for small volume parenterals (100 mL or less).
[00250] 49. The composition obtainable according to any of items 46-48, wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
[00251] 50. A composition comprising one or more recombinant proteins, wherein the composition is stable.
[00252] 51. The composition according to item 50, wherein the stable composition is characterized by one or more of:
[00253] (i) having an adverse enzyme activity of less than 50% active enzyme relative to the adverse enzyme activity of a wild-type production system.
[00254] (ii) having a shelf-life of at least about 3 months, about 4 months, about 5 months, about 6 months, about 7 months, about 8 months, about 9 months, about 10 months, about 1 1 months, about 12 months, about 18 months, about 24 months, or more.
[00255] (iii) having a SVP count of less than about 25 particles of at least 10 microns per mL for large volume parenterals (greater than 100 mL), less than about 3 particles of at least 25 microns per mL for large volume parenterals (greater than 100 mL), less than about 6,000 particles of at least 10 microns per container for small volume parenterals (100 mL or less), or less than about 600 particles of at least 25 microns per container for small volume parenterals (100 mL or less).
[00256] 52. The composition according to any of items 50-51 , wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
[00257] 53. The composition according to any of items 50-52, wherein the recombinant protein is selected form the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein, or any combinations thereof.
[00258] Other features of the invention will become apparent in the course of the following descriptions of exemplary embodiments which are given for illustration of the invention and are not intended to be limiting thereof.
Examples
[00259] The following examples are put forth so as to provide those of ordinary skill in the art how to make and use the methods and compositions described herein, and are not intended to limit the scope of the invention. Efforts have been made to ensure accuracy with respect to numbers used {e.g., amount, temperature, etc.) but some experimental error and deviation should be accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is average molecular weight, temperature is in degrees Centigrade, and pressure is at or near atmospheric.
Example 1. Targeted Disruption of an Esterase Gene in the Host Cell
[00260] To employ disruption of the target esterase gene, i.e., phospholipase B-like 2 gene, of a CHO cell origin, a Type II CRISPR/Cas system which requires at least 20 nucleotides (nt) of homology between a chimeric RNA (i.e., guide RNA) and its genomic target was used. Guide RNA sequences were designed for specific targeting of an exon within the CHO phospholipase B-like 2 (PLBD2) nucleic acid (SEQ ID NO:33) and are considered unique (to minimize off-target effects in the genome). Multiple small guide RNAs (sgRNA) were synthesized for use in the genome editing procedure targeting the following genomic segments of PLBD2 listed in Table 8.
Table 8
[00261] The sgRNA expression plasmid (System Biosciences, CAS940A-1 ) contains a human H1 promoter that drives expression of the small guide RNA and the tracrRNA following the sgRNA. Immortalized Chinese hamster ovary (CHO) cells were transfected with the plasmid encoding Cas9- H1 enzyme followed by one of the sgRNA sequences, for instance sgRNAI (SEQ ID NO:45) or sgRNA2 (SEQ ID NO:46), designed to target the first exon of CHO PLBD2. sgRNAI and sgRNA2 were predicted to generate a double strand break (DSB) at or around nucleotides 53 and 59 of SEQ ID NO:33, respectively. A DSB was therefore predicted to occur approx. 23 or 29 nucleotides downstream of the PLBD2 start codon. (Note that nucleotides 1-30 of SEQ ID NO:33 encode a signal peptide.) A negative control transfection was performed where the parental CHO line was transfected with the plasmid encoding Cas9-H1 enzyme without a proceeding sgRNA, or an sgRNA encoding a gene sequence not present in the CHO genome.
[00262] Following transfection, cells were cultured for 6 days in serum-free medium, and then were single cell cloned using flow cytometry. After 12 days in culture, stable clones with desirable growth properties were isolated, expanded in serum-free medium, cell pellets were collected for genotyping and clonal cell lines were banked.
[00263] Genomic DNA (gDNA) and messenger RNA (mRNA) were isolated from the clonal cell pellets and analyzed by quantitative PCR (qPCR). qPCR primers and probes were designed to overlap with the sgRNA sequence used for the double strand break targeting event, in order to detect disruption of the genomic DNA and its transcription. The relative abundance of PLBD2 gene or transcript in the candidate clones was determined using relative qPCR method, where the clones derived from the negative control transfection were used as a calibrator. See Figure 1. The qPCR primers and probes were designed to detect sequences either in the sgRNAI or sgRNA2 position in PLBD2 exon 1 . Both gDNA and RNA isolated from clone 1 failed to support qPCR amplification of PLBD2 exon 1 in either sgRNAI or sgRNA2 regions, but amplification of the housekeeping gene, GAPDH, was detected. Based on this data, clone 1 was identified as a potential knock out of PLBD2 in which both genomic alleles of PLBD2 of exonl were disrupted. It is noted that amplification of genomic DNA and mRNA was not detected in Clone 8 using primers overlapping with sgRNA2, however, sgRNAI primers/probes detected genomic DNA above control values. Clone 8, and others were further analyzed in order to understand the performance of the site- directed nuclease method.
[00264] The size of the entire PLBD2 exon 1 in clone 1 was analyzed by PCR from either gDNA or RNA derived templates and compared to that amplified from the wild type CHO cells. The length of amplicon fragments was determined using Caliper GX instrument (Figure 2). Both gDNA and mRNA amplification from clone 1 resulted in a single PCR fragment which was shorter than the one amplified from the wild type control cells.
[00265] The amplification products were sequenced, resulting in Clone 1 being identified as PLBD2 knock out, in which PLBD2 gene was found to have 1 1 bp deletion resulting in frameshift.
[00266] The inventors also unexpectedly identified Clone 8 as a PLBD2 knockout despite the fact that genomic DNA fragments were identified by qPCR primers overlapping with the sgRNAI sequence. The identification of a clone that has no detectable phospholipase activity or no detectable phospholipase protein was technically challenging and time-consuming. Site-directed nuclease techniques may provide an ease-of-use, however, careful screening and elimination of false positives is necessary and still there may be unpredictable outcomes with regard to the identity of a single clone having two disrupted alleles. Surprisingly, only 1 % of the clones screened using the techniques described above were identified as viable PLBD2 knockout clones. See Table 9.
Table 9
Example 2. Introduction and Expression of a monoclonal antibody (mAbl) in the candidate clonal cell lines
[00267] Clone 1 and the wild type control host cell line were transfected with plasmids encoding the light and heavy chains of mAb1 , a fully human IgG, in the presence of Cre recombinase to facilitate recombination mediated cassette exchange (RMCE) into EESYR locus (US Patent No. 7771997B2, issued August 10, 2010). The transfected cultures were selected for 1 1 days in serum-free medium containing 400 ug/mL hygromycin. Cells that underwent RMCE, were isolated by flow cytometry. PLBD2 knock out clone 1 and the wild type host cell line produced equivalent observed
recombinant population (data not shown). The clone 1 derived isogenic cell line expressing mAb1 was designated RS001 , and the mAb1 expressing cell line originated from the PLBD2 wild type host was designated RS0WT.
[00268] Fed-batch production of mAb1 from RS001 or RS0WT was carried out in a standard 12 day process. The conditioned medium for each production culture was sampled at Day 2, 4, 6, 9 and 12 and the Protein-A binding fraction was quantified (Figure 3). Protein titer of mAb1 from RS001 culture was comparable to that produced from RS0WT, and unexpectedly there were no observable differences in the behavior of the cells with respect to the two cultures. It cannot be predicted whether disruption of PLBD2 or any endogenous gene in a CHO host cell would have no observable deleterious effect on production of an exogenous recombinant protein, especially a therapeutic monoclonal antibody.
Example 3. Esterase Activity Detection in unmodified CHO cells
[00269] Polysorbate 20 or polysorbate 80 degradation was measured to detect putative esterase activity in the supernatants of PLBD2 mutants. Unpurified protein supernatant from CHO cells, and supernatant taken at each step or sequence of steps when subjected to sequential purification steps, was tested for stability of polysorbate. The percent intact polysorbate reported was inversely proportional to the amount of contaminant esterase activity. Unpurified protein supernatant from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, was tested in assays measuring polysorbate degradation. The relative levels of intact polysorbate reported is inversely proportional to levels of contaminant esterase activity.
[00270] Degradation of polysorbate 20 was examined to determine the etiological agent responsible for polysorbate 20 degradation in a monoclonal antibody formulation. The buffered antibody (150 mg/mL) was separated into two fractions by 10 kDa filtration: a protein fraction, and a buffer fraction. These two fractions, as well as intact buffered antibody, were spiked with 0.2% (w/v) of super refined polysorbate 20 (PS20-B) and stressed at 45°C for up to 14 days. The study showed (Table 10, part A, columns 1-2) that the protein fraction, not the buffer fraction, had an effect on the degradation of sorbitan laurate (i.e., the major component of polysorbate 20), and that the degradation of polysorbate 20 was correlated with the concentration of the antibody (Table 10, part B, columns 3-4). Table 10 (part A) Table 10 (part B)
[00271] Monoclonal antibody was produced in an unmodified CHO cell and purified by different processes according to Table 1 1 , and the esterase activity measured by percent intact polysorbate 20, as in Table 9.
Table 11
[00272] Hydrophobic interaction chromatography (HIC) was most efficient at removing residual PLBD2. In some circumstances, a reduction in the number of purification steps and lower cost could be realized. Therefore, it was contemplated that a modified CHO cell having reduced levels of expression of phospholipase reduces the purification steps, and e.g., may eliminate the need for HIC purification.
Example 4. Esterase protein abundance and activity detection in mAbl purified from modified compared to unmodified CHO cells.
[00273] mAbl was produced from RS001 and RSOWT and purified from the conditioned media using either PA alone, or PA and AEX chromatography The PA-purified mAB1 from RS001 and RSOWT were analyzed for lipase abundance using trypsin digest mass spectrometry. The trypsin digests of RS001 and RSOWT mAbl were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass spectrometer set to monitor a specific product ion fragmented from SEQ ID NO:32. In parallel, a series of PLBD2 standards were prepared by spiking in varying amounts of recombinant PLBD2 into mAb1 with no endogenous PLBD2. The signals of the experimental and control reactions were used to quantify the abundance of PLBD2 in mAb1 from RS001 and RSOWT (Figure 4). No detectable amounts of PLBD2 protein were observed in the purified samples of Clone 8-produced mAb1 when purified with PA alone (data not shown).
Example 5. Identification of Fatty Acid Hydrolases
Preparation of the HIC Strip Fraction
[00274] A hydrophobic interaction column (Phenyl Sepharose® High Performance [GE Healthcare, Little Chalfont, Buckinghamshire, UK]) was used to generate a "HIC strip" fraction containing a protein of interest (i.e., mAb2) and associated host cell proteins. The column was first equilibrated with two column volumes (CV) of buffer containing 40 mM Tris, 200 mM citrate at pH8.0. The monoclonal antibody-containing load material from an anion exchange pool ("Q pool") was adjusted to 200 mM sodium citrate, pH8.0, then loaded onto the column at a loading amount of 20-40 grams of protein per liter of the phenyl sepharose resin. After loading, the column was washed with six CVs of 40 mM Tris, 200 mM citrate, at pH8.0. Then the column was stripped with three CVs of reverse osmosis deionized water. The stripped fraction was collected for subsequent analysis.
Trypsin digestion
[00275] A 50 μg aliquot of the mAb2-containing HIC strip sample was denatured and reduced in a solution containing 5 mM acetic acid and 5 mM tris(2-carboxyethyl)phosphine-HCI by heating at 80°C for 10 minutes. The sample was then diluted in 50 mM Tris-HCI buffer (pH 8.0) and alkylated with 1 .5 mM iodoacetamide (IAA) and digested with trypsin (modified, sequencing grade from Promega, Madison, Wl) with an enzyme to substrate ratio of 1 :20 (w/w) at 37°C in the dark for three hours. The digestion was then stopped by addition of 10% trifluoroacetic acid (TFA).
NanoLC/MS
[00276] Aliquots (3 μg) of each digested mAb2-containing HIC strip was injected to an Acclaim™ PepMap™ nanoViper C18 trap column (75 m x 2 cm) on a Thermo EASY-nLC™ system (Thermo Scientific, Waltham, MA) at a flow rate of 2 μL/minute and washed for 15 minutes. Then the peptides were eluted onto the Acclaim™ PepMap™ RSLC nanoViper C18 analytical column (75 μηη x 25 cm) which was equilibrated with 99% mobile phase A (0.1 % FA in water) prior to sample injection at a flow rate of 250 nL/minute. Peptides were separated using a linear gradient from 1 % mobile phase B (0.1 % FA in acetonitrile) to 7% mobile phase B for the first 5 minutes, followed by a second linear increase from 7% to 27% mobile phase B over the next 1 10 minutes, and another subsequent linear increase from 27%-40% in 10 minutes and a final increase to 90% in 5 minutes. The gradient was held at 90% for 20 minutes. A Thermo Q Exactive™ Plus mass spectrometer (Thermo Scientific, Waltham, MA) was used for peptide mass analyses, with high-energy collisional dissociation (HCD) employed for peptide fragmentation for MS/MS experiments.
Data analysis
[00277] The mass spectra were processed using Thermo Xcalibur™ (version 2.2.42) (Thermo Scientific, Waltham, MA). Proteome Discoverer (version 1.4) (Thermo Scientific, Waltham, MA) was also used to perform the peptide identification using both Mascot and Sequent search engines. The peptide spectra from the Proteome Discovery was manually examined to confirm the spectral assignment and protein identification.
Results
[00278] Lipoprotein lipase (UniProtein ID: P06858), lysosomal acid lipase (UniProtein ID: P38571 ) and acid ceramidase (UniProtein ID: Q13510) were identified as potential active fatty acid hydrolases by more than three unique peptides per protein in the mAb2-containing HIC strip fraction.
Example 6. Targeted Disruption of a Lipoprotein Lipase (LPL) Gene
[00279] Clone 1 , in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) were transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lipoprotein lipase (LPL) gene (SEQ ID NO:61 ): sgRNA3 (SEQ ID NO:63), sgRNA4 (SEQ ID NO:64) or sgRNA5 (SEQ ID NO:65) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette was used in some
experiments. In other experiments, two separate plasmids were used, which confered flexibility for combining Cas9 with different sgRNAs. Cre recombinase was co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures were selected for 1 1 days in serum-free medium containing 400 μg/mL hygromycin. Cells that underwent RMCE were isolated by flow cytometry. The desired knock out genotype was confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing only the LPL knock-out loci is designated as "clone 9".The resultant clone containing both the PLBD2 and LPL knock-out loci is designated as "clone 10".
[00280] The wild-type host cell, Clone 1 (PLBD2-KO), Clone 9 (LPL-KO) and Clone 10 (LPL+PLBD2 KO) were cultured in 2L bioreactors under fed-batch conditions at 36.5°C (pH 6.9-7.4) with dissolved oxygen. Cells were inoculated in the production bioreactor (i.e. transferred from a seed train culture (N-1 ) at a cell titer of 5.0 x 106 - 7.0 x 106 cells/mL) and protein production was induced by the addition of doxycycline. Cells were cultured for 14 days in chemically defined media supplemented with nutrient feeds as needed during the batch culture up to day 12. Cell viability (Figure 5), titer (Figure 6), and viable cell count (VCC) (Figure 7) were monitored throughout the batch process. Samples of cells were harvested and subjected to purification via protein A chromatography for titer determination (Hober, S., Nord, K., and Linhult, M., "Protein A
Chromatography for Antibody Purification," Journal of Chromatography B, 848 (2007) 40-47; and Lin et al., "Protein A Affinity Column for Monoclonal Antibody (MAb) Titer Analysis," Thermo
Scientific Poster Note PN20806_E 06/13S, available from https://tools.thermofisher.com/.../PN- 20806-Protein-Affinity-Column-Monoclonal- Antibody-Titer-Analysis-PN20806- EN.pdf&usg=AFQjCNFHkixB9Um020cT4nG1 UpTzXwON9Q, version avaialble Aug 23, 2017).
[00281] Additional LPL-KO clones were generated according to the methods provided herein. For example, Figure 8 depicts an alignment of LPL-KO clones 19-22 (SEQ ID NOs:159-162, respectively) compared to a partial Chinese hamster LPL sequence (SEQ ID NO:158) showing the gaps in the respective clone sequences.
[00282] To construct the LPL-KO clones (as well as any other fatty acid hydrolase knock-out clone), plasmids encoding for Cas9 nuclease, eYFP and site-specific sgRNA were stably integrated into the CHO genome using Lipofectamine-based transfection protocol followed by selection for neomycin resistant cells. Seventeen days post transfection, the YFP positive cells were enriched by flow cytometry prior to single cell-sorting. After a 21 -day expansion, mRNA was isolated and the single cell clones were analyzed by qPCR for the presence of gene disruption.
[00283] 1 , 2 or 3 different sgRNA expression cassettes (the cassettes include promoter, sgRNA, terminator) may be placed in the same plasmid. Some constructs are manufactured using at least 2 sgRNA cassettes per lipase in a single plasmid.
Example 7. Targeted Disruption of a Lysosomal Acid Lipase (LIPA) Gene
[00284] Clone 1 , in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) or sgRNA8 (SEQ ID NO:73) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent
Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 μg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing only the LI PA knock-out or knock-down loci is designated as "clone 1 1 ".The resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 12".
Example 8. Targeted Disruption of an Acid Ceramidase (ACE) Gene
[00285] Clone 1 , in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA1 1 (SEQ ID NO:81 ) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 μg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing only the ACE knock-out or knock-down loci is designated as "clone 13". The resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 14".
Example 9. Targeted Disruption of a Lipoprotein Lipase (LPL) Gene and Lysosomal Acid
Lipase (LIPA)
[00286] Clone 9, in which the LPL gene and is modified, and Clone 10, in which both the LPL gene and the PLBD2 gene are modified, are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) and/or sgRNA8 (SEQ ID NO:73) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 μg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing the LIPA modified locus and the LPL modified locus designated as "clone 15".The resultant clone containing the PLBD2, LPL and LIPA modified loci is designated as "clone 16".
Example 10. Targeted Disruption of a Lipoprotein Lipase (LPL) Gene, a Lysosomal Acid
Lipase (LIPA) Gene and an Acid Ceramidase (ACE) Gene
[00287] Clone 15, in which the LPL and LIPA genes and are modified, and Clone 16, in which the PLBD2, LPL and LIPA genes are modified, are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA1 1 (SEQ ID NO:81 ) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 μg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 17". The resultant clone containing the PLBD2 modified locus, the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 18".
[00288] The present invention may be embodied in other specific embodiments. HOST CELL PROTEIN MODIFICATION
FIELD
[0001] The invention provides for cells and methods for expression and purification of recombinant proteins in eukaryotic cells. In particular, the invention includes methods and compositions for expression of proteins in eukaryotic cells, particularly Chinese hamster (Cricetulus griseus) cell lines, that employ downregulating gene expression of endogenous proteins in order to control production of such unwanted "sticky" host cell proteins. The invention includes polynucleotides and modified cells that facilitate purification of an exogenous recombinant protein of interest. The methods of the invention efficiently target host cell proteins in the Chinese hamster cellular genome in order to facilitate enhanced and stable expression of recombinant proteins expressed by the modified cells.
BACKGROUND
[0002] Cellular expression systems aim to provide a reliable and efficient source for the
manufacture of biopharmaceutical products for therapeutic use. Purification of any recombinant protein produced by either eukaryotic or prokaryotic cells in such systems is an ongoing challenge due to, for example, the plethora of host cell proteins and nucleic acid molecules that need to be eliminated from the final pharmaceutical grade product.
[0003] Certain dynamics of host cell proteins (HCPs), viewed as impure byproducts, have been surveyed during various stages of bioprocessing. Advanced liquid chromatography/mass spectrometry (LC/MS) was done to detect and monitor E. coli HCPs accompanying peptibodies produced by cell culture (Schenauer, MR., et al, 2013, Biotechnol Prog 29(4):951-7). The information obtained by HCP profiles is useful for monitoring process development and assessing quality and purity of the product in order to assess safety risks posed by any one or more HCP(s).
Changes in cell culture conditions of eukaryotic cells has been shown to impact the purity and stability of manufactured proteins, as seen by the increased quantity of HCPs of CHO cells upon downstream bioprocessing alterations (Tait, et al, 2013, Biotechnol Prog 29(3):688-696). The detrimental effect of leftover HCPs in any product may affect the overall quality or quantity, or both the quality and quantity of the product. HCPs, if present even at low levels in a therapeutic product, may induce an undesired immune response which causes concern for patient safety and efficacy of the drug product (Singh SK. 201 1. "Impact of product-related factors on immunogenicity of biotherapeutics." J Pharm Sci 100:354-387; Ipsen Press Release "Ipsen's partner Inspiration Biopharmaceuticals announces hold of phase III clinical trials evaluating IB1001 for the treatment and prevention of hemophilia B", 10 July 2012). Current protocols seek to alter the protein of interest produced by the cell (e.g.,therapeutic antibody) to eliminate differential binding or interaction with the protein of interest and the host cell protein (Zhang, Q. et al, mAbs, Published online: 11 Feb 2014). Alternative bioprocessing or purification techniques may be warranted in order to minimize the risk of excess impurities (Yuk, et al. 2015, Biotechnol. Bioeng. 9999: 1-16).
[0004] Despite the availability of numerous cell expression systems, engineered cell lines and systems that do not negatively impact the biological properties of an expressed protein of interest are particularly advantageous. Accordingly, there is a need in the art for improved methods towards preparation of quality protein samples for downstream bioprocessing and subsequently commercial use.
BRIEF SUMMARY
[0005] The use of gene editing tools to eliminate two or more contaminant host cell proteins is contemplated, and thus, engineered host cells for more efficient manufacturing processing of proteins is provided.
[0006] In one aspect, the invention provides a recombinant host cell, wherein the cell is modified to decrease the expression levels of two or more fatty acid hydrolases (FAHs) relative to the expression levels of FAH in an unmodified cell.
[0007] In another aspect, the invention provides a recombinant host cell, wherein the cell is modified to have no expression of two or more target FAHs.
[0008] In some embodiments, one of said two or more target FAHs is an esterase. In more specific embodiments, the esterase is a lipase. In more specific embodiments, the lipase is: (1 ) a phospholipase, such as a phospholipase B-like protein or a phospholipase B-like 2 protein, (2) a lipoprotein lipase or (3) a lysosomal acid lipase. In some embodiments, one of said two or more target FAHs is an amidase. In more specific embodiments, the amidase is a fatty acid acylase. In a more specific embodiment, the fatty acid acylase is an acid ceramidase.
[0009] In some embodiments, a gene of interest is exogenously added to the recombinant host cell. In other embodiments, the exogenously added gene encodes a protein of interest (POI), for example the POI is selected form the group consisting of antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein.
[0010] In further embodiments, the invention relates to a composition comprising one or more proteins of interest (POIs) obtainable by a method according to the invention. The protein may be a recombinant protein and may be e.g. selected form the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein, or any combinations thereof.
[0011] In some embodiments, the invention relates to a composition comprising one or more proteins of interest (POIs), wherein the adverse enzyme activity is < 50%, 40%, 48%, 47%, 46%, 45%, 44%, 43%, 42%, 41 %, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31 %, 30%, 29%, 28%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, 0.05%, or <0.05% active relative to the adverse enzyme activity of a wild-type production system. The phrase "adverse enzyme activity" refers to any enzyme and its action upon the resulting composition as a whole, wherein the action results in metabolism of any protein components which metabolites reduces the shelf-life of the composition or results in the formation of subvisible particles (SVPs) above prescribed regulations. The protein may be a recombinant protein and may e.g. be selected from the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein or any combinations thereof. The enzymes and their activities may be, e.g., various esterases, hydrolases, lipases, phospholipases, ceramidases and the likes, or any combinations thereof. Adverse enzyme activity may be measured using a functional assay {e.g., polysorbate fatty acid hydrolysis assay), or a structural assay {e.g., nano LC-MS of peptide fragments, or the like).
[0012] The invention provides a cell comprising a nonfunctional PLBD2 protein and one or more additional nonfunctional fatty acid hydrolases (FAH). In one embodiment, the additional
nonfunctional FAH is a nonfunctional lipoprotein lipase (LPL), lysosomal acid lipase (LI PA), acid ceramidase (ACE), platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), phosphoinositide phospholipase C fragment (PIPLCf), phosphoinositide phospholipase C (PIPLC), liver carboxylesterase 1 (LCE), isoamyl acetate-hydrolyzing esterase 1 -like (IAH1 ), group XV phospholipase A2 (LPLA2), carboxylic ester hydrolase (CEH), and/or arylsulfatase A (ASA).
[0013] The invention provides making a cell by FAH target disruption. In some embodiments, the method comprises a site-specific nuclease for disrupting or editing the cell genome at a target site or sequence. In some embodiments, the FAH target site is (1 ) a PLBD2 target site, (2) an LPL target site, (3) an LI PA target site, (4) an ACE target site, (5) a PAFAHG site, (6) a PIPLCf or PIPLC site, (7) an LCE site, (8) an IAH1 site, (9) an LPLA2 site, (9) an CEH site, and /or (10) an ASA site.
[0014] In one embodiment, the PLBD2 target site comprises a position within SEQ ID NO:33, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:33, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:33, within exon 1 of SEQ ID NO:33, within exon 2 of SEQ ID NO:33, or within exon 3 of SEQ ID NO:33.
[0015] In one embodiment, the PLBD2 target site comprises a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-230, 190-210, 200-220, 210-230, 220-240, 230-250, 240-260, and 250-270 of SEQ ID NO:33.
[0016] In another embodiment, the target site at a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 is selected from the group consisting of nucleotides spanning positions numbered 37-56, 44-56, 33-62, 40-69, 110-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33. In this regard, the PLBD2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:33 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:33 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0017] In one embodiment, the LPL target site comprises a position within SEQ ID NO:61 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:61 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO:61 , within exon 1 of SEQ ID NO:61 , within exon 2 of SEQ ID NO:61 , or within exon 3 of SEQ ID NO:61.
[0018] In one embodiment, the LPL target site comprises a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490- 510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590- 610, 600-620, 610-630, 620-640 and 630-650 of SEQ ID NO:61.
[0019] In another embodiment, the target site at a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 is selected from the group consisting of nucleotides spanning positions numbered 465-484, 558-577, and 593-612 of SEQ ID NO:61. In this regard, the LPL target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:61 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:61 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0020] In one embodiment, the LIPA target site comprises a position within SEQ ID NO:69, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:69, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:69, within exon 1 of SEQ ID NO:69, within exon 2 of SEQ ID NO:69, or within exon 3 of SEQ ID NO:69.
[0021] In one embodiment, the LIPA target site comprises a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350 and 340-360 of SEQ ID NO:69.
[0022] In another embodiment, the target site at a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 is selected from the group consisting of nucleotides spanning positions numbered 180-199, 239-258, and 276-295 of SEQ ID NO:69. In this regard, the LIPA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:69 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:69 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0023] In one embodiment, the ACE target site comprises a position within SEQ ID NO:77, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:77, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:77, within exon 1 of SEQ ID NO:77, within exon 2 of SEQ ID NO:77, within exon 3 of SEQ ID NO:77, or within exon 4 of SEQ ID NO:77.
[0024] In one embodiment, the ACE target site comprises a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370 and 360-380 of SEQ ID NO:77.
[0025] In another embodiment, the target site at a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 is selected from the group consisting of nucleotides spanning positions numbered 135-154, 237-256, and 332-351 of SEQ ID NO:77. In this regard, the ACE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:77 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:77 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0026] In one embodiment, the PAFAHG target site comprises a position within SEQ ID NO:1 17, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:117, 100 nucleotides
downstream of the 5-prime end of SEQ ID NO: 117, within exon 1 of SEQ ID NO:1 17, within exon 2 of SEQ ID NO:117, or within exon 3 of SEQ ID NO:1 17.
[0027] In one embodiment, the PAFAHG target site comprises a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:1 17 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, 760-780, and 770-787 of SEQ ID NO:1 17.
[0028] In another embodiment, the target site at a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:117 is selected from the group consisting of nucleotides spanning positions numbered 101-120, 1 1 1-130, 121 -140, 131-150, 141-160, and 150-169 of SEQ ID NO:1 17. In this regard, the PAFAHG target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 17 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 17 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0029] In one embodiment, the PIPLC target site comprises a position within SEQ ID NO:1 18, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:118, 100 nucleotides
downstream of the 5-prime end of SEQ ID NO: 118, within exon 1 of SEQ ID NO:1 18, within exon 2 of SEQ ID NO:118, within exon 3 of SEQ ID NO:118, within exon 4 of SEQ ID NO:1 18, within exon 5 of SEQ ID NO:1 18, within exon 6 of SEQ ID NO:1 18, within exon 7 of SEQ ID NO:1 18, within exon 8 of SEQ ID NO: 1 18, within exon 9 of SEQ ID NO: 1 18, within exon 10 of SEQ ID NO: 1 18, within exon 1 1 of SEQ ID NO:1 18, within exon 12 of SEQ ID NO:1 18, or within exon 13 of SEQ ID NO:1 18.
[0030] In one embodiment, the PIPLC target site comprises a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:1 18 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ I D NO: 118.
[0031] In another embodiment, the target site at a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:118 is selected from the group consisting of nucleotides spanning positions numbered 39-188, 39-58, 49-68, 59-78, 69-88, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 149-178, and 159-188 of SEQ ID NO:118. In this regard, the PIPLC target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 18 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 18 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0032] In one embodiment, the LCE target site comprises a position within SEQ ID NO:1 19, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 19, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:1 19, within exon 1 of SEQ ID NO:1 19, within exon 2 of SEQ ID NO:1 19, within exon 3 of SEQ ID NO:119, within exon 4 of SEQ ID NO:1 19, or within exon 5 of SEQ ID NO:1 19.
[0033] In one embodiment, the LCE target site comprises a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:1 19 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ I D NO: 119.
[0034] In another embodiment, the target site at a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:119 is selected from the group consisting of nucleotides spanning positions numbered 89-140, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, and 139-158 of SEQ ID NO:1 19. In this regard, the LCE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 19 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 19 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0035] In one embodiment, the IAH1 target site comprises a position within SEQ ID NO:120, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 120, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:120, within exon 1 of SEQ ID NO:120, within exon 2 of SEQ ID NO:120, within exon 3 of SEQ ID NO:120, or within exon 4 of SEQ ID NO:120.
[0036] In one embodiment, the IAH1 target site comprises a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO:120 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO: 120.
[0037] In another embodiment, the target site at a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO:120 is selected from the group consisting of nucleotides spanning positions numbered 104-325, 104-123, 1 14-133, 124-143, 134-153, 144-163, 154-173, 164-183, 174-193, 184-203, 194-213, 204-223, 214-233, 224-243, 234-253, 244-263, 254-273, 264-283, 274-293, 284-303, 294-313, 304-323, and 314-333 of SEQ ID NO:120. In this regard, the IAH1 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 120 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:120 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0038] In one embodiment, the LPLA2 target site comprises a position within SEQ ID NO:121 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 121 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 121 , within exon 1 of SEQ ID NO: 121 , within exon 2 of SEQ ID NO: 121 , within exon 3 of SEQ ID NO: 121 , or within exon 4 of SEQ ID NO: 121.
[0039] In one embodiment, the LPLA2 target site comprises a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO:121.
[0040] In another embodiment, the target site at a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 is selected from the group consisting of nucleotides spanning positions numbered 69-195, 69-88, 79-98, 89-108, 99-118, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, and 179-198 of SEQ ID NO:121. In this regard, the LPLA2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 121 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:121 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodimen g p g occurs on both alleles of the gene.
[0041] In one embodiment, the CEH target site comprises a position within SEQ ID NO:122, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 122, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 122, within exon 1 of SEQ ID NO: 122, within exon 2 of SEQ ID NO: 122, within exon 3 of SEQ ID NO: 122, within exon 4 of SEQ ID NO: 122, or within exon 5 of SEQ ID NO: 122.
[0042] In one embodiment, the CEH target site comprises a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO:122 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO: 122.
[0043] In another embodiment, the target site at a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO:122 is selected from the group consisting of nucleotides spanning positions numbered 79-186, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, and 169-188 of SEQ ID NO:122. In this regard, the CEH target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:122 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO: 122 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0044] In one embodiment, the ASA target site comprises a position within SEQ ID NO:123, within 100 nucleotides upstream of the 5 prime end of SEQ ID NO: 123 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 123, within exon 1 of SEQ ID NO: 123, within exon 2 of SEQ ID NO: 123, within exon 3 of SEQ ID NO: 123, or within exon 4 of SEQ ID NO: 123.
[0045] In one embodiment, the ASA target site comprises a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO:123 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO: 123.
[0046] In another embodiment, the target site at a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO:123 is selected from the group consisting of nucleotides spanning positions numbered 79-296, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, 179-198 , 189-208 , 199-218, 209-228, 219-238, 229-248, 239-258, 249-268, 259-278, 269-288, and 279-298 of SEQ ID NO:123. In this regard, the ASA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:123 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO: 123 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0047] In certain embodiments, the cell further integrates an exogenous nucleic acid sequence. In other embodiments, the cell is capable of producing an exogenous protein of interest. In still other embodiments, the altered protein resulting from a disrupted gene does not bind to the protein of interest produced by the cell.
[0048] In another aspect, an isolated Chinese hamster ovary (CHO) cell is provided that comprises an engineered nucleic acid sequence comprising a variant of the PLBD2 gene (such as a variant of SEQ ID NO:33). In one embodiment, the PLBD2 gene comprises GACAGTCACG TGGCCCGACT GAGGCACGCG , nucleotides 1-30 of SEQ ID NO:33 (SEQ ID NO: 44). In another embodiment, the PLBD2 gene is engineered to disrupt expression of the open reading frame. In other embodiments, the invention provides an isolated CHO cell comprising (a) a disrupted PLBD2 gene comprising GACAGTCACG TGGCCCGACT GAGGCACGCG (SEQ ID NO: 44, also nucleotides 1- 30 of SEQ ID NO:33), (b) a disrupted esterase gene comprising a nucleotide encoding any one of the amino acid sequences in Table 2, or (c) a protein fragment of Table 2 expressed by a disrupted PLBD2 gene; and an exogenous nucleic acid sequence comprising a gene of interest.
[0049] In one embodiment, the CHO cell that comprises an engineered nucleic acid sequence comprising a variant of the PLBD2 gene also comprises variants of one or more of the genes encoding (1 ) LPL (variant of SEQ ID NO:61 ), (2) LIPA (SEQ ID NO:69), (3) ACE (SEQ ID NO:77), (4) PAFAHG (SEQ ID NO:1 17), (5) PIPLC (SEQ ID NO:1 18), (6) LCE (SEQ ID NO:1 19), (7) IAH1 (SEQ ID NO:120), (8) LPLA2 (SEQ ID NO:121 ), (9) CEH (SEQ ID NO:122), and/or (10) ASA (SEQ ID NO:123).
[0050] In another aspect, a method of producing a protein of interest using a recombinant host cell is provided, wherein the host cell is modified to decrease the expression levels of esterase relative to the expression levels of esterase in an unmodified cell.
[0051] In another embodiment, the method comprises the modified host cell having decreased esterase expression and an exogenous nucleic acid sequence comprising a gene of interest (GOI).
[0052] In certain embodiments, the exogenous nucleic acid sequence comprises one or more genes of interest. In some embodiments, the one or more genes of interest are selected from the group consisting of a first GOI, a second GOI and a third GOI.
[0053] In another aspect, the invention provides expression systems comprising the recombinant host cell comprising modified or nonfunctional esterase.
[0054] In yet another embodiment, the cell comprises a GOI operably linked to a promoter capable of driving expression of the GOI, wherein the promoter comprises a eukaryotic promoter that can be regulated by an activator or inhibitor. In other embodiments, the eukaryotic promoter is operably linked to a prokaryotic operator, and the eukaryotic cell optionally further comprises a prokaryotic repressor protein.
[0055] In another embodiment, one or more selectable markers are expressed by the modified host cell. In some embodiments, the genes of interest and/or the one or more selectable markers are operably linked to a promoter, wherein the promoter may be the same or different. In another embodiment, the promoter comprises a eukaryotic promoter (such as, for example, a CMV promoter or an SV40 late promoter), optionally controlled by a prokaryotic operator (such as, for example, a tet operator). In other embodiments, the cell further comprises a gene encoding a prokaryotic repressor (such as, for example, a tet repressor).
[0056] In one aspect, a CHO host cell is provided, comprising recombinase recognition sites. In some embodiments, the recombinase recognition sites are selected from a LoxP site, a /_ox51 1 site, a Lox2272 site, Lox2372, Lox5171 , and a frt site.
[0057] In another embodiment, the cell further comprises a gene capable of expressing a recombinase. In some embodiments, the recombinase is a Cre recombinase.
[0058] In one embodiment, the selectable marker gene is a drug resistance gene. In another embodiment, the drug resistance gene is a neomycin resistance gene or a hygromycin resistance gene. In another embodiment, the second and third selectable marker genes encode two different fluorescent proteins. In one embodiment, the two different fluorescent proteins are selected from the group consisting of Discosoma coral (DsRed), green fluorescent protein (GFP), enhanced green fluorescent protein (eGFP), cyano fluorescent protein (CFP), enhanced cyano fluorescent protein (eCFP), yellow fluorescent protein (YFP), enhanced yellow fluorescent protein (eYFP) and far-red fluorescent protein (mKate).
[0059] In one embodiment, the first, second, and third promoters are the same. In another embodiment, the first, second, and third promoters are different from each other. In another embodiment, the first promoter is different from the second and third promoters, and the second and third promoters are the same. In more embodiments, the first promoter is an SV40 late promoter, and the second and third promoters are each a human CMV promoter. In other embodiments, the first and second promoters are operably linked to a prokaryotic operator.
[0060] In one embodiment, the host cell line has an exogenously added gene encoding a recombinase integrated into its genome, operably linked to a promoter. In another embodiment, the recombinase is Cre recombinase. In another embodiment, the host cell has a gene encoding a regulatory protein integrated into its genome, operably linked to a promoter. In more embodiments, the regulatory protein is a tet repressor protein.
[0061] In one embodiment, the first GOI and the second GOI encode a light chain, or fragment thereof, of an antibody or a heavy chain, or fragment thereof, of an antibody. In another embodiment, the first GOI encodes a light chain of an antibody and the second GOI encodes a heavy chain of an antibody.
[0062] In certain embodiments, the first, second and third GOI encode a polypeptide selected from the group consisting of a first light chain, or fragment thereof, a second light chain, or fragment thereof and a heavy chain, or fragment thereof. In yet another embodiment, the first, second and third GOI encode a polypeptide selected from the group consisting of a light chain, or fragment thereof, a first heavy chain, or fragment thereof and a second heavy chain, or fragment thereof.
[0063] In one aspect, a method is provided for making a protein of interest, comprising (a) introducing into a CHO host cell a gene of interest (GOI), wherein the GOI integrates into a specific locus such as a locus described in US Patent No. 7771997B2, issued August 10, 2010 or other stable integration and/or expression-enhancing locus; (b) culturing the cell of (a) under conditions that allow expression of the GOI; and (c) recovering the protein of interest. In one embodiment, the protein of interest is selected from the group consisting of a subunit of an immunoglobulin, or fragment thereof, and a receptor, or ligand-binding fragment thereof. In certain embodiments, the protein of interest is selected from the group consisting of an antibody light chain, or antigen-binding fragment thereof, and an antibody heavy chain, or antigen-binding fragment thereof.
[0064] In certain embodiments, the CHO host cell genome comprises further modifications, and comprises one or more recombinase recognition sites as described above, and the GOI is introduced into a specific locus through the action of a recombinase that recognizes the
recombinase recognition site.
[0065] In some embodiments, the GOI is introduced into the cell employing a targeting vector for recombinase-mediated cassette exchange (RMCE) when the CHO host cell genome comprises at least one exogenous recognition sequence within a specific locus.
[0066] In another embodiment, the GOI is introduced into the cell employing a targeting vector for homologous recombination, and wherein the targeting vector comprises a 5' homology arm homologous to a sequence present in the specific locus, a GOI, and a 3' homology arm
homologous to a sequence present in the specific locus. In another embodiment, the targeting vector further comprises two, three, four, or five or more genes of interest. In another embodiment, one or more of the genes of interest are operably linked to a promoter.
[0067] In another aspect, a method is provided for modifying a CHO cell genome to integrate an exogenous nucleic acid sequence, comprising the step of introducing into the cell a vehicle comprising an exogenous nucleic acid sequence wherein the exogenous nucleic acid integrates within a locus of the genome.
[0068] In yet another aspect, the invention provides a process for manufacturing a stable protein formulation comprising the steps of: (a) extracting a protein fraction from the modified host cell of the invention having decreased or ablated expression of esterase, (b) contacting the protein fraction comprising a protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (c) collecting the protein of interest from the media, wherein a reduced level of the esterase activity is associated with the protein fraction collected at step (c), thus providing a stable protein formulation.
[0069] In yet another aspect, the invention provides a process for reducing esterase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate expression of esterase, (b) transfecting the host cell with a protein of interest, (c) extracting a protein fraction from the modified host cell, (c) contacting the protein fraction comprising the protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (d) collecting the protein of interest from the media, and (e) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer, thus providing a protein formulation essentially free of detectable esterase activity. In some embodiments, the protein formulation is essentially free of PLBD2 protein or PLBD2 activity.
[0070] In yet another aspect, a method is provided for modifying a CHO cell genome to express a therapeutic agent comprising a vehicle for introducing, into the genome, an exogenous nucleic acid comprising a sequence for expression of the therapeutic agent, wherein the vehicle comprises a 5' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33, a nucleic acid encoding the therapeutic agent, and a 3' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33.
[0071] In one more aspect, the invention provides a modified CHO host cell comprising a modified CHO genome wherein the CHO genome is modified by disruption of target sequence within a nucleotide sequence at least 90% identical to SEQ ID NO: 33. In one embodiment, the modified CHO host cell further comprises another FAH target sequence disruption. In some embodiments, the another FAH target sequence is within (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:61 , (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:69 and/or (3) a nucleotide sequence at least 90% identical to SEQ ID NO:77.
[0072] In another aspect, the invention provides a modified eukaryotic host cell comprising a modified eukaryotic genome wherein the eukaryotic genome is modified at a target sequence in a coding region of the target gene by a site-specific nuclease. In some embodiments, the site-specific nuclease comprises a zinc finger nuclease (ZFN), a ZFN dimer, a transcription activator-like effector nuclease (TALEN), a TAL effector domain fusion protein, or an RNA-guided DNA endonuclease. The invention also provides methods of making such a modified eukaryotic host cell.
[0073] In any of the aspects and embodiments described above, the target sequence can be placed in the indicated orientation as in SEQ ID NO:33, 61 , 69, 77, 1 17, 118, 119, 120, 121 , 122, or 123; or in the reverse of the orientation of SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 119, 120, 121 , 122, or 123.
[0074] Any of the aspects and embodiments of the invention can be used in conjunction with any other aspect or embodiment of the invention, unless otherwise specified or apparent from the context.
[0075] Other objects and advantages will become apparent from a review of the ensuing detailed description.
BRIEF DESCRIPTION OF THE DRAWINGS
[0076] Fig. 1 depicts the results of Taqman® quantitative polymerase chain reaction (qPCR) experiments to detect genomic (gDNA) or transcripts (mRNA) of the modified clones. Primers and probes were designed to flank the sequences predicted as subject to targeted disruption within exon 1 , either starting at nucleotide 37 (sgRNAI ) or starting at nucleotide 44 (sgRNA2) of SEQ ID NO:33. Relative amount of amplicons from clones targeted by either sgRNAI or sgRNA2 are graphed (i.e., relative to amplicons derived from the negative control transfection clones which were subject to no sgRNA or unmatched sgRNA). Clone 1 , for example, has relatively no amplified gDNA nor mRNA per qPCR of the targeted exon 1 region. Clone 1 and several other clones were selected for follow up analysis. The vertical axes represent the relative amount of template containing sgRNAI sequence (upper panel) or sgRNA2 sequence (lower panel).
[0077] Fig. 2A and Fig. 2B illustrate the results of further PCR analysis of a Clone 1 cells population compared to wild type Chinese hamster overy (CHO) cells. Fig. 2A shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from genomic DNA of wild type cells. Fig. 2B shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from mRNA of wild type cells. Sequencing confirmed an 1 1 bp deletion in the PLBD2 gene of Clone 1 . The vertical axes represent the size of the PCR fragments in base-pairs (bp).
[0078] Fig. 3 illustrates the relative protein titer of monoclonal antibody 1 (mAbl )-expressing Clone 1 cells (RS001 ) or mAb1 -expressing wild type CHO cells (RS0WT) subject to the same fed-batch culture conditions for 12 days. Samples of conditioned medium were extracted for each culture, and the Protein A binding fraction was quantified at Day 2, 4, 6, 9 and 12.
[0079] Fig. 4 shows the results of RS001 or RS0WT cells following production culture and protein purification using either Protein A (PA) alone, or PA and anion exchange (AEX) chromatography. PA-purified mAb1 from RS001 and RS0WT was analyzed for lipase abundance using trypsin digest mass spectrometry. As such, trypsin digests of RS001 - and RSOWT-produced mAb1 were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass spectrometer set to monitor a specific PLBD2 product fragment (as in Table 2). Control reactions containing reference samples of mAb1 (with no endogenous PLBD2) spiked with varying amounts of recombinant PLBD2 were also analyzed and plotted. The signals detected in the experiments were compared to the control reactions to determine concentration of PLBD2. mAb1 produced from Clone 1 shows no detectable amounts of PLBD2 when purified with PA alone.
[0080] Fig. 5 is a line plot depicting percent cell viability as a function of time in days. Open circles (-0-) represent wildtype cells. Filled diamonds (-♦-) represent PLBD2-KO cells. Filled squares (-■-) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
[0081] Fig. 6 is a line plot depicting protein production (titer) in grams per liter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (-♦-) represent PLBD2-KO cells. Filled squares (-■-) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
[0082] Fig. 7 is a line plot depicting viable cell counts in cells per milliliter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (-♦-) represent PLBD2-KO cells. Filled squares (-■-) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
[0083] Fig. 8 shows a formatted alignment of LPL knock out constructs clone 19, clone 20, clone 21 , and clone 22, represented by SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161 , and SEQ ID NO:162, respectively. The partial wildtype LPL sequence is represented by SEQ ID NO:158.
DETAILED DESCRIPTION
[0084] Before the present methods are described, it is to be understood that this invention is not limited to particular methods, and experimental conditions described, as such methods and conditions may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only by the appended claims.
[0085] As used in this specification and the appended claims, the singular forms "a", "an", and "the" include plural references unless the context clearly dictates otherwise. Thus for example, a reference to "a method" includes one or more methods, and/or steps of the type described herein and/or which will become apparent to those persons skilled in the art upon reading this disclosure.
[0086] Unless defined otherwise, or otherwise specified, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
[0087] Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, particular methods and materials are now described. All publications mentioned herein are incorporated herein by reference in their entirety.
Definitions
[0088] The phrase "exogenously added gene" or "exogenously added nucleic acid" refers to any DNA sequence or gene not present within the genome of the cell as found in nature. For example, an "exogenously added gene" within a CHO genome, can be a gene from any other species (e.g., a human gene), a chimeric gene (e.g., human/mouse), or a hamster gene not found in nature within the particular CHO locus in which the gene is inserted (i.e., a hamster gene from another locus in the hamster genome), or any other gene not found in nature to exist within a CHO locus of interest.
[0089] Percent identity, when describing an esterase, e.g., a hydrolase protein, such as SEQ ID NO:32, 34, 35, 37, 62,70, 74, 75, 76, 78, 82, 83, 84; 124, 125, 126, 127, 128, 129, and 130; or gene, such as SEQ ID NO:33, 61 , 69, 77, 1 17, 118, 119, 120, 121 , 122, and 123 includes homologous sequences that display the recited identity along regions of contiguous homology, but the presence of gaps, deletions, or insertions that have no homolog in the compared sequence are not taken into account in calculating percent identity.
[0090] A "percent identity" determination between, e.g., SEQ ID NO:32 with a species homolog would not include a comparison of sequences where the species homolog has no homologous sequence to compare in an alignment (i.e., SEQ ID NO:32 compared to a fragment thereof, or the species homolog has a gap or deletion, as the case may be). Thus, "percent identity" does not include penalties for gaps, deletions, and insertions.
[0091] "Targeted disruption" of a gene or nucleic acid sequence refers to gene targeting methods that direct cleavage or breaks (such as double stranded breaks) in genomic DNA and thus cause a modification to the coding sequence of such gene or nucleic acid sequence. Gene target sites are the sites selected for cleavage or break by a nuclease. The DNA break is normally repaired by the non-homologous end-joining (NHEJ) DNA repair pathway. During NHEJ repair, insertions or deletions (InDels) may occur, as such, a small number of nucleotides are either inserted or deleted at random at the site of the break and these InDels may shift or disrupt the open reading frame (ORF) of the target gene. Shifts in the ORF may cause significant changes in the resulting amino acid sequence downstream of the DNA break, or may introduce a premature stop codon, therefore the expressed protein, if any, is rendered nonfunctional or subject to degradation.
[0092] "Targeted insertion" refers to gene targeting methods employed to direct insertion or integration of a gene or nucleic acid sequence to a specific location on the genome, i.e., to direct the DNA to a specific site between two nucleotides in a contiguous polynucleotide chain. Targeted insertion may also be performed to introduce a small number of nucleotides or to introduce an entire gene cassette, which includes multiple genes, regulatory elements, and/or nucleic acid sequences. "Insertion" and "integration" are used interchangeably.
[0093] "Recognition site" or "recognition sequence" is a specific DNA sequence recognized by a nuclease or other enzyme to bind and direct site-specific cleavage of the DNA backbone.
Endonucleases cleave DNA within a DNA molecule. Recognition sites are also referred to in the art as recognition target sites.
[0094] Polysorbates are fatty acid esters of sorbitan or iso-sorbide (polyoxyethylene sorbitan or iso- sorbide mono- or di- esters). The polyoxyethylene serves as the hydrophilic head group and the fatty acid as the lipophilic hydrophobic tail. The effectiveness as a surfactant of the polysorbate depends upon the amphiphilic nature of the molecule with both hydrophilic head and hydrophobic tail present (in a single molecule). When a polysorbate degrades (hydrolyzes) into its component head group and fatty acid tail, it loses its effectiveness as a protein stabilizer, potentially allowing for aggregation and subsequent subvisible particle (SVP) formation is an indicator of such degradation. SVPs may attribute to immunogenicity. Regulatory authorities like the United States Food and Drug Administration (USFDA) provide limitations on the number of subvisible particles (SVPs) allowed in a pharmaceutical formulation. United States Pharmacopeia (USP) publishes standards for strength, purity and quality of drugs and drug ingredients, as well as food ingredients and dietary
supplements. For example, USP 31 monograph <788> sets the limit for number of particles allowed in parenteral formulations. USP 31 monograph <788> is available at http://www.uspnf.com/official- text/revision-bulletins/particulate-matter-injections; and as Revision Bulletin Official July 1 , 2012, <788> Particulate Matter in Injections, The United States Pharmacopeial Convention. For large volume parenterals (greater than 100 ml_), the limit is set at no more than 25 particles of at least 10 microns per ml_, and no more than 3 particles of at least 25 microns per ml_. For small volume parenterals (100 mL or less), the limit is set at no more than 6,000 particles of at least 10 microns per container, and no more than 600 particles of at least 25 microns per container.
[0095] The importance of maintaining a stable composition to minimize losses of the biologically active agent due to any contaminant or degradative process is emphasized by the International Conference on Harmonisation of Technical Requirements For Registration of Pharmaceuticals For Human Use (ICH). According to ICH Specifications (Q6A and Q6B), if a drug substance does not degrade in the specific formulation and under the specific storage conditions proposed in a new drug application, as demonstrated via appropriate analytical methodology, then degradation product testing may be reduced or eliminated upon approval by the regulatory authorities.
[0096] The term "stability" refers to the retention of an acceptable degree of physical structure (colloidal, nature), chemical structure or biological function of the biologically active agent (e.g., biotherapeutic or other protein produced in a cell-based bioprocess) over time during storage (a.k.a. "shelf-life"), during processing, or after administration and while in vivo. The biologically active agent may be stable even though it does not maintain 100% of its structure or function after storage or administration for a defined amount of time. Under certain circumstances, if about 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97% 98%, 99%, or > 99% of the biologically active agents have a native conformation, structure or function, the biologically active agent and formulation containing the biologically active agent may be regarded as "stable".
[0097] Stability can be measured, inter alia, by determining the percentage of native molecule that remains in the formulation after storage or administration for a defined amount of time at a defined temperature. The percentage of native molecule can be determined by, inter alia, size exclusion chromatography (e.g., size exclusion high performance liquid chromatography [SE-HPLC]), such that native means non-aggregated and non-degraded. In certain embodiments, at least about 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% of the native form of the biologically active agent can be detected in the formulation after a defined amount of time at a defined temperature or under physiological conditions after administration. The defined amount of time after which stability is measured can be about 14 days, about 28 days, about 1 month, about 2 months, about 3 months, about 4 months, about 5 months, about 6 months, about 7 months, about 8 months, about 9 months, about 10 months, about 1 1 months, about 12 months, about 18 months, about 24 months, or more. The temperature at which the formulation containing the biologically active agent may be kept when assessing stability can be any temperature from about -80°C to about 45°C, e.g., storage at about - 80°C, about -30°C, about -20°C, about 0°C, about 4°-8°C, about 5°C, about 25°C, about 35°C, about 37°C or other physiological temperatures, or about 45°C. For example, the biologically active agent may be deemed stable if after 3 months under physiological conditions, greater than about 75%, 80%, 85% or 90% of native molecule is detected in the soluble fraction by SE-HPLC or other size exclusion or size determination method. "Physiological temperature" includes the body temperature of any vertebrate. For example, the physiological temperature of humans is about 37°C. In some embodiments of the invention, physiological temperature is between about 25°C and about 45°C. In some embodiments, physiological temperature is about 25°C, about 26°C, about 27°C, about 28°C, about 29°C, about 30°C, about 31 °C, about 32°C, about 33°C, about 34°C, about 35°C, about 36°C, about 37°C, about 38°C, about 39°C, about 40°C, about 41 °C, about 42°C, about 43°C, about 44°C, and about 45°C.
[0098] Stability can be measured, inter alia, by determining the percentage of biologically active agent, such as a protein, that forms an aggregate (i.e., high molecular weight species) after a defined amount of time at a defined temperature, wherein stability is inversely proportional to the percent high molecular weight (HMW) species that is formed of the biologically active agent (protein). The percentage of HMW species of the biologically active agent can be determined by, inter alia, size exclusion chromatography, as described above. A pharmaceutical formulation containing the biologically active agent may also be deemed stable if after three months at physiological conditions less than about 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1% of the biologically active agent is detected in a HMW form.
[0099] Stability can be measured, inter alia, by determining the percentage of a biologically active agent, such as a protein, that is degraded or otherwise is found as a low molecular weight (LMW) species after a defined amount of time at a defined temperature. Stability is inversely proportional to the percent LMW species that is formed in the soluble fraction. The percentage of LMW species of the biologically active agent in the soluble fraction can be determined by, inter alia, size exclusion chromatography, as described above. A pharmaceutical formulation may also be deemed stable if after three months under storage conditions less than about 15%, 14%, 13%, 12%, 1 1%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1% of the biologically active agent is detected in a LMW form.
[00100] Pharmaceutical compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines. The composition complies if tested for subvisible particles and the test results in the average number of particles present in the units tested does not exceed 12 per milliliter equal to or greater than 10 um in size, and does not exceed 2 per milliliter equal to or greater than 25 um. Various test for microscopic particles in solution are well-known in the art, including but not limited to tests recommended in ICH Guideline Q4B Annex 3(R1 ), dated 27 September 2010 (Evaluation And Recommendation Of Pharmacopoeial Texts For Use In The lch Regions On Test For Particulate Contamination: Sub-Visible Particles General Chapter).
[00101] "Host cell proteins" or "HCPs" refer to proteins produced or encoded by the host organisms used to produce recombinant therapeutic proteins. HCPs are generally process-related impurities during biologies production. The amount of residual HCPs in drug product is generally considered a critical quality attribute (CQA), due to their potential to affect product safety and efficacy. Regulatory authorities require a product sponsor to monitor the removal of HCPs in drug product during bioprocess development. A sensitive assay e.g., immunoassay, capable of detecting a wide range of protein impurities is generally utilized. This testing can include verification at commercial scale in accordance with regional regulations and may be done at the time of submission of a marketing approval application. According to ICH Specifications (Q6A and Q6B, section 2.3), if a drug substance or drug product does not contain any impurity in the specific formulation, i.e. if efficient control or removal to acceptable levels is demonstrated by suitable studies, further testing may be reduced or eliminated upon approval by the regulatory authorities.
[00102] Pharmaceutical compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines. The composition prepared by the host cells described herein comprise less than 100 ng/mg (ppm), less than about 90 ppm, less than about 80 ppm, less than about 70 ppm, less than about 60 ppm, less than about 50 ppm, less than about 40 ppm, less than about 30 ppm, less than about 20 ppm, less than about 10 ppm, less than about 5 ppm, or 0 ppm of the target host cell protein, i.e. fatty acid hydrolase.
[00103] The term "fatty acid hydrolase" or "FAH" refers to any hydrolytic enzyme that cleaves at a carbonyl group creating a carboxylic acid product in which the carboxylic acid comprises an R- group that is lipophilic or otherwise hydrophobic. In some embodiments, the carboxylic acid product is a fatty acid. "Esterases" and "fatty acid acylases/amidases" are included as subgenera of fatty acid hydrolase. "Lipases" are a subgenus of esterases that cleave lipids (fats, waxes, sterols, glycerides and phospholipids. "Phospholipases" are a subgenus of lipases that cleave
phospholipids. Esterases cleave fatty acid esters into fatty acids and alcohols. Lipases include PLBD2, LPL and LIPA. "Ceramidase" is a subgenus of fatty acid acylase that cleaves ceramide and releases a fatty acid and sphingosine, which is an amino alcohol. Examples of ceramidases include acid ceramidase, neutral ceramidase, alkaline ceramidase 1 , alkaline ceramidase 2 and alkaline ceramidase 3.
[00104] Protein A-binding fraction refers to the fraction of cell lysate from cultured cells expressing a protein of interest which binds to a Protein A affinity format. It is well understood in the art that Protein A affinity chromatography, such as Protein A chromatography medium, such as resins, beads, columns and the like, are utilized to capture Fc-containing proteins due to their affinity to Protein A.
[00105] Phospholipase B-like 2 (PLBD2) refers to the homologs of a phospholipase gene known as NCBI RefSeq. XM_003510812.2 (SEQ ID NO:33) or protein known as NCBI RefSeq.
XP_003510860.1 (SEQ ID NO:32), and further described herein. PLBD2 is also referred to in the art as putative phospholipase B-like 2 (PLBL2), 76 kDa protein, LAMA-like protein 2, PLB homolog 2, lamina ancestor homolog 2, mannose-6-phosphate protein associated protein p76, p76, phospholipase B-like 2 32 kDa form, phospholipase B-like 2 45 kDa form, or Lysosomal 66.3 kDa protein.
[00106] Lipoprotein lipase (LPL) is a glycosylated homodimer secreted by parenchymal cells and associated with endothelial cells of the capillary lumen. Exemplary LPL proteins include Chinese hamster LPL (SEQ ID NO:62), mouse LPL (SEQ ID NO:66), ), rat LPL (SEQ ID NO:67) and human LPL (SEQ ID NO:68). Mouse LPL is 92% identical to Chinese hamster LPL. Rat LPL is 92% identical to Chinese hamster LPL. Human LPL is 88% identical to Chinese hamster LPL. In one embodiment, Chinese hamster LPL is encoded by a polynucleotide sequence of SEQ ID NO:61 .
[00107] Lysosomal acid lipase (LI PA), also known as lysosomal lipase, lipase A, lysosomal acid and cholesterol esterase is an intracellular lipase that functions in the lysosome. LIPA reversibly catalyzes cholesteryl ester bond formation and cleavage. LIPA is also a glycosylated homodimer. Exemplary LIPA proteins include Chinese hamster LIPA (SEQ ID NO:70), mouse LIPA (SEQ ID NO:74), ), rat LIPA (SEQ ID NO:75) and human LIPA (SEQ ID NO:76). Mouse LIPA is 72% identical to Chinese hamster LIPA. Rat LIPA is 75% identical to Chinese hamster LIPA. Human LIPA is 74% identical to Chinese hamster LIPA. In one embodiment, Chinese hamster LIPA is encoded by a polynucleotide sequence of SEQ ID NO:69.
[00108] Acid ceramidase (CE), also known as ASAH1 , AC, ACDase, ASAH, PHP, PHP32, SMAPME, N-acylsphingosine amidohydrolase (acid ceramidase) 1 is an acylase that cleaves ceramide to produce fatty acid and sphingosine. It is a heterodimer comprising a non-glycosylated alpha subunit and a glycosylated beta subunit. Acid ceramidase has an acid pH optimum. The lipid accumulation disease, Farber Lipogranulomatosis, is associated with a deficiency in acid ceramidase activity. Exemplary acid ceramidases (ACE) include Chinese hamster ACE (SEQ ID NO:78), mouse ACE (SEQ ID NO:82), ), rat ACE (SEQ ID NO:83) and human ACE (SEQ ID NO:84). Mouse ACE is 87% identical to Chinese hamster ACE. Rat ACE is 89% identical to Chinese hamster ACE. Human ACE is 83% identical to Chinese hamster ACE. In one embodiment, Chinese hamster ACE is encoded by a polynucleotide sequence of SEQ ID NO:77.
[00109] Platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), also known as PAFAH1 B3, PAFAHG, and platelet activating factor acetylhydrolase 1 b catalytic subunit 3 is one of the catalytic subunits along with beta of the cytosolic tetrameric platelet-activating factor acetylhydrolase IB. PAFAHG belongs to the phospholipase A2 family and catalyzes the hydrolysis of the acyl group at position 2 of glycerol in bioactive phospholipids (see Stafforini et al., Journal of Biological Chemistry, 272:17895-17898, July 1997). Chinese hamster PAFAHG (SEQ ID NO:124) is 98% identical to both rat and mouse PAFAHG, and 96% identical to human PAFAHG. In one embodiment, Chinese hamster PAFAHG is encoded by a polynucleotide sequence of SEQ ID NO:1 17.
[00110] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00111] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00112] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00113] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00114] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00115] Additional fatty acid hydrolases that can serve as targets for deletion, either individually, or in combination with one of more additional fatty acid hydrolases are listed in Table 1. Any of the following hydrolases, or their equivalents, listed in Table A may be the target protein in the creation of a knockout cell line, where removal of the host cell protein is necessitated due to contamination in the preparation of a biopharmaceutical product.
Table 1
[00116] The term "cell" or "cell line" includes any cell that is suitable for expressing a recombinant nucleic acid sequence. Cells include those of prokaryotes and eukaryotes (single-cell or multiple- cell), bacterial cells (e.g., strains of E. coli, Bacillus spp., Streptomyces spp., etc.), mycobacteria cells, fungal cells, yeast cells (e.g.,S. cerevisiae, S. pombe, P. partoris, P. methanolica, etc.), plant cells, insect cells (e.g.,SF-9, SF-21 , baculovirus-infected insect cells, Trichoplusia ni, etc.), non- human animal cells, mammalian cells, human cells, or cell fusions such as, for example, hybridomas or quadromas. In certain embodiments, the cell is a human, monkey, ape, hamster, rat or mouse cell. In certain embodiments, the cell is eukaryotic and is selected from the following cells: CHO (e.g.,CHO K1 , DXB-1 1 CHO, Veggie-CHO), COS (e.g.,COS-7), retinal cells, Vero, CV1 , kidney (e.g.,HEK293, 293 EBNA, MSR 293, MDCK, HaK, BHK21 ), HeLa, HepG2, WI38, MRC 5, Colo25, HB 8065, HL-60, Jurkat, Daudi, A431 (epidermal), CV-1 , U937, 3T3, L cell, C127 cell, SP2/0, NS-0, MMT cell, tumor cell, and a cell line derived from an aforementioned cell. In some embodiments, the cell comprises one or more viral genes, e.g., a retinal cell that expresses a viral gene (e.g.,a PER.C6® cell).
General Description
[00117] The invention is based at least in part on a recombinant host cell and cell expression system thereof that decreases expression of two or more an endogenous host cell fatty acid hydrolases (FAHs), decreases the enzymatic function or binding ability of two or more endogenous host cell FAHs, or lacks detectable expression of two or more FAHs. The inventors discovered that the disruption of genes encoding at least two FAHs allows for the optimized and efficient production and purification of biopharmaceutical products expressed in such expression systems. The invention may be employed in several ways, such as 1 ) utilizing gene editing tools to totally knockout FAH expression, whereas no measurable full-length FAH enzyme is expressed in the cell due to disruption of the gene encoding the FAH; 2) utilizing gene editing tools to eliminate or reduce enzymatic activity, whereas the FAH protein is expressed but rendered nonfunctional due to disruptions in its gene; and 3) utilizing gene editing tools to eliminate or reduce the ability of an endogenous host cell FAH to bind exogenous recombinant protein produced by the cell. FAH activity was determined in protein fractions of certain antibody-producing cells. Several particular fatty acid hydrolases were determined as contaminants in these protein fractions, including three carboxylic esterases (a.k.a. esterases): phospholipase B-like (PLBD2), lipoprotein lipase (LPL) and lysosomal acid lipase (LIPA), and acid ceramidase, a carboxylic amidase (an acylase). Gene editing target sites were identified in hamster PLBD2, LPL, LIPA and acid ceramidase (ACE) genes that enable targeted disruption of those genes in a hamster cell {i.e., CHO) genome.
[00118] An optimized host cell comprising a combination of genetic modifications that affect the expression of genes encoding (1 ) PLBD2 and LPL; (2) PLBD2 and LIPA; (3) PLBD2 and ACE; (4) LPL and LIPA; (5) LPL and ACE; (6) LIPA and ACE; (7) PLBD2, LPL and LIPA; (8) PLBD2, LPL and ACE; (9) PLBD2, LIPA and ACE; (10) LPL, LIPA and ACE, (1 1 ) PLBD2, LPL, LIPA and ACE, (12) PLBD2 and PIPLC, (13) PLBD2 and CEH, (14) PLBD2 and PAFAHG, (15) PLBD2 and LCE, (16) PLBD2 and ASA, (17) PLBD2 and IAH1 , (18) PLBD2 and LPLA2, (19) LPL and PIPLC, (20) LPL and CEH, (21 ) LPL and PAFAHG, (22) LPL and LCE, (23) LPL and ASA, (24) LPL and IAH1 , (25) LPL and LPLA2, (26) LIPA and PIPLC, (27) LIPA and CEH, (28) LIPA and PAFAHG, (29) LIPA AND LCE, (30) LIPA and ASA, (31 ) LIPA and IAH1 , (32) LIPA and LPLA2, (33) ACE and PIPLC, (34) ACE and CEH, (35) ACE and PAFAHG, (36) ACE and LCE, (37) ACE and ASA, (38) ACE and IAH1 , (39) ACE and LPLA2, (40) PIPLC and CEH, (41 ) PIPLC and PAFAHG, (42) PIPLC and LCE, (43) PIPLC and ASA, (44) PIPLC and IAH1 , (45) PIPLC and LPLA2, (46) CEH and PAFAHG, (47) CEH and LCE, (48) CEH and ASA, (49) CEH and IAH1 , (50) CEH and LPLA2, (51 ) PAFAHG and LCE, (53) PAFAHG and ASA, (54) PAFAHG and IAH1 , (55) PAFAHG and LPLA2, (56) LCE and ASA, (57) LCE and IAH1 , (58) LCE and LPLA2, (56) ASA, and IAH1 , (58) ASA and LPLA2, or (59) IAH1 and LPLA2, and the like is/are useful for the biological production of high-quality proteins.
[00119] Such a cell is envisioned to reduce the burden of certain purification steps, thereby reducing time and cost, while increasing production yield. Also, the formulated protein is expected to have improved stability due to the reduced hydrolase burden.
[00120] The invention is also based on the specific targeting of an exogenous gene to the integration site. The methods of the invention allow efficient modification of the cell genome, thus producing a modified or recombinant host cell useful as a cell expression system for the
bioprocessing of therapeutic or other commercial protein products. To this end, the methods of the invention employ cellular genome gene editing strategies for the alteration of particular genes of interest that otherwise may diminish or contaminate the quality of recombinant protein formulations, or require multiple purification steps.
[00121] The compositions of the invention, e.g., gene editing tools, can also be included in expression constructs for example, in expression vectors for cloning and engineering new cell lines. These cell lines comprise the modifications described herein, and further modifications for optimal incorporation of expression constructs for the purpose of protein expression are envisioned.
Expression vectors comprising polynucleotides can be used to express proteins of interest transiently, or can be integrated into the cellular genome by random or targeted recombination such as, for example, homologous recombination or recombination mediated by recombinases that recognize specific recombination sites (e.g., Cre-lox-mediated recombination).
[00122] Target sites for disruption or insertion of DNA are typically identified with the maximum effect of the gene disruption or insertion in mind. For example, target sequences may be chosen near the N-terminus of the coding region of the gene of interest whereas a DNA break is introduced within the first or second exon of the gene. Introns (non-coding regions) are not typically targeted for disruption as repair of the DNA break in that region may not disrupt the target gene. The changes introduced by these modifications are permanent to the genomic DNA of the organism.
[00123] Essentially, following identification of target sites within SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77, gene editing protocols were employed to render a nonfunctional versions of two or more of those genes. In some embodiments, the one of those genes encodes PLBD2 (e.g., SEQ ID NO:33). Once the contaminant FAH host cell proteins are eliminated, protocols known in the art for introducing an expressible gene of interest (GOI), such as a multi- subunit antibody, along with any other desirable elements such as, e.g., promoters, enhancers, markers, operators, ribosome binding sites (e.g., internal ribosome entry sites), efc. are also employed.
[00124] The resulting recombinant cell line conveniently provides more efficient downstream bioprocess methods with respect to expressible exogenous genes of interest (GOIs), since purification steps for exogenous proteins of interest are eliminated due to the absence of the contaminant host cell protein. Eliminating or refining purification procedures also results in higher amounts (titer) of the recovered protein of interest.
Physical and Functional Characterization of Modified CHO Cells
[00125] Applicants have discovered enzymatic activities associated with the destabilization of polysorbates (including polysorbate 20 and polysorbate 80) and/or enzymatic activities co-purifying with highly concentrated, multimerized or aggregated protein. Those activities were found to be associated with one or more fatty acid hydrolases (FAHs). One such FAH was identified from the peptide sequences listed in Table 2. A BLAST search of those peptide sequences revealed identity with a putative phospholipase B-like 2 (PLBD2, also referred to as PLBL2). PLBD2 is highly conserved in hamster (SEQ ID NO:32), mice (SEQ ID NO:34), rat (SEQ ID NO:35), human (SEQ ID NO:36), and bovine (SEQ ID NO:37). The applicants discovered that PLBD2, which co-purifies under certain processes with some classes of proteins-of-interest manufactured in a mammalian cell line, has enzymatic activity responsible for the hydrolysis of polysorbate 20 and 80. Other FAH species, of which PLBD2 is an example, may contribute to polysorbate instability or persist as hydrophobic "sticky" proteins that bind protein multimers or aggregates during purification and ultimate formulation, depending upon the particular protein-of-interest and/or background of the host cell.
Table 2
[00126] Ester hydrolysis of polysorbate 80 was recently reported (see Labrenz, S.R., "Ester hydrolysis of polysorbate 80 in mAb drug product: evidence in support of the hypothesized risk after observation of visible particulate in mAb formulations," J. Pharma. Sci. 103(8):2268-77 (2014)). That paper reported the formation of visible particles in a formulation containing IgG. The author postulated that the colloidal IgG particles formed due to the enzymatic hydrolysis of oleate esters of polysorbate 80. Although no esterase was directly identified, the author speculates that a lipase or tweenase co-purified with the IgG, which was responsible for degrading the polysorbate 80.
Interestingly, IgGs formulated with polysorbate 20 did not form particles and the putative esterase did not hydrolyze the polysorbate 20. The author reported that the putative lipase associated with the IgG did not affect saturated C 12 fatty acid {i.e., laurate) (Id at 7.)
[00127] The applicants discovered three other FAH proteins in addition to PLBD2 that co-purify with antibodies produced in mammalian cells: lipoprotein lipase(LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE). LPL is a triacylglycerol/diacylglycerol hydrolase of the carboxylic ester hydrolase (esterase) family (see Hide et al., "Structure and evolution of the lipase superfamily," J Lipid Res. 1992 Feb; 33(2): 167-78). LI PA is a sterol esterase and synthase that acts on esters of sterols and long-chain fatty acids (see Dubland and Francis, "Lysosomal acid lipase: at the crossroads of normal and atherogenic cholesterol metabolism," Front. Cell Dev. Biol. 2015 Feb 2; 3(3): 1-11 ). Acid ceramidase is not a carboxylic ester hydrolase, but rather an amide hydrolase that cleaves fatty acids from ceramide at the amide bond (carboxylic amide hydrolase) (see Park and Schuchman, "Acid ceramidase and human disease," Biochim. Biophys. Acta. 2006 Dec; 1758(12): 2133-8).
[00128] Phospholipases are a family of esterase enzymes that catalyze the cleavage of phospholipids. Each phospholipase subclass has different substrate specificity based on its target cleavage site. Phospholipase B (PLB) was identified as related to a group of prokaryotic and eukaryotic lipase proteins by virtue of the presence of a highly conserved amino acid sequence motif, Gly-Asp-Ser-Leu (GDSL) (Upton, C, and Buckley, JT. A new family of lipolytic enzymes? Trends Biochem Sci. 1995; 20:178-179). However, phospholipase B is also classified with known GDSL(S) hydrolases, and has little sequence homology to true lipases, differentiating itself structurally from phospholipases by having a serine-containing motif closer to the N-terminus than other lipases. Thus, phospholipase B-like proteins are also classified as N-terminal nucleophile (Ntn) hydrolases. Functionally, phospholipase B-like enzymes hydrolyze their target substrate (fatty acid esters such as diacylglycerophospholipids) to produce free fatty acids and ester-containing compounds {e.g., produces glycerophosphocholine), in a similar in manner as other
phospholipases. It has been suggested that PLB-like proteins, such as phospholipase B-like protein 1 (PLBD1 ) and phospholipase B-like protein 2 (PLBD2), also have amidase activity, similar to other Ntn hydrolases (Repo, H. et al, Proteins 2014; 82:300-31 1 ).
[00129] Lipoprotein lipases have also been demonstrated to cleave carboxylic ester bonds of polysorbate 20 and 80; and to associate with some monoclonal antibodies during production (see N. Levy, "Host cell protein impurities and protein-protein interactions in downstream purification of monoclonal antibodies," Dissertation submitted to the Faculty of the University of Delaware, Summer 2014, UMI 3642330, Published by ProQuest LLC, 2014). Cell-cultures of a CHO-K1 RNAi knock-down of LPL reveled diminished polysorbate esterase activity. The effect of such a knockdown on overall cell viability or the production of useful titers of ectopic protein has not been investigated.
[00130] Knockout of a host cell gene, such as an FAH, more particularly one or more of phospholipase B-like protein 2, lipoprotein lipase, lysosomal acid lipase and acid ceramidase may be accomplished in several ways. Rendering the FAH encoding gene nonfunctional, or reducing the functional activity of the target FAH protein may be done by introducing point mutations in the FAH genomic sequence, particularly in the exons (coding regions). In some embodiments, the nucleic acid sequences of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 were identified and sequences upstream and downstream of the target site {i.e., homologous arms) may be utilized to integrate an expression cassette comprising a mutated gene by homologous recombination. Further gene editing tools are described herein in accordance with the invention.
[00131] Cell lines devoid of multiple FAH activities, particularly PLBD2, LPL, LIPA and ACE activity, are useful for the production of therapeutic proteins to be purified and stored long term, and such cell lines solve problems associated with long term storage of pharmaceutical compositions in a formulation containing a fatty acid ester surfactant by maintaining protein stability and reducing subvisible particle (SVP) formation (see also PCT International Application No. PCT/US 15/54600 filed October 8, 2015, which is hereby incorporated in its entirety into the specification).
[00132] Assays to detect FAH activity include polysorbate degradation measurements. Unpurified protein supernatants or fractions from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, is tested for stability of polysorbate, such as polysorbate 20 or 80. The measurement of percent intact polysorbate reported is inversely proportional to the amount of contaminating FAH activity. Other measurements for detection of FAH activity or presence of FAH in a protein sample are known in the art. Detection of FAH protein (e.g., lipase, phospholipase, PLBD2, LPL, LIPA, acylase, ACE) may be done by trypsin digest mass spectrometry.
[00133] It is hypothesized that the instability of the non-ionic detergent, i.e., surfactant such as polysorbate, in a protein (e.g., antibody) formulation contributes to the formation of subvisible particles. Thus, degradation of the polysorbate incurs loss of surfactant activity, and therefore allows the protein to aggregate and form subvisible particles. Additionally or alternatively, the fatty acids released by the degrading sorbitan fatty acid esters may also contribute to subvisible particle formation as immiscible fatty acid droplets. Therefore, levels of subvisible particles≥ 10
micrometers in diameter may be counted in the protein formulation in order to detect esterase or other FAH activity.
[00134] Other assays for detecting FAH, especially esterase activity are known in the art. For example, glycerophospho[3H]choline formation from phosphatidyl[3H]choline following incubation of phosphatidyl[3H]choline and protein supernatant may be determined by thin-layer chromatography (following similar protocols according to Kanoh, H. et al. 1991 Comp Biochem Physiol 102B(2):367- 369).
[00135] SEQ ID NO:32 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of PLBD2, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:32, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:32 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00136] The mammalian PLBD2 sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 3 identifies exemplary mammalian PLBD2 proteins and their degree of homology.
TABLE 3: Amino acid identity of PLBD2 homologs
[00137] In certain embodiments, the targeted disruption of SEQ ID NO:33 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
[00138] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
[00139] In another embodiment, the PLBD2 nucleic acid sequence is at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:33 or target sequence thereof.
[00140] SEQ ID NO:62 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of LPL, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:62, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:62 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00141] The mammalian LPL sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 4 identifies exemplary mammalian LPL proteins and their degree of homology. TABLE 4: Amino acid identity of LPL homologs
[00142] In certain embodiments, the targeted disruption of SEQ ID NO:61 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400- 420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490- 510, 500- 520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590- 610, 600- 620, 610-630, 620-640, 630-650, 465-484, 558-577, and 593-612 of SEQ ID NO:61.
[00143] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410- 430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490- 510, 500-520, 510- 530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590- 610, 600-620, 610- 630, 620-640, 630-650, 465-484, 558-577, and 593-612 of SEQ ID NO:61.
[00144] In another embodiment, the LPL nucleic acid sequence is at least about 80% identical, at least about 81% identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:61 or target sequence thereof.
[00145] SEQ ID NO:70 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of LI PA, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:70, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:70 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00146] The mammalian LIPA sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 5 identifies exemplary mammalian LIPA proteins and their degree of homology.
TABLE 5: Amino acid identity of LIPA homologs
[00147] In certain embodiments, the targeted disruption of SEQ ID NO:69 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69.
[00148] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69. [00149] In another embodiment, the LIPA nucleic acid sequence is at least about 70% identical, at least about 71% identical, at least about 72% identical, at least about 73% identical, at least about 74% identical, at least about 75% identical, at least about 76% identical, at least about 77% identical, at least about 78% identical, or at least about 79% identical, at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:69 or target sequence thereof.
[00150] SEQ ID NO:78 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of LPL, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:78, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:78 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00151] The mammalian acid ceramidase (ACE) sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 6 identifies exemplary mammalian ACE proteins and their degree of homology.
TABLE 6: Amino acid identity of ACE homologs
[00152] In certain embodiments, the targeted disruption of SEQ ID NO:77 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ ID NO:77.
[00153] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ I D NO:77.
[00154] In another embodiment, the ACE nucleic acid sequence is at least about 80% identical, at least about 81% identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:77 or target sequence thereof.
[00155] Cell populations expressing enhanced levels of a protein of interest can be developed using the cell lines and methods provided herein. The isolated commercial protein, protein supernatant or fraction thereof, produced by the cells of the invention have no detectable esterase or esterase activity. Cell pools further modified with exogenous sequence(s) integrated within the genome of the modified cells of the invention are expected to be stable over time, and can be treated as stable cell lines for most purposes. Recombination steps can also be delayed until later in the process of development of the cell lines of the invention.
Genetically Modifying the Target Host Cell Protein
[00156] Methods for genetically engineering a host cell genome in a particular location (i.e., target host cell protein) may be achieved in several ways. Genetic editing techniques were used to modify a nucleic acid sequence in a eukaryotic cell, wherein the nucleic acid sequence is an endogenous sequence normally found in such cells and expressing a contaminant host cell protein. Clonal expansion is necessary to ensure that the cell progeny will share the identical genotypic and phenotypic characteristics of the engineered cell line. In some examples, native cells are modified by a homologous recombination technique to integrate a nonfunctional or mutated target nucleic acid sequence encoding a host cell protein, such as a variant of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77.
[00157] One such method of editing the CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA genomic sequences involves the use of guide RNAs and a type II Cas enzyme to specifically target an exon of PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and/or ASA. Specific guide RNAs directed to particular exons of CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA have been employed (Table 7) in a site-specific nuclease editing method as described herein. Other methods of targeted genome editing, for example nucleases, recombination-based methods, or RNA interference, to modify the FAH genes may be employed for the targeted disruption of the CHO genome.
[00158] In some embodiments, the engineered mammalian host cell (i.e. non-natural cell) comprises one or more disruptions within gene sequences selected from the group consisting of nucleotides 37-63 of SEQ ID NO:33, nucleotides 465-612 of SEQ ID NO:61 , nucleotides 180-295 of SEQ ID NO:69, nucleotides 135-351 of SEQ ID NO:77, nucleotides 249-388 of SEQ ID NO:1 17, nucleotides 1624-2157 of SEQ ID NO:163, nucleotides 372-1399 of SEQ ID NO:1 18, nucleotides 1 155-1600 of SEQ ID NO:1 19, nucleotides 423-615 of SEQ ID NO:120, nucleotides 753-1 141 of SEQ ID NO:121 , nucleotides 311 -581 of SEQ ID NO:164, nucleotides 1 155-1443 of SEQ ID NO:122, and nucleotides 813-1414 of SEQ ID NO:123.
Table 7
[00159] In one aspect, methods and compositions for knockout or downregulation of a nucleic acid molecule encoding an ortholog of a host cell FAH protein having at least 80% identity to SEQ ID NO:33 (PLBD2), SEQ ID NO:61 (LPL), SEQ ID NO:69 (LIPA) and/or SEQ ID NO:77 (ACE); or antibody-binding variant thereof, are via homologous recombination. A nucleic acid molecule encoding an FAH protein (or any protein of interest in general) can be targeted by homologous recombination or by using site-specific nuclease methods that specifically target sequences at the FAH-expressing site of the host cell genome. For homologous recombination, homologous polynucleotide molecules {i.e., homologous arms) line up and exchange a stretch of their sequences. A transgene can be introduced during this exchange if the transgene is flanked by homologous genomic sequences. In one example, a recombinase recognition site can also be introduced into the host cell genome at the integration sites.
[00160] Homologous recombination in eukaryotic cells can be facilitated by introducing a break in the chromosomal DNA at the integration site. Model systems have demonstrated that the frequency of homologous recombination during gene targeting increases if a double-strand break is introduced within the chromosomal target sequence. This may be accomplished by targeting certain nucleases to the specific site of integration. DNA-binding proteins that recognize DNA sequences at the target gene are known in the art. Gene targeting vectors are also employed to facilitate homologous recombination. In the absence of a gene targeting vector for homology directed repair, the cells frequently close the double-strand break by non-homologous end-joining (NHEJ) which may lead to deletion or insertion of multiple nucleotides at the cleavage site. Gene targeting vector construction and nuclease selection are within the skill of the artisan to whom this invention pertains.
[00161] In some examples, zinc finger nucleases (ZFNs), which have a modular structure and contain individual zinc finger domains, recognize a particular 3-nucleotide sequence in the target sequence. Some embodiments can utilize ZFNs with a combination of individual zinc finger domains targeting multiple target sequences. ZFN methods to target disruption of two or more FAH genes are also embodied by the invention.
[00162] Transcription activator-like (TAL) effector nucleases (TALENs) may also be employed for site-specific genome editing. TAL effector protein DNA-binding domain is typically utilized in combination with a non-specific cleavage domain of a restriction nuclease, such as Fokl. In some embodiments, a fusion protein comprising a TAL effector protein DNA-binding domain and a restriction nuclease cleavage domain is employed to recognize and cleave DNA at a target sequence within an exon of the gene encoding the target host cell protein, for example an esterase, such as a phospholipase B-like 2 protein(or other mammalian phospholipase), a lipoprotein lipase and a lysosomal acid lipase, and/or a fatty acylase, such as acid ceramidase (or other mammalian ceramidase). Targeted disruption or insertion of exogenous sequences into a specific exon of the CHO protein encoded by SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77 may be done by employing a TALE nuclease (TALEN) targeted to locations within exon 1 , exon 2, exon 3, etc. of the fatty acid hydrolase genomic DNA (see Tables 6 and 7). The TALEN target cleavage site within the gene sequences {e.g., SEQ ID NOs:33, 61 , 69 and 77) may be selected based on ZiFit.partners.org (ZiFit Targeter Version 4.2) and then TALENs are designed based on known methods (Boch J et al., 2009 Science 326:1509-1512; Bogdanove, A. J. & Voytas, D. F. 201 1 Science 333, 1843-1846; Miller, J. C. et al., 201 1 Nat Biotechnol 29, 143-148). TALEN methods to target disruption of two or more of the PLBD2 gene {e.g., exon 1 or exon 2), LPL gene {e.g., exon 2, 3 or 4), LI PA gene {e.g., exon 1 or 2) and ACE gene {e.g., exon 1 , 3 or 4) are also embodied by the invention.
[00163] RNA-guided endonucleases (RGENs) are programmable genome engineering tools that were developed from bacterial adaptive immune machinery. In this system— the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) immune response— the protein Cas9 forms a sequence-specific endonuclease when complexed with two RNAs, one of which guides target selection. RGENs consist of components (Cas9 and tracrRNA) and a target- specific CRISPR RNA (crRNA). Both the efficiency of DNA target cleavage and the location of the cleavage sites vary based on the position of a protospacer adjacent motif (PAM), an additional requirement for target recognition (Chen, H. et al, J. Biol. Chem. published online March 14, 2014, as Manuscript M1 13.539726). CRISPR-Cas9 methods to target disruption of two or more of the PLBD2 gene (e.g., exon 1 or exon 2), LPL gene (e.g., exon 2, 3 or 4), LIPA gene (e.g., exon 1 or 2) and ACE gene (e.g., exon 1 , 3 or 4) are also embodied by the invention..
[00164] Still other methods of homologous recombination are available to the skilled artisan, such as BuD-derived nucleases (BuDNs) with precise DNA-binding specificities (Stella, S. et al. Acta Cryst. 2014, D70, 2042-2052). A single residue-to-nucleotide code guides the BuDN to the specific DNA target within polynucleotide of interest {e.g., SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and or SEQ ID NO:77).
[00165] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding PLBD2, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614971 .1 , at: Exon 1 within nucleotides (nt) 175367 to 175644 (SEQ ID NO:47); Exon 2 within nt 168958 to 169051 (SEQ ID NO:48); Exon 3 within nt 166451 to166609 (SEQ ID NO:49); Exon 4 within nt 164966 to 165066 (SEQ ID NO:50); Exon 5 within nt 164564 to164778 (SEQ ID NO:51 ); Exon 6 within nt 162682 to162779 (SEQ ID NO:52); Exon 7 within nt 160036 to160196 (SEQ ID NO:53); Exon 8 within nt 159733 to 159828 (SEQ ID NO:54); Exon 9 within nt 159491 to 159562 (SEQ ID NO:55); Exon 10 within nt 158726 to 158878 (SEQ ID NO:56); Exon 11 within nt 158082 to 158244 (SEQ ID NO:57); or Exon 12 at nucleotides (nt) 157747 to 157914 (SEQ ID NO:58), wherein PLBD2 exons 1 -12 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00166] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding LPL, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613760.1 , at: Exon 1 within nucleotides (nt) 1257424 to 1257507 (SEQ ID NO:85); Exon 2 within nt 1266450 to 1266610 (SEQ ID NO:86); Exon 3 within nt 1270069 1270248 (SEQ ID NO:87); Exon 4 within nt 1271770 to 1271881 (SEQ ID NO:88); Exon 5 within nt 12283518 12283751 (SEQ ID NO:89); Exon 6 within nt 123715 1273957 (SEQ ID NO:90); Exon 7 within nt 1276672 1276792 (SEQ ID NO:91 ); Exon 8 within nt 1278328 to 1278510 (SEQ ID NO:92); or Exon 9 within nt 1279637 to 1279738 (SEQ ID NO:93), wherein LPL exons 1-9 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00167] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding LI PA, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614200.1 , at: Exon 1 within nucleotides (nt) 985778 to 985674 (SEQ ID NO:94); Exon 2 within nt 984375 to 984258 (SEQ ID NO:95); Exon 3 within nt 970771 970573 (SEQ ID NO:96); Exon 4 within nt 969327 to 969218 (SEQ ID NO:97); Exon 5 within nt 968139 968003 (SEQ ID NO:98); Exon 6 within nt 961871 to 961725 (SEQ ID NO:99); Exon 7 within nt 960826 to 960755 (SEQ ID NO:100); Exon 8 within nt 955312 to 955241 (SEQ ID NO: 101 ); or Exon 9 within nt 954457 to 954224 (SEQ ID NO: 102), wherein LI PA exons 1-9 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00168] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding ACE, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613654.1 , at: Exon 1 within nucleotides (nt) 1378167 to 1378244 (SEQ ID NO: 103); Exon 2 within nt 1393746 to 1393792 (SEQ ID NO: 104); Exon 3 within nt 1398208 to 1398298 (SEQ ID NO:105); Exon 4 within nt 1399171 to 1399257 (SEQ ID NO: 106); Exon 5 within nt 1402147 to 1402225 (SEQ ID NO:107); Exon 6 within nt 1404854 to 1404928 (SEQ ID NO:108); Exon 7 within nt 1405714 to 1405759 (SEQ ID NO:109); Exon 8 within nt 1406783 to 1406927 (SEQ ID NO:1 10); Exon 9 within nt 1407840 to 1407894 (SEQ ID NO:11 1 ); Exon 10 within nt 1408004 to 1408085 (SEQ ID NO: 1 12); Exon 1 1 within nt 1409791 to 1409922 (SEQ ID NO:1 13); Exon 12 within nt 1410031 to 1410154 (SEQ ID NO:1 14); Exon 13 within nt 1410568 to 1410624 (SEQ ID NO: 1 15); or Exon 14 at nucleotides (nt) 141 1555 to 141 1644 (SEQ ID NO:1 16), wherein ACE exons 1 -14 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00169] Precise genome modification methods are chosen based on the tools available compatible with unique target sequences within SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 so that disruption of the cell phenotype is avoided.
Proteins of Interest
[00170] Any protein of interest suitable for expression in prokaryotic or eukaryotic cells can be used in the engineered host cell systems provided. For example, the protein of interest includes, but is not limited to, an antibody or antigen-binding fragment thereof, a chimeric antibody or antigen- binding fragment thereof, an ScFv or fragment thereof, an Fc-fusion protein or fragment thereof, a growth factor or a fragment thereof, a cytokine or a fragment thereof, or an extracellular domain of a cell surface receptor or a fragment thereof. Proteins of interest may be simple polypeptides consisting of a single subunit, or complex multisubunit proteins comprising two or more subunits. The protein of interest may be a biopharmaceutical product, food additive or preservative, or any protein product subject to purification and quality standards.
[00171] In some embodiments, the protein product (protein of interest) is an antibody, a human antibody, a humanized antibody, a chimeric antibody, a monoclonal antibody, a multispecific antibody, a bispecific antibody, an antigen binding antibody fragment, a single chain antibody, a diabody, triabody or tetrabody, a Fab fragment or a F(ab')2 fragment, an IgD antibody, an IgE antibody, an IgM antibody, an IgG antibody, an lgG1 antibody, an lgG2 antibody, an lgG3 antibody, or an lgG4 antibody. In one embodiment, the antibody is an lgG1 antibody. In one embodiment, the antibody is an lgG2 antibody. In one embodiment, the antibody is an lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1/lgG4 antibody.
[00172] In some embodiments, the antibody is selected from the group consisting of an anti- Programmed Cell Death 1 antibody (e.g. an anti-PD1 antibody as described in U.S. Pat. Appln. Pub. No. US2015/0203579A1 ), an anti-Programmed Cell Death Ligand-1 (e.g. an anti-PD-L1 antibody as described in in U.S. Pat. Appln. Pub. No. US2015/0203580A1 ), an anti-DII4 antibody, an anti-Angiopoetin-2 antibody (e.g. an anti-ANG2 antibody as described in U.S. Pat. No.
9,402,898), an anti- Angiopoetin-Like 3 antibody (e.g. an anti-AngPtl3 antibody as described in U.S. Pat. No. 9,018,356), an anti-platelet derived growth factor receptor antibody (e.g. an anti-PDGFR antibody as described in U.S. Pat. No. 9,265,827), an anti-Erb3 antibody, an anti- Prolactin Receptor antibody (e.g. anti-PRLR antibody as described in U.S. Pat. No. 9,302,015), an anti- Complement 5 antibody (e.g. an anti-C5 antibody as described in U.S. Pat. Appln. Pub. No
US2015/0313194A1 ), an anti-TNF antibody, an anti-epidermal growth factor receptor antibody (e.g. an anti-EGFR antibody as described in U.S. Pat. No. 9,132,192 or an anti-EGFRvlll antibody as described in U.S. Pat. Appln. Pub. No. US2015/0259423A1 ), an anti-Proprotein Convertase Subtilisin Kexin-9 antibody (e.g. an anti-PCSK9 antibody as described in U.S. Pat. No. 8,062,640 or U.S. Pat. Appln. Pub. No. US2014/0044730A1 ), an anti-Growth And Differentiation Factor-8 antibody (e.g. an anti-GDF8 antibody, also known as anti-myostatin antibody, as described in U.S. Pat Nos. 8,871 ,209 or 9,260,515), an anti-Glucagon Receptor (e.g. anti-GCGR antibody as described in U.S. Pat. Appln. Pub. Nos. US2015/0337045A1 or US2016/0075778A1 ), an anti-VEGF antibody, an anti-l L1 R antibody, an interleukin 4 receptor antibody (e.g an anti-IL4R antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271681 A1 or U.S. Pat Nos. 8,735,095 or
8,945,559), an anti-interleukin 6 receptor antibody (e.g. an anti-IL6R antibody as described in U.S. Pat. Nos. 7,582,298, 8,043,617 or 9,173,880), an anti-IL1 antibody, an anti-IL2 antibody, an anti-IL3 antibody, an anti-l L4 antibody, an anti-IL5 antibody, an anti-l L6 antibody, an anti-l L7 antibody, an anti-interleukin 33 (e.g. anti- IL33 antibody as described in U.S. Pat. Appln. Pub. Nos.
US2014/0271658A1 or US2014/0271642A1 ), an anti-Respiratory syncytial virus antibody (e.g. anti- RSV antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271653A1 ), an anti-Cluster of differentiation 3 (e.g. an anti-CD3 antibody, as described in U.S. Pat. Appln. Pub. Nos.
US2014/0088295A1 and US20150266966A1 , and in U.S. Application No. 62/222,605), an anti- Cluster of differentiation 20 (e.g. an anti-CD20 antibody as described in U.S. Pat. Appln. Pub. Nos. US2014/0088295A1 and US20150266966A1 , and in U.S. Pat. No. 7,879,984), an anti-CD19 antibody, an anti-CD28 antibody, an anti- Cluster of Differentiation-48 (e.g. anti-CD48 antibody as described in U.S. Pat. No. 9,228,014), an anti-Fel d1 antibody (e.g. as described in U.S. Pat. No. 9,079,948), an anti-Middle East Respiratory Syndrome virus (e.g. an anti-MERS antibody as described in U.S. Pat. Appln. Pub. No. US2015/0337029A1 ), an anti-Ebola virus antibody (e.g. as described in U.S. Pat. Appln. Pub. No. US2016/0215040), an anti-Zika virus antibody, an anti- Lymphocyte Activation Gene 3 antibody (e.g. an anti-LAG3 antibody, or an anti-CD223 antibody), an anti-Nerve Growth Factor antibody (e.g. an anti-NGF antibody as described in U.S. Pat. Appln. Pub. No. US2016/0017029 and U.S. Pat. Nos. 8,309,088 and 9,353,176) and an anti-Activin A antibody. In some embodiments, the bispecific antibody is selected from the group consisting of an anti-CD3 x anti-CD20 bispecific antibody (as described in U.S. Pat. Appln. Pub. Nos.
US2014/0088295A1 and US20150266966A1 ), an anti-CD3 x anti-Mucin 16 bispecific antibody (e.g., an anti-CD3 x anti-Muc16 bispecific antibody), and an anti-CD3 x anti- Prostate-specific membrane antigen bispecific antibody (e.g., an anti-CD3 x anti-PSMA bispecific antibody). In some embodiments, the protein of interest is selected from the group consisting of alirocumab, sarilumab, fasinumab, nesvacumab, dupilumab, trevogrumab, evinacumab, and rinucumab. All publications mentioned throughout this disclosure are incorporated herein by reference in their entirety.
[00173] In some embodiments, the protein of interest is a recombinant protein that contains an Fc moiety and another domain, (e.g., an Fc-fusion protein). In some embodiments, an Fc-fusion protein is a receptor Fc-fusion protein, which contains one or more extracellular domain(s) of a receptor coupled to an Fc moiety. In some embodiments, the Fc moiety comprises a hinge region followed by a CH2 and CH3 domain of an IgG. In some embodiments, the receptor Fc-fusion protein contains two or more distinct receptor chains that bind to either a single ligand or multiple ligands. For example, an Fc-fusion protein is a TRAP protein, such as for example an IL-1 trap (e.g., rilonacept, which contains the IL-1 RAcP ligand binding region fused to the 11-1 R1 extracellular region fused to Fc of hlgG1 ; see U.S. Pat. No. 6,927,004, which is herein incorporated by reference in its entirety), or a VEGF trap (e.g., aflibercept or ziv-aflibercept, which contains the Ig domain 2 of the VEGF receptor Flt1 fused to the Ig domain 3 of the VEGF receptor Flk1 fused to Fc of hlgG1 ; see U.S. Pat. Nos. 7,087,41 1 and 7,279,159). In other embodiments, an Fc-fusion protein is a ScFv-Fc-fusion protein, which contains one or more of one or more antigen-binding domain(s), such as a variable heavy chain fragment and a variable light chain fragment, of an antibody coupled to an Fc moiety.
Host Cells and Transfection
[00174] The host cells used in the methods of the invention are eukaryotic host cells including, for example, Chinese hamster ovary (CHO) cells, human cells, rat cells and mouse cells. In a preferred embodiment, the invention provides a cell comprising a disrupted nucleic acid sequence fragment of SEQ ID NO:33 and at least one more of SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77.
[00175] The invention includes an engineered mammalian host cell further transfected with an expression vector comprising an exogenous gene of interest, such gene encoding the
biopharmaceutical product. While any mammalian cell may be used, in one particular embodiment the host cell is a CHO cell.
[00176] Transfected host cells include cells that have been transfected with expression vectors that comprise a sequence encoding a protein or polypeptide. Expressed proteins will preferably be secreted into the culture medium for use in the invention, depending on the nucleic acid sequence selected, but may be retained in the cell or deposited in the cell membrane. Various mammalian cell culture systems can be employed to express recombinant proteins. Other cell lines developed for specific selection or amplification schemes will also be useful with the methods and compositions provided herein, provided that at least two genes encoding a different fatty acid hydrolase (FAH) having at least 80% homology to at least two of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 have been downregulated, knocked out or otherwise disrupted in accordance with the invention. An embodied cell line is the CHO cell line designated K1 . To achieve high volume production of recombinant proteins, the host cell line may be pre-adapted to bioreactor medium in the appropriate case.
[00177] Several transfection protocols are known in the art, and are reviewed in Kaufman (1988) Meth. Enzymology 185:537. The transfection protocol chosen will depend on the host cell type and the nature of the GOI, and can be chosen based upon routine experimentation. The basic requirements of any such protocol are first to introduce DNA encoding the protein of interest into a suitable host cell, and then to identify and isolate host cells which have incorporated the
heterologous DNA in a relatively stable, expressible manner.
[00178] One commonly used method of introducing heterologous DNA into a cell is calcium phosphate precipitation, for example, as described by Wigler ef al. (Proc. Natl. Acad. Sci. USA 77:3567, 1980). DNA introduced into a host cell by this method frequently undergoes
rearrangement, making this procedure useful for cotransfection of independent genes.
[00179] Polyethylene-induced fusion of bacterial protoplasts with mammalian cells (Schaffner et al., (1980) Proc. Natl. Acad. Sci. USA 77:2163) is another useful method of introducing heterologous DNA. Protoplast fusion protocols frequently yield multiple copies of the plasmid DNA integrated into the mammalian host cell genome, and this technique requires the selection and amplification marker to be on the same plasmid as the GOI.
[00180] Electroporation can also be used to introduce DNA directly into the cytoplasm of a host cell, for example, as described by Potter et al. (Proc. Natl. Acad. Sci. USA 81 :7161 , 1988) or Shigekawa et al. (BioTechniques 6:742, 1988). Unlike protoplast fusion, electroporation does not require the selection marker and the GOI to be on the same plasmid.
[00181] Other reagents useful for introducing heterologous DNA into a mammalian cell have been described, such as Lipofectin™ Reagent and Lipofectamine™ Reagent (Gibco BRL, Gaithersburg, Md.). Both of these commercially available reagents are used to form lipid-nucleic acid complexes (or liposomes) which, when applied to cultured cells, facilitate uptake of the nucleic acid into the cells.
[00182] Methods for amplifying the GOI are also desirable for expression of the recombinant protein of interest, and typically involves the use of a selection marker (reviewed in Kaufman supra). Resistance to cytotoxic drugs is the characteristic most frequently used as a selection marker, and can be the result of either a dominant trait (e.g., can be used independent of host cell type) or a recessive trait (e.g., useful in particular host cell types that are deficient in whatever activity is being selected for). Several amplifiable markers are suitable for use in the cell lines of the invention and may be introduced by expression vectors and techniques well known in the art (e.g., as described in Sambrook, Molecular Biology: A Laboratory Manual, Cold Spring Harbor Laboratory, NY, 1989; pgs 16.9-16.14).
[00183] Useful selectable markers and other tools for gene amplification such as regulatory elements, described previously or known in the art, can also be included in the nucleic acid constructs used to transfect mammalian cells. The transfection protocol chosen and the elements selected for use therein will depend on the type of host cell used. Those of skill in the art are aware of numerous different protocols and host cells in order to adapt the invention for a particular use, and can select an appropriate system for expression of a desired protein, based on the
requirements of the cell culture system.
[00184] In specific embodiments, the invention relates to the following items:
[00185] 1. A recombinant host cell comprising a modification in two or more genes encoding two or more fatty acid hydrolases (FAH).
[00186] 2. The recombinant host cell according to item 1 , wherein the two or more FAHs are selected from the group consisting of phospholipase B-like 2 (PLBD2) protein, lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE).
[00187] 3. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00188] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
[00189] b. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification.
[00190] 4. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00191] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and [00192] b. a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
[00193] 5. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00194] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
[00195] b. a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
[00196] 6. The recombinant host cell according to any of previous items, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
[00197] 7. The recombinant host cell according to any of previous items, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
[00198] 8. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00199] a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
[00200] b. a modification in a coding sequence of a polynucleotide encoding the LI PA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LI PA protein in a cell lacking the modification.
[00201] 9. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00202] a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and [00203] b. a modification in a coding sequence of a polynucleotide encoding the ACE protein, wherein the modification decreases the expression level of the ACE protein relative to the expression level of a ACE protein in a cell lacking the modification.
[00204] 10. The recombinant host cell according to any of previous items, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the LI PA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LI PA protein in a cell lacking the modification.
[00205] 1 1. The recombinant host cell according to any of previous items, wherein all alleles of the coding sequence of the two or more genes encoding the two or more FAHs comprise the modification.
[00206] 12. The recombinant host cell according to any of previous items, wherein the cell does not express detectable levels of PLBD2 protein, LPL, LIPA and/or ACE.
[00207] 13. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 1 of the coding sequence of the
polynucleotide encoding the PLBD2 protein.
[00208] 14. The recombinant host cell according to any of previous items, wherein the PLBD2 protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:32.
[00209] 15. The recombinant host cell according to any of previous items, wherein the PLBD2 protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:35, and SEQ ID NO:36.
[00210] 16. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:47.
[00211] 17. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 2, exon 3, or exon 4 of the coding sequence of the polynucleotide encoding the LPL protein.
[00212] 18. The recombinant host cell according to any of previous items, wherein the LPL protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:62.
[00213] 19. The recombinant host cell according to any of previous items, wherein the LPL protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:62, SEQ ID NO:66, SEQ ID NO:67, and SEQ ID NO:68.
[00214] 20. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:86, SEQ ID NO:87 or SEQ ID NO:88.
[00215] 21. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 1 or exon 2 of the coding sequence of the polynucleotide encoding the LI PA protein.
[00216] 22. The recombinant host cell according to any of previous items, wherein the LI PA protein comprises an amino acid sequence at least 70% identical to SEQ ID NO:70.
[00217] 23. The recombinant host cell according to any of previous items, wherein the LI PA protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:70, SEQ ID NO:74, SEQ ID NO:75, and SEQ ID NO:76.
[00218] 24. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:94 or SEQ ID NO:95.
[00219] 25. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 1 , exon 3 or exon 4 of the coding sequence of the polynucleotide encoding the ACE protein.
[00220] 26. The recombinant host cell according to any of previous items, wherein the ACE protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:78.
[00221] 27. The recombinant host cell according to any of previous items, wherein the ACE protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:78, SEQ ID NO:82, SEQ ID NO:83, and SEQ ID NO:84.
[00222] 28. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:103, SEQ ID NO:105 or SEQ ID NO:106.
[00223] 29. The recombinant host cell according to any of previous items further comprising a polynucleotide encoding an exogenous protein of interest.
[00224] 30. The recombinant host cell according to any of previous items, wherein the exogenous protein of interest is selected from the group consisting of an antibody heavy chain, an antibody light chain, an antigen-binding fragment, an antigen-binding protein, and an Fc-fusion protein.
[00225] 31. The recombinant host cell according to any of previous items, wherein the cell produces a Protein A-binding fraction having no detectable fatty acid hydrolase activity.
[00226] 32. A method of producing a recombinant protein comprising the steps of: (a) obtaining a sample comprising a recombinant protein and a plurality of host cell proteins from a host cell that is modified to produce reduced levels of fatty acid hydrolase compared to an unmodified cell; and (b) subjecting the sample to at least one purification step to remove at least one host cell protein.
[00227] 33. The method according to any of items 31 -32, wherein the plurality of host cell proteins
[00228] a. does not comprise a detectable amount of a phospholipase B-like 2 (PLBD2) protein; and
[00229] b. does not comprise a detectable amount of at least one other fatty acid hydrolase.
[00230] 34. The method according to any of items 31 -33, wherein the host cell comprises:
[00231] a. a modification in a coding sequence of a polynucleotide encoding a phospholipase B- like 2 (PLBD2) protein; and
[00232] b. a modification in a coding sequence of a polynucleotide encoding a fatty acid hydrolase selected from the group consisting of lipoprotein lipase (LPL), lysosomeal acid lipase (LI PA), acid ceramidase (ACE), or a combination thereof.
[00233] 35. The method according to any of items 31 -34, wherein the purification step is selected from the group consisting of protein A affinity (PA) chromatography, cation exchange (CEX) chromatography, and anion exchange (AEX) chromatography.
[00234] 36. The method according to any of items 31 -35, wherein the purification step does not comprise hydrophobic interaction chromatography.
[00235] 37. A process for reducing fatty acid hydrolase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein; (b) modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and/or acid ceramidase (ACE); (c) transfecting the host cell with a polynucleotide encoding a protein of interest; (d) extracting a protein fraction comprising the protein of interest from the host cell; (e) contacting the protein fraction with a chromatography media selected from the group consisting of protein A affinity (PA) media, cation exchange (CEX) media, and anion exchange (AEX) media; (f) collecting the protein of interest from the media; and (g) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer.
[00236] 38. The process according to item 37, wherein the step of modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein comprises inserting or deleting at least one nucleotide within exon 1 of a polynucleotide encoding the PLBD2 protein.
[00237] 39. The process according to any of items 37-38, wherein the polynucleotide encoding the PLBD2 protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:32. [00238] 40. The process according to any of items 37-39, wherein the step of modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL) comprises inserting or deleting at least one nucleotide within exon 2, exon 3 or exon 4 of a polynucleotide encoding the LPL protein.
[00239] 41. The process according to any of items 37-40, wherein the polynucleotide encoding the LPL protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:62.
[00240] 42. The process according to any of items 37-41 , wherein the step of modifying the host cell to decrease or ablate the expression of lysosomal acid lipase (LI PA) comprises inserting or deleting at least one nucleotide within exon 1 or exon 2 of a polynucleotide encoding the LI PA protein.
[00241] 43. The process according to any of items 37-42, wherein the polynucleotide encoding the LIPA protein comprises a nucleic acid sequence that is at least 70% identical to SEQ ID NO:70.
[00242] 44. The process according to any of items 37-43, wherein the step of modifying the host cell to decrease or ablate the expression of acid ceramidase (ACE) comprises inserting or deleting at least one nucleotide within exon 1 , exon 3 or exon 4 of a polynucleotide encoding the ACE protein.
[00243] 45. The process according to any of items 37-44, wherein the polynucleotide encoding the ACE protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:78.
[00244] 46 . A composition comprising one or more recombinant proteins obtainable by the method according to any of items 31-36 or the process according to any of items 37-45.
[00245] 47. The composition obtainable according to item 46, wherein the composition is stable.
[00246] 48. The composition obtainable according to any of items 46-47, wherein the stable composition is characterized by one or more of:
[00247] (i) having an adverse enzyme activity of at the most [....] .
[00248] (ii) having a shelf-life of at least [XX Days- YY weeks- ZZ years]
[00249] (iii) having a SVP count of below [....]
[00250] 49. The composition obtainable according to any of items 46-48, wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
[00251] 50. A composition comprising one or more recombinant proteins, wherein the composition is stable. [00252] 51. The composition according to item 50, wherein the stable composition is characterized by one or more of:
[00253] (i) having an adverse enzyme activity of at the most [....] . [00254] (ii) having a shelf-life of at least [XX Days- YY weeks- ZZ years] [00255] (iii) having a SVP count of below [....]
[00256] 52. The composition according to any of items 50-51 , wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
[00257] 53. The composition according to any of items 50-52, wherein the recombinant protein is selected form the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein, or any combinations thereof.
[00258] Other features of the invention will become apparent in the course of the following descriptions of exemplary embodiments which are given for illustration of the invention and are not intended to be limiting thereof.
Examples
[00259] The following examples are put forth so as to provide those of ordinary skill in the art how to make and use the methods and compositions described herein, and are not intended to limit the scope of the invention. Efforts have been made to ensure accuracy with respect to numbers used {e.g., amount, temperature, etc.) but some experimental error and deviation should be accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is average molecular weight, temperature is in degrees Centigrade, and pressure is at or near atmospheric.
Example 1. Targeted Disruption of an Esterase Gene in the Host Cell
[00260] To employ disruption of the target esterase gene, i.e., phospholipase B-like 2 gene, of a CHO cell origin, a Type II CRISPR/Cas system which requires at least 20 nucleotides (nt) of homology between a chimeric RNA (i.e., guide RNA) and its genomic target was used. Guide RNA sequences were designed for specific targeting of an exon within the CHO phospholipase B-like 2 (PLBD2) nucleic acid (SEQ ID NO:33) and are considered unique (to minimize off-target effects in the genome). Multiple small guide RNAs (sgRNA) were synthesized for use in the genome editing procedure targeting the following genomic segments of PLBD2 listed in Table 8. Table 8
[00261] The sgRNA expression plasmid (System Biosciences, CAS940A-1 ) contains a human H1 promoter that drives expression of the small guide RNA and the tracrRNA following the sgRNA. Immortalized Chinese hamster ovary (CHO) cells were transfected with the plasmid encoding Cas9- H1 enzyme followed by one of the sgRNA sequences, for instance sgRNAI (SEQ ID NO:45) or sgRNA2 (SEQ ID NO:46), designed to target the first exon of CHO PLBD2. sgRNAI and sgRNA2 were predicted to generate a double strand break (DSB) at or around nucleotides 53 and 59 of SEQ ID NO:33, respectively. A DSB was therefore predicted to occur approx. 23 or 29 nucleotides downstream of the PLBD2 start codon. (Note that nucleotides 1-30 of SEQ ID NO:33 encode a signal peptide.) A negative control transfection was performed where the parental CHO line was transfected with the plasmid encoding Cas9-H1 enzyme without a proceeding sgRNA, or an sgRNA encoding a gene sequence not present in the CHO genome.
[00262] Following transfection, cells were cultured for 6 days in serum-free medium, and then were single cell cloned using flow cytometry. After 12 days in culture, stable clones with desirable growth properties were isolated, expanded in serum-free medium, cell pellets were collected for genotyping and clonal cell lines were banked.
[00263] Genomic DNA (gDNA) and messenger RNA (mRNA) were isolated from the clonal cell pellets and analyzed by quantitative PCR (qPCR). qPCR primers and probes were designed to overlap with the sgRNA sequence used for the double strand break targeting event, in order to detect disruption of the genomic DNA and its transcription. The relative abundance of PLBD2 gene or transcript in the candidate clones was determined using relative qPCR method, where the clones derived from the negative control transfection were used as a calibrator. See Figure 1. The qPCR primers and probes were designed to detect sequences either in the sgRNAI or sgRNA2 position in PLBD2 exon 1 . Both gDNA and RNA isolated from clone 1 failed to support qPCR amplification of PLBD2 exon 1 in either sgRNAI or sgRNA2 regions, but amplification of the housekeeping gene, GAPDH, was detected. Based on this data, clone 1 was identified as a potential knock out of PLBD2 in which both genomic alleles of PLBD2 of exonl were disrupted. It is noted that amplification of genomic DNA and mRNA was not detected in Clone 8 using primers overlapping with sgRNA2, however, sgRNAI primers/probes detected genomic DNA above control values. Clone 8, and others were further analyzed in order to understand the performance of the site- directed nuclease method.
[00264] The size of the entire PLBD2 exon 1 in clone 1 was analyzed by PCR from either gDNA or RNA derived templates and compared to that amplified from the wild type CHO cells. The length of amplicon fragments was determined using Caliper GX instrument (Figure 2). Both gDNA and mRNA amplification from clone 1 resulted in a single PCR fragment which was shorter than the one amplified from the wild type control cells.
[00265] The amplification products were sequenced, resulting in Clone 1 being identified as PLBD2 knock out, in which PLBD2 gene was found to have 1 1 bp deletion resulting in frameshift.
[00266] The inventors also unexpectedly identified Clone 8 as a PLBD2 knockout despite the fact that genomic DNA fragments were identified by qPCR primers overlapping with the sgRNAI sequence. The identification of a clone that has no detectable phospholipase activity or no detectable phospholipase protein was technically challenging and time-consuming. Site-directed nuclease techniques may provide an ease-of-use, however, careful screening and elimination of false positives is necessary and still there may be unpredictable outcomes with regard to the identity of a single clone having two disrupted alleles. Surprisingly, only 1 % of the clones screened using the techniques described above were identified as viable PLBD2 knockout clones. See Table 9.
Table 9
Example 2. Introduction and Expression of a monoclonal antibody (mAbl) in the candidate clonal cell lines
[00267] Clone 1 and the wild type control host cell line were transfected with plasmids encoding the light and heavy chains of mAbl , a fully human IgG, in the presence of Cre recombinase to facilitate recombination mediated cassette exchange (RMCE) into EESYR locus (US Patent No. 7771997B2, issued August 10, 2010). The transfected cultures were selected for 1 1 days in serum-free medium containing 400 ug/mL hygromycin. Cells that underwent RMCE, were isolated by flow cytometry. PLBD2 knock out clone 1 and the wild type host cell line produced equivalent observed
recombinant population (data not shown). The clone 1 derived isogenic cell line expressing mAbl was designated RS001 , and the mAbl expressing cell line originated from the PLBD2 wild type host was designated RS0WT.
[00268] Fed-batch production of mAbl from RS001 or RS0WT was carried out in a standard 12 day process. The conditioned medium for each production culture was sampled at Day 2, 4, 6, 9 and 12 and the Protein-A binding fraction was quantified (Figure 3). Protein titer of mAbl from RS001 culture was comparable to that produced from RS0WT, and unexpectedly there were no observable differences in the behavior of the cells with respect to the two cultures. It cannot be predicted whether disruption of PLBD2 or any endogenous gene in a CHO host cell would have no observable deleterious effect on production of an exogenous recombinant protein, especially a therapeutic monoclonal antibody. Example 3. Esterase Activity Detection in unmodified CHO cells
[00269] Polysorbate 20 or polysorbate 80 degradation was measured to detect putative esterase activity in the supernatants of PLBD2 mutants. Unpurified protein supernatant from CHO cells, and supernatant taken at each step or sequence of steps when subjected to sequential purification steps, was tested for stability of polysorbate. The percent intact polysorbate reported was inversely proportional to the amount of contaminant esterase activity. Unpurified protein supernatant from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, was tested in assays measuring polysorbate degradation. The relative levels of intact polysorbate reported is inversely proportional to levels of contaminant esterase activity.
[00270] Degradation of polysorbate 20 was examined to determine the etiological agent responsible for polysorbate 20 degradation in a monoclonal antibody formulation. The buffered antibody (150 mg/mL) was separated into two fractions by 10 kDa filtration: a protein fraction, and a buffer fraction. These two fractions, as well as intact buffered antibody, were spiked with 0.2% (w/v) of super refined polysorbate 20 (PS20-B) and stressed at 45°C for up to 14 days. The study showed (Table 10, part A, columns 1-2) that the protein fraction, not the buffer fraction, had an effect on the degradation of sorbitan laurate (i.e., the major component of polysorbate 20), and that the degradation of polysorbate 20 was correlated with the concentration of the antibody (Table 10, part B, columns 3-4).
Table 10 (part A) Table 10 (part B)
[00271] Monoclonal antibody was produced in an unmodified CHO cell and purified by different processes according to Table 1 1 , and the esterase activity measured by percent intact polysorbate 20, as in Table 9.
Table 11
[00272] Hydrophobic interaction chromatography (HIC) was most efficient at removing residual PLBD2. In some circumstances, a reduction in the number of purification steps and lower cost could be realized. Therefore, it was contemplated that a modified CHO cell having reduced levels of expression of phospholipase reduces the purification steps, and e.g., may eliminate the need for HIC purification.
Example 4. Esterase protein abundance and activity detection in mAbl purified from modified compared to unmodified CHO cells.
[00273] mAbl was produced from RS001 and RSOWT and purified from the conditioned media using either PA alone, or PA and AEX chromatography The PA-purified mAB1 from RS001 and RSOWT were analyzed for lipase abundance using trypsin digest mass spectrometry. The trypsin digests of RS001 and RSOWT mAbl were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass spectrometer set to monitor a specific product ion fragmented from SEQ ID NO:32. In parallel, a series of PLBD2 standards were prepared by spiking in varying amounts of recombinant PLBD2 into mAbl with no endogenous PLBD2. The signals of the experimental and control reactions were used to quantify the abundance of PLBD2 in mAbl from RS001 and RSOWT (Figure 4). No detectable amounts of PLBD2 protein were observed in the purified samples of Clone 8-produced mAbl when purified with PA alone (data not shown).
Example 5. Identification of Fatty Acid Hydrolases
Preparation of the HIC Strip Fraction
[00274] A hydrophobic interaction column (Phenyl Sepharose® High Performance [GE Healthcare, Little Chalfont, Buckinghamshire, UK]) was used to generate a "HIC strip" fraction containing a protein of interest {i.e., mAb2) and associated host cell proteins. The column was first equilibrated with two column volumes (CV) of buffer containing 40 mM Tris, 200 mM citrate at pH8.0. The monoclonal antibody-containing load material from an anion exchange pool ("Q pool") was adjusted to 200 mM sodium citrate, pH8.0, then loaded onto the column at a loading amount of 20-40 grams of protein per liter of the phenyl sepharose resin. After loading, the column was washed with six CVs of 40 mM Tris, 200 mM citrate, at pH8.0. Then the column was stripped with three CVs of reverse osmosis deionized water. The stripped fraction was collected for subsequent analysis.
Trypsin digestion
[00275] A 50 pg aliquot of the mAb2-containing HIC strip sample was denatured and reduced in a solution containing 5 mM acetic acid and 5 mM ir/'s(2-carboxyethyl)phosphine-HCI by heating at 80°C for 10 minutes. The sample was then diluted in 50 mM Tris-HCI buffer (pH 8.0) and alkylated with 1 .5 mM iodoacetamide (IAA) and digested with trypsin (modified, sequencing grade from Promega, Madison, Wl) with an enzyme to substrate ratio of 1 :20 (w/w) at 37°C in the dark for three hours. The digestion was then stopped by addition of 10% trifluoroacetic acid (TFA).
NanoLC/MS
[00276] Aliquots (3 μg) of each digested mAb2-containing HIC strip was injected to an Acclaim™ PepMap™ nanoViper C18 trap column (75 pm x 2 cm) on a Thermo EASY-nLC™ system (Thermo Scientific, Waltham, MA) at a flow rate of 2 pL/minute and washed for 15 minutes. Then the peptides were eluted onto the Acclaim™ PepMap™ RSLC nanoViper C18 analytical column (75 pm x 25 cm) which was equilibrated with 99% mobile phase A (0.1% FA in water) prior to sample injection at a flow rate of 250 nL minute. Peptides were separated using a linear gradient from 1 % mobile phase B (0.1 % FA in acetonitrile) to 7% mobile phase B for the first 5 minutes, followed by a second linear increase from 7% to 27% mobile phase B over the next 1 10 minutes, and another subsequent linear increase from 27%-40% in 10 minutes and a final increase to 90% in 5 minutes. The gradient was held at 90% for 20 minutes. A Thermo Q Exactive™ Plus mass spectrometer (Thermo Scientific, Waltham, MA) was used for peptide mass analyses, with high-energy collisional dissociation (HCD) employed for peptide fragmentation for MS/MS experiments.
Data analysis
[00277] The mass spectra were processed using Thermo Xcalibur™ (version 2.2.42) (Thermo Scientific, Waltham, MA). Proteome Discoverer (version 1.4) (Thermo Scientific, Waltham, MA) was also used to perform the peptide identification using both Mascot and Sequent search engines. The peptide spectra from the Proteome Discovery was manually examined to confirm the spectral assignment and protein identification.
Results
[00278] Lipoprotein lipase (UniProtein ID: P06858), lysosomal acid lipase (UniProtein ID: P38571 ) and acid ceramidase (UniProtein ID: Q13510) were identified as potential active fatty acid hydrolases by more than three unique peptides per protein in the mAb2-containing HIC strip fraction.
Example 6. Targeted Disruption of a Lipoprotein Lipase (LPL) Gene
[00279] Clone 1 , in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) were transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lipoprotein lipase (LPL) gene (SEQ ID NO:61 ): sgRNA3 (SEQ ID NO:63), sgRNA4 (SEQ ID NO:64) or sgRNA5 (SEQ ID NO:65) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette was used in some
experiments. In other experiments, two separate plasmids were used, which confered flexibility for combining Cas9 with different sgRNAs. Cre recombinase was co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/0115502A1 , published April 28, 2016). The transfected cultures were selected for 1 1 days in serum-free medium containing 400 μg mL hygromycin. Cells that underwent RMCE were isolated by flow cytometry. The desired knock out genotype was confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing only the LPL knock-out or knock-down loci is designated as "clone 9". The resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 10".
[00280] The wild-type host cell, Clone 1 (PLBD2-KO), Clone 9 (LPL-KO) and Clone 10
(LPL+PLBD2 KO) were cultured in 2L bioreactors under fed-batch conditions at 36.5°C (pH 6.9-7.4) with dissolved oxygen. Cells were inoculated in the production bioreactor (i.e. transferred from a seed train culture (N-1 ) at a cell titer of 5.0 x 10s— 7.0 x 106 cells/mL) and protein production was induced by the addition of doxycycline. Cells were cultured for 14 days in chemically defined media supplemented with nutrient feeds as needed during the batch culture up to day 12. Cell viability (Figure 5), titer (Figure 6), and viable cell count (VCC) (Figure 7) were monitored throughout the batch process. Samples of cells were harvested and subjected to purification via protein A chromatography for titer determination (Hober, S., Nord, K., and Linhult, M., "Protein A
Chromatography for Antibody Purification," Journal of Chromatography B, 848 (2007) 40-47; and Lin et al., "Protein A Affinity Column for Monoclonal Antibody (MAb) Titer Analysis," Thermo Scientific Poster Note PN20806_E 06/13S, available from https://tools.thermofisher.com/.../PN- 20806-Protein-Affinity-Column-Monoclonal- Antibody-Titer-Analysis-PN20806- EN.pdf&usg=AFQjCNFHkixB9Um020cT4nG1 UpTzXwON9Q, version avaialble Aug 23, 2017).
[00281] Additional LPL-KO clones were generated according to the methods provided herein. For example, Figure 8 depicts an alignment of LPL-KO clones 19-22 (SEQ ID NOs:159-162, respectively) compared to a partial Chinese hamster LPL sequence (SEQ ID NO:158) showing the gaps in the respective clone sequences.
[00282] To construct the LPL-KO clones (as well as any other fatty acid hydrolase knock-out clone), plasmids encoding for Cas9 nuclease, eYFP and site-specific sgRNA were stably integrated into the CHO genome using Lipofectamine-based transfection protocol followed by selection for neomycin resistant cells. Seventeen days post transfection, the YFP positive cells were enriched by flow cytometry prior to single cell-sorting. After a 21 -day expansion, mRNA was isolated and the single cell clones were analyzed by qPCR for the presence of gene disruption.
[00283] 1 , 2 or 3 different sgRNA expression cassettes (the cassettes include promoter, sgRNA, terminator) may be placed in the same plasmid. Some constructs are manufactured using at least 2 sgRNA cassettes per lipase in a single plasmid.
Example 7. Targeted Disruption of a Lysosomal Acid Lipase (LIPA) Gene
[00284] Clone 1 , in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) or sgRNA8 (SEQ ID NO:73) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent
Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 11 days in serum-free medium containing 400 pg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing only the LIPA knock-out or knock-down loci is designated as "clone 1 1 ".The resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 12".
Example 8. Targeted Disruption of an Acid Ceramidase (ACE) Gene
[00285] Clone 1 , in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA11 (SEQ ID NO:81 ) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/0115502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 pg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing only the ACE knock-out or knock-down loci is designated as "clone 13". The resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 14".
Example 9. Targeted Disruption of a Lipoprotein Lipase (LPL) Gene and Lysosomal Acid
Lipase (LIPA)
[00286] Clone 9, in which the LPL gene and is modified, and Clone 10, in which both the LPL gene and the PLBD2 gene are modified, are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) and/or sgRNA8 (SEQ ID NO:73) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 μg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing the LIPA modified locus and the LPL modified locus designated as "clone 15".The resultant clone containing the PLBD2, LPL and LIPA modified loci is designated as "clone 16". Example 10. Targeted Disruption of a Lipoprotein Lipase (LPL) Gene, a Lysosomal Acid Lipase (LIPA) Gene and an Acid Ceramidase (ACE) Gene
[00287] Clone 15, in which the LPL and LIPA genes and are modified, and Clone 16, in which the PLBD2, LPL and LIPA genes are modified, are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA1 1 (SEQ ID NO:81 ) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 μg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 17".The resultant clone containing the PLBD2 modified locus, the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 18".
[00288] The present invention may be embodied in other specific embodiments.
HOST CELL PROTEIN MODIFICATION
FIELD
[0001] The invention provides for cells and methods for expression and purification of recombinant proteins in eukaryotic cells. In particular, the invention includes methods and compositions for expression of proteins in eukaryotic cells, particularly Chinese hamster (Cricetulus griseus) cell lines, that employ downregulating gene expression of endogenous proteins in order to control production of such unwanted "sticky" host cell proteins. The invention includes polynucleotides and modified cells that facilitate purification of an exogenous recombinant protein of interest. The methods of the invention efficiently target host cell proteins in the Chinese hamster cellular genome in order to facilitate enhanced and stable expression of recombinant proteins expressed by the modified cells.
BACKGROUND
[0002] Cellular expression systems aim to provide a reliable and efficient source for the
manufacture of biopharmaceutical products for therapeutic use. Purification of any recombinant protein produced by either eukaryotic or prokaryotic cells in such systems is an ongoing challenge due to, for example, the plethora of host cell proteins and nucleic acid molecules that need to be eliminated from the final pharmaceutical grade product.
[0003] Certain dynamics of host cell proteins (HCPs), viewed as impure byproducts, have been surveyed during various stages of bioprocessing. Advanced liquid chromatography/mass spectrometry (LC/MS) was done to detect and monitor E. coli HCPs accompanying peptibodies produced by cell culture (Schenauer, MR., et al, 2013, Biotechnol Prog 29(4):951-7). The information obtained by HCP profiles is useful for monitoring process development and assessing quality and purity of the product in order to assess safety risks posed by any one or more HCP(s).
Changes in cell culture conditions of eukaryotic cells has been shown to impact the purity and stability of manufactured proteins, as seen by the increased quantity of HCPs of CHO cells upon downstream bioprocessing alterations (Tait, et al, 2013, Biotechnol Prog 29(3):688-696). The detrimental effect of leftover HCPs in any product may affect the overall quality or quantity, or both the quality and quantity of the product. HCPs, if present even at low levels in a therapeutic product, may induce an undesired immune response which causes concern for patient safety and efficacy of the drug product (Singh SK. 201 1. "Impact of product-related factors on immunogenicity of biotherapeutics." J Pharm Sci 100:354-387; Ipsen Press Release "Ipsen's partner Inspiration Biopharmaceuticals announces hold of phase III clinical trials evaluating IB1001 for the treatment and prevention of hemophilia B", 10 July 2012). Current protocols seek to alter the protein of interest produced by the cell (e.g.,therapeutic antibody) to eliminate differential binding or interaction with the protein of interest and the host cell protein (Zhang, Q. et al, mAbs, Published online: 11 Feb 2014). Alternative bioprocessing or purification techniques may be warranted in order to minimize the risk of excess impurities (Yuk, et al. 2015, Biotechnol. Bioeng. 9999: 1-16).
[0004] Despite the availability of numerous cell expression systems, engineered cell lines and systems that do not negatively impact the biological properties of an expressed protein of interest are particularly advantageous. Accordingly, there is a need in the art for improved methods towards preparation of quality protein samples for downstream bioprocessing and subsequently commercial use.
BRIEF SUMMARY
[0005] The use of gene editing tools to eliminate two or more contaminant host cell proteins is contemplated, and thus, engineered host cells for more efficient manufacturing processing of proteins is provided.
[0006] In one aspect, the invention provides a recombinant host cell, wherein the cell is modified to decrease the expression levels of two or more fatty acid hydrolases (FAHs) relative to the expression levels of FAH in an unmodified cell.
[0007] In another aspect, the invention provides a recombinant host cell, wherein the cell is modified to have no expression of two or more target FAHs.
[0008] In some embodiments, one of said two or more target FAHs is an esterase. In more specific embodiments, the esterase is a lipase. In more specific embodiments, the lipase is: (1 ) a phospholipase, such as a phospholipase B-like protein or a phospholipase B-like 2 protein, (2) a lipoprotein lipase or (3) a lysosomal acid lipase. In some embodiments, one of said two or more target FAHs is an amidase. In more specific embodiments, the amidase is a fatty acid acylase. In a more specific embodiment, the fatty acid acylase is an acid ceramidase.
[0009] In some embodiments, a gene of interest is exogenously added to the recombinant host cell. In other embodiments, the exogenously added gene encodes a protein of interest (POI), for example the POI is selected form the group consisting of antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein.
[0010] In further embodiments, the invention relates to a composition comprising one or more proteins of interest (POIs) obtainable by a method according to the invention. The protein may be a recombinant protein and may be e.g. selected form the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein, or any combinations thereof.
[0011] In some embodiments, the invention relates to a composition comprising one or more proteins of interest (POIs), wherein the adverse enzyme activity is < 50%, 40%, 48%, 47%, 46%, 45%, 44%, 43%, 42%, 41 %, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31 %, 30%, 29%, 28%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, 0.05%, or <0.05% active relative to the adverse enzyme activity of a wild-type production system. The phrase "adverse enzyme activity" refers to any enzyme and its action upon the resulting composition as a whole, wherein the action results in metabolism of any protein components which metabolites reduces the shelf-life of the composition or results in the formation of subvisible particles (SVPs) above prescribed regulations. The protein may be a recombinant protein and may e.g. be selected from the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein or any combinations thereof. The enzymes and their activities may be, e.g., various esterases, hydrolases, lipases, phospholipases, ceramidases and the likes, or any combinations thereof. Adverse enzyme activity may be measured using a functional assay {e.g., polysorbate fatty acid hydrolysis assay), or a structural assay {e.g., nano LC-MS of peptide fragments, or the like).
[0012] The invention provides a cell comprising a nonfunctional PLBD2 protein and one or more additional nonfunctional fatty acid hydrolases (FAH). In one embodiment, the additional
nonfunctional FAH is a nonfunctional lipoprotein lipase (LPL), lysosomal acid lipase (LI PA), acid ceramidase (ACE), platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), phosphoinositide phospholipase C fragment (PIPLCf), phosphoinositide phospholipase C (PIPLC), liver carboxylesterase 1 (LCE), isoamyl acetate-hydrolyzing esterase 1 -like (IAH1 ), group XV phospholipase A2 (LPLA2), carboxylic ester hydrolase (CEH), and/or arylsulfatase A (ASA).
[0013] The invention provides making a cell by FAH target disruption. In some embodiments, the method comprises a site-specific nuclease for disrupting or editing the cell genome at a target site or sequence. In some embodiments, the FAH target site is (1 ) a PLBD2 target site, (2) an LPL target site, (3) an LI PA target site, (4) an ACE target site, (5) a PAFAHG site, (6) a PIPLCf or PIPLC site, (7) an LCE site, (8) an IAH1 site, (9) an LPLA2 site, (9) an CEH site, and /or (10) an ASA site.
[0014] In one embodiment, the PLBD2 target site comprises a position within SEQ ID NO:33, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:33, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:33, within exon 1 of SEQ ID NO:33, within exon 2 of SEQ ID NO:33, or within exon 3 of SEQ ID NO:33.
[0015] In one embodiment, the PLBD2 target site comprises a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-230, 190-210, 200-220, 210-230, 220-240, 230-250, 240-260, and 250-270 of SEQ ID NO:33.
[0016] In another embodiment, the target site at a position within SEQ ID NO:33 or adjacent to a position within SEQ ID NO:33 is selected from the group consisting of nucleotides spanning positions numbered 37-56, 44-56, 33-62, 40-69, 110-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33. In this regard, the PLBD2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:33 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:33 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0017] In one embodiment, the LPL target site comprises a position within SEQ ID NO:61 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:61 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO:61 , within exon 1 of SEQ ID NO:61 , within exon 2 of SEQ ID NO:61 , or within exon 3 of SEQ ID NO:61.
[0018] In one embodiment, the LPL target site comprises a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490- 510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590- 610, 600-620, 610-630, 620-640 and 630-650 of SEQ ID NO:61.
[0019] In another embodiment, the target site at a position within SEQ ID NO:61 or adjacent to a position within SEQ ID NO:61 is selected from the group consisting of nucleotides spanning positions numbered 465-484, 558-577, and 593-612 of SEQ ID NO:61. In this regard, the LPL target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:61 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:61 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0020] In one embodiment, the LIPA target site comprises a position within SEQ ID NO:69, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:69, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:69, within exon 1 of SEQ ID NO:69, within exon 2 of SEQ ID NO:69, or within exon 3 of SEQ ID NO:69.
[0021] In one embodiment, the LIPA target site comprises a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350 and 340-360 of SEQ ID NO:69.
[0022] In another embodiment, the target site at a position within SEQ ID NO:69 or adjacent to a position within SEQ ID NO:69 is selected from the group consisting of nucleotides spanning positions numbered 180-199, 239-258, and 276-295 of SEQ ID NO:69. In this regard, the LIPA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:69 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:69 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0023] In one embodiment, the ACE target site comprises a position within SEQ ID NO:77, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:77, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:77, within exon 1 of SEQ ID NO:77, within exon 2 of SEQ ID NO:77, within exon 3 of SEQ ID NO:77, or within exon 4 of SEQ ID NO:77.
[0024] In one embodiment, the ACE target site comprises a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 110-140, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370 and 360-380 of SEQ ID NO:77.
[0025] In another embodiment, the target site at a position within SEQ ID NO:77 or adjacent to a position within SEQ ID NO:77 is selected from the group consisting of nucleotides spanning positions numbered 135-154, 237-256, and 332-351 of SEQ ID NO:77. In this regard, the ACE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:77 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:77 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0026] In one embodiment, the PAFAHG target site comprises a position within SEQ ID NO:1 17, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:117, 100 nucleotides
downstream of the 5-prime end of SEQ ID NO: 117, within exon 1 of SEQ ID NO:1 17, within exon 2 of SEQ ID NO:117, or within exon 3 of SEQ ID NO:1 17.
[0027] In one embodiment, the PAFAHG target site comprises a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:1 17 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, 760-780, and 770-787 of SEQ ID NO:1 17.
[0028] In another embodiment, the target site at a position within SEQ ID NO:1 17 or adjacent to a position within SEQ ID NO:117 is selected from the group consisting of nucleotides spanning positions numbered 101-120, 1 1 1-130, 121 -140, 131-150, 141-160, and 150-169 of SEQ ID NO:1 17. In this regard, the PAFAHG target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 17 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 17 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0029] In one embodiment, the PIPLC target site comprises a position within SEQ ID NO:1 18, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO:118, 100 nucleotides
downstream of the 5-prime end of SEQ ID NO: 118, within exon 1 of SEQ ID NO:1 18, within exon 2 of SEQ ID NO:118, within exon 3 of SEQ ID NO:118, within exon 4 of SEQ ID NO:1 18, within exon 5 of SEQ ID NO:1 18, within exon 6 of SEQ ID NO:1 18, within exon 7 of SEQ ID NO:1 18, within exon 8 of SEQ ID NO: 1 18, within exon 9 of SEQ ID NO: 1 18, within exon 10 of SEQ ID NO: 1 18, within exon 1 1 of SEQ ID NO:1 18, within exon 12 of SEQ ID NO:1 18, or within exon 13 of SEQ ID NO:1 18.
[0030] In one embodiment, the PIPLC target site comprises a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:1 18 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ I D NO: 118.
[0031] In another embodiment, the target site at a position within SEQ ID NO:1 18 or adjacent to a position within SEQ ID NO:118 is selected from the group consisting of nucleotides spanning positions numbered 39-188, 39-58, 49-68, 59-78, 69-88, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 149-178, and 159-188 of SEQ ID NO:118. In this regard, the PIPLC target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 18 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 18 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0032] In one embodiment, the LCE target site comprises a position within SEQ ID NO:1 19, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 1 19, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:1 19, within exon 1 of SEQ ID NO:1 19, within exon 2 of SEQ ID NO:1 19, within exon 3 of SEQ ID NO:119, within exon 4 of SEQ ID NO:1 19, or within exon 5 of SEQ ID NO:1 19.
[0033] In one embodiment, the LCE target site comprises a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:1 19 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ I D NO: 119.
[0034] In another embodiment, the target site at a position within SEQ ID NO:1 19 or adjacent to a position within SEQ ID NO:119 is selected from the group consisting of nucleotides spanning positions numbered 89-140, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, and 139-158 of SEQ ID NO:1 19. In this regard, the LCE target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:1 19 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:1 19 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0035] In one embodiment, the IAH1 target site comprises a position within SEQ ID NO:120, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 120, 100 nucleotides downstream of the 5-prime end of SEQ ID NO:120, within exon 1 of SEQ ID NO:120, within exon 2 of SEQ ID NO:120, within exon 3 of SEQ ID NO:120, or within exon 4 of SEQ ID NO:120.
[0036] In one embodiment, the IAH1 target site comprises a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO:120 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO: 120.
[0037] In another embodiment, the target site at a position within SEQ ID NO:120 or adjacent to a position within SEQ ID NO:120 is selected from the group consisting of nucleotides spanning positions numbered 104-325, 104-123, 1 14-133, 124-143, 134-153, 144-163, 154-173, 164-183, 174-193, 184-203, 194-213, 204-223, 214-233, 224-243, 234-253, 244-263, 254-273, 264-283, 274-293, 284-303, 294-313, 304-323, and 314-333 of SEQ ID NO:120. In this regard, the IAH1 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 120 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:120 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0038] In one embodiment, the LPLA2 target site comprises a position within SEQ ID NO:121 , within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 121 , 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 121 , within exon 1 of SEQ ID NO: 121 , within exon 2 of SEQ ID NO: 121 , within exon 3 of SEQ ID NO: 121 , or within exon 4 of SEQ ID NO: 121.
[0039] In one embodiment, the LPLA2 target site comprises a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 selected from the group consisting of nucleotides spanning positions numbered -20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO:121.
[0040] In another embodiment, the target site at a position within SEQ ID NO:121 or adjacent to a position within SEQ ID NO:121 is selected from the group consisting of nucleotides spanning positions numbered 69-195, 69-88, 79-98, 89-108, 99-118, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, and 179-198 of SEQ ID NO:121. In this regard, the LPLA2 target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO: 121 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO:121 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0041] In one embodiment, the CEH target site comprises a position within SEQ ID NO:122, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 122, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 122, within exon 1 of SEQ ID NO: 122, within exon 2 of SEQ ID NO: 122, within exon 3 of SEQ ID NO: 122, within exon 4 of SEQ ID NO: 122, or within exon 5 of SEQ ID NO: 122.
[0042] In one embodiment, the CEH target site comprises a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO:122 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO: 122.
[0043] In another embodiment, the target site at a position within SEQ ID NO:122 or adjacent to a position within SEQ ID NO:122 is selected from the group consisting of nucleotides spanning positions numbered 79-186, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, and 169-188 of SEQ ID NO:122. In this regard, the CEH target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:122 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO: 122 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell {i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0044] In one embodiment, the ASA target site comprises a position within SEQ ID NO:123, within 100 nucleotides upstream of the 5-prime end of SEQ ID NO: 123, 100 nucleotides downstream of the 5-prime end of SEQ ID NO: 123, within exon 1 of SEQ ID NO: 123, within exon 2 of SEQ ID NO: 123, within exon 3 of SEQ ID NO: 123, or within exon 4 of SEQ ID NO: 123.
[0045] In one embodiment, the ASA target site comprises a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO:123 selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 100-120, 1 10-130, 120-140, 130-150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490-510, 500-520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590-610, 600-620, 610-630, 620-640, 630-650, 640-660, 650-670, 660-680, 670-690, 680-700, 690-710, 700-720, 710-730, 720-740, 730-750, 740-760, 750-770, or 760-780 of SEQ ID NO: 123.
[0046] In another embodiment, the target site at a position within SEQ ID NO:123 or adjacent to a position within SEQ ID NO:123 is selected from the group consisting of nucleotides spanning positions numbered 79-296, 79-98, 89-108, 99-1 18, 109-128, 1 19-138, 129-148, 139-158, 149-168, 159-178, 169-188, 179-198 , 189-208 , 199-218, 209-228, 219-238, 229-248, 239-258, 249-268, 259-278, 269-288, and 279-298 of SEQ ID NO:123. In this regard, the ASA target site is partially or fully within or encompassed by the nucleotide positions of SEQ ID NO:123 provided herein, and disrupting or editing the cell genome at the target site or sequence may consist of deleting or inserting one or more nucleotides within the nucleotide positions of SEQ ID NO: 123 provided herein, whereas disrupting or editing alters a subsequent transcript as compared to that transcribed from a wild-type cell (i.e., a cell free of genomic disruption or gene editing). In some embodiments, the subsequent transcript of an altered gene results in a frameshift of the translated protein. In some embodiments, the subsequent transcript of an altered gene results in an altered protein that is subject to degradation, is not detectable by a standard method such as mass spectrometry, or has no detectable activity. In some embodiments, the targeted disruption or editing occurs on both alleles of the gene.
[0047] In certain embodiments, the cell further integrates an exogenous nucleic acid sequence. In other embodiments, the cell is capable of producing an exogenous protein of interest. In still other embodiments, the altered protein resulting from a disrupted gene does not bind to the protein of interest produced by the cell.
[0048] In another aspect, an isolated Chinese hamster ovary (CHO) cell is provided that comprises an engineered nucleic acid sequence comprising a variant of the PLBD2 gene (such as a variant of SEQ ID NO:33). In one embodiment, the PLBD2 gene comprises GACAGTCACG TGGCCCGACT GAGGCACGCG , nucleotides 1-30 of SEQ ID NO:33 (SEQ ID NO: 44). In another embodiment, the PLBD2 gene is engineered to disrupt expression of the open reading frame. In other embodiments, the invention provides an isolated CHO cell comprising (a) a disrupted PLBD2 gene comprising GACAGTCACG TGGCCCGACT GAGGCACGCG (SEQ ID NO: 44, also nucleotides 1- 30 of SEQ ID NO:33), (b) a disrupted esterase gene comprising a nucleotide encoding any one of the amino acid sequences in Table 2, or (c) a protein fragment of Table 2 expressed by a disrupted PLBD2 gene; and an exogenous nucleic acid sequence comprising a gene of interest.
[0049] In one embodiment, the CHO cell that comprises an engineered nucleic acid sequence comprising a variant of the PLBD2 gene also comprises variants of one or more of the genes encoding (1 ) LPL (variant of SEQ ID NO:61 ), (2) LIPA (SEQ ID NO:69), (3) ACE (SEQ ID NO:77), (4) PAFAHG (SEQ ID NO:1 17), (5) PIPLC (SEQ ID NO:1 18), (6) LCE (SEQ ID NO:1 19), (7) IAH1 (SEQ ID NO:120), (8) LPLA2 (SEQ ID NO:121 ), (9) CEH (SEQ ID NO:122), and/or (10) ASA (SEQ ID NO:123).
[0050] In another aspect, a method of producing a protein of interest using a recombinant host cell is provided, wherein the host cell is modified to decrease the expression levels of esterase relative to the expression levels of esterase in an unmodified cell.
[0051] In another embodiment, the method comprises the modified host cell having decreased esterase expression and an exogenous nucleic acid sequence comprising a gene of interest (GOI).
[0052] In certain embodiments, the exogenous nucleic acid sequence comprises one or more genes of interest. In some embodiments, the one or more genes of interest are selected from the group consisting of a first GOI, a second GOI and a third GOI.
[0053] In another aspect, the invention provides expression systems comprising the recombinant host cell comprising modified or nonfunctional esterase.
[0054] In yet another embodiment, the cell comprises a GOI operably linked to a promoter capable of driving expression of the GOI, wherein the promoter comprises a eukaryotic promoter that can be regulated by an activator or inhibitor. In other embodiments, the eukaryotic promoter is operably linked to a prokaryotic operator, and the eukaryotic cell optionally further comprises a prokaryotic repressor protein.
[0055] In another embodiment, one or more selectable markers are expressed by the modified host cell. In some embodiments, the genes of interest and/or the one or more selectable markers are operably linked to a promoter, wherein the promoter may be the same or different. In another embodiment, the promoter comprises a eukaryotic promoter (such as, for example, a CMV promoter or an SV40 late promoter), optionally controlled by a prokaryotic operator (such as, for example, a tet operator). In other embodiments, the cell further comprises a gene encoding a prokaryotic repressor (such as, for example, a tet repressor).
[0056] In one aspect, a CHO host cell is provided, comprising recombinase recognition sites. In some embodiments, the recombinase recognition sites are selected from a LoxP site, a /_ox51 1 site, a Lox2272 site, Lox2372, Lox5171 , and a frt site.
[0057] In another embodiment, the cell further comprises a gene capable of expressing a recombinase. In some embodiments, the recombinase is a Cre recombinase.
[0058] In one embodiment, the selectable marker gene is a drug resistance gene. In another embodiment, the drug resistance gene is a neomycin resistance gene or a hygromycin resistance gene. In another embodiment, the second and third selectable marker genes encode two different fluorescent proteins. In one embodiment, the two different fluorescent proteins are selected from the group consisting of Discosoma coral (DsRed), green fluorescent protein (GFP), enhanced green fluorescent protein (eGFP), cyano fluorescent protein (CFP), enhanced cyano fluorescent protein (eCFP), yellow fluorescent protein (YFP), enhanced yellow fluorescent protein (eYFP) and far-red fluorescent protein (mKate).
[0059] In one embodiment, the first, second, and third promoters are the same. In another embodiment, the first, second, and third promoters are different from each other. In another embodiment, the first promoter is different from the second and third promoters, and the second and third promoters are the same. In more embodiments, the first promoter is an SV40 late promoter, and the second and third promoters are each a human CMV promoter. In other embodiments, the first and second promoters are operably linked to a prokaryotic operator.
[0060] In one embodiment, the host cell line has an exogenously added gene encoding a recombinase integrated into its genome, operably linked to a promoter. In another embodiment, the recombinase is Cre recombinase. In another embodiment, the host cell has a gene encoding a regulatory protein integrated into its genome, operably linked to a promoter. In more embodiments, the regulatory protein is a tet repressor protein.
[0061] In one embodiment, the first GOI and the second GOI encode a light chain, or fragment thereof, of an antibody or a heavy chain, or fragment thereof, of an antibody. In another embodiment, the first GOI encodes a light chain of an antibody and the second GOI encodes a heavy chain of an antibody.
[0062] In certain embodiments, the first, second and third GOI encode a polypeptide selected from the group consisting of a first light chain, or fragment thereof, a second light chain, or fragment thereof and a heavy chain, or fragment thereof. In yet another embodiment, the first, second and third GOI encode a polypeptide selected from the group consisting of a light chain, or fragment thereof, a first heavy chain, or fragment thereof and a second heavy chain, or fragment thereof.
[0063] In one aspect, a method is provided for making a protein of interest, comprising (a) introducing into a CHO host cell a gene of interest (GOI), wherein the GOI integrates into a specific locus such as a locus described in US Patent No. 7771997B2, issued August 10, 2010 or other stable integration and/or expression-enhancing locus; (b) culturing the cell of (a) under conditions that allow expression of the GOI; and (c) recovering the protein of interest. In one embodiment, the protein of interest is selected from the group consisting of a subunit of an immunoglobulin, or fragment thereof, and a receptor, or ligand-binding fragment thereof. In certain embodiments, the protein of interest is selected from the group consisting of an antibody light chain, or antigen-binding fragment thereof, and an antibody heavy chain, or antigen-binding fragment thereof.
[0064] In certain embodiments, the CHO host cell genome comprises further modifications, and comprises one or more recombinase recognition sites as described above, and the GOI is introduced into a specific locus through the action of a recombinase that recognizes the
recombinase recognition site.
[0065] In some embodiments, the GOI is introduced into the cell employing a targeting vector for recombinase-mediated cassette exchange (RMCE) when the CHO host cell genome comprises at least one exogenous recognition sequence within a specific locus.
[0066] In another embodiment, the GOI is introduced into the cell employing a targeting vector for homologous recombination, and wherein the targeting vector comprises a 5' homology arm homologous to a sequence present in the specific locus, a GOI, and a 3' homology arm
homologous to a sequence present in the specific locus. In another embodiment, the targeting vector further comprises two, three, four, or five or more genes of interest. In another embodiment, one or more of the genes of interest are operably linked to a promoter.
[0067] In another aspect, a method is provided for modifying a CHO cell genome to integrate an exogenous nucleic acid sequence, comprising the step of introducing into the cell a vehicle comprising an exogenous nucleic acid sequence wherein the exogenous nucleic acid integrates within a locus of the genome.
[0068] In yet another aspect, the invention provides a process for manufacturing a stable protein formulation comprising the steps of: (a) extracting a protein fraction from the modified host cell of the invention having decreased or ablated expression of esterase, (b) contacting the protein fraction comprising a protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (c) collecting the protein of interest from the media, wherein a reduced level of the esterase activity is associated with the protein fraction collected at step (c), thus providing a stable protein formulation.
[0069] In yet another aspect, the invention provides a process for reducing esterase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate expression of esterase, (b) transfecting the host cell with a protein of interest, (c) extracting a protein fraction from the modified host cell, (c) contacting the protein fraction comprising the protein of interest with a column selected from the group consisting of protein A affinity (PA), cation exchange (CEX) and anion exchange (AEX) chromatography, (d) collecting the protein of interest from the media, and (e) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer, thus providing a protein formulation essentially free of detectable esterase activity. In some embodiments, the protein formulation is essentially free of PLBD2 protein or PLBD2 activity.
[0070] In yet another aspect, a method is provided for modifying a CHO cell genome to express a therapeutic agent comprising a vehicle for introducing, into the genome, an exogenous nucleic acid comprising a sequence for expression of the therapeutic agent, wherein the vehicle comprises a 5' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33, a nucleic acid encoding the therapeutic agent, and a 3' homology arm homologous to a sequence present in the nucleotide sequence of SEQ ID NO:33.
[0071] In one more aspect, the invention provides a modified CHO host cell comprising a modified CHO genome wherein the CHO genome is modified by disruption of target sequence within a nucleotide sequence at least 90% identical to SEQ ID NO: 33. In one embodiment, the modified CHO host cell further comprises another FAH target sequence disruption. In some embodiments, the another FAH target sequence is within (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:61 , (1 ) a nucleotide sequence at least 90% identical to SEQ ID NO:69 and/or (3) a nucleotide sequence at least 90% identical to SEQ ID NO:77.
[0072] In another aspect, the invention provides a modified eukaryotic host cell comprising a modified eukaryotic genome wherein the eukaryotic genome is modified at a target sequence in a coding region of the target gene by a site-specific nuclease. In some embodiments, the site-specific nuclease comprises a zinc finger nuclease (ZFN), a ZFN dimer, a transcription activator-like effector nuclease (TALEN), a TAL effector domain fusion protein, or an RNA-guided DNA endonuclease. The invention also provides methods of making such a modified eukaryotic host cell.
[0073] In any of the aspects and embodiments described above, the target sequence can be placed in the indicated orientation as in SEQ ID NO:33, 61 , 69, 77, 1 17, 118, 119, 120, 121 , 122, or 123; or in the reverse of the orientation of SEQ ID NO:33, 61 , 69, 77, 1 17, 1 18, 119, 120, 121 , 122, or 123.
[0074] Any of the aspects and embodiments of the invention can be used in conjunction with any other aspect or embodiment of the invention, unless otherwise specified or apparent from the context.
[0075] Other objects and advantages will become apparent from a review of the ensuing detailed description.
BRIEF DESCRIPTION OF THE DRAWINGS
[0076] Fig. 1 depicts the results of Taqman® quantitative polymerase chain reaction (qPCR) experiments to detect genomic (gDNA) or transcripts (mRNA) of the modified clones. Primers and probes were designed to flank the sequences predicted as subject to targeted disruption within exon 1 , either starting at nucleotide 37 (sgRNAI ) or starting at nucleotide 44 (sgRNA2) of SEQ ID NO:33. Relative amount of amplicons from clones targeted by either sgRNAI or sgRNA2 are graphed (i.e., relative to amplicons derived from the negative control transfection clones which were subject to no sgRNA or unmatched sgRNA). Clone 1 , for example, has relatively no amplified gDNA nor mRNA per qPCR of the targeted exon 1 region. Clone 1 and several other clones were selected for follow up analysis. The vertical axes represent the relative amount of template containing sgRNAI sequence (upper panel) or sgRNA2 sequence (lower panel).
[0077] Fig. 2A and Fig. 2B illustrate the results of further PCR analysis of a Clone 1 cells population compared to wild type Chinese hamster overy (CHO) cells. Fig. 2A shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from genomic DNA of wild type cells. Fig. 2B shows a PCR amplicon from Clone 1 that is shorter in length (bp) compared to the amplicon from mRNA of wild type cells. Sequencing confirmed an 1 1 bp deletion in the PLBD2 gene of Clone 1 . The vertical axes represent the size of the PCR fragments in base-pairs (bp).
[0078] Fig. 3 illustrates the relative protein titer of monoclonal antibody 1 (mAbl )-expressing Clone 1 cells (RS001 ) or mAb1 -expressing wild type CHO cells (RS0WT) subject to the same fed-batch culture conditions for 12 days. Samples of conditioned medium were extracted for each culture, and the Protein A binding fraction was quantified at Day 2, 4, 6, 9 and 12.
[0079] Fig. 4 shows the results of RS001 or RS0WT cells following production culture and protein purification using either Protein A (PA) alone, or PA and anion exchange (AEX) chromatography. PA-purified mAb1 from RS001 and RS0WT was analyzed for lipase abundance using trypsin digest mass spectrometry. As such, trypsin digests of RS001 - and RSOWT-produced mAb1 were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass spectrometer set to monitor a specific PLBD2 product fragment (as in Table 2). Control reactions containing reference samples of mAb1 (with no endogenous PLBD2) spiked with varying amounts of recombinant PLBD2 were also analyzed and plotted. The signals detected in the experiments were compared to the control reactions to determine concentration of PLBD2. mAb1 produced from Clone 1 shows no detectable amounts of PLBD2 when purified with PA alone.
[0080] Fig. 5 is a line plot depicting percent cell viability as a function of time in days. Open circles (-0-) represent wildtype cells. Filled diamonds (-♦-) represent PLBD2-KO cells. Filled squares (-■-) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
[0081] Fig. 6 is a line plot depicting protein production (titer) in grams per liter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (-♦-) represent PLBD2-KO cells. Filled squares (-■-) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
[0082] Fig. 7 is a line plot depicting viable cell counts in cells per milliliter as a function of time in days. Open circles (-o-) represent wildtype cells. Filled diamonds (-♦-) represent PLBD2-KO cells. Filled squares (-■-) represent LPL-KO cells. Filled triangles (-A-) represent the double knock-out LPL-KO / PLBD2KO.
[0083] Fig. 8 shows a formatted alignment of LPL knock out constructs clone 19, clone 20, clone 21 , and clone 22, represented by SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161 , and SEQ ID NO:162, respectively. The partial wildtype LPL sequence is represented by SEQ ID NO:158.
DETAILED DESCRIPTION
[0084] Before the present methods are described, it is to be understood that this invention is not limited to particular methods, and experimental conditions described, as such methods and conditions may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only by the appended claims.
[0085] As used in this specification and the appended claims, the singular forms "a", "an", and "the" include plural references unless the context clearly dictates otherwise. Thus for example, a reference to "a method" includes one or more methods, and/or steps of the type described herein and/or which will become apparent to those persons skilled in the art upon reading this disclosure.
[0086] Unless defined otherwise, or otherwise specified, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
[0087] Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, particular methods and materials are now described. All publications mentioned herein are incorporated herein by reference in their entirety.
Definitions
[0088] The phrase "exogenously added gene" or "exogenously added nucleic acid" refers to any DNA sequence or gene not present within the genome of the cell as found in nature. For example, an "exogenously added gene" within a CHO genome, can be a gene from any other species (e.g., a human gene), a chimeric gene (e.g., human/mouse), or a hamster gene not found in nature within the particular CHO locus in which the gene is inserted (i.e., a hamster gene from another locus in the hamster genome), or any other gene not found in nature to exist within a CHO locus of interest.
[0089] Percent identity, when describing an esterase, e.g., a hydrolase protein, such as SEQ ID NO:32, 34, 35, 37, 62,70, 74, 75, 76, 78, 82, 83, 84; 124, 125, 126, 127, 128, 129, and 130; or gene, such as SEQ ID NO:33, 61 , 69, 77, 1 17, 118, 119, 120, 121 , 122, and 123 includes homologous sequences that display the recited identity along regions of contiguous homology, but the presence of gaps, deletions, or insertions that have no homolog in the compared sequence are not taken into account in calculating percent identity.
[0090] A "percent identity" determination between, e.g., SEQ ID NO:32 with a species homolog would not include a comparison of sequences where the species homolog has no homologous sequence to compare in an alignment (i.e., SEQ ID NO:32 compared to a fragment thereof, or the species homolog has a gap or deletion, as the case may be). Thus, "percent identity" does not include penalties for gaps, deletions, and insertions.
[0091] "Targeted disruption" of a gene or nucleic acid sequence refers to gene targeting methods that direct cleavage or breaks (such as double stranded breaks) in genomic DNA and thus cause a modification to the coding sequence of such gene or nucleic acid sequence. Gene target sites are the sites selected for cleavage or break by a nuclease. The DNA break is normally repaired by the non-homologous end-joining (NHEJ) DNA repair pathway. During NHEJ repair, insertions or deletions (InDels) may occur, as such, a small number of nucleotides are either inserted or deleted at random at the site of the break and these InDels may shift or disrupt the open reading frame (ORF) of the target gene. Shifts in the ORF may cause significant changes in the resulting amino acid sequence downstream of the DNA break, or may introduce a premature stop codon, therefore the expressed protein, if any, is rendered nonfunctional or subject to degradation.
[0092] "Targeted insertion" refers to gene targeting methods employed to direct insertion or integration of a gene or nucleic acid sequence to a specific location on the genome, i.e., to direct the DNA to a specific site between two nucleotides in a contiguous polynucleotide chain. Targeted insertion may also be performed to introduce a small number of nucleotides or to introduce an entire gene cassette, which includes multiple genes, regulatory elements, and/or nucleic acid sequences. "Insertion" and "integration" are used interchangeably.
[0093] "Recognition site" or "recognition sequence" is a specific DNA sequence recognized by a nuclease or other enzyme to bind and direct site-specific cleavage of the DNA backbone.
Endonucleases cleave DNA within a DNA molecule. Recognition sites are also referred to in the art as recognition target sites.
[0094] Polysorbates are fatty acid esters of sorbitan or iso-sorbide (polyoxyethylene sorbitan or iso- sorbide mono- or di- esters). The polyoxyethylene serves as the hydrophilic head group and the fatty acid as the lipophilic hydrophobic tail. The effectiveness as a surfactant of the polysorbate depends upon the amphiphilic nature of the molecule with both hydrophilic head and hydrophobic tail present (in a single molecule). When a polysorbate degrades (hydrolyzes) into its component head group and fatty acid tail, it loses its effectiveness as a protein stabilizer, potentially allowing for aggregation and subsequent subvisible particle (SVP) formation is an indicator of such degradation. SVPs may attribute to immunogenicity. Regulatory authorities like the United States Food and Drug Administration (USFDA) provide limitations on the number of subvisible particles (SVPs) allowed in a pharmaceutical formulation. United States Pharmacopeia (USP) publishes standards for strength, purity and quality of drugs and drug ingredients, as well as food ingredients and dietary
supplements. For example, USP 31 monograph <788> sets the limit for number of particles allowed in parenteral formulations. USP 31 monograph <788> is available at http://www.uspnf.com/official- text/revision-bulletins/particulate-matter-injections; and as Revision Bulletin Official July 1 , 2012, <788> Particulate Matter in Injections, The United States Pharmacopeial Convention. For large volume parenterals (greater than 100 ml_), the limit is set at no more than 25 particles of at least 10 microns per ml_, and no more than 3 particles of at least 25 microns per ml_. For small volume parenterals (100 mL or less), the limit is set at no more than 6,000 particles of at least 10 microns per container, and no more than 600 particles of at least 25 microns per container.
[0095] The importance of maintaining a stable composition to minimize losses of the biologically active agent due to any contaminant or degradative process is emphasized by the International Conference on Harmonisation of Technical Requirements For Registration of Pharmaceuticals For Human Use (ICH). According to ICH Specifications (Q6A and Q6B), if a drug substance does not degrade in the specific formulation and under the specific storage conditions proposed in a new drug application, as demonstrated via appropriate analytical methodology, then degradation product testing may be reduced or eliminated upon approval by the regulatory authorities.
[0096] The term "stability" refers to the retention of an acceptable degree of physical structure (colloidal, nature), chemical structure or biological function of the biologically active agent (e.g., biotherapeutic or other protein produced in a cell-based bioprocess) over time during storage (a.k.a. "shelf-life"), during processing, or after administration and while in vivo. The biologically active agent may be stable even though it does not maintain 100% of its structure or function after storage or administration for a defined amount of time. Under certain circumstances, if about 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97% 98%, 99%, or > 99% of the biologically active agents have a native conformation, structure or function, the biologically active agent and formulation containing the biologically active agent may be regarded as "stable".
[0097] Stability can be measured, inter alia, by determining the percentage of native molecule that remains in the formulation after storage or administration for a defined amount of time at a defined temperature. The percentage of native molecule can be determined by, inter alia, size exclusion chromatography (e.g., size exclusion high performance liquid chromatography [SE-HPLC]), such that native means non-aggregated and non-degraded. In certain embodiments, at least about 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% of the native form of the biologically active agent can be detected in the formulation after a defined amount of time at a defined temperature or under physiological conditions after administration. The defined amount of time after which stability is measured can be about 14 days, about 28 days, about 1 month, about 2 months, about 3 months, about 4 months, about 5 months, about 6 months, about 7 months, about 8 months, about 9 months, about 10 months, about 1 1 months, about 12 months, about 18 months, about 24 months, or more. The temperature at which the formulation containing the biologically active agent may be kept when assessing stability can be any temperature from about -80°C to about 45°C, e.g., storage at about - 80°C, about -30°C, about -20°C, about 0°C, about 4°-8°C, about 5°C, about 25°C, about 35°C, about 37°C or other physiological temperatures, or about 45°C. For example, the biologically active agent may be deemed stable if after 3 months under physiological conditions, greater than about 75%, 80%, 85% or 90% of native molecule is detected in the soluble fraction by SE-HPLC or other size exclusion or size determination method. "Physiological temperature" includes the body temperature of any vertebrate. For example, the physiological temperature of humans is about 37°C. In some embodiments of the invention, physiological temperature is between about 25°C and about 45°C. In some embodiments, physiological temperature is about 25°C, about 26°C, about 27°C, about 28°C, about 29°C, about 30°C, about 31 °C, about 32°C, about 33°C, about 34°C, about 35°C, about 36°C, about 37°C, about 38°C, about 39°C, about 40°C, about 41 °C, about 42°C, about 43°C, about 44°C, and about 45°C.
[0098] Stability can be measured, inter alia, by determining the percentage of biologically active agent, such as a protein, that forms an aggregate (i.e., high molecular weight species) after a defined amount of time at a defined temperature, wherein stability is inversely proportional to the percent high molecular weight (HMW) species that is formed of the biologically active agent (protein). The percentage of HMW species of the biologically active agent can be determined by, inter alia, size exclusion chromatography, as described above. A pharmaceutical formulation containing the biologically active agent may also be deemed stable if after three months at physiological conditions less than about 15%, 14%, 13%, 12%, 1 1 %, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1% of the biologically active agent is detected in a HMW form.
[0099] Stability can be measured, inter alia, by determining the percentage of a biologically active agent, such as a protein, that is degraded or otherwise is found as a low molecular weight (LMW) species after a defined amount of time at a defined temperature. Stability is inversely proportional to the percent LMW species that is formed in the soluble fraction. The percentage of LMW species of the biologically active agent in the soluble fraction can be determined by, inter alia, size exclusion chromatography, as described above. A pharmaceutical formulation may also be deemed stable if after three months under storage conditions less than about 15%, 14%, 13%, 12%, 1 1%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1 %, 0.5%, or 0.1% of the biologically active agent is detected in a LMW form.
[00100] Pharmaceutical compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines. The composition complies if tested for subvisible particles and the test results in the average number of particles present in the units tested does not exceed 12 per milliliter equal to or greater than 10 um in size, and does not exceed 2 per milliliter equal to or greater than 25 um. Various test for microscopic particles in solution are well-known in the art, including but not limited to tests recommended in ICH Guideline Q4B Annex 3(R1 ), dated 27 September 2010 (Evaluation And Recommendation Of Pharmacopoeial Texts For Use In The lch Regions On Test For Particulate Contamination: Sub-Visible Particles General Chapter).
[00101] "Host cell proteins" or "HCPs" refer to proteins produced or encoded by the host organisms used to produce recombinant therapeutic proteins. HCPs are generally process-related impurities during biologies production. The amount of residual HCPs in drug product is generally considered a critical quality attribute (CQA), due to their potential to affect product safety and efficacy. Regulatory authorities require a product sponsor to monitor the removal of HCPs in drug product during bioprocess development. A sensitive assay e.g., immunoassay, capable of detecting a wide range of protein impurities is generally utilized. This testing can include verification at commercial scale in accordance with regional regulations and may be done at the time of submission of a marketing approval application. According to ICH Specifications (Q6A and Q6B, section 2.3), if a drug substance or drug product does not contain any impurity in the specific formulation, i.e. if efficient control or removal to acceptable levels is demonstrated by suitable studies, further testing may be reduced or eliminated upon approval by the regulatory authorities.
[00102] Pharmaceutical compositions prepared in host cells as described herein are envisioned to comply with standards set forth by multiple regulatory authorities and ICH guidelines. The composition prepared by the host cells described herein comprise less than 100 ng/mg (ppm), less than about 90 ppm, less than about 80 ppm, less than about 70 ppm, less than about 60 ppm, less than about 50 ppm, less than about 40 ppm, less than about 30 ppm, less than about 20 ppm, less than about 10 ppm, less than about 5 ppm, or 0 ppm of the target host cell protein, i.e. fatty acid hydrolase.
[00103] The term "fatty acid hydrolase" or "FAH" refers to any hydrolytic enzyme that cleaves at a carbonyl group creating a carboxylic acid product in which the carboxylic acid comprises an R- group that is lipophilic or otherwise hydrophobic. In some embodiments, the carboxylic acid product is a fatty acid. "Esterases" and "fatty acid acylases/amidases" are included as subgenera of fatty acid hydrolase. "Lipases" are a subgenus of esterases that cleave lipids (fats, waxes, sterols, glycerides and phospholipids. "Phospholipases" are a subgenus of lipases that cleave
phospholipids. Esterases cleave fatty acid esters into fatty acids and alcohols. Lipases include PLBD2, LPL and LIPA. "Ceramidase" is a subgenus of fatty acid acylase that cleaves ceramide and releases a fatty acid and sphingosine, which is an amino alcohol. Examples of ceramidases include acid ceramidase, neutral ceramidase, alkaline ceramidase 1 , alkaline ceramidase 2 and alkaline ceramidase 3.
[00104] Protein A-binding fraction refers to the fraction of cell lysate from cultured cells expressing a protein of interest which binds to a Protein A affinity format. It is well understood in the art that Protein A affinity chromatography, such as Protein A chromatography medium, such as resins, beads, columns and the like, are utilized to capture Fc-containing proteins due to their affinity to Protein A.
[00105] Phospholipase B-like 2 (PLBD2) refers to the homologs of a phospholipase gene known as NCBI RefSeq. XM_003510812.2 (SEQ ID NO:33) or protein known as NCBI RefSeq.
XP_003510860.1 (SEQ ID NO:32), and further described herein. PLBD2 is also referred to in the art as putative phospholipase B-like 2 (PLBL2), 76 kDa protein, LAMA-like protein 2, PLB homolog 2, lamina ancestor homolog 2, mannose-6-phosphate protein associated protein p76, p76, phospholipase B-like 2 32 kDa form, phospholipase B-like 2 45 kDa form, or Lysosomal 66.3 kDa protein.
[00106] Lipoprotein lipase (LPL) is a glycosylated homodimer secreted by parenchymal cells and associated with endothelial cells of the capillary lumen. Exemplary LPL proteins include Chinese hamster LPL (SEQ ID NO:62), mouse LPL (SEQ ID NO:66), ), rat LPL (SEQ ID NO:67) and human LPL (SEQ ID NO:68). Mouse LPL is 92% identical to Chinese hamster LPL. Rat LPL is 92% identical to Chinese hamster LPL. Human LPL is 88% identical to Chinese hamster LPL. In one embodiment, Chinese hamster LPL is encoded by a polynucleotide sequence of SEQ ID NO:61 .
[00107] Lysosomal acid lipase (LI PA), also known as lysosomal lipase, lipase A, lysosomal acid and cholesterol esterase is an intracellular lipase that functions in the lysosome. LIPA reversibly catalyzes cholesteryl ester bond formation and cleavage. LIPA is also a glycosylated homodimer. Exemplary LIPA proteins include Chinese hamster LIPA (SEQ ID NO:70), mouse LIPA (SEQ ID NO:74), ), rat LIPA (SEQ ID NO:75) and human LIPA (SEQ ID NO:76). Mouse LIPA is 72% identical to Chinese hamster LIPA. Rat LIPA is 75% identical to Chinese hamster LIPA. Human LIPA is 74% identical to Chinese hamster LIPA. In one embodiment, Chinese hamster LIPA is encoded by a polynucleotide sequence of SEQ ID NO:69.
[00108] Acid ceramidase (CE), also known as ASAH1 , AC, ACDase, ASAH, PHP, PHP32, SMAPME, N-acylsphingosine amidohydrolase (acid ceramidase) 1 is an acylase that cleaves ceramide to produce fatty acid and sphingosine. It is a heterodimer comprising a non-glycosylated alpha subunit and a glycosylated beta subunit. Acid ceramidase has an acid pH optimum. The lipid accumulation disease, Farber Lipogranulomatosis, is associated with a deficiency in acid ceramidase activity. Exemplary acid ceramidases (ACE) include Chinese hamster ACE (SEQ ID NO:78), mouse ACE (SEQ ID NO:82), ), rat ACE (SEQ ID NO:83) and human ACE (SEQ ID NO:84). Mouse ACE is 87% identical to Chinese hamster ACE. Rat ACE is 89% identical to Chinese hamster ACE. Human ACE is 83% identical to Chinese hamster ACE. In one embodiment, Chinese hamster ACE is encoded by a polynucleotide sequence of SEQ ID NO:77.
[00109] Platelet-activating factor acetylhydrolase IB subunit gamma (PAFAHG), also known as PAFAH1 B3, PAFAHG, and platelet activating factor acetylhydrolase 1 b catalytic subunit 3 is one of the catalytic subunits along with beta of the cytosolic tetrameric platelet-activating factor acetylhydrolase IB. PAFAHG belongs to the phospholipase A2 family and catalyzes the hydrolysis of the acyl group at position 2 of glycerol in bioactive phospholipids (see Stafforini et al., Journal of Biological Chemistry, 272:17895-17898, July 1997). Chinese hamster PAFAHG (SEQ ID NO:124) is 98% identical to both rat and mouse PAFAHG, and 96% identical to human PAFAHG. In one embodiment, Chinese hamster PAFAHG is encoded by a polynucleotide sequence of SEQ ID NO:1 17.
[00110] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00111] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00112] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00113] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00114] Phosphoinositide phospholipase C (PIPLC), also known as triphosphoinositide
phosphodiesterase, phosphoinositidase C, 1-phosphatidylinositol-4,5-bisphosphate
phosphodiesterase, monophosphatidylinositol phosphodiesterase, phosphatidylinositol
phospholipase C, PI-PIPLC, and 1-phosphatidyl-D-myo-inositol-4,5-bisphosphate
inositoltrisphosphohydrolase is member of the phospholipase C superfamily of phophodiesterases that participate in phosphatidylinositol 4,5-bisphosphate (PIP2) metabolism and calcium-dependent lipid signaling. PIPLC hydrolyzes PIP2 at the inner membrane to produce inositol triphosphate (IP3) and diacylglycerol (DAG). Chinese hamster PIPLC (SEQ ID NO:125) is 96% identical to rat PIPLC, 95% identical to mouse PIPLC, and 92% identical to human PIPLC. In one embodiment, Chinese hamster PIPLC is encoded by a polynucleotide sequence of SEQ ID NO:1 18.
[00115] Additional fatty acid hydrolases that can serve as targets for deletion, either individually, or in combination with one of more additional fatty acid hydrolases are listed in Table 1. Any of the following hydrolases, or their equivalents, listed in Table A may be the target protein in the creation of a knockout cell line, where removal of the host cell protein is necessitated due to contamination in the preparation of a biopharmaceutical product.
Table 1
[00116] The term "cell" or "cell line" includes any cell that is suitable for expressing a recombinant nucleic acid sequence. Cells include those of prokaryotes and eukaryotes (single-cell or multiple- cell), bacterial cells (e.g., strains of E. coli, Bacillus spp., Streptomyces spp., etc.), mycobacteria cells, fungal cells, yeast cells (e.g.,S. cerevisiae, S. pombe, P. partoris, P. methanolica, etc.), plant cells, insect cells (e.g.,SF-9, SF-21 , baculovirus-infected insect cells, Trichoplusia ni, etc.), non- human animal cells, mammalian cells, human cells, or cell fusions such as, for example, hybridomas or quadromas. In certain embodiments, the cell is a human, monkey, ape, hamster, rat or mouse cell. In certain embodiments, the cell is eukaryotic and is selected from the following cells: CHO (e.g.,CHO K1 , DXB-1 1 CHO, Veggie-CHO), COS (e.g.,COS-7), retinal cells, Vero, CV1 , kidney (e.g.,HEK293, 293 EBNA, MSR 293, MDCK, HaK, BHK21 ), HeLa, HepG2, WI38, MRC 5, Colo25, HB 8065, HL-60, Jurkat, Daudi, A431 (epidermal), CV-1 , U937, 3T3, L cell, C127 cell, SP2/0, NS-0, MMT cell, tumor cell, and a cell line derived from an aforementioned cell. In some embodiments, the cell comprises one or more viral genes, e.g., a retinal cell that expresses a viral gene (e.g.,a PER.C6® cell).
General Description
[00117] The invention is based at least in part on a recombinant host cell and cell expression system thereof that decreases expression of two or more an endogenous host cell fatty acid hydrolases (FAHs), decreases the enzymatic function or binding ability of two or more endogenous host cell FAHs, or lacks detectable expression of two or more FAHs. The inventors discovered that the disruption of genes encoding at least two FAHs allows for the optimized and efficient production and purification of biopharmaceutical products expressed in such expression systems. The invention may be employed in several ways, such as 1 ) utilizing gene editing tools to totally knockout FAH expression, whereas no measurable full-length FAH enzyme is expressed in the cell due to disruption of the gene encoding the FAH; 2) utilizing gene editing tools to eliminate or reduce enzymatic activity, whereas the FAH protein is expressed but rendered nonfunctional due to disruptions in its gene; and 3) utilizing gene editing tools to eliminate or reduce the ability of an endogenous host cell FAH to bind exogenous recombinant protein produced by the cell. FAH activity was determined in protein fractions of certain antibody-producing cells. Several particular fatty acid hydrolases were determined as contaminants in these protein fractions, including three carboxylic esterases (a.k.a. esterases): phospholipase B-like (PLBD2), lipoprotein lipase (LPL) and lysosomal acid lipase (LIPA), and acid ceramidase, a carboxylic amidase (an acylase). Gene editing target sites were identified in hamster PLBD2, LPL, LIPA and acid ceramidase (ACE) genes that enable targeted disruption of those genes in a hamster cell {i.e., CHO) genome.
[00118] An optimized host cell comprising a combination of genetic modifications that affect the expression of genes encoding (1 ) PLBD2 and LPL; (2) PLBD2 and LIPA; (3) PLBD2 and ACE; (4) LPL and LIPA; (5) LPL and ACE; (6) LIPA and ACE; (7) PLBD2, LPL and LIPA; (8) PLBD2, LPL and ACE; (9) PLBD2, LIPA and ACE; (10) LPL, LIPA and ACE, (1 1 ) PLBD2, LPL, LIPA and ACE, (12) PLBD2 and PIPLC, (13) PLBD2 and CEH, (14) PLBD2 and PAFAHG, (15) PLBD2 and LCE, (16) PLBD2 and ASA, (17) PLBD2 and IAH1 , (18) PLBD2 and LPLA2, (19) LPL and PIPLC, (20) LPL and CEH, (21 ) LPL and PAFAHG, (22) LPL and LCE, (23) LPL and ASA, (24) LPL and IAH1 , (25) LPL and LPLA2, (26) LIPA and PIPLC, (27) LIPA and CEH, (28) LIPA and PAFAHG, (29) LIPA AND LCE, (30) LIPA and ASA, (31 ) LIPA and IAH1 , (32) LIPA and LPLA2, (33) ACE and PIPLC, (34) ACE and CEH, (35) ACE and PAFAHG, (36) ACE and LCE, (37) ACE and ASA, (38) ACE and IAH1 , (39) ACE and LPLA2, (40) PIPLC and CEH, (41 ) PIPLC and PAFAHG, (42) PIPLC and LCE, (43) PIPLC and ASA, (44) PIPLC and IAH1 , (45) PIPLC and LPLA2, (46) CEH and PAFAHG, (47) CEH and LCE, (48) CEH and ASA, (49) CEH and IAH1 , (50) CEH and LPLA2, (51 ) PAFAHG and LCE, (53) PAFAHG and ASA, (54) PAFAHG and IAH1 , (55) PAFAHG and LPLA2, (56) LCE and ASA, (57) LCE and IAH1 , (58) LCE and LPLA2, (56) ASA, and IAH1 , (58) ASA and LPLA2, or (59) IAH1 and LPLA2, and the like is/are useful for the biological production of high-quality proteins.
[00119] Such a cell is envisioned to reduce the burden of certain purification steps, thereby reducing time and cost, while increasing production yield. Also, the formulated protein is expected to have improved stability due to the reduced hydrolase burden.
[00120] The invention is also based on the specific targeting of an exogenous gene to the integration site. The methods of the invention allow efficient modification of the cell genome, thus producing a modified or recombinant host cell useful as a cell expression system for the
bioprocessing of therapeutic or other commercial protein products. To this end, the methods of the invention employ cellular genome gene editing strategies for the alteration of particular genes of interest that otherwise may diminish or contaminate the quality of recombinant protein formulations, or require multiple purification steps.
[00121] The compositions of the invention, e.g., gene editing tools, can also be included in expression constructs for example, in expression vectors for cloning and engineering new cell lines. These cell lines comprise the modifications described herein, and further modifications for optimal incorporation of expression constructs for the purpose of protein expression are envisioned.
Expression vectors comprising polynucleotides can be used to express proteins of interest transiently, or can be integrated into the cellular genome by random or targeted recombination such as, for example, homologous recombination or recombination mediated by recombinases that recognize specific recombination sites (e.g., Cre-lox-mediated recombination).
[00122] Target sites for disruption or insertion of DNA are typically identified with the maximum effect of the gene disruption or insertion in mind. For example, target sequences may be chosen near the N-terminus of the coding region of the gene of interest whereas a DNA break is introduced within the first or second exon of the gene. Introns (non-coding regions) are not typically targeted for disruption as repair of the DNA break in that region may not disrupt the target gene. The changes introduced by these modifications are permanent to the genomic DNA of the organism.
[00123] Essentially, following identification of target sites within SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77, gene editing protocols were employed to render a nonfunctional versions of two or more of those genes. In some embodiments, the one of those genes encodes PLBD2 (e.g., SEQ ID NO:33). Once the contaminant FAH host cell proteins are eliminated, protocols known in the art for introducing an expressible gene of interest (GOI), such as a multi- subunit antibody, along with any other desirable elements such as, e.g., promoters, enhancers, markers, operators, ribosome binding sites (e.g., internal ribosome entry sites), efc. are also employed.
[00124] The resulting recombinant cell line conveniently provides more efficient downstream bioprocess methods with respect to expressible exogenous genes of interest (GOIs), since purification steps for exogenous proteins of interest are eliminated due to the absence of the contaminant host cell protein. Eliminating or refining purification procedures also results in higher amounts (titer) of the recovered protein of interest.
Physical and Functional Characterization of Modified CHO Cells
[00125] Applicants have discovered enzymatic activities associated with the destabilization of polysorbates (including polysorbate 20 and polysorbate 80) and/or enzymatic activities co-purifying with highly concentrated, multimerized or aggregated protein. Those activities were found to be associated with one or more fatty acid hydrolases (FAHs). One such FAH was identified from the peptide sequences listed in Table 2. A BLAST search of those peptide sequences revealed identity with a putative phospholipase B-like 2 (PLBD2, also referred to as PLBL2). PLBD2 is highly conserved in hamster (SEQ ID NO:32), mice (SEQ ID NO:34), rat (SEQ ID NO:35), human (SEQ ID NO:36), and bovine (SEQ ID NO:37). The applicants discovered that PLBD2, which co-purifies under certain processes with some classes of proteins-of-interest manufactured in a mammalian cell line, has enzymatic activity responsible for the hydrolysis of polysorbate 20 and 80. Other FAH species, of which PLBD2 is an example, may contribute to polysorbate instability or persist as hydrophobic "sticky" proteins that bind protein multimers or aggregates during purification and ultimate formulation, depending upon the particular protein-of-interest and/or background of the host cell.
Table 2
[00126] Ester hydrolysis of polysorbate 80 was recently reported (see Labrenz, S.R., "Ester hydrolysis of polysorbate 80 in mAb drug product: evidence in support of the hypothesized risk after observation of visible particulate in mAb formulations," J. Pharma. Sci. 103(8):2268-77 (2014)). That paper reported the formation of visible particles in a formulation containing IgG. The author postulated that the colloidal IgG particles formed due to the enzymatic hydrolysis of oleate esters of polysorbate 80. Although no esterase was directly identified, the author speculates that a lipase or tweenase co-purified with the IgG, which was responsible for degrading the polysorbate 80.
Interestingly, IgGs formulated with polysorbate 20 did not form particles and the putative esterase did not hydrolyze the polysorbate 20. The author reported that the putative lipase associated with the IgG did not affect saturated C 12 fatty acid {i.e., laurate) (Id at 7.)
[00127] The applicants discovered three other FAH proteins in addition to PLBD2 that co-purify with antibodies produced in mammalian cells: lipoprotein lipase(LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE). LPL is a triacylglycerol/diacylglycerol hydrolase of the carboxylic ester hydrolase (esterase) family (see Hide et al., "Structure and evolution of the lipase superfamily," J Lipid Res. 1992 Feb; 33(2): 167-78). LI PA is a sterol esterase and synthase that acts on esters of sterols and long-chain fatty acids (see Dubland and Francis, "Lysosomal acid lipase: at the crossroads of normal and atherogenic cholesterol metabolism," Front. Cell Dev. Biol. 2015 Feb 2; 3(3): 1-11 ). Acid ceramidase is not a carboxylic ester hydrolase, but rather an amide hydrolase that cleaves fatty acids from ceramide at the amide bond (carboxylic amide hydrolase) (see Park and Schuchman, "Acid ceramidase and human disease," Biochim. Biophys. Acta. 2006 Dec; 1758(12): 2133-8).
[00128] Phospholipases are a family of esterase enzymes that catalyze the cleavage of phospholipids. Each phospholipase subclass has different substrate specificity based on its target cleavage site. Phospholipase B (PLB) was identified as related to a group of prokaryotic and eukaryotic lipase proteins by virtue of the presence of a highly conserved amino acid sequence motif, Gly-Asp-Ser-Leu (GDSL) (Upton, C, and Buckley, JT. A new family of lipolytic enzymes? Trends Biochem Sci. 1995; 20:178-179). However, phospholipase B is also classified with known GDSL(S) hydrolases, and has little sequence homology to true lipases, differentiating itself structurally from phospholipases by having a serine-containing motif closer to the N-terminus than other lipases. Thus, phospholipase B-like proteins are also classified as N-terminal nucleophile (Ntn) hydrolases. Functionally, phospholipase B-like enzymes hydrolyze their target substrate (fatty acid esters such as diacylglycerophospholipids) to produce free fatty acids and ester-containing compounds {e.g., produces glycerophosphocholine), in a similar in manner as other
phospholipases. It has been suggested that PLB-like proteins, such as phospholipase B-like protein 1 (PLBD1 ) and phospholipase B-like protein 2 (PLBD2), also have amidase activity, similar to other Ntn hydrolases (Repo, H. et al, Proteins 2014; 82:300-31 1 ).
[00129] Lipoprotein lipases have also been demonstrated to cleave carboxylic ester bonds of polysorbate 20 and 80; and to associate with some monoclonal antibodies during production (see N. Levy, "Host cell protein impurities and protein-protein interactions in downstream purification of monoclonal antibodies," Dissertation submitted to the Faculty of the University of Delaware, Summer 2014, UMI 3642330, Published by ProQuest LLC, 2014). Cell-cultures of a CHO-K1 RNAi knock-down of LPL reveled diminished polysorbate esterase activity. The effect of such a knockdown on overall cell viability or the production of useful titers of ectopic protein has not been investigated.
[00130] Knockout of a host cell gene, such as an FAH, more particularly one or more of phospholipase B-like protein 2, lipoprotein lipase, lysosomal acid lipase and acid ceramidase may be accomplished in several ways. Rendering the FAH encoding gene nonfunctional, or reducing the functional activity of the target FAH protein may be done by introducing point mutations in the FAH genomic sequence, particularly in the exons (coding regions). In some embodiments, the nucleic acid sequences of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 were identified and sequences upstream and downstream of the target site {i.e., homologous arms) may be utilized to integrate an expression cassette comprising a mutated gene by homologous recombination. Further gene editing tools are described herein in accordance with the invention.
[00131] Cell lines devoid of multiple FAH activities, particularly PLBD2, LPL, LIPA and ACE activity, are useful for the production of therapeutic proteins to be purified and stored long term, and such cell lines solve problems associated with long term storage of pharmaceutical compositions in a formulation containing a fatty acid ester surfactant by maintaining protein stability and reducing subvisible particle (SVP) formation (see also PCT International Application No. PCT/US 15/54600 filed October 8, 2015, which is hereby incorporated in its entirety into the specification).
[00132] Assays to detect FAH activity include polysorbate degradation measurements. Unpurified protein supernatants or fractions from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, is tested for stability of polysorbate, such as polysorbate 20 or 80. The measurement of percent intact polysorbate reported is inversely proportional to the amount of contaminating FAH activity. Other measurements for detection of FAH activity or presence of FAH in a protein sample are known in the art. Detection of FAH protein (e.g., lipase, phospholipase, PLBD2, LPL, LIPA, acylase, ACE) may be done by trypsin digest mass spectrometry.
[00133] It is hypothesized that the instability of the non-ionic detergent, i.e., surfactant such as polysorbate, in a protein (e.g., antibody) formulation contributes to the formation of subvisible particles. Thus, degradation of the polysorbate incurs loss of surfactant activity, and therefore allows the protein to aggregate and form subvisible particles. Additionally or alternatively, the fatty acids released by the degrading sorbitan fatty acid esters may also contribute to subvisible particle formation as immiscible fatty acid droplets. Therefore, levels of subvisible particles≥ 10
micrometers in diameter may be counted in the protein formulation in order to detect esterase or other FAH activity.
[00134] Other assays for detecting FAH, especially esterase activity are known in the art. For example, glycerophospho[3H]choline formation from phosphatidyl[3H]choline following incubation of phosphatidyl[3H]choline and protein supernatant may be determined by thin-layer chromatography (following similar protocols according to Kanoh, H. et al. 1991 Comp Biochem Physiol 102B(2):367- 369).
[00135] SEQ ID NO:32 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of PLBD2, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:32, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:32 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00136] The mammalian PLBD2 sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 3 identifies exemplary mammalian PLBD2 proteins and their degree of homology.
TABLE 3: Amino acid identity of PLBD2 homologs
[00137] In certain embodiments, the targeted disruption of SEQ ID NO:33 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
[00138] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 33-62, 37-56, 40-69, 44-63, 1 10-139, 198-227, 182-21 1 , and 242-271 of SEQ ID NO:33.
[00139] In another embodiment, the PLBD2 nucleic acid sequence is at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:33 or target sequence thereof.
[00140] SEQ ID NO:62 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of LPL, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:62, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:62 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00141] The mammalian LPL sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 4 identifies exemplary mammalian LPL proteins and their degree of homology. TABLE 4: Amino acid identity of LPL homologs
[00142] In certain embodiments, the targeted disruption of SEQ ID NO:61 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400- 420, 410-430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490- 510, 500- 520, 510-530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590- 610, 600- 620, 610-630, 620-640, 630-650, 465-484, 558-577, and 593-612 of SEQ ID NO:61.
[00143] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 370-390, 380-400, 390-410, 400-420, 410- 430, 420-440, 430-450, 440-460, 450-470, 460-480, 470-490, 480-500, 490- 510, 500-520, 510- 530, 520-540, 530-550, 540-560, 550-570, 560-580, 570-590, 580-600, 590- 610, 600-620, 610- 630, 620-640, 630-650, 465-484, 558-577, and 593-612 of SEQ ID NO:61.
[00144] In another embodiment, the LPL nucleic acid sequence is at least about 80% identical, at least about 81% identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:61 or target sequence thereof.
[00145] SEQ ID NO:70 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of LI PA, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:70, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:70 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00146] The mammalian LIPA sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 5 identifies exemplary mammalian LIPA proteins and their degree of homology.
TABLE 5: Amino acid identity of LIPA homologs
[00147] In certain embodiments, the targeted disruption of SEQ ID NO:69 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69.
[00148] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 180-199, 239-258, and 276-295 of SEQ ID NO:69. [00149] In another embodiment, the LIPA nucleic acid sequence is at least about 70% identical, at least about 71% identical, at least about 72% identical, at least about 73% identical, at least about 74% identical, at least about 75% identical, at least about 76% identical, at least about 77% identical, at least about 78% identical, or at least about 79% identical, at least about 80% identical, at least about 81 % identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:69 or target sequence thereof.
[00150] SEQ ID NO:78 disclosed herein was identified from proteins expressed in CHO cells. Other mammalian species (such as, for example, humans, rats, mice), were found to have high homology to the identified FAH. Homologous sequences may also be found in cell lines derived from other tissue types of Cricetulus griseus, or other homologous species, and can be identified and isolated by techniques that are well-known in the art. For example, one may identify other homologous sequences by cross-species hybridization or PCR-based techniques. In addition, variants of LPL, can then be tested for FAH activity as described herein. DNAs that encode proteins at least about 80% identical in amino acid identity to SEQ ID NO:78, or variants thereof, and having FAH activity are expected to exhibit their FAH activity on biopharmaceutical compositions and are candidates for targeted disruption in the engineered cell line. Accordingly, homologs of SEQ ID NO:78 or variants thereof, and the cells expressing such homologs are also encompassed by embodiments of the invention.
[00151] The mammalian acid ceramidase (ACE) sequences (nucleic acid and amino acid) are conserved among hamster, human, mouse and rat genomes. Table 6 identifies exemplary mammalian ACE proteins and their degree of homology.
TABLE 6: Amino acid identity of ACE homologs
[00152] In certain embodiments, the targeted disruption of SEQ ID NO:77 is directed to the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130- 150, 140-160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200- 220, 210-230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300- 320, 310-330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ ID NO:77.
[00153] In another embodiment, the target sequence is wholly or partially within the region selected from the group consisting of nucleotides spanning positions numbered 1-20, 10-30, 20-40, 30-50, 30-60, 30-70, 40-60, 40-70, 50-70, 60-80, 70-90, 80-100, 90-1 10, 1 10-140, 120-140, 130-150, 140- 160, 150-170, 160-180, 160-180, 170-190, 180-200, 180-220, 190-210, 190-230, 200-220, 210- 230, 220-240, 230-250, 240-260, 250-270, 260-280, 270-290, 280-300, 290- 310, 300-320, 310- 330, 320-340, 330-350, 340-360, 350-370, 360-380, 135-154, 237-256, and 332-351 of SEQ I D NO:77.
[00154] In another embodiment, the ACE nucleic acid sequence is at least about 80% identical, at least about 81% identical, at least about 82% identical, at least about 83% identical, at least about 84% identical, at least about 85% identical, at least about 86% identical, at least about 87% identical, at least about 88% identical, or at least about 89% identical, at least about 90% identical, at least about 91 % identical, at least about 92% identical, at least about 93% identical, at least about 94% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, or at least about 99% identical to the sequence of SEQ ID NO:77 or target sequence thereof.
[00155] Cell populations expressing enhanced levels of a protein of interest can be developed using the cell lines and methods provided herein. The isolated commercial protein, protein supernatant or fraction thereof, produced by the cells of the invention have no detectable esterase or esterase activity. Cell pools further modified with exogenous sequence(s) integrated within the genome of the modified cells of the invention are expected to be stable over time, and can be treated as stable cell lines for most purposes. Recombination steps can also be delayed until later in the process of development of the cell lines of the invention.
Genetically Modifying the Target Host Cell Protein
[00156] Methods for genetically engineering a host cell genome in a particular location (i.e., target host cell protein) may be achieved in several ways. Genetic editing techniques were used to modify a nucleic acid sequence in a eukaryotic cell, wherein the nucleic acid sequence is an endogenous sequence normally found in such cells and expressing a contaminant host cell protein. Clonal expansion is necessary to ensure that the cell progeny will share the identical genotypic and phenotypic characteristics of the engineered cell line. In some examples, native cells are modified by a homologous recombination technique to integrate a nonfunctional or mutated target nucleic acid sequence encoding a host cell protein, such as a variant of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77.
[00157] One such method of editing the CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA genomic sequences involves the use of guide RNAs and a type II Cas enzyme to specifically target an exon of PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and/or ASA. Specific guide RNAs directed to particular exons of CHO PLBD2, LPL, LIPA, ACE, PAFAHG, PIPLC, LCE, IAH1 , LPLA2, CEH, and ASA have been employed (Table 7) in a site-specific nuclease editing method as described herein. Other methods of targeted genome editing, for example nucleases, recombination-based methods, or RNA interference, to modify the FAH genes may be employed for the targeted disruption of the CHO genome.
[00158] In some embodiments, the engineered mammalian host cell (i.e. non-natural cell) comprises one or more disruptions within gene sequences selected from the group consisting of nucleotides 37-63 of SEQ ID NO:33, nucleotides 465-612 of SEQ ID NO:61 , nucleotides 180-295 of SEQ ID NO:69, nucleotides 135-351 of SEQ ID NO:77, nucleotides 249-388 of SEQ ID NO:1 17, nucleotides 1624-2157 of SEQ ID NO:163, nucleotides 372-1399 of SEQ ID NO:1 18, nucleotides 1 155-1600 of SEQ ID NO:1 19, nucleotides 423-615 of SEQ ID NO:120, nucleotides 753-1 141 of SEQ ID NO:121 , nucleotides 311 -581 of SEQ ID NO:164, nucleotides 1 155-1443 of SEQ ID NO:122, and nucleotides 813-1414 of SEQ ID NO:123.
Table 7
[00159] In one aspect, methods and compositions for knockout or downregulation of a nucleic acid molecule encoding an ortholog of a host cell FAH protein having at least 80% identity to SEQ ID NO:33 (PLBD2), SEQ ID NO:61 (LPL), SEQ ID NO:69 (LIPA) and/or SEQ ID NO:77 (ACE); or antibody-binding variant thereof, are via homologous recombination. A nucleic acid molecule encoding an FAH protein (or any protein of interest in general) can be targeted by homologous recombination or by using site-specific nuclease methods that specifically target sequences at the FAH-expressing site of the host cell genome. For homologous recombination, homologous polynucleotide molecules {i.e., homologous arms) line up and exchange a stretch of their sequences. A transgene can be introduced during this exchange if the transgene is flanked by homologous genomic sequences. In one example, a recombinase recognition site can also be introduced into the host cell genome at the integration sites.
[00160] Homologous recombination in eukaryotic cells can be facilitated by introducing a break in the chromosomal DNA at the integration site. Model systems have demonstrated that the frequency of homologous recombination during gene targeting increases if a double-strand break is introduced within the chromosomal target sequence. This may be accomplished by targeting certain nucleases to the specific site of integration. DNA-binding proteins that recognize DNA sequences at the target gene are known in the art. Gene targeting vectors are also employed to facilitate homologous recombination. In the absence of a gene targeting vector for homology directed repair, the cells frequently close the double-strand break by non-homologous end-joining (NHEJ) which may lead to deletion or insertion of multiple nucleotides at the cleavage site. Gene targeting vector construction and nuclease selection are within the skill of the artisan to whom this invention pertains.
[00161] In some examples, zinc finger nucleases (ZFNs), which have a modular structure and contain individual zinc finger domains, recognize a particular 3-nucleotide sequence in the target sequence. Some embodiments can utilize ZFNs with a combination of individual zinc finger domains targeting multiple target sequences. ZFN methods to target disruption of two or more FAH genes are also embodied by the invention.
[00162] Transcription activator-like (TAL) effector nucleases (TALENs) may also be employed for site-specific genome editing. TAL effector protein DNA-binding domain is typically utilized in combination with a non-specific cleavage domain of a restriction nuclease, such as Fokl. In some embodiments, a fusion protein comprising a TAL effector protein DNA-binding domain and a restriction nuclease cleavage domain is employed to recognize and cleave DNA at a target sequence within an exon of the gene encoding the target host cell protein, for example an esterase, such as a phospholipase B-like 2 protein(or other mammalian phospholipase), a lipoprotein lipase and a lysosomal acid lipase, and/or a fatty acylase, such as acid ceramidase (or other mammalian ceramidase). Targeted disruption or insertion of exogenous sequences into a specific exon of the CHO protein encoded by SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and/or SEQ ID NO:77 may be done by employing a TALE nuclease (TALEN) targeted to locations within exon 1 , exon 2, exon 3, etc. of the fatty acid hydrolase genomic DNA (see Tables 6 and 7). The TALEN target cleavage site within the gene sequences {e.g., SEQ ID NOs:33, 61 , 69 and 77) may be selected based on ZiFit.partners.org (ZiFit Targeter Version 4.2) and then TALENs are designed based on known methods (Boch J et al., 2009 Science 326:1509-1512; Bogdanove, A. J. & Voytas, D. F. 201 1 Science 333, 1843-1846; Miller, J. C. et al., 201 1 Nat Biotechnol 29, 143-148). TALEN methods to target disruption of two or more of the PLBD2 gene {e.g., exon 1 or exon 2), LPL gene {e.g., exon 2, 3 or 4), LI PA gene {e.g., exon 1 or 2) and ACE gene {e.g., exon 1 , 3 or 4) are also embodied by the invention.
[00163] RNA-guided endonucleases (RGENs) are programmable genome engineering tools that were developed from bacterial adaptive immune machinery. In this system— the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) immune response— the protein Cas9 forms a sequence-specific endonuclease when complexed with two RNAs, one of which guides target selection. RGENs consist of components (Cas9 and tracrRNA) and a target- specific CRISPR RNA (crRNA). Both the efficiency of DNA target cleavage and the location of the cleavage sites vary based on the position of a protospacer adjacent motif (PAM), an additional requirement for target recognition (Chen, H. et al, J. Biol. Chem. published online March 14, 2014, as Manuscript M1 13.539726). CRISPR-Cas9 methods to target disruption of two or more of the PLBD2 gene (e.g., exon 1 or exon 2), LPL gene (e.g., exon 2, 3 or 4), LIPA gene (e.g., exon 1 or 2) and ACE gene (e.g., exon 1 , 3 or 4) are also embodied by the invention..
[00164] Still other methods of homologous recombination are available to the skilled artisan, such as BuD-derived nucleases (BuDNs) with precise DNA-binding specificities (Stella, S. et al. Acta Cryst. 2014, D70, 2042-2052). A single residue-to-nucleotide code guides the BuDN to the specific DNA target within polynucleotide of interest {e.g., SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and or SEQ ID NO:77).
[00165] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding PLBD2, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614971 .1 , at: Exon 1 within nucleotides (nt) 175367 to 175644 (SEQ ID NO:47); Exon 2 within nt 168958 to 169051 (SEQ ID NO:48); Exon 3 within nt 166451 to166609 (SEQ ID NO:49); Exon 4 within nt 164966 to 165066 (SEQ ID NO:50); Exon 5 within nt 164564 to164778 (SEQ ID NO:51 ); Exon 6 within nt 162682 to162779 (SEQ ID NO:52); Exon 7 within nt 160036 to160196 (SEQ ID NO:53); Exon 8 within nt 159733 to 159828 (SEQ ID NO:54); Exon 9 within nt 159491 to 159562 (SEQ ID NO:55); Exon 10 within nt 158726 to 158878 (SEQ ID NO:56); Exon 11 within nt 158082 to 158244 (SEQ ID NO:57); or Exon 12 at nucleotides (nt) 157747 to 157914 (SEQ ID NO:58), wherein PLBD2 exons 1 -12 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00166] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding LPL, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613760.1 , at: Exon 1 within nucleotides (nt) 1257424 to 1257507 (SEQ ID NO:85); Exon 2 within nt 1266450 to 1266610 (SEQ ID NO:86); Exon 3 within nt 1270069 1270248 (SEQ ID NO:87); Exon 4 within nt 1271770 to 1271881 (SEQ ID NO:88); Exon 5 within nt 12283518 12283751 (SEQ ID NO:89); Exon 6 within nt 123715 1273957 (SEQ ID NO:90); Exon 7 within nt 1276672 1276792 (SEQ ID NO:91 ); Exon 8 within nt 1278328 to 1278510 (SEQ ID NO:92); or Exon 9 within nt 1279637 to 1279738 (SEQ ID NO:93), wherein LPL exons 1-9 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00167] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding LI PA, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003614200.1 , at: Exon 1 within nucleotides (nt) 985778 to 985674 (SEQ ID NO:94); Exon 2 within nt 984375 to 984258 (SEQ ID NO:95); Exon 3 within nt 970771 970573 (SEQ ID NO:96); Exon 4 within nt 969327 to 969218 (SEQ ID NO:97); Exon 5 within nt 968139 968003 (SEQ ID NO:98); Exon 6 within nt 961871 to 961725 (SEQ ID NO:99); Exon 7 within nt 960826 to 960755 (SEQ ID NO:100); Exon 8 within nt 955312 to 955241 (SEQ ID NO: 101 ); or Exon 9 within nt 954457 to 954224 (SEQ ID NO: 102), wherein LI PA exons 1-9 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00168] Sequence-specific endonucleases, or any homologous recombination technique, may be directed to a target sequence at any one of the exons encoding ACE, for example in the CHO-K1 genome, NCBI Reference Sequence: NW_003613654.1 , at: Exon 1 within nucleotides (nt) 1378167 to 1378244 (SEQ ID NO: 103); Exon 2 within nt 1393746 to 1393792 (SEQ ID NO: 104); Exon 3 within nt 1398208 to 1398298 (SEQ ID NO:105); Exon 4 within nt 1399171 to 1399257 (SEQ ID NO: 106); Exon 5 within nt 1402147 to 1402225 (SEQ ID NO:107); Exon 6 within nt 1404854 to 1404928 (SEQ ID NO:108); Exon 7 within nt 1405714 to 1405759 (SEQ ID NO:109); Exon 8 within nt 1406783 to 1406927 (SEQ ID NO:1 10); Exon 9 within nt 1407840 to 1407894 (SEQ ID NO:11 1 ); Exon 10 within nt 1408004 to 1408085 (SEQ ID NO: 1 12); Exon 1 1 within nt 1409791 to 1409922 (SEQ ID NO:1 13); Exon 12 within nt 1410031 to 1410154 (SEQ ID NO:1 14); Exon 13 within nt 1410568 to 1410624 (SEQ ID NO: 1 15); or Exon 14 at nucleotides (nt) 141 1555 to 141 1644 (SEQ ID NO:1 16), wherein ACE exons 1 -14 are described on the minus strand gene and the complement of each sequence is also incorporated herewith.
[00169] Precise genome modification methods are chosen based on the tools available compatible with unique target sequences within SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 so that disruption of the cell phenotype is avoided.
Proteins of Interest
[00170] Any protein of interest suitable for expression in prokaryotic or eukaryotic cells can be used in the engineered host cell systems provided. For example, the protein of interest includes, but is not limited to, an antibody or antigen-binding fragment thereof, a chimeric antibody or antigen- binding fragment thereof, an ScFv or fragment thereof, an Fc-fusion protein or fragment thereof, a growth factor or a fragment thereof, a cytokine or a fragment thereof, or an extracellular domain of a cell surface receptor or a fragment thereof. Proteins of interest may be simple polypeptides consisting of a single subunit, or complex multisubunit proteins comprising two or more subunits. The protein of interest may be a biopharmaceutical product, food additive or preservative, or any protein product subject to purification and quality standards.
[00171] In some embodiments, the protein product (protein of interest) is an antibody, a human antibody, a humanized antibody, a chimeric antibody, a monoclonal antibody, a multispecific antibody, a bispecific antibody, an antigen binding antibody fragment, a single chain antibody, a diabody, triabody or tetrabody, a Fab fragment or a F(ab')2 fragment, an IgD antibody, an IgE antibody, an IgM antibody, an IgG antibody, an lgG1 antibody, an lgG2 antibody, an lgG3 antibody, or an lgG4 antibody. In one embodiment, the antibody is an lgG1 antibody. In one embodiment, the antibody is an lgG2 antibody. In one embodiment, the antibody is an lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG4 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1 antibody. In one embodiment, the antibody is a chimeric lgG2/lgG1/lgG4 antibody.
[00172] In some embodiments, the antibody is selected from the group consisting of an anti- Programmed Cell Death 1 antibody (e.g. an anti-PD1 antibody as described in U.S. Pat. Appln. Pub. No. US2015/0203579A1 ), an anti-Programmed Cell Death Ligand-1 (e.g. an anti-PD-L1 antibody as described in in U.S. Pat. Appln. Pub. No. US2015/0203580A1 ), an anti-DII4 antibody, an anti-Angiopoetin-2 antibody (e.g. an anti-ANG2 antibody as described in U.S. Pat. No.
9,402,898), an anti- Angiopoetin-Like 3 antibody (e.g. an anti-AngPtl3 antibody as described in U.S. Pat. No. 9,018,356), an anti-platelet derived growth factor receptor antibody (e.g. an anti-PDGFR antibody as described in U.S. Pat. No. 9,265,827), an anti-Erb3 antibody, an anti- Prolactin Receptor antibody (e.g. anti-PRLR antibody as described in U.S. Pat. No. 9,302,015), an anti- Complement 5 antibody (e.g. an anti-C5 antibody as described in U.S. Pat. Appln. Pub. No
US2015/0313194A1 ), an anti-TNF antibody, an anti-epidermal growth factor receptor antibody (e.g. an anti-EGFR antibody as described in U.S. Pat. No. 9,132,192 or an anti-EGFRvlll antibody as described in U.S. Pat. Appln. Pub. No. US2015/0259423A1 ), an anti-Proprotein Convertase Subtilisin Kexin-9 antibody (e.g. an anti-PCSK9 antibody as described in U.S. Pat. No. 8,062,640 or U.S. Pat. Appln. Pub. No. US2014/0044730A1 ), an anti-Growth And Differentiation Factor-8 antibody (e.g. an anti-GDF8 antibody, also known as anti-myostatin antibody, as described in U.S. Pat Nos. 8,871 ,209 or 9,260,515), an anti-Glucagon Receptor (e.g. anti-GCGR antibody as described in U.S. Pat. Appln. Pub. Nos. US2015/0337045A1 or US2016/0075778A1 ), an anti-VEGF antibody, an anti-l L1 R antibody, an interleukin 4 receptor antibody (e.g an anti-IL4R antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271681 A1 or U.S. Pat Nos. 8,735,095 or
8,945,559), an anti-interleukin 6 receptor antibody (e.g. an anti-IL6R antibody as described in U.S. Pat. Nos. 7,582,298, 8,043,617 or 9,173,880), an anti-IL1 antibody, an anti-IL2 antibody, an anti-IL3 antibody, an anti-l L4 antibody, an anti-IL5 antibody, an anti-l L6 antibody, an anti-l L7 antibody, an anti-interleukin 33 (e.g. anti- IL33 antibody as described in U.S. Pat. Appln. Pub. Nos.
US2014/0271658A1 or US2014/0271642A1 ), an anti-Respiratory syncytial virus antibody (e.g. anti- RSV antibody as described in U.S. Pat. Appln. Pub. No. US2014/0271653A1 ), an anti-Cluster of differentiation 3 (e.g. an anti-CD3 antibody, as described in U.S. Pat. Appln. Pub. Nos.
US2014/0088295A1 and US20150266966A1 , and in U.S. Application No. 62/222,605), an anti- Cluster of differentiation 20 (e.g. an anti-CD20 antibody as described in U.S. Pat. Appln. Pub. Nos. US2014/0088295A1 and US20150266966A1 , and in U.S. Pat. No. 7,879,984), an anti-CD19 antibody, an anti-CD28 antibody, an anti- Cluster of Differentiation-48 (e.g. anti-CD48 antibody as described in U.S. Pat. No. 9,228,014), an anti-Fel d1 antibody (e.g. as described in U.S. Pat. No. 9,079,948), an anti-Middle East Respiratory Syndrome virus (e.g. an anti-MERS antibody as described in U.S. Pat. Appln. Pub. No. US2015/0337029A1 ), an anti-Ebola virus antibody (e.g. as described in U.S. Pat. Appln. Pub. No. US2016/0215040), an anti-Zika virus antibody, an anti- Lymphocyte Activation Gene 3 antibody (e.g. an anti-LAG3 antibody, or an anti-CD223 antibody), an anti-Nerve Growth Factor antibody (e.g. an anti-NGF antibody as described in U.S. Pat. Appln. Pub. No. US2016/0017029 and U.S. Pat. Nos. 8,309,088 and 9,353,176) and an anti-Activin A antibody. In some embodiments, the bispecific antibody is selected from the group consisting of an anti-CD3 x anti-CD20 bispecific antibody (as described in U.S. Pat. Appln. Pub. Nos.
US2014/0088295A1 and US20150266966A1 ), an anti-CD3 x anti-Mucin 16 bispecific antibody (e.g., an anti-CD3 x anti-Muc16 bispecific antibody), and an anti-CD3 x anti- Prostate-specific membrane antigen bispecific antibody (e.g., an anti-CD3 x anti-PSMA bispecific antibody). In some embodiments, the protein of interest is selected from the group consisting of alirocumab, sarilumab, fasinumab, nesvacumab, dupilumab, trevogrumab, evinacumab, and rinucumab. All publications mentioned throughout this disclosure are incorporated herein by reference in their entirety.
[00173] In some embodiments, the protein of interest is a recombinant protein that contains an Fc moiety and another domain, (e.g., an Fc-fusion protein). In some embodiments, an Fc-fusion protein is a receptor Fc-fusion protein, which contains one or more extracellular domain(s) of a receptor coupled to an Fc moiety. In some embodiments, the Fc moiety comprises a hinge region followed by a CH2 and CH3 domain of an IgG. In some embodiments, the receptor Fc-fusion protein contains two or more distinct receptor chains that bind to either a single ligand or multiple ligands. For example, an Fc-fusion protein is a TRAP protein, such as for example an IL-1 trap (e.g., rilonacept, which contains the IL-1 RAcP ligand binding region fused to the 11-1 R1 extracellular region fused to Fc of hlgG1 ; see U.S. Pat. No. 6,927,004, which is herein incorporated by reference in its entirety), or a VEGF trap (e.g., aflibercept or ziv-aflibercept, which contains the Ig domain 2 of the VEGF receptor Flt1 fused to the Ig domain 3 of the VEGF receptor Flk1 fused to Fc of hlgG1 ; see U.S. Pat. Nos. 7,087,41 1 and 7,279,159). In other embodiments, an Fc-fusion protein is a ScFv-Fc-fusion protein, which contains one or more of one or more antigen-binding domain(s), such as a variable heavy chain fragment and a variable light chain fragment, of an antibody coupled to an Fc moiety.
Host Cells and Transfection
[00174] The host cells used in the methods of the invention are eukaryotic host cells including, for example, Chinese hamster ovary (CHO) cells, human cells, rat cells and mouse cells. In a preferred embodiment, the invention provides a cell comprising a disrupted nucleic acid sequence fragment of SEQ ID NO:33 and at least one more of SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77.
[00175] The invention includes an engineered mammalian host cell further transfected with an expression vector comprising an exogenous gene of interest, such gene encoding the
biopharmaceutical product. While any mammalian cell may be used, in one particular embodiment the host cell is a CHO cell.
[00176] Transfected host cells include cells that have been transfected with expression vectors that comprise a sequence encoding a protein or polypeptide. Expressed proteins will preferably be secreted into the culture medium for use in the invention, depending on the nucleic acid sequence selected, but may be retained in the cell or deposited in the cell membrane. Various mammalian cell culture systems can be employed to express recombinant proteins. Other cell lines developed for specific selection or amplification schemes will also be useful with the methods and compositions provided herein, provided that at least two genes encoding a different fatty acid hydrolase (FAH) having at least 80% homology to at least two of SEQ ID NO:33, SEQ ID NO:61 , SEQ ID NO:69 and SEQ ID NO:77 have been downregulated, knocked out or otherwise disrupted in accordance with the invention. An embodied cell line is the CHO cell line designated K1 . To achieve high volume production of recombinant proteins, the host cell line may be pre-adapted to bioreactor medium in the appropriate case.
[00177] Several transfection protocols are known in the art, and are reviewed in Kaufman (1988) Meth. Enzymology 185:537. The transfection protocol chosen will depend on the host cell type and the nature of the GOI, and can be chosen based upon routine experimentation. The basic requirements of any such protocol are first to introduce DNA encoding the protein of interest into a suitable host cell, and then to identify and isolate host cells which have incorporated the
heterologous DNA in a relatively stable, expressible manner.
[00178] One commonly used method of introducing heterologous DNA into a cell is calcium phosphate precipitation, for example, as described by Wigler ef al. (Proc. Natl. Acad. Sci. USA 77:3567, 1980). DNA introduced into a host cell by this method frequently undergoes
rearrangement, making this procedure useful for cotransfection of independent genes.
[00179] Polyethylene-induced fusion of bacterial protoplasts with mammalian cells (Schaffner et al., (1980) Proc. Natl. Acad. Sci. USA 77:2163) is another useful method of introducing heterologous DNA. Protoplast fusion protocols frequently yield multiple copies of the plasmid DNA integrated into the mammalian host cell genome, and this technique requires the selection and amplification marker to be on the same plasmid as the GOI.
[00180] Electroporation can also be used to introduce DNA directly into the cytoplasm of a host cell, for example, as described by Potter et al. (Proc. Natl. Acad. Sci. USA 81 :7161 , 1988) or Shigekawa et al. (BioTechniques 6:742, 1988). Unlike protoplast fusion, electroporation does not require the selection marker and the GOI to be on the same plasmid.
[00181] Other reagents useful for introducing heterologous DNA into a mammalian cell have been described, such as Lipofectin™ Reagent and Lipofectamine™ Reagent (Gibco BRL, Gaithersburg, Md.). Both of these commercially available reagents are used to form lipid-nucleic acid complexes (or liposomes) which, when applied to cultured cells, facilitate uptake of the nucleic acid into the cells.
[00182] Methods for amplifying the GOI are also desirable for expression of the recombinant protein of interest, and typically involves the use of a selection marker (reviewed in Kaufman supra). Resistance to cytotoxic drugs is the characteristic most frequently used as a selection marker, and can be the result of either a dominant trait (e.g., can be used independent of host cell type) or a recessive trait (e.g., useful in particular host cell types that are deficient in whatever activity is being selected for). Several amplifiable markers are suitable for use in the cell lines of the invention and may be introduced by expression vectors and techniques well known in the art (e.g., as described in Sambrook, Molecular Biology: A Laboratory Manual, Cold Spring Harbor Laboratory, NY, 1989; pgs 16.9-16.14).
[00183] Useful selectable markers and other tools for gene amplification such as regulatory elements, described previously or known in the art, can also be included in the nucleic acid constructs used to transfect mammalian cells. The transfection protocol chosen and the elements selected for use therein will depend on the type of host cell used. Those of skill in the art are aware of numerous different protocols and host cells in order to adapt the invention for a particular use, and can select an appropriate system for expression of a desired protein, based on the
requirements of the cell culture system.
[00184] In specific embodiments, the invention relates to the following items:
[00185] 1. A recombinant host cell comprising a modification in two or more genes encoding two or more fatty acid hydrolases (FAH).
[00186] 2. The recombinant host cell according to item 1 , wherein the two or more FAHs are selected from the group consisting of phospholipase B-like 2 (PLBD2) protein, lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE).
[00187] 3. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00188] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
[00189] b. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification.
[00190] 4. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00191] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and [00192] b. a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
[00193] 5. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00194] a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
[00195] b. a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
[00196] 6. The recombinant host cell according to any of previous items, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
[00197] 7. The recombinant host cell according to any of previous items, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
[00198] 8. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00199] a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
[00200] b. a modification in a coding sequence of a polynucleotide encoding the LI PA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LI PA protein in a cell lacking the modification.
[00201] 9. The recombinant host cell according to any of previous items, wherein the cell comprises:
[00202] a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and [00203] b. a modification in a coding sequence of a polynucleotide encoding the ACE protein, wherein the modification decreases the expression level of the ACE protein relative to the expression level of a ACE protein in a cell lacking the modification.
[00204] 10. The recombinant host cell according to any of previous items, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the LI PA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LI PA protein in a cell lacking the modification.
[00205] 1 1. The recombinant host cell according to any of previous items, wherein all alleles of the coding sequence of the two or more genes encoding the two or more FAHs comprise the modification.
[00206] 12. The recombinant host cell according to any of previous items, wherein the cell does not express detectable levels of PLBD2 protein, LPL, LIPA and/or ACE.
[00207] 13. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 1 of the coding sequence of the
polynucleotide encoding the PLBD2 protein.
[00208] 14. The recombinant host cell according to any of previous items, wherein the PLBD2 protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:32.
[00209] 15. The recombinant host cell according to any of previous items, wherein the PLBD2 protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:35, and SEQ ID NO:36.
[00210] 16. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:47.
[00211] 17. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 2, exon 3, or exon 4 of the coding sequence of the polynucleotide encoding the LPL protein.
[00212] 18. The recombinant host cell according to any of previous items, wherein the LPL protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:62.
[00213] 19. The recombinant host cell according to any of previous items, wherein the LPL protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:62, SEQ ID NO:66, SEQ ID NO:67, and SEQ ID NO:68.
[00214] 20. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:86, SEQ ID NO:87 or SEQ ID NO:88.
[00215] 21. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 1 or exon 2 of the coding sequence of the polynucleotide encoding the LI PA protein.
[00216] 22. The recombinant host cell according to any of previous items, wherein the LI PA protein comprises an amino acid sequence at least 70% identical to SEQ ID NO:70.
[00217] 23. The recombinant host cell according to any of previous items, wherein the LI PA protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:70, SEQ ID NO:74, SEQ ID NO:75, and SEQ ID NO:76.
[00218] 24. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:94 or SEQ ID NO:95.
[00219] 25. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within exon 1 , exon 3 or exon 4 of the coding sequence of the polynucleotide encoding the ACE protein.
[00220] 26. The recombinant host cell according to any of previous items, wherein the ACE protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:78.
[00221] 27. The recombinant host cell according to any of previous items, wherein the ACE protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:78, SEQ ID NO:82, SEQ ID NO:83, and SEQ ID NO:84.
[00222] 28. The recombinant host cell according to any of previous items, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:103, SEQ ID NO:105 or SEQ ID NO:106.
[00223] 29. The recombinant host cell according to any of previous items further comprising a polynucleotide encoding an exogenous protein of interest.
[00224] 30. The recombinant host cell according to any of previous items, wherein the exogenous protein of interest is selected from the group consisting of an antibody heavy chain, an antibody light chain, an antigen-binding fragment, an antigen-binding protein, and an Fc-fusion protein.
[00225] 31. The recombinant host cell according to any of previous items, wherein the cell produces a Protein A-binding fraction having no detectable fatty acid hydrolase activity.
[00226] 32. A method of producing a recombinant protein comprising the steps of: (a) obtaining a sample comprising a recombinant protein and a plurality of host cell proteins from a host cell that is modified to produce reduced levels of fatty acid hydrolase compared to an unmodified cell; and (b) subjecting the sample to at least one purification step to remove at least one host cell protein.
[00227] 33. The method according to any of items 31 -32, wherein the plurality of host cell proteins
[00228] a. does not comprise a detectable amount of a phospholipase B-like 2 (PLBD2) protein; and
[00229] b. does not comprise a detectable amount of at least one other fatty acid hydrolase.
[00230] 34. The method according to any of items 31 -33, wherein the host cell comprises:
[00231] a. a modification in a coding sequence of a polynucleotide encoding a phospholipase B- like 2 (PLBD2) protein; and
[00232] b. a modification in a coding sequence of a polynucleotide encoding a fatty acid hydrolase selected from the group consisting of lipoprotein lipase (LPL), lysosomeal acid lipase (LI PA), acid ceramidase (ACE), or a combination thereof.
[00233] 35. The method according to any of items 31 -34, wherein the purification step is selected from the group consisting of protein A affinity (PA) chromatography, cation exchange (CEX) chromatography, and anion exchange (AEX) chromatography.
[00234] 36. The method according to any of items 31 -35, wherein the purification step does not comprise hydrophobic interaction chromatography.
[00235] 37. A process for reducing fatty acid hydrolase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein; (b) modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and/or acid ceramidase (ACE); (c) transfecting the host cell with a polynucleotide encoding a protein of interest; (d) extracting a protein fraction comprising the protein of interest from the host cell; (e) contacting the protein fraction with a chromatography media selected from the group consisting of protein A affinity (PA) media, cation exchange (CEX) media, and anion exchange (AEX) media; (f) collecting the protein of interest from the media; and (g) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer.
[00236] 38. The process according to item 37, wherein the step of modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein comprises inserting or deleting at least one nucleotide within exon 1 of a polynucleotide encoding the PLBD2 protein.
[00237] 39. The process according to any of items 37-38, wherein the polynucleotide encoding the PLBD2 protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:32. [00238] 40. The process according to any of items 37-39, wherein the step of modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL) comprises inserting or deleting at least one nucleotide within exon 2, exon 3 or exon 4 of a polynucleotide encoding the LPL protein.
[00239] 41. The process according to any of items 37-40, wherein the polynucleotide encoding the LPL protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:62.
[00240] 42. The process according to any of items 37-41 , wherein the step of modifying the host cell to decrease or ablate the expression of lysosomal acid lipase (LI PA) comprises inserting or deleting at least one nucleotide within exon 1 or exon 2 of a polynucleotide encoding the LI PA protein.
[00241] 43. The process according to any of items 37-42, wherein the polynucleotide encoding the LIPA protein comprises a nucleic acid sequence that is at least 70% identical to SEQ ID NO:70.
[00242] 44. The process according to any of items 37-43, wherein the step of modifying the host cell to decrease or ablate the expression of acid ceramidase (ACE) comprises inserting or deleting at least one nucleotide within exon 1 , exon 3 or exon 4 of a polynucleotide encoding the ACE protein.
[00243] 45. The process according to any of items 37-44, wherein the polynucleotide encoding the ACE protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:78.
[00244] 46 . A composition comprising one or more recombinant proteins obtainable by the method according to any of items 31-36 or the process according to any of items 37-45.
[00245] 47. The composition obtainable according to item 46, wherein the composition is stable.
[00246] 48. The composition obtainable according to any of items 46-47, wherein the stable composition is characterized by one or more of:
[00247] (i) having an adverse enzyme activity of at the most [....] .
[00248] (ii) having a shelf-life of at least [XX Days- YY weeks- ZZ years]
[00249] (iii) having a SVP count of below [....]
[00250] 49. The composition obtainable according to any of items 46-48, wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
[00251] 50. A composition comprising one or more recombinant proteins, wherein the composition is stable. [00252] 51. The composition according to item 50, wherein the stable composition is characterized by one or more of:
[00253] (i) having an adverse enzyme activity of at the most [....] . [00254] (ii) having a shelf-life of at least [XX Days- YY weeks- ZZ years] [00255] (iii) having a SVP count of below [....]
[00256] 52. The composition according to any of items 50-51 , wherein the adverse enzyme activity is the activity of one or more of esterases, hydrolases, lipases, phospholipases, ceramidases.
[00257] 53. The composition according to any of items 50-52, wherein the recombinant protein is selected form the group comprising antibody heavy chain, antibody light chain, antigen-binding fragment, and Fc-fusion protein, or any combinations thereof.
[00258] Other features of the invention will become apparent in the course of the following descriptions of exemplary embodiments which are given for illustration of the invention and are not intended to be limiting thereof.
Examples
[00259] The following examples are put forth so as to provide those of ordinary skill in the art how to make and use the methods and compositions described herein, and are not intended to limit the scope of the invention. Efforts have been made to ensure accuracy with respect to numbers used {e.g., amount, temperature, etc.) but some experimental error and deviation should be accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is average molecular weight, temperature is in degrees Centigrade, and pressure is at or near atmospheric.
Example 1. Targeted Disruption of an Esterase Gene in the Host Cell
[00260] To employ disruption of the target esterase gene, i.e., phospholipase B-like 2 gene, of a CHO cell origin, a Type II CRISPR/Cas system which requires at least 20 nucleotides (nt) of homology between a chimeric RNA (i.e., guide RNA) and its genomic target was used. Guide RNA sequences were designed for specific targeting of an exon within the CHO phospholipase B-like 2 (PLBD2) nucleic acid (SEQ ID NO:33) and are considered unique (to minimize off-target effects in the genome). Multiple small guide RNAs (sgRNA) were synthesized for use in the genome editing procedure targeting the following genomic segments of PLBD2 listed in Table 8. Table 8
[00261] The sgRNA expression plasmid (System Biosciences, CAS940A-1 ) contains a human H1 promoter that drives expression of the small guide RNA and the tracrRNA following the sgRNA. Immortalized Chinese hamster ovary (CHO) cells were transfected with the plasmid encoding Cas9- H1 enzyme followed by one of the sgRNA sequences, for instance sgRNAI (SEQ ID NO:45) or sgRNA2 (SEQ ID NO:46), designed to target the first exon of CHO PLBD2. sgRNAI and sgRNA2 were predicted to generate a double strand break (DSB) at or around nucleotides 53 and 59 of SEQ ID NO:33, respectively. A DSB was therefore predicted to occur approx. 23 or 29 nucleotides downstream of the PLBD2 start codon. (Note that nucleotides 1-30 of SEQ ID NO:33 encode a signal peptide.) A negative control transfection was performed where the parental CHO line was transfected with the plasmid encoding Cas9-H1 enzyme without a proceeding sgRNA, or an sgRNA encoding a gene sequence not present in the CHO genome.
[00262] Following transfection, cells were cultured for 6 days in serum-free medium, and then were single cell cloned using flow cytometry. After 12 days in culture, stable clones with desirable growth properties were isolated, expanded in serum-free medium, cell pellets were collected for genotyping and clonal cell lines were banked.
[00263] Genomic DNA (gDNA) and messenger RNA (mRNA) were isolated from the clonal cell pellets and analyzed by quantitative PCR (qPCR). qPCR primers and probes were designed to overlap with the sgRNA sequence used for the double strand break targeting event, in order to detect disruption of the genomic DNA and its transcription. The relative abundance of PLBD2 gene or transcript in the candidate clones was determined using relative qPCR method, where the clones derived from the negative control transfection were used as a calibrator. See Figure 1. The qPCR primers and probes were designed to detect sequences either in the sgRNAI or sgRNA2 position in PLBD2 exon 1 . Both gDNA and RNA isolated from clone 1 failed to support qPCR amplification of PLBD2 exon 1 in either sgRNAI or sgRNA2 regions, but amplification of the housekeeping gene, GAPDH, was detected. Based on this data, clone 1 was identified as a potential knock out of PLBD2 in which both genomic alleles of PLBD2 of exonl were disrupted. It is noted that amplification of genomic DNA and mRNA was not detected in Clone 8 using primers overlapping with sgRNA2, however, sgRNAI primers/probes detected genomic DNA above control values. Clone 8, and others were further analyzed in order to understand the performance of the site- directed nuclease method.
[00264] The size of the entire PLBD2 exon 1 in clone 1 was analyzed by PCR from either gDNA or RNA derived templates and compared to that amplified from the wild type CHO cells. The length of amplicon fragments was determined using Caliper GX instrument (Figure 2). Both gDNA and mRNA amplification from clone 1 resulted in a single PCR fragment which was shorter than the one amplified from the wild type control cells.
[00265] The amplification products were sequenced, resulting in Clone 1 being identified as PLBD2 knock out, in which PLBD2 gene was found to have 1 1 bp deletion resulting in frameshift.
[00266] The inventors also unexpectedly identified Clone 8 as a PLBD2 knockout despite the fact that genomic DNA fragments were identified by qPCR primers overlapping with the sgRNAI sequence. The identification of a clone that has no detectable phospholipase activity or no detectable phospholipase protein was technically challenging and time-consuming. Site-directed nuclease techniques may provide an ease-of-use, however, careful screening and elimination of false positives is necessary and still there may be unpredictable outcomes with regard to the identity of a single clone having two disrupted alleles. Surprisingly, only 1 % of the clones screened using the techniques described above were identified as viable PLBD2 knockout clones. See Table 9.
Table 9
Example 2. Introduction and Expression of a monoclonal antibody (mAbl) in the candidate clonal cell lines
[00267] Clone 1 and the wild type control host cell line were transfected with plasmids encoding the light and heavy chains of mAbl , a fully human IgG, in the presence of Cre recombinase to facilitate recombination mediated cassette exchange (RMCE) into EESYR locus (US Patent No. 7771997B2, issued August 10, 2010). The transfected cultures were selected for 1 1 days in serum-free medium containing 400 ug/mL hygromycin. Cells that underwent RMCE, were isolated by flow cytometry. PLBD2 knock out clone 1 and the wild type host cell line produced equivalent observed
recombinant population (data not shown). The clone 1 derived isogenic cell line expressing mAbl was designated RS001 , and the mAbl expressing cell line originated from the PLBD2 wild type host was designated RS0WT.
[00268] Fed-batch production of mAbl from RS001 or RS0WT was carried out in a standard 12 day process. The conditioned medium for each production culture was sampled at Day 2, 4, 6, 9 and 12 and the Protein-A binding fraction was quantified (Figure 3). Protein titer of mAbl from RS001 culture was comparable to that produced from RS0WT, and unexpectedly there were no observable differences in the behavior of the cells with respect to the two cultures. It cannot be predicted whether disruption of PLBD2 or any endogenous gene in a CHO host cell would have no observable deleterious effect on production of an exogenous recombinant protein, especially a therapeutic monoclonal antibody. Example 3. Esterase Activity Detection in unmodified CHO cells
[00269] Polysorbate 20 or polysorbate 80 degradation was measured to detect putative esterase activity in the supernatants of PLBD2 mutants. Unpurified protein supernatant from CHO cells, and supernatant taken at each step or sequence of steps when subjected to sequential purification steps, was tested for stability of polysorbate. The percent intact polysorbate reported was inversely proportional to the amount of contaminant esterase activity. Unpurified protein supernatant from CHO cells, and supernatant at each step or sequence of steps when subjected to sequential purification steps, was tested in assays measuring polysorbate degradation. The relative levels of intact polysorbate reported is inversely proportional to levels of contaminant esterase activity.
[00270] Degradation of polysorbate 20 was examined to determine the etiological agent responsible for polysorbate 20 degradation in a monoclonal antibody formulation. The buffered antibody (150 mg/mL) was separated into two fractions by 10 kDa filtration: a protein fraction, and a buffer fraction. These two fractions, as well as intact buffered antibody, were spiked with 0.2% (w/v) of super refined polysorbate 20 (PS20-B) and stressed at 45°C for up to 14 days. The study showed (Table 10, part A, columns 1-2) that the protein fraction, not the buffer fraction, had an effect on the degradation of sorbitan laurate (i.e., the major component of polysorbate 20), and that the degradation of polysorbate 20 was correlated with the concentration of the antibody (Table 10, part B, columns 3-4).
Table 10 (part A) Table 10 (part B)
[00271] Monoclonal antibody was produced in an unmodified CHO cell and purified by different processes according to Table 1 1 , and the esterase activity measured by percent intact polysorbate 20, as in Table 9.
Table 11
[00272] Hydrophobic interaction chromatography (HIC) was most efficient at removing residual PLBD2. In some circumstances, a reduction in the number of purification steps and lower cost could be realized. Therefore, it was contemplated that a modified CHO cell having reduced levels of expression of phospholipase reduces the purification steps, and e.g., may eliminate the need for HIC purification.
Example 4. Esterase protein abundance and activity detection in mAbl purified from modified compared to unmodified CHO cells.
[00273] mAbl was produced from RS001 and RSOWT and purified from the conditioned media using either PA alone, or PA and AEX chromatography The PA-purified mAB1 from RS001 and RSOWT were analyzed for lipase abundance using trypsin digest mass spectrometry. The trypsin digests of RS001 and RSOWT mAbl were injected into a reverse phase liquid chromatography column coupled to a triple quadrupole mass spectrometer set to monitor a specific product ion fragmented from SEQ ID NO:32. In parallel, a series of PLBD2 standards were prepared by spiking in varying amounts of recombinant PLBD2 into mAbl with no endogenous PLBD2. The signals of the experimental and control reactions were used to quantify the abundance of PLBD2 in mAbl from RS001 and RSOWT (Figure 4). No detectable amounts of PLBD2 protein were observed in the purified samples of Clone 8-produced mAbl when purified with PA alone (data not shown).
Example 5. Identification of Fatty Acid Hydrolases
Preparation of the HIC Strip Fraction
[00274] A hydrophobic interaction column (Phenyl Sepharose® High Performance [GE Healthcare, Little Chalfont, Buckinghamshire, UK]) was used to generate a "HIC strip" fraction containing a protein of interest {i.e., mAb2) and associated host cell proteins. The column was first equilibrated with two column volumes (CV) of buffer containing 40 mM Tris, 200 mM citrate at pH8.0. The monoclonal antibody-containing load material from an anion exchange pool ("Q pool") was adjusted to 200 mM sodium citrate, pH8.0, then loaded onto the column at a loading amount of 20-40 grams of protein per liter of the phenyl sepharose resin. After loading, the column was washed with six CVs of 40 mM Tris, 200 mM citrate, at pH8.0. Then the column was stripped with three CVs of reverse osmosis deionized water. The stripped fraction was collected for subsequent analysis.
Trypsin digestion
[00275] A 50 pg aliquot of the mAb2-containing HIC strip sample was denatured and reduced in a solution containing 5 mM acetic acid and 5 mM ir/'s(2-carboxyethyl)phosphine-HCI by heating at 80°C for 10 minutes. The sample was then diluted in 50 mM Tris-HCI buffer (pH 8.0) and alkylated with 1 .5 mM iodoacetamide (IAA) and digested with trypsin (modified, sequencing grade from Promega, Madison, Wl) with an enzyme to substrate ratio of 1 :20 (w/w) at 37°C in the dark for three hours. The digestion was then stopped by addition of 10% trifluoroacetic acid (TFA).
NanoLC/MS
[00276] Aliquots (3 μg) of each digested mAb2-containing HIC strip was injected to an Acclaim™ PepMap™ nanoViper C18 trap column (75 pm x 2 cm) on a Thermo EASY-nLC™ system (Thermo Scientific, Waltham, MA) at a flow rate of 2 pL/minute and washed for 15 minutes. Then the peptides were eluted onto the Acclaim™ PepMap™ RSLC nanoViper C18 analytical column (75 pm x 25 cm) which was equilibrated with 99% mobile phase A (0.1% FA in water) prior to sample injection at a flow rate of 250 nL minute. Peptides were separated using a linear gradient from 1 % mobile phase B (0.1 % FA in acetonitrile) to 7% mobile phase B for the first 5 minutes, followed by a second linear increase from 7% to 27% mobile phase B over the next 1 10 minutes, and another subsequent linear increase from 27%-40% in 10 minutes and a final increase to 90% in 5 minutes. The gradient was held at 90% for 20 minutes. A Thermo Q Exactive™ Plus mass spectrometer (Thermo Scientific, Waltham, MA) was used for peptide mass analyses, with high-energy collisional dissociation (HCD) employed for peptide fragmentation for MS/MS experiments.
Data analysis
[00277] The mass spectra were processed using Thermo Xcalibur™ (version 2.2.42) (Thermo Scientific, Waltham, MA). Proteome Discoverer (version 1.4) (Thermo Scientific, Waltham, MA) was also used to perform the peptide identification using both Mascot and Sequent search engines. The peptide spectra from the Proteome Discovery was manually examined to confirm the spectral assignment and protein identification.
Results
[00278] Lipoprotein lipase (UniProtein ID: P06858), lysosomal acid lipase (UniProtein ID: P38571 ) and acid ceramidase (UniProtein ID: Q13510) were identified as potential active fatty acid hydrolases by more than three unique peptides per protein in the mAb2-containing HIC strip fraction.
Example 6. Targeted Disruption of a Lipoprotein Lipase (LPL) Gene
[00279] Clone 1 , in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) were transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lipoprotein lipase (LPL) gene (SEQ ID NO:61 ): sgRNA3 (SEQ ID NO:63), sgRNA4 (SEQ ID NO:64) or sgRNA5 (SEQ ID NO:65) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette was used in some
experiments. In other experiments, two separate plasmids were used, which confered flexibility for combining Cas9 with different sgRNAs. Cre recombinase was co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/0115502A1 , published April 28, 2016). The transfected cultures were selected for 1 1 days in serum-free medium containing 400 μg mL hygromycin. Cells that underwent RMCE were isolated by flow cytometry. The desired knock out genotype was confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing only the LPL knock-out or knock-down loci is designated as "clone 9". The resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 10".
[00280] The wild-type host cell, Clone 1 (PLBD2-KO), Clone 9 (LPL-KO) and Clone 10
(LPL+PLBD2 KO) were cultured in 2L bioreactors under fed-batch conditions at 36.5°C (pH 6.9-7.4) with dissolved oxygen. Cells were inoculated in the production bioreactor (i.e. transferred from a seed train culture (N-1 ) at a cell titer of 5.0 x 10s— 7.0 x 106 cells/mL) and protein production was induced by the addition of doxycycline. Cells were cultured for 14 days in chemically defined media supplemented with nutrient feeds as needed during the batch culture up to day 12. Cell viability (Figure 5), titer (Figure 6), and viable cell count (VCC) (Figure 7) were monitored throughout the batch process. Samples of cells were harvested and subjected to purification via protein A chromatography for titer determination (Hober, S., Nord, K., and Linhult, M., "Protein A
Chromatography for Antibody Purification," Journal of Chromatography B, 848 (2007) 40-47; and Lin et al., "Protein A Affinity Column for Monoclonal Antibody (MAb) Titer Analysis," Thermo Scientific Poster Note PN20806_E 06/13S, available from https://tools.thermofisher.com/.../PN- 20806-Protein-Affinity-Column-Monoclonal- Antibody-Titer-Analysis-PN20806- EN.pdf&usg=AFQjCNFHkixB9Um020cT4nG1 UpTzXwON9Q, version avaialble Aug 23, 2017).
[00281] Additional LPL-KO clones were generated according to the methods provided herein. For example, Figure 8 depicts an alignment of LPL-KO clones 19-22 (SEQ ID NOs:159-162, respectively) compared to a partial Chinese hamster LPL sequence (SEQ ID NO:158) showing the gaps in the respective clone sequences.
[00282] To construct the LPL-KO clones (as well as any other fatty acid hydrolase knock-out clone), plasmids encoding for Cas9 nuclease, eYFP and site-specific sgRNA were stably integrated into the CHO genome using Lipofectamine-based transfection protocol followed by selection for neomycin resistant cells. Seventeen days post transfection, the YFP positive cells were enriched by flow cytometry prior to single cell-sorting. After a 21 -day expansion, mRNA was isolated and the single cell clones were analyzed by qPCR for the presence of gene disruption.
[00283] 1 , 2 or 3 different sgRNA expression cassettes (the cassettes include promoter, sgRNA, terminator) may be placed in the same plasmid. Some constructs are manufactured using at least 2 sgRNA cassettes per lipase in a single plasmid.
Example 7. Targeted Disruption of a Lysosomal Acid Lipase (LIPA) Gene
[00284] Clone 1 , in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) or sgRNA8 (SEQ ID NO:73) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent
Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 11 days in serum-free medium containing 400 pg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing only the LIPA knock-out or knock-down loci is designated as "clone 1 1 ".The resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 12".
Example 8. Targeted Disruption of an Acid Ceramidase (ACE) Gene
[00285] Clone 1 , in which PLBD2 gene was knocked out, and a wild type host cell line (EESYR®) are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA11 (SEQ ID NO:81 ) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co-transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/0115502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 pg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing only the ACE knock-out or knock-down loci is designated as "clone 13". The resultant clone containing both the PLBD2 and LPL knock-out or knock-down loci is designated as "clone 14".
Example 9. Targeted Disruption of a Lipoprotein Lipase (LPL) Gene and Lysosomal Acid
Lipase (LIPA)
[00286] Clone 9, in which the LPL gene and is modified, and Clone 10, in which both the LPL gene and the PLBD2 gene are modified, are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the lysosomal acid lipase (LIPA) gene (SEQ ID NO:69): sgRNA6 (SEQ ID NO:71 ), sgRNA7 (SEQ ID NO:72) and/or sgRNA8 (SEQ ID NO:73) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 μg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing the LIPA modified locus and the LPL modified locus designated as "clone 15".The resultant clone containing the PLBD2, LPL and LIPA modified loci is designated as "clone 16". Example 10. Targeted Disruption of a Lipoprotein Lipase (LPL) Gene, a Lysosomal Acid Lipase (LIPA) Gene and an Acid Ceramidase (ACE) Gene
[00287] Clone 15, in which the LPL and LIPA genes and are modified, and Clone 16, in which the PLBD2, LPL and LIPA genes are modified, are transfected with a pair of plasmids. One of the plasmids encodes the Cas9 protein and the other plasmid encodes the guide RNA designed to target the acid ceramidase (ACE) gene (SEQ ID NO:77): sgRNA9 (SEQ ID NO:79), sgRNAI O (SEQ ID NO:80) or sgRNA1 1 (SEQ ID NO:81 ) (Table 7). Tandem RNA polymerase III promoters such as H1 and/or U6 in any order can be used. A single plasmid encoding both the Cas9 and the sgRNA expression cassette is used in some experiments. In other experiments, two separate plasmids are used, which confers flexibility for combining Cas9 with different sgRNAs. Cre recombinase is co- transfected along with the Cas9 and the sgRNA plasmid to facilitate recombination mediated cassette exchange (RMCE) into the EESYR® locus (US Patent No. 7771997B2, issued August 10, 2010) or the EESYR® II locus (US Patent Application Publication No. 2016/01 15502A1 , published April 28, 2016). The transfected cultures are selected for 1 1 days in serum-free medium containing 400 μg/mL hygromycin. Cells that undergo RMCE are isolated by flow cytometry. The desired knock out genotype is confirmed by standard molecular biology methods such as qPCR and sequencing. The resultant clone containing the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 17".The resultant clone containing the PLBD2 modified locus, the LIPA modified locus, the LPL modified locus and the ACE modified locus is designated as "clone 18".
[00288] The present invention may be embodied in other specific embodiments.
Claims
1 . A recombinant host cell comprising a modification in two or more genes encoding two or more fatty acid hydrolases (FAH).
2. The recombinant host cell of claim 1 , wherein the two or more FAHs are selected from the
group consisting of phospholipase B-like 2 (PLBD2) protein, lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE).
3. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification.
4. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LIPA in a cell lacking the modification.
5. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
6. The recombinant host cell of claim 3, wherein the cell further comprises a modification in a
coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
7. The recombinant host cell of claim 3 or 6, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
8. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the LI PA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LIPA protein in a cell lacking the modification.
9. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the ACE protein, wherein the modification decreases the expression level of the ACE protein relative to the expression level of a ACE protein in a cell lacking the modification.
10. The recombinant host cell of claim 9, wherein the cell further comprises a modification in a
coding sequence of a polynucleotide encoding the LIPA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LI PA protein in a cell lacking the modification.
1 1. The recombinant host cell of any one of claimsl -10, wherein all alleles of the coding sequence of the two or more genes encoding the two or more FAHs comprise the modification.
12. The recombinant host cell of claims 1 -1 1 , wherein the cell does not express detectable levels of PLBD2 protein, LPL, LIPA and/or ACE.
13. The recombinant host cell of any one of claims 1-7, 1 1 and 12, wherein the modification
comprises a nucleotide insertion or deletion within exon 1 of the coding sequence of the polynucleotide encoding the PLBD2 protein.
14. The recombinant host cell of claim 13, wherein the PLBD2 protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:32.
15. The recombinant host cell of claim 14, wherein the PLBD2 protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:35, and SEQ ID NO:36.
16. The recombinant host cell of claim 13 or 14, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:47.
17. The recombinant host cell of any one of claims 1-3 and 6-12, wherein the modification
comprises a nucleotide insertion or deletion within exon 2, exon 3, or exon 4 of the coding sequence of the polynucleotide encoding the LPL protein.
18. The recombinant host cell of claim 17, wherein the LPL protein comprises an amino acid
sequence at least 80% identical to SEQ ID NO:62.
19. The recombinant host cell of claim 18, wherein the LPL protein comprises an amino acid
sequence selected from the group consisting of SEQ ID NO:62, SEQ ID NO:66, SEQ ID NO:67, and SEQ ID NO:68.
20. The recombinant host cell of any one of claims 17-19, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:86, SEQ ID NO:87 or SEQ ID NO:88.
21. The recombinant host cell of any one of claims 1 , 2, 4, 6-8 and 10-12, wherein the modification comprises a nucleotide insertion or deletion within exon 1 or exon 2 of the coding sequence of the polynucleotide encoding the LIPA protein.
22. The recombinant host cell of claim 21 , wherein the LIPA protein comprises an amino acid
sequence at least 70% identical to SEQ ID NO:70.
23. The recombinant host cell of claim 22, wherein the LI PA protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:70, SEQ ID NO:74, SEQ ID NO:75, and SEQ ID NO:76.
24. The recombinant host cell of claim any one of claims 21-23, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:94 or SEQ ID NO:95.
25. The recombinant host cell of any one of claims 1 , 2, 5, 7 and 9-12, wherein the modification comprises a nucleotide insertion or deletion within exon 1 , exon 3 or exon 4 of the coding sequence of the polynucleotide encoding the ACE protein.
26. The recombinant host cell of claim 21 , wherein the ACE protein comprises an amino acid
sequence at least 80% identical to SEQ ID NO:78.
27. The recombinant host cell of claim 22, wherein the ACE protein comprises an amino acid
sequence selected from the group consisting of SEQ ID NO:78, SEQ ID NO:82, SEQ ID NO:83, and SEQ ID NO:84.
28. The recombinant host cell of claim any one of claims 25-27, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:103, SEQ ID NO:105 or SEQ ID NO:106.
29. The recombinant host cell of any one of claims 1-28 further comprising a polynucleotide
encoding an exogenous protein of interest.
30. The recombinant host cell of claim 29, wherein the exogenous protein of interest is selected from the group consisting of an antibody heavy chain, an antibody light chain, an antigen- binding fragment, an antigen-binding protein, and an Fc-fusion protein.
31. The recombinant host cell of any one of claims 1-9, wherein the cell produces a Protein A- binding fraction having no detectable fatty acid hydrolase activity.
32. A method of producing a recombinant protein (a) obtaining a sample comprising a recombinant protein and a plurality of host cell proteins from a host cell that is modified to produce reduced
levels of fatty acid hydrolase compared to an unmodified cell; and (b) subjecting the sample to at least one purification step to remove at least one host cell protein.
33. The method of claim 32, wherein the plurality of host cell proteins
a. does not comprise a detectable amount of a phospholipase B-like 2 (PLBD2) protein; and
b. does not comprise a detectable amount of at least one other fatty acid hydrolase.
34. The method of claim 32 or 33, wherein the host cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding a phospholipase B-like 2 (PLBD2) protein; and
b. a modification in a coding sequence of a polynucleotide encoding a fatty acid hydrolase selected from the group consisting of lipoprotein lipase (LPL), lysosomeal acid lipase (LI PA), acid ceramidase (ACE), or a combination thereof.
35. The method of any one of claims 32-34, wherein the purification step is selected from the group consisting of protein A affinity (PA) chromatography, cation exchange (CEX) chromatography, and anion exchange (AEX) chromatography.
36. The method of any one of claims 32-35, wherein the purification step does not comprise
hydrophobic interaction chromatography.
37. A process for reducing fatty acid hydrolase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein; (b) modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and/or acid ceramidase (ACE); (c) transfecting the host cell with a polynucleotide encoding a protein of interest; (d) extracting a protein fraction comprising the protein of interest from the host cell; (e) contacting the protein fraction with a chromatography media selected from the group consisting of protein A affinity (PA) media, cation exchange (CEX) media, and anion exchange (AEX) media; (f) collecting the protein of interest from the media; and (g) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer.
38. The process of claim 37, wherein the step of modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein comprises inserting or deleting at least one nucleotide within exon 1 of a polynucleotide encoding the PLBD2 protein.
39. The process of claim 37 or 38, wherein the polynucleotide encoding the PLBD2 protein
comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:32.
40. The process of any one of claims 37-39, wherein the step of modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL) comprises inserting or deleting at least one nucleotide within exon 2, exon 3 or exon 4 of a polynucleotide encoding the LPL protein.
41. The process of any one of claims 37-40, wherein the polynucleotide encoding the LPL protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:62.
42. The process of any one of claims 37-41 , wherein the step of modifying the host cell to decrease or ablate the expression of lysosomal acid lipase (LI PA) comprises inserting or deleting at least one nucleotide within exon 1 or exon 2 of a polynucleotide encoding the LIPA protein.
43. The process of any one of claims 37-42, wherein the polynucleotide encoding the LIPA protein comprises a nucleic acid sequence that is at least 70% identical to SEQ ID NO:70.
44. The process of any one of claims 37-43, wherein the step of modifying the host cell to decrease or ablate the expression of acid ceramidase (ACE) comprises inserting or deleting at least one nucleotide within exon 1 , exon 3 or exon 4 of a polynucleotide encoding the ACE protein.
45. The process of any one of claims 37-44, wherein the polynucleotide encoding the ACE protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:78.
WE CLAIM:
1 . A recombinant host cell comprising a modification in two or more genes encoding two or more fatty acid hydrolases (FAH).
2. The recombinant host cell of claim 1 , wherein the two or more FAHs are selected from the
group consisting of phospholipase B-like 2 (PLBD2) protein, lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE).
3. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification.
4. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LIPA in a cell lacking the modification.
5. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
6. The recombinant host cell of claim 3, wherein the cell further comprises a modification in a
coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
7. The recombinant host cell of claim 3 or 6, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
8. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the LI PA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LIPA protein in a cell lacking the modification.
9. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the ACE protein, wherein the modification decreases the expression level of the ACE protein relative to the expression level of a ACE protein in a cell lacking the modification.
10. The recombinant host cell of claim 9, wherein the cell further comprises a modification in a
coding sequence of a polynucleotide encoding the LIPA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LI PA protein in a cell lacking the modification.
1 1. The recombinant host cell of any one of claimsl -10, wherein all alleles of the coding sequence of the two or more genes encoding the two or more FAHs comprise the modification.
12. The recombinant host cell of claims 1 -1 1 , wherein the cell does not express detectable levels of PLBD2 protein, LPL, LIPA and/or ACE.
13. The recombinant host cell of any one of claims 1-7, 1 1 and 12, wherein the modification
comprises a nucleotide insertion or deletion within exon 1 of the coding sequence of the polynucleotide encoding the PLBD2 protein.
14. The recombinant host cell of claim 13, wherein the PLBD2 protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:32.
15. The recombinant host cell of claim 14, wherein the PLBD2 protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:35, and SEQ ID NO:36.
16. The recombinant host cell of claim 13 or 14, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:47.
17. The recombinant host cell of any one of claims 1-3 and 6-12, wherein the modification
comprises a nucleotide insertion or deletion within exon 2, exon 3, or exon 4 of the coding sequence of the polynucleotide encoding the LPL protein.
18. The recombinant host cell of claim 17, wherein the LPL protein comprises an amino acid
sequence at least 80% identical to SEQ ID NO:62.
19. The recombinant host cell of claim 18, wherein the LPL protein comprises an amino acid
sequence selected from the group consisting of SEQ ID NO:62, SEQ ID NO:66, SEQ ID NO:67, and SEQ ID NO:68.
20. The recombinant host cell of any one of claims 17-19, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:86, SEQ ID NO:87 or SEQ ID NO:88.
21. The recombinant host cell of any one of claims 1 , 2, 4, 6-8 and 10-12, wherein the modification comprises a nucleotide insertion or deletion within exon 1 or exon 2 of the coding sequence of the polynucleotide encoding the LIPA protein.
22. The recombinant host cell of claim 21 , wherein the LIPA protein comprises an amino acid
sequence at least 70% identical to SEQ ID NO:70.
23. The recombinant host cell of claim 22, wherein the LI PA protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:70, SEQ ID NO:74, SEQ ID NO:75, and SEQ ID NO:76.
24. The recombinant host cell of claim any one of claims 21-23, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:94 or SEQ ID NO:95.
25. The recombinant host cell of any one of claims 1 , 2, 5, 7 and 9-12, wherein the modification comprises a nucleotide insertion or deletion within exon 1 , exon 3 or exon 4 of the coding sequence of the polynucleotide encoding the ACE protein.
26. The recombinant host cell of claim 21 , wherein the ACE protein comprises an amino acid
sequence at least 80% identical to SEQ ID NO:78.
27. The recombinant host cell of claim 22, wherein the ACE protein comprises an amino acid
sequence selected from the group consisting of SEQ ID NO:78, SEQ ID NO:82, SEQ ID NO:83, and SEQ ID NO:84.
28. The recombinant host cell of claim any one of claims 25-27, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:103, SEQ ID NO:105 or SEQ ID NO:106.
29. The recombinant host cell of any one of claims 1-28 further comprising a polynucleotide
encoding an exogenous protein of interest.
30. The recombinant host cell of claim 29, wherein the exogenous protein of interest is selected from the group consisting of an antibody heavy chain, an antibody light chain, an antigen- binding fragment, an antigen-binding protein, and an Fc-fusion protein.
31. The recombinant host cell of any one of claims 1-9, wherein the cell produces a Protein A- binding fraction having no detectable fatty acid hydrolase activity.
32. A method of producing a recombinant protein (a) obtaining a sample comprising a recombinant protein and a plurality of host cell proteins from a host cell that is modified to produce reduced
levels of fatty acid hydrolase compared to an unmodified cell; and (b) subjecting the sample to at least one purification step to remove at least one host cell protein.
33. The method of claim 32, wherein the plurality of host cell proteins
a. does not comprise a detectable amount of a phospholipase B-like 2 (PLBD2) protein; and
b. does not comprise a detectable amount of at least one other fatty acid hydrolase.
34. The method of claim 32 or 33, wherein the host cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding a phospholipase B-like 2 (PLBD2) protein; and
b. a modification in a coding sequence of a polynucleotide encoding a fatty acid hydrolase selected from the group consisting of lipoprotein lipase (LPL), lysosomeal acid lipase (LI PA), acid ceramidase (ACE), or a combination thereof.
35. The method of any one of claims 32-34, wherein the purification step is selected from the group consisting of protein A affinity (PA) chromatography, cation exchange (CEX) chromatography, and anion exchange (AEX) chromatography.
36. The method of any one of claims 32-35, wherein the purification step does not comprise
hydrophobic interaction chromatography.
37. A process for reducing fatty acid hydrolase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein; (b) modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and/or acid ceramidase (ACE); (c) transfecting the host cell with a polynucleotide encoding a protein of interest; (d) extracting a protein fraction comprising the protein of interest from the host cell; (e) contacting the protein fraction with a chromatography media selected from the group consisting of protein A affinity (PA) media, cation exchange (CEX) media, and anion exchange (AEX) media; (f) collecting the protein of interest from the media; and (g) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer.
38. The process of claim 37, wherein the step of modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein comprises inserting or deleting at least one nucleotide within exon 1 of a polynucleotide encoding the PLBD2 protein.
39. The process of claim 37 or 38, wherein the polynucleotide encoding the PLBD2 protein
comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:32.
40. The process of any one of claims 37-39, wherein the step of modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL) comprises inserting or deleting at least one nucleotide within exon 2, exon 3 or exon 4 of a polynucleotide encoding the LPL protein.
41. The process of any one of claims 37-40, wherein the polynucleotide encoding the LPL protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:62.
42. The process of any one of claims 37-41 , wherein the step of modifying the host cell to decrease or ablate the expression of lysosomal acid lipase (LI PA) comprises inserting or deleting at least one nucleotide within exon 1 or exon 2 of a polynucleotide encoding the LIPA protein.
43. The process of any one of claims 37-42, wherein the polynucleotide encoding the LIPA protein comprises a nucleic acid sequence that is at least 70% identical to SEQ ID NO:70.
44. The process of any one of claims 37-43, wherein the step of modifying the host cell to decrease or ablate the expression of acid ceramidase (ACE) comprises inserting or deleting at least one nucleotide within exon 1 , exon 3 or exon 4 of a polynucleotide encoding the ACE protein.
45. The process of any one of claims 37-44, wherein the polynucleotide encoding the ACE protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:78.
WE CLAIM:
1 . A recombinant host cell comprising a modification in two or more genes encoding two or more fatty acid hydrolases (FAH).
2. The recombinant host cell of claim 1 , wherein the two or more FAHs are selected from the
group consisting of phospholipase B-like 2 (PLBD2) protein, lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and acid ceramidase (ACE).
3. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification.
4. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LIPA in a cell lacking the modification.
5. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the PLBD2 protein, wherein the modification decreases the expression level of the PLBD2 protein relative to the expression level of a PLBD2 protein in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
6. The recombinant host cell of claim 3, wherein the cell further comprises a modification in a
coding sequence of a polynucleotide encoding the LI PA, wherein the modification decreases the expression level of the LI PA relative to the expression level of LI PA in a cell lacking the modification.
7. The recombinant host cell of claim 3 or 6, wherein the cell further comprises a modification in a coding sequence of a polynucleotide encoding the ACE, wherein the modification decreases the expression level of the ACE relative to the expression level of ACE in a cell lacking the modification.
8. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the LI PA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LIPA protein in a cell lacking the modification.
9. The recombinant host cell of claim 1 or 2, wherein the cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding the LPL, wherein the modification decreases the expression level of the LPL relative to the expression level of LPL in a cell lacking the modification; and
b. a modification in a coding sequence of a polynucleotide encoding the ACE protein, wherein the modification decreases the expression level of the ACE protein relative to the expression level of a ACE protein in a cell lacking the modification.
10. The recombinant host cell of claim 9, wherein the cell further comprises a modification in a
coding sequence of a polynucleotide encoding the LIPA protein, wherein the modification decreases the expression level of the LI PA protein relative to the expression level of a LI PA protein in a cell lacking the modification.
1 1. The recombinant host cell of any one of claimsl -10, wherein all alleles of the coding sequence of the two or more genes encoding the two or more FAHs comprise the modification.
12. The recombinant host cell of claims 1 -1 1 , wherein the cell does not express detectable levels of PLBD2 protein, LPL, LIPA and/or ACE.
13. The recombinant host cell of any one of claims 1-7, 1 1 and 12, wherein the modification
comprises a nucleotide insertion or deletion within exon 1 of the coding sequence of the polynucleotide encoding the PLBD2 protein.
14. The recombinant host cell of claim 13, wherein the PLBD2 protein comprises an amino acid sequence at least 80% identical to SEQ ID NO:32.
15. The recombinant host cell of claim 14, wherein the PLBD2 protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:35, and SEQ ID NO:36.
16. The recombinant host cell of claim 13 or 14, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:47.
17. The recombinant host cell of any one of claims 1-3 and 6-12, wherein the modification
comprises a nucleotide insertion or deletion within exon 2, exon 3, or exon 4 of the coding sequence of the polynucleotide encoding the LPL protein.
18. The recombinant host cell of claim 17, wherein the LPL protein comprises an amino acid
sequence at least 80% identical to SEQ ID NO:62.
19. The recombinant host cell of claim 18, wherein the LPL protein comprises an amino acid
sequence selected from the group consisting of SEQ ID NO:62, SEQ ID NO:66, SEQ ID NO:67, and SEQ ID NO:68.
20. The recombinant host cell of any one of claims 17-19, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:86, SEQ ID NO:87 or SEQ ID NO:88.
21. The recombinant host cell of any one of claims 1 , 2, 4, 6-8 and 10-12, wherein the modification comprises a nucleotide insertion or deletion within exon 1 or exon 2 of the coding sequence of the polynucleotide encoding the LIPA protein.
22. The recombinant host cell of claim 21 , wherein the LIPA protein comprises an amino acid
sequence at least 70% identical to SEQ ID NO:70.
23. The recombinant host cell of claim 22, wherein the LI PA protein comprises an amino acid sequence selected from the group consisting of SEQ ID NO:70, SEQ ID NO:74, SEQ ID NO:75, and SEQ ID NO:76.
24. The recombinant host cell of claim any one of claims 21-23, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:94 or SEQ ID NO:95.
25. The recombinant host cell of any one of claims 1 , 2, 5, 7 and 9-12, wherein the modification comprises a nucleotide insertion or deletion within exon 1 , exon 3 or exon 4 of the coding sequence of the polynucleotide encoding the ACE protein.
26. The recombinant host cell of claim 21 , wherein the ACE protein comprises an amino acid
sequence at least 80% identical to SEQ ID NO:78.
27. The recombinant host cell of claim 22, wherein the ACE protein comprises an amino acid
sequence selected from the group consisting of SEQ ID NO:78, SEQ ID NO:82, SEQ ID NO:83, and SEQ ID NO:84.
28. The recombinant host cell of claim any one of claims 25-27, wherein the modification comprises a nucleotide insertion or deletion within SEQ ID NO:103, SEQ ID NO:105 or SEQ ID NO:106.
29. The recombinant host cell of any one of claims 1-28 further comprising a polynucleotide
encoding an exogenous protein of interest.
30. The recombinant host cell of claim 29, wherein the exogenous protein of interest is selected from the group consisting of an antibody heavy chain, an antibody light chain, an antigen- binding fragment, an antigen-binding protein, and an Fc-fusion protein.
31. The recombinant host cell of any one of claims 1-9, wherein the cell produces a Protein A- binding fraction having no detectable fatty acid hydrolase activity.
32. A method of producing a recombinant protein (a) obtaining a sample comprising a recombinant protein and a plurality of host cell proteins from a host cell that is modified to produce reduced
levels of fatty acid hydrolase compared to an unmodified cell; and (b) subjecting the sample to at least one purification step to remove at least one host cell protein.
33. The method of claim 32, wherein the plurality of host cell proteins
a. does not comprise a detectable amount of a phospholipase B-like 2 (PLBD2) protein; and
b. does not comprise a detectable amount of at least one other fatty acid hydrolase.
34. The method of claim 32 or 33, wherein the host cell comprises:
a. a modification in a coding sequence of a polynucleotide encoding a phospholipase B-like 2 (PLBD2) protein; and
b. a modification in a coding sequence of a polynucleotide encoding a fatty acid hydrolase selected from the group consisting of lipoprotein lipase (LPL), lysosomeal acid lipase (LI PA), acid ceramidase (ACE), or a combination thereof.
35. The method of any one of claims 32-34, wherein the purification step is selected from the group consisting of protein A affinity (PA) chromatography, cation exchange (CEX) chromatography, and anion exchange (AEX) chromatography.
36. The method of any one of claims 32-35, wherein the purification step does not comprise
hydrophobic interaction chromatography.
37. A process for reducing fatty acid hydrolase activity in a protein formulation comprising the steps of: (a) modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein; (b) modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL), lysosomal acid lipase (LI PA) and/or acid ceramidase (ACE); (c) transfecting the host cell with a polynucleotide encoding a protein of interest; (d) extracting a protein fraction comprising the protein of interest from the host cell; (e) contacting the protein fraction with a chromatography media selected from the group consisting of protein A affinity (PA) media, cation exchange (CEX) media, and anion exchange (AEX) media; (f) collecting the protein of interest from the media; and (g) combining the protein of interest with a fatty acid ester, and optionally a buffer and thermal stabilizer.
38. The process of claim 37, wherein the step of modifying a host cell to decrease or ablate the expression of phospholipase B-like 2 (PLBD2) protein comprises inserting or deleting at least one nucleotide within exon 1 of a polynucleotide encoding the PLBD2 protein.
39. The process of claim 37 or 38, wherein the polynucleotide encoding the PLBD2 protein
comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:32.
40. The process of any one of claims 37-39, wherein the step of modifying the host cell to decrease or ablate the expression of lipoprotein lipase (LPL) comprises inserting or deleting at least one nucleotide within exon 2, exon 3 or exon 4 of a polynucleotide encoding the LPL protein.
41. The process of any one of claims 37-40, wherein the polynucleotide encoding the LPL protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:62.
42. The process of any one of claims 37-41 , wherein the step of modifying the host cell to decrease or ablate the expression of lysosomal acid lipase (LI PA) comprises inserting or deleting at least one nucleotide within exon 1 or exon 2 of a polynucleotide encoding the LIPA protein.
43. The process of any one of claims 37-42, wherein the polynucleotide encoding the LIPA protein comprises a nucleic acid sequence that is at least 70% identical to SEQ ID NO:70.
44. The process of any one of claims 37-43, wherein the step of modifying the host cell to decrease or ablate the expression of acid ceramidase (ACE) comprises inserting or deleting at least one nucleotide within exon 1 , exon 3 or exon 4 of a polynucleotide encoding the ACE protein.
45. The process of any one of claims 37-44, wherein the polynucleotide encoding the ACE protein comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:78.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662379226P | 2016-08-24 | 2016-08-24 | |
PCT/US2017/048498 WO2018039499A1 (en) | 2016-08-24 | 2017-08-24 | Host cell protein modification |
Publications (1)
Publication Number | Publication Date |
---|---|
EP3504328A1 true EP3504328A1 (en) | 2019-07-03 |
Family
ID=60327366
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP17798018.2A Withdrawn EP3504328A1 (en) | 2016-08-24 | 2017-08-24 | Host cell protein modification |
Country Status (2)
Country | Link |
---|---|
EP (1) | EP3504328A1 (en) |
WO (1) | WO2018039499A1 (en) |
Families Citing this family (10)
* Cited by examiner, † Cited by third partyPublication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SG11202009503PA (en) * | 2018-05-04 | 2020-11-27 | Sigma Aldrich Co Llc | Engineered cells with modified host cell protein profiles |
WO2020081789A1 (en) | 2018-10-17 | 2020-04-23 | Perfect Day, Inc. | Recombinant components and compositions for use in food products |
US20200393455A1 (en) * | 2019-05-21 | 2020-12-17 | Regeneron Pharmaceuticals, Inc. | Methods for identifying and quantitating host cell protein |
KR20220034169A (en) * | 2019-07-10 | 2022-03-17 | 리제너론 파아마슈티컬스, 인크. | Methods and compositions comprising reduced levels of host cell proteins |
MX2022004311A (en) * | 2019-10-15 | 2022-05-10 | Lilly Co Eli | MAMMAL CELL STRANCES WITH LIPASE/ESTERASE DEFICIENCY GENETICALLY MODIFIED IN A RECOMBINANT MANNER. |
CN115516313A (en) * | 2020-02-27 | 2022-12-23 | 瑞泽恩制药公司 | Activity-based mass spectrometry of host cell proteins |
TW202202620A (en) | 2020-03-26 | 2022-01-16 | 美商建南德克公司 | Modified mammalian cells |
JP2023539278A (en) * | 2020-08-31 | 2023-09-13 | ジェネンテック, インコーポレイテッド | High-throughput fluorescence-based esterase activity assay to assess polysorbate degradation risk during biopharmaceutical development |
EP4211475A4 (en) * | 2020-09-08 | 2024-10-23 | Regeneron Pharmaceuticals, Inc. | METHOD FOR DETECTING HOST CELL PROTEINS IN THERAPEUTIC ANTIBODIES BY COMBINING TRYPSIN DIGESTION, CHROMATOGRAPHY GRADIENTS AND BOXCAR MASS SPECTROMETRY |
CN119677854A (en) * | 2022-08-10 | 2025-03-21 | 勃林格殷格翰国际公司 | Artificial miRNA targeting multiple hydrolases |
Family Cites Families (38)
* Cited by examiner, † Cited by third partyPublication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7087411B2 (en) | 1999-06-08 | 2006-08-08 | Regeneron Pharmaceuticals, Inc. | Fusion protein capable of binding VEGF |
JP3727317B2 (en) | 2002-03-08 | 2005-12-14 | エイエスエムエル ネザランドズ ベスローテン フエンノートシャップ | Mask for use in lithography, method of making a mask, lithographic apparatus, and device manufacturing method |
EP2374818B1 (en) | 2006-06-02 | 2012-12-19 | Regeneron Pharmaceuticals, Inc. | High affinity antibodies to human IL-6 receptor |
US7608693B2 (en) | 2006-10-02 | 2009-10-27 | Regeneron Pharmaceuticals, Inc. | High affinity human antibodies to human IL-4 receptor |
US7771997B2 (en) | 2007-06-04 | 2010-08-10 | Regeneron Pharmaceuticals, Inc. | Enhanced expression and stability regions |
WO2009018411A1 (en) | 2007-07-31 | 2009-02-05 | Regeneron Pharmaceuticals, Inc. | Human antibodies to human cd20 and method of using thereof |
US8309088B2 (en) | 2007-08-10 | 2012-11-13 | Regeneron Pharmaceuticals, Inc. | Method of treating osteoarthritis with an antibody to NGF |
JO3672B1 (en) | 2008-12-15 | 2020-08-27 | Regeneron Pharma | High Affinity Human Antibodies to PCSK9 |
JO3417B1 (en) | 2010-01-08 | 2019-10-20 | Regeneron Pharma | Stabilized formulations containing anti-interleukin-6 receptor (il-6r) antibodies |
JO3340B1 (en) | 2010-05-26 | 2019-03-13 | Regeneron Pharma | Antibodies to human gdf8 |
JOP20190250A1 (en) | 2010-07-14 | 2017-06-16 | Regeneron Pharma | Stable formulas containing anti-NGF antibodies |
AR083044A1 (en) | 2010-09-27 | 2013-01-30 | Regeneron Pharma | ANTI-CD48 ANTIBODIES AND USES OF THE SAME |
PL3354280T3 (en) | 2010-10-06 | 2021-02-08 | Regeneron Pharmaceuticals, Inc. | Stabilized formulations containing anti-interleukin-4 receptor (il-4r) antibodies |
JO3756B1 (en) | 2010-11-23 | 2021-01-31 | Regeneron Pharma | Human antibodies to the glucagon receptor |
AR087329A1 (en) | 2011-06-17 | 2014-03-19 | Regeneron Pharma | HUMAN ANTIBODIES AGAINST PROTEIN 3 OF HUMAN ANGIOPOIETIN TYPE |
WO2013074557A1 (en) | 2011-11-14 | 2013-05-23 | Regeneron Pharmaceuticals, Inc. | Compositions and methods for increasing muscle mass and muscle strength by specifically antagonizing gdf8 and/or activin a |
EP2807190B1 (en) | 2012-01-23 | 2018-12-26 | Regeneron Pharmaceuticals, Inc. | Stabilized formulations containing anti-ang2 antibodies |
JO3820B1 (en) | 2012-05-03 | 2021-01-31 | Regeneron Pharma | Human antibodies to FEL D1 and methods for their use |
TW201843172A (en) | 2012-06-25 | 2018-12-16 | 美商再生元醫藥公司 | anti-EGFR antibody and use thereof |
JP6309521B2 (en) | 2012-08-13 | 2018-04-11 | リジェネロン・ファーマシューティカルズ・インコーポレイテッドRegeneron Pharmaceuticals, Inc. | Anti-PCSK9 antibody with pH-dependent binding properties |
JOP20200236A1 (en) | 2012-09-21 | 2017-06-16 | Regeneron Pharma | Anti-cd3 antibodies, bispecific antigen-binding molecules that bind cd3 and cd20, and uses thereof |
JO3405B1 (en) | 2013-01-09 | 2019-10-20 | Regeneron Pharma | ANTI-PDGFR-beta ANTIBODIES AND USES THEREOF |
JO3532B1 (en) | 2013-03-13 | 2020-07-05 | Regeneron Pharma | Anti-il-33 antibodies and uses thereof |
TWI659968B (en) | 2013-03-14 | 2019-05-21 | 再生元醫藥公司 | Human antibodies to respiratory syncytial virus f protein and methods of use thereof |
KR20150127591A (en) | 2013-03-15 | 2015-11-17 | 리제너론 파아마슈티컬스, 인크. | Il-33 antagonists and uses thereof |
TWI641620B (en) | 2013-08-21 | 2018-11-21 | 再生元醫藥公司 | Anti-prlr antibodies and uses thereof |
US9932591B2 (en) * | 2013-12-18 | 2018-04-03 | University Of Delaware | Reduction of lipase activity in product formulations |
TWI680138B (en) | 2014-01-23 | 2019-12-21 | 美商再生元醫藥公司 | Human antibodies to pd-l1 |
TWI681969B (en) | 2014-01-23 | 2020-01-11 | 美商再生元醫藥公司 | Human antibodies to pd-1 |
MY178160A (en) | 2014-03-11 | 2020-10-06 | Regeneron Pharma | Anti-egfrviii antibodies and uses thereof |
TWI754319B (en) | 2014-03-19 | 2022-02-01 | 美商再生元醫藥公司 | Methods and antibody compositions for tumor treatment |
CN106659146B (en) | 2014-05-05 | 2020-06-23 | 再生元制药公司 | Humanized C5 and C3 animals |
JO3701B1 (en) | 2014-05-23 | 2021-01-31 | Regeneron Pharma | Human antibodies to middle east respiratory syndrome – coronavirus spike protein |
EA201790377A1 (en) | 2014-09-16 | 2017-08-31 | Регенерон Фармасьютикалз, Инк. | ANTIBODIES TO GLUCAGON AND THEIR APPLICATIONS |
TW201628649A (en) * | 2014-10-09 | 2016-08-16 | 再生元醫藥公司 | Method for reducing microscopic particles in a pharmaceutical formulation |
CA2965495C (en) | 2014-10-23 | 2022-05-24 | Regeneron Pharmaceuticals, Inc. | Cho integration sites and uses thereof |
TWI710573B (en) | 2015-01-26 | 2020-11-21 | 美商再生元醫藥公司 | Human antibodies to ebola virus glycoprotein |
TW201702380A (en) * | 2015-02-27 | 2017-01-16 | 再生元醫藥公司 | Host cell protein modification |
-
2017
- 2017-08-24 EP EP17798018.2A patent/EP3504328A1/en not_active Withdrawn
- 2017-08-24 WO PCT/US2017/048498 patent/WO2018039499A1/en unknown
Also Published As
Publication number | Publication date |
---|---|
WO2018039499A9 (en) | 2018-05-11 |
WO2018039499A1 (en) | 2018-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3504328A1 (en) | 2019-07-03 | Host cell protein modification |
EP3262168B1 (en) | 2019-08-21 | Host cell protein modification |
US20210207093A1 (en) | 2021-07-08 | Genetically-modified cells comprising a modified human t cell receptor alpha constant region gene |
EP3359660B1 (en) | 2019-12-04 | Engineered meganucleases with recognition sequences found in the human t cell receptor alpha constant region gene |
AU2018239433B2 (en) | 2024-02-01 | A genetically modified mouse expressing human APOE4p and mouse Trem2 p.R47H and methods of use thereof |
US20220251172A1 (en) | 2022-08-11 | Recombinantly engineered, lipase/esterase-deficient mammalian cell lines |
US20210008199A1 (en) | 2021-01-14 | Methods and compositions comprising reduced level of host cell proteins |
US20210371455A1 (en) | 2021-12-02 | Methods and compositions comprising reduced level of host cell proteins |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
2019-05-31 | PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
2019-07-03 | 17P | Request for examination filed |
Effective date: 20190322 |
2019-07-03 | AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
2019-07-03 | AX | Request for extension of the european patent |
Extension state: BA ME |
2019-07-19 | STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN |
2019-08-21 | 18W | Application withdrawn |
Effective date: 20190717 |