WO2024013514A2 - Gene edited livestock animals having coronavirus resistance - Google Patents
Gene edited livestock animals having coronavirus resistance Download PDFInfo
- Publication number
- WO2024013514A2 WO2024013514A2 PCT/GB2023/051854 GB2023051854W WO2024013514A2 WO 2024013514 A2 WO2024013514 A2 WO 2024013514A2 GB 2023051854 W GB2023051854 W GB 2023051854W WO 2024013514 A2 WO2024013514 A2 WO 2024013514A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- nos
- sequence
- gene
- dna
- Prior art date
Links
- 241001465754 Metazoa Species 0.000 title claims abstract description 122
- 241000711573 Coronaviridae Species 0.000 title claims abstract description 40
- 244000144972 livestock Species 0.000 title claims abstract description 36
- 108090000623 proteins and genes Proteins 0.000 title claims description 168
- 210000004027 cell Anatomy 0.000 claims abstract description 105
- 238000000034 method Methods 0.000 claims abstract description 62
- 108010049990 CD13 Antigens Proteins 0.000 claims abstract description 59
- 150000007523 nucleic acids Chemical class 0.000 claims description 96
- 102000039446 nucleic acids Human genes 0.000 claims description 87
- 108020004707 nucleic acids Proteins 0.000 claims description 87
- 108020005004 Guide RNA Proteins 0.000 claims description 75
- 108091033409 CRISPR Proteins 0.000 claims description 47
- 230000027455 binding Effects 0.000 claims description 40
- 210000000287 oocyte Anatomy 0.000 claims description 37
- 108020004485 Nonsense Codon Proteins 0.000 claims description 32
- 101150058497 ANPEP gene Proteins 0.000 claims description 29
- 241000711484 Transmissible gastroenteritis virus Species 0.000 claims description 19
- 210000002950 fibroblast Anatomy 0.000 claims description 16
- 241001135549 Porcine epidemic diarrhea virus Species 0.000 claims description 13
- 241000711493 Porcine respiratory coronavirus Species 0.000 claims description 12
- 230000002829 reductive effect Effects 0.000 claims description 11
- 230000000694 effects Effects 0.000 abstract description 40
- 230000002759 chromosomal effect Effects 0.000 abstract description 32
- 230000001965 increasing effect Effects 0.000 abstract description 8
- 210000004102 animal cell Anatomy 0.000 abstract description 4
- 108090000765 processed proteins & peptides Proteins 0.000 description 132
- 102000004196 processed proteins & peptides Human genes 0.000 description 123
- 229920001184 polypeptide Polymers 0.000 description 121
- 102000004169 proteins and genes Human genes 0.000 description 81
- 235000018102 proteins Nutrition 0.000 description 80
- 108020004414 DNA Proteins 0.000 description 70
- 239000002773 nucleotide Substances 0.000 description 61
- 125000003729 nucleotide group Chemical group 0.000 description 59
- 238000003776 cleavage reaction Methods 0.000 description 53
- 230000007017 scission Effects 0.000 description 53
- 230000004568 DNA-binding Effects 0.000 description 48
- 101710163270 Nuclease Proteins 0.000 description 36
- 102100022749 Aminopeptidase N Human genes 0.000 description 33
- 102000040430 polynucleotide Human genes 0.000 description 31
- 108091033319 polynucleotide Proteins 0.000 description 31
- 239000002157 polynucleotide Substances 0.000 description 31
- QZDDFQLIQRYMBV-UHFFFAOYSA-N 2-[3-nitro-2-(2-nitrophenyl)-4-oxochromen-8-yl]acetic acid Chemical compound OC(=O)CC1=CC=CC(C(C=2[N+]([O-])=O)=O)=C1OC=2C1=CC=CC=C1[N+]([O-])=O QZDDFQLIQRYMBV-UHFFFAOYSA-N 0.000 description 30
- 235000001014 amino acid Nutrition 0.000 description 30
- 229940024606 amino acid Drugs 0.000 description 29
- 150000001413 amino acids Chemical class 0.000 description 29
- 108091028043 Nucleic acid sequence Proteins 0.000 description 27
- 108010042407 Endonucleases Proteins 0.000 description 26
- 241000282887 Suidae Species 0.000 description 26
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 26
- 239000011701 zinc Substances 0.000 description 26
- 229910052725 zinc Inorganic materials 0.000 description 26
- 108020004705 Codon Proteins 0.000 description 24
- 239000000523 sample Substances 0.000 description 23
- 210000001519 tissue Anatomy 0.000 description 23
- 102100031780 Endonuclease Human genes 0.000 description 22
- 238000006467 substitution reaction Methods 0.000 description 20
- 230000009368 gene silencing by RNA Effects 0.000 description 19
- 238000012217 deletion Methods 0.000 description 18
- 230000037430 deletion Effects 0.000 description 18
- 230000008685 targeting Effects 0.000 description 18
- 125000003275 alpha amino acid group Chemical group 0.000 description 17
- 239000013598 vector Substances 0.000 description 17
- 108091030071 RNAI Proteins 0.000 description 15
- 210000002257 embryonic structure Anatomy 0.000 description 15
- 230000006780 non-homologous end joining Effects 0.000 description 15
- 238000013518 transcription Methods 0.000 description 15
- 230000035897 transcription Effects 0.000 description 15
- 238000003780 insertion Methods 0.000 description 14
- 230000037431 insertion Effects 0.000 description 14
- 239000002609 medium Substances 0.000 description 14
- 241000894007 species Species 0.000 description 14
- 239000012634 fragment Substances 0.000 description 13
- 238000012163 sequencing technique Methods 0.000 description 13
- 241000282898 Sus scrofa Species 0.000 description 12
- 108020001507 fusion proteins Proteins 0.000 description 12
- 102000037865 fusion proteins Human genes 0.000 description 12
- 238000000338 in vitro Methods 0.000 description 12
- 230000001105 regulatory effect Effects 0.000 description 12
- 229940088598 enzyme Drugs 0.000 description 11
- 108020004999 messenger RNA Proteins 0.000 description 11
- 108091079001 CRISPR RNA Proteins 0.000 description 10
- 108091026890 Coding region Proteins 0.000 description 10
- 102000004190 Enzymes Human genes 0.000 description 10
- 108090000790 Enzymes Proteins 0.000 description 10
- 241000193996 Streptococcus pyogenes Species 0.000 description 10
- 230000004927 fusion Effects 0.000 description 10
- 230000002068 genetic effect Effects 0.000 description 10
- 238000010362 genome editing Methods 0.000 description 10
- 230000001404 mediated effect Effects 0.000 description 10
- 108091008146 restriction endonucleases Proteins 0.000 description 10
- 239000004055 small Interfering RNA Substances 0.000 description 10
- 125000006850 spacer group Chemical group 0.000 description 10
- 108020004459 Small interfering RNA Proteins 0.000 description 9
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 9
- 235000013601 eggs Nutrition 0.000 description 9
- 208000015181 infectious disease Diseases 0.000 description 9
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 8
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 8
- 102100039556 Galectin-4 Human genes 0.000 description 8
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 8
- 125000000539 amino acid group Chemical group 0.000 description 8
- 201000010099 disease Diseases 0.000 description 8
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 238000001727 in vivo Methods 0.000 description 8
- 238000004519 manufacturing process Methods 0.000 description 8
- 230000014616 translation Effects 0.000 description 8
- 108091093088 Amplicon Proteins 0.000 description 7
- 102000053602 DNA Human genes 0.000 description 7
- 241000700605 Viruses Species 0.000 description 7
- 210000002459 blastocyst Anatomy 0.000 description 7
- 230000001939 inductive effect Effects 0.000 description 7
- 230000010354 integration Effects 0.000 description 7
- 210000001161 mammalian embryo Anatomy 0.000 description 7
- 239000000203 mixture Substances 0.000 description 7
- 244000052769 pathogen Species 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 208000024891 symptom Diseases 0.000 description 7
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 6
- 108010091086 Recombinases Proteins 0.000 description 6
- 102000018120 Recombinases Human genes 0.000 description 6
- 238000007792 addition Methods 0.000 description 6
- 210000000805 cytoplasm Anatomy 0.000 description 6
- 230000001605 fetal effect Effects 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 210000001672 ovary Anatomy 0.000 description 6
- 210000003101 oviduct Anatomy 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 238000003753 real-time PCR Methods 0.000 description 6
- 230000009467 reduction Effects 0.000 description 6
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 6
- 230000003612 virological effect Effects 0.000 description 6
- 238000010453 CRISPR/Cas method Methods 0.000 description 5
- 239000012591 Dulbecco’s Phosphate Buffered Saline Substances 0.000 description 5
- 102000004389 Ribonucleoproteins Human genes 0.000 description 5
- 108010081734 Ribonucleoproteins Proteins 0.000 description 5
- 108091027967 Small hairpin RNA Proteins 0.000 description 5
- 241000194025 Streptococcus oralis Species 0.000 description 5
- 238000010459 TALEN Methods 0.000 description 5
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 5
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 description 5
- 101710185494 Zinc finger protein Proteins 0.000 description 5
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 5
- 238000003556 assay Methods 0.000 description 5
- 229940098773 bovine serum albumin Drugs 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- 210000000349 chromosome Anatomy 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 230000005782 double-strand break Effects 0.000 description 5
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 210000004602 germ cell Anatomy 0.000 description 5
- 244000144980 herd Species 0.000 description 5
- 230000035800 maturation Effects 0.000 description 5
- 238000000520 microinjection Methods 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 230000036961 partial effect Effects 0.000 description 5
- 230000004952 protein activity Effects 0.000 description 5
- 230000008439 repair process Effects 0.000 description 5
- 230000009870 specific binding Effects 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 108700028369 Alleles Proteins 0.000 description 4
- 102000011022 Chorionic Gonadotropin Human genes 0.000 description 4
- 108010062540 Chorionic Gonadotropin Proteins 0.000 description 4
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 4
- 102000004533 Endonucleases Human genes 0.000 description 4
- 108010016790 RNA-Induced Silencing Complex Proteins 0.000 description 4
- 102000000574 RNA-Induced Silencing Complex Human genes 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 230000000692 anti-sense effect Effects 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 210000001771 cumulus cell Anatomy 0.000 description 4
- 238000006471 dimerization reaction Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000012173 estrus Effects 0.000 description 4
- 229940084986 human chorionic gonadotropin Drugs 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 229930182817 methionine Natural products 0.000 description 4
- 230000001717 pathogenic effect Effects 0.000 description 4
- 230000008488 polyadenylation Effects 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 210000000582 semen Anatomy 0.000 description 4
- 210000002966 serum Anatomy 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 235000000346 sugar Nutrition 0.000 description 4
- 241001430294 unidentified retrovirus Species 0.000 description 4
- 238000010200 validation analysis Methods 0.000 description 4
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 4
- PFFIDZXUXFLSSR-UHFFFAOYSA-N 1-methyl-N-[2-(4-methylpentan-2-yl)-3-thienyl]-3-(trifluoromethyl)pyrazole-4-carboxamide Chemical compound S1C=CC(NC(=O)C=2C(=NN(C)C=2)C(F)(F)F)=C1C(C)CC(C)C PFFIDZXUXFLSSR-UHFFFAOYSA-N 0.000 description 3
- 241000004176 Alphacoronavirus Species 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 108091035707 Consensus sequence Proteins 0.000 description 3
- -1 Csm2 Proteins 0.000 description 3
- 241000701022 Cytomegalovirus Species 0.000 description 3
- 230000007018 DNA scission Effects 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- 241000713666 Lentivirus Species 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- 102000003960 Ligases Human genes 0.000 description 3
- 108090000364 Ligases Proteins 0.000 description 3
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 3
- 241000194020 Streptococcus thermophilus Species 0.000 description 3
- 108091027544 Subgenomic mRNA Proteins 0.000 description 3
- 108091023040 Transcription factor Proteins 0.000 description 3
- 102000040945 Transcription factor Human genes 0.000 description 3
- 230000003213 activating effect Effects 0.000 description 3
- 235000004279 alanine Nutrition 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000009395 breeding Methods 0.000 description 3
- 230000001488 breeding effect Effects 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 238000006731 degradation reaction Methods 0.000 description 3
- 239000012636 effector Substances 0.000 description 3
- 210000001671 embryonic stem cell Anatomy 0.000 description 3
- 230000007613 environmental effect Effects 0.000 description 3
- 210000003608 fece Anatomy 0.000 description 3
- 210000001733 follicular fluid Anatomy 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 230000002452 interceptive effect Effects 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 230000035935 pregnancy Effects 0.000 description 3
- 230000004853 protein function Effects 0.000 description 3
- 108020003175 receptors Proteins 0.000 description 3
- 102000005962 receptors Human genes 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 229940054269 sodium pyruvate Drugs 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000010374 somatic cell nuclear transfer Methods 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 108700020534 tetracycline resistance-encoding transposon repressor Proteins 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 239000013603 viral vector Substances 0.000 description 3
- 241000272517 Anseriformes Species 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 108010077544 Chromatin Proteins 0.000 description 2
- 241000611330 Chryseobacterium Species 0.000 description 2
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 2
- 101710096438 DNA-binding protein Proteins 0.000 description 2
- 241000702421 Dependoparvovirus Species 0.000 description 2
- 206010012735 Diarrhoea Diseases 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- 108060002716 Exonuclease Proteins 0.000 description 2
- 108010046276 FLP recombinase Proteins 0.000 description 2
- 101000834253 Gallus gallus Actin, cytoplasmic 1 Proteins 0.000 description 2
- 239000007995 HEPES buffer Substances 0.000 description 2
- 239000012981 Hank's balanced salt solution Substances 0.000 description 2
- 101000687346 Homo sapiens PR domain zinc finger protein 2 Proteins 0.000 description 2
- 108010003272 Hyaluronate lyase Proteins 0.000 description 2
- 102000001974 Hyaluronidases Human genes 0.000 description 2
- 102100034343 Integrase Human genes 0.000 description 2
- 108010061833 Integrases Proteins 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- 102000009151 Luteinizing Hormone Human genes 0.000 description 2
- 108010073521 Luteinizing Hormone Proteins 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- 108700011259 MicroRNAs Proteins 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 102100024885 PR domain zinc finger protein 2 Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- 108091000080 Phosphotransferase Proteins 0.000 description 2
- 241001135989 Porcine reproductive and respiratory syndrome virus Species 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 101710198474 Spike protein Proteins 0.000 description 2
- 241000193985 Streptococcus agalactiae Species 0.000 description 2
- 241001291896 Streptococcus constellatus Species 0.000 description 2
- 241000194042 Streptococcus dysgalactiae Species 0.000 description 2
- 241000194045 Streptococcus macacae Species 0.000 description 2
- 241000193991 Streptococcus parasanguinis Species 0.000 description 2
- 241001400864 Streptococcus pseudoporcinus Species 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 108010064672 Tre-Recombinase Proteins 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- 239000012190 activator Substances 0.000 description 2
- 210000004504 adult stem cell Anatomy 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 210000000227 basophil cell of anterior lobe of hypophysis Anatomy 0.000 description 2
- RYYVLZVUVIJVGH-UHFFFAOYSA-N caffeine Chemical compound CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 description 2
- 150000001768 cations Chemical class 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 210000003483 chromatin Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 230000018044 dehydration Effects 0.000 description 2
- 238000006297 dehydration reaction Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000002224 dissection Methods 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000009510 drug design Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 210000002308 embryonic cell Anatomy 0.000 description 2
- 102000013165 exonuclease Human genes 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000004720 fertilization Effects 0.000 description 2
- 230000003325 follicular Effects 0.000 description 2
- 230000037433 frameshift Effects 0.000 description 2
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 2
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 229960002773 hyaluronidase Drugs 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000009027 insemination Effects 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 229940040129 luteinizing hormone Drugs 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108091070501 miRNA Proteins 0.000 description 2
- 239000002679 microRNA Substances 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 108091027963 non-coding RNA Proteins 0.000 description 2
- 102000042567 non-coding RNA Human genes 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 238000002823 phage display Methods 0.000 description 2
- 102000020233 phosphotransferase Human genes 0.000 description 2
- 230000002028 premature Effects 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 235000019419 proteases Nutrition 0.000 description 2
- 238000010807 real-time PCR kit Methods 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000012340 reverse transcriptase PCR Methods 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 210000000130 stem cell Anatomy 0.000 description 2
- 108020003113 steroid hormone receptors Proteins 0.000 description 2
- 102000005969 steroid hormone receptors Human genes 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 210000001550 testis Anatomy 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical class OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 238000002604 ultrasonography Methods 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 238000003260 vortexing Methods 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- ALNDFFUAQIVVPG-NGJCXOISSA-N (2r,3r,4r)-3,4,5-trihydroxy-2-methoxypentanal Chemical compound CO[C@@H](C=O)[C@H](O)[C@H](O)CO ALNDFFUAQIVVPG-NGJCXOISSA-N 0.000 description 1
- MXHRCPNRJAMMIM-SHYZEUOFSA-N 2'-deoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-SHYZEUOFSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical group OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- LUCHPKXVUGJYGU-XLPZGREQSA-N 5-methyl-2'-deoxycytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 LUCHPKXVUGJYGU-XLPZGREQSA-N 0.000 description 1
- 230000005730 ADP ribosylation Effects 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 102100027211 Albumin Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- VWAUPFMBXBWEQY-ANULTFPQSA-N Altrenogest Chemical compound C1CC(=O)C=C2CC[C@@H]([C@H]3[C@@](C)([C@](CC3)(O)CC=C)C=C3)C3=C21 VWAUPFMBXBWEQY-ANULTFPQSA-N 0.000 description 1
- 240000007304 Amorphophallus muelleri Species 0.000 description 1
- 208000031295 Animal disease Diseases 0.000 description 1
- 235000002198 Annona diversifolia Nutrition 0.000 description 1
- 241001109946 Aquimarina Species 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 208000031504 Asymptomatic Infections Diseases 0.000 description 1
- 241000193818 Atopobium Species 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000606125 Bacteroides Species 0.000 description 1
- 241001567982 Bacteroides graminisolvens Species 0.000 description 1
- 101150018129 CSF2 gene Proteins 0.000 description 1
- 101150069031 CSN2 gene Proteins 0.000 description 1
- 241000282832 Camelidae Species 0.000 description 1
- 241000711506 Canine coronavirus Species 0.000 description 1
- 241000190890 Capnocytophaga Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 241000282994 Cervidae Species 0.000 description 1
- 241001426140 Chryseobacterium tenax Species 0.000 description 1
- 240000005721 Cirsium palustre Species 0.000 description 1
- 102000029816 Collagenase Human genes 0.000 description 1
- 108060005980 Collagenase Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108091028732 Concatemer Proteins 0.000 description 1
- 206010010741 Conjunctivitis Diseases 0.000 description 1
- 241001657523 Coriobacteriaceae Species 0.000 description 1
- 241001162417 Coriobacteriaceae bacterium Species 0.000 description 1
- 235000005956 Cosmos caudatus Nutrition 0.000 description 1
- 206010011224 Cough Diseases 0.000 description 1
- 108010051219 Cre recombinase Proteins 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 208000035240 Disease Resistance Diseases 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 239000006145 Eagle's minimal essential medium Substances 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 102000009024 Epidermal Growth Factor Human genes 0.000 description 1
- 102400001368 Epidermal growth factor Human genes 0.000 description 1
- 101800003838 Epidermal growth factor Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000283074 Equus asinus Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 101100417563 Escherichia coli (strain K12) rpoH gene Proteins 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 241000725579 Feline coronavirus Species 0.000 description 1
- 241001617393 Finegoldia Species 0.000 description 1
- 241000589565 Flavobacterium Species 0.000 description 1
- 241000405386 Flavobacterium frigidarium Species 0.000 description 1
- 241000721361 Flavobacterium soli Species 0.000 description 1
- 241000589601 Francisella Species 0.000 description 1
- 241000589599 Francisella tularensis subsp. novicida Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 102000006771 Gonadotropins Human genes 0.000 description 1
- 108010086677 Gonadotropins Proteins 0.000 description 1
- 241000606790 Haemophilus Species 0.000 description 1
- 241001059853 Haemophilus pittmaniae Species 0.000 description 1
- 241000819598 Haemophilus sputorum Species 0.000 description 1
- 108010056307 Hin recombinase Proteins 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 101000757160 Homo sapiens Aminopeptidase N Proteins 0.000 description 1
- 101000976075 Homo sapiens Insulin Proteins 0.000 description 1
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 description 1
- 244000309467 Human Coronavirus Species 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 1
- 206010022678 Intestinal infections Diseases 0.000 description 1
- LPHGQDQBBGAPDZ-UHFFFAOYSA-N Isocaffeine Natural products CN1C(=O)N(C)C(=O)C2=C1N(C)C=N2 LPHGQDQBBGAPDZ-UHFFFAOYSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 240000006024 Lactobacillus plantarum Species 0.000 description 1
- 241000282838 Lama Species 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 206010024264 Lethargy Diseases 0.000 description 1
- 241000186781 Listeria Species 0.000 description 1
- 241000186805 Listeria innocua Species 0.000 description 1
- 241000619533 Mesonia Species 0.000 description 1
- 241001505339 Mesonia mobilis Species 0.000 description 1
- 108010006035 Metalloproteases Proteins 0.000 description 1
- 102000005741 Metalloproteases Human genes 0.000 description 1
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 108010059724 Micrococcal Nuclease Proteins 0.000 description 1
- 241000713869 Moloney murine leukemia virus Species 0.000 description 1
- 206010068052 Mosaicism Diseases 0.000 description 1
- 108010086093 Mung Bean Nuclease Proteins 0.000 description 1
- 101100494762 Mus musculus Nedd9 gene Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 241000588650 Neisseria meningitidis Species 0.000 description 1
- 101100385413 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) csm-3 gene Proteins 0.000 description 1
- 241001292005 Nidovirales Species 0.000 description 1
- 108091007494 Nucleic acid- binding domains Proteins 0.000 description 1
- 241000272458 Numididae Species 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000169855 Olivibacter Species 0.000 description 1
- 241000170398 Olivibacter sitiensis Species 0.000 description 1
- 241000927544 Olsenella Species 0.000 description 1
- 241000990071 Olsenella profusa Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 240000007019 Oxalis corniculata Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 240000000968 Parkia biglobosa Species 0.000 description 1
- 241000606860 Pasteurella Species 0.000 description 1
- 241000606601 Pasteurella bettyae Species 0.000 description 1
- 241001112692 Peptostreptococcaceae Species 0.000 description 1
- 241000286209 Phasianidae Species 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- RVGRUAULSDPKGF-UHFFFAOYSA-N Poloxamer Chemical compound C1CO1.CC1CO1 RVGRUAULSDPKGF-UHFFFAOYSA-N 0.000 description 1
- 241000605894 Porphyromonas Species 0.000 description 1
- 241000134844 Porphyromonas catoniae Species 0.000 description 1
- 241000605861 Prevotella Species 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108010055016 Rec A Recombinases Proteins 0.000 description 1
- 102000001218 Rec A Recombinases Human genes 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 206010057190 Respiratory tract infections Diseases 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 101001025539 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Homothallic switching endonuclease Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241000202917 Spiroplasma Species 0.000 description 1
- 241000202907 Spiroplasma apis Species 0.000 description 1
- 241001606419 Spiroplasma syrphidicola Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 241000194008 Streptococcus anginosus Species 0.000 description 1
- 241000194019 Streptococcus mutans Species 0.000 description 1
- 101000910035 Streptococcus pyogenes serotype M1 CRISPR-associated endonuclease Cas9/Csn1 Proteins 0.000 description 1
- 241000194024 Streptococcus salivarius Species 0.000 description 1
- 240000007349 Streptococcus sp. BS-21 Species 0.000 description 1
- 241001266658 Streptococcus sp. BS35b Species 0.000 description 1
- 241000302474 Streptococcus sp. CM6 Species 0.000 description 1
- 241000059353 Streptococcus sp. SR4 Species 0.000 description 1
- 241001493546 Suina Species 0.000 description 1
- 101100491049 Sus scrofa ANPEP gene Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- 108010020764 Transposases Proteins 0.000 description 1
- 102000008579 Transposases Human genes 0.000 description 1
- 241000589886 Treponema Species 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- GLNADSQYFUSGOU-GPTZEZBUSA-J Trypan blue Chemical compound [Na+].[Na+].[Na+].[Na+].C1=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(/N=N/C3=CC=C(C=C3C)C=3C=C(C(=CC=3)\N=N\C=3C(=CC4=CC(=CC(N)=C4C=3O)S([O-])(=O)=O)S([O-])(=O)=O)C)=C(O)C2=C1N GLNADSQYFUSGOU-GPTZEZBUSA-J 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 102000006275 Ubiquitin-Protein Ligases Human genes 0.000 description 1
- 108010083111 Ubiquitin-Protein Ligases Proteins 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241001148134 Veillonella Species 0.000 description 1
- 108010051583 Ventricular Myosins Proteins 0.000 description 1
- 241001416177 Vicugna pacos Species 0.000 description 1
- PTFCDOFLOPIGGS-UHFFFAOYSA-N Zinc dication Chemical compound [Zn+2] PTFCDOFLOPIGGS-UHFFFAOYSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000006154 adenylylation Effects 0.000 description 1
- 101150084233 ago2 gene Proteins 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 229960000971 altrenogest Drugs 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 244000037640 animal pathogen Species 0.000 description 1
- 208000022531 anorexia Diseases 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 101150010487 are gene Proteins 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 235000015278 beef Nutrition 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 210000001109 blastomere Anatomy 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 229960001948 caffeine Drugs 0.000 description 1
- VJEONQKOZGKCAK-UHFFFAOYSA-N caffeine Natural products CN1C(=O)N(C)C(=O)C2=C1C=CN2C VJEONQKOZGKCAK-UHFFFAOYSA-N 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 101150038500 cas9 gene Proteins 0.000 description 1
- 230000007073 chemical hydrolysis Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 235000013330 chicken meat Nutrition 0.000 description 1
- 229960002424 collagenase Drugs 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 230000002153 concerted effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 101150055601 cops2 gene Proteins 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 101150036876 cre gene Proteins 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 235000013365 dairy product Nutrition 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 206010061428 decreased appetite Diseases 0.000 description 1
- 238000012350 deep sequencing Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000006114 demyristoylation Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- MXHRCPNRJAMMIM-UHFFFAOYSA-N desoxyuridine Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-UHFFFAOYSA-N 0.000 description 1
- 238000011847 diagnostic investigation Methods 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 230000000447 dimerizing effect Effects 0.000 description 1
- 208000028659 discharge Diseases 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-N dithiophosphoric acid Chemical group OP(O)(S)=S NAGJZTKCGNOGPW-UHFFFAOYSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 230000002500 effect on skin Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000008519 endogenous mechanism Effects 0.000 description 1
- 230000002616 endonucleolytic effect Effects 0.000 description 1
- 238000012407 engineering method Methods 0.000 description 1
- 210000001842 enterocyte Anatomy 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000007071 enzymatic hydrolysis Effects 0.000 description 1
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 1
- 229940116977 epidermal growth factor Drugs 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 150000002148 esters Chemical group 0.000 description 1
- 230000001036 exonucleolytic effect Effects 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 230000002550 fecal effect Effects 0.000 description 1
- 230000035558 fertility Effects 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- 210000000604 fetal stem cell Anatomy 0.000 description 1
- 238000011010 flushing procedure Methods 0.000 description 1
- 235000021393 food security Nutrition 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 230000006251 gamma-carboxylation Effects 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 238000002695 general anesthesia Methods 0.000 description 1
- 229960002518 gentamicin Drugs 0.000 description 1
- 210000001654 germ layer Anatomy 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 239000002622 gonadotropin Substances 0.000 description 1
- 210000002503 granulosa cell Anatomy 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 210000005003 heart tissue Anatomy 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 101150099475 hin gene Proteins 0.000 description 1
- 108091008039 hormone receptors Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000012212 insulator Substances 0.000 description 1
- PBGKTOXHQIOBKM-FHFVDXKLSA-N insulin (human) Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H]1CSSC[C@H]2C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3C=CC(O)=CC=3)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3NC=NC=3)NC(=O)[C@H](CO)NC(=O)CNC1=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O)=O)CSSC[C@@H](C(N2)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](NC(=O)CN)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C=CC=CC=1)C(C)C)C1=CN=CN1 PBGKTOXHQIOBKM-FHFVDXKLSA-N 0.000 description 1
- 210000004153 islets of langerhan Anatomy 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 238000002350 laparotomy Methods 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 210000004216 mammary stem cell Anatomy 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000021121 meiosis Effects 0.000 description 1
- 210000004379 membrane Anatomy 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 102000006240 membrane receptors Human genes 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000007758 minimum essential medium Substances 0.000 description 1
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 1
- 230000036438 mutation frequency Effects 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- 238000010449 nuclear transplantation Methods 0.000 description 1
- 230000030648 nucleus localization Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000016087 ovulation Effects 0.000 description 1
- 238000009304 pastoral farming Methods 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 150000008298 phosphoramidates Chemical class 0.000 description 1
- SXADIBFZNXBEGI-UHFFFAOYSA-N phosphoramidous acid Chemical compound NP(O)O SXADIBFZNXBEGI-UHFFFAOYSA-N 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 210000004508 polar body Anatomy 0.000 description 1
- 229960000502 poloxamer Drugs 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000000270 postfertilization Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000011045 prefiltration Methods 0.000 description 1
- 238000003825 pressing Methods 0.000 description 1
- 230000002294 pubertal effect Effects 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 235000021251 pulses Nutrition 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 230000029058 respiratory gaseous exchange Effects 0.000 description 1
- 208000023504 respiratory system disease Diseases 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000013049 sediment Substances 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 238000009612 semen analysis Methods 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 206010041232 sneezing Diseases 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 238000010396 two-hybrid screening Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 230000004580 weight loss Effects 0.000 description 1
- 208000016261 weight loss Diseases 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/027—New or modified breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/485—Exopeptidases (3.4.11-3.4.19)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/11—Aminopeptidases (3.4.11)
- C12Y304/11002—Membrane alanyl aminopeptidase (3.4.11.2), i.e. aminopeptidase N
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/07—Animals genetically altered by homologous recombination
- A01K2217/075—Animals genetically altered by homologous recombination inducing loss of function, i.e. knock out
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/108—Swine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/02—Animal zootechnically ameliorated
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
Definitions
- the present teachings relate to gene edited livestock animals and the editing of amino peptidase N (ANPEP) to provide coronavirus resistance.
- ANPEP amino peptidase N
- TGEV transmissible gastroenteritis virus
- PEDV porcine epidemic diarrhea virus
- PEDV and TGEV belong to the genus Alphacoronavirus in the family Coronaviridae (Lin, C.-M., et al., Journal of Virology, 2015, 89, 3332-3342).
- Porcine respiratory coronavirus (PRCV) is also an Alphacoronavirus and is closely related to TGEV.
- PRCV generally causes subclinical infection or mild respiratory disease, but severe cases have been described and there is evidence that it may worsen the severity of disease when pigs are dually infected with both PRCV and another virus such as porcine respiratory and reproductive syndrome virus (PRRSV) (Killoren, K. E. et al.
- PRCV-positive status of a herd may have economic implications, because some countries will not import animals that are PRCV-positive.
- Coronaviruses are enveloped, single stranded, positive sense RNA viruses, placed in the order, Nidovirales.
- the characteristic hallmark of nidoviruses is the synthesis of a nested set of subgenomic mRNAs.
- the unique structural feature of coronaviruses is the “corona” formed by the spike proteins protruding from the surface of the virion. Even though the viral spike protein is the primary receptor protein for all coronaviruses, the corresponding cell surface receptors vary (Li, F., Journal of Virology, 2015, 89, 1954- 1964). Delmas et al. were the first to characterize porcine aminopeptidase N (ANPEP, APN, or CD13) as a candidate receptor for TGEV (Delmas, B., et al., Nature, 1992, 357 417-420).
- the present teachings provide for and include livestock animals that have gene edits in the ANPEP gene confer resistance to coronaviruses.
- the present teachings can include a livestock animal or progeny thereof with a genome comprising an edited ANPEP gene comprising a premature stop codon.
- an ANPEP protein encoded by the edited ANPEP gene can show reduced binding to a coronavirus relative to an unedited ANPEP protein.
- the coronavirus can be transmissible gastroenteritis virus (TGEV), porcine epidemic diarrhea virus (PEDV), or Porcine respiratory coronavirus (PRCV).
- TGEV gastroenteritis virus
- PEDV porcine epidemic diarrhea virus
- PRCV Porcine respiratory coronavirus
- the coronavirus can be TGEV.
- the protein produced by the edited ANPEP gene shows no binding to TGEV.
- the premature stop codon can be in exon 2, 3, 4, 16, or 18.
- the premature stop codon can be in exon 2. In various configurations, the premature stop codon can be in exon 3. In various configurations, the premature stop codon can be in exon 4. In various configurations, the premature stop codon can be in exon 16. In various configurations, the premature stop codon can be in exon 18.
- the animal can be a pig.
- the edited ANPEP gene can comprise SEQ ID NO: 252, 253, 254, 255, 256, or 257.
- the edited ANPEP gene can comprise SEQ ID NO: 252.
- the edited ANPEP gene can comprise SEQ ID NO: 253.
- the edited ANPEP gene can comprise SEQ ID NO: 254.
- the edited ANPEP gene can comprise SEQ ID NO: 255.
- the edited ANPEP gene can comprise SEQ ID NO: 256.
- the edited ANPEP gene can comprise SEQ ID NO: 257.
- a cell isolated from the animal can have an ANPEP gene as described herein.
- the present teachings can include an isolated cell line obtained from the livestock animal as described herein.
- the present teachings can include an isolated fibroblast line obtained from the livestock animal as described herein.
- a method of producing a gene edited livestock animal comprising: introducing into an isolated cell: a Cas9 protein or a nucleic acid encoding a Cas9 protein; and a pair of gRNAs that each create a double stranded break in an ANPEP gene such that a premature stop codon is formed when the double stranded breaks are repaired.
- the method can further comprise producing an animal from the isolated cell.
- the pair of gRNAs can be SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 102 and 10, SEQ ID NOs: 58 and 52, SEQ ID NOs: 58 and 26, SEQ ID NOs: 58 and 10, SEQ ID NOs: 32 and 26, SEQ ID NOs: 32 and 10, SEQ ID NOs: 120 and 111, SEQ ID NOs: 119 and 113, SEQ ID NOs: 115 and 111, SEQ ID NOs 137 and 130, SEQ ID NOs: 133 and 130, SEQ ID NOs: 137 and 128, SEQ ID NOs: 133 and 128, SEQ ID NOs: 189 and 183, SEQ ID NOs: 231 and 208, SEQ ID NOs: 228 and 213, SEQ ID NOs: 216 and 213, or SEQ ID NOs: 216 and 208.
- the pair of gRNAs can be SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 58 and 52, SEQ ID Nos: 120 and 111, SEQ ID NOs: 231 and 208, or SEQ ID NOs: 228 and 213.
- the pair of gRNAs can be SEQ ID NOs: 102 and 52.
- the pair of gRNAs can be SEQ ID NOs: 102 and 26.
- the pair of gRNAs can be SEQ ID NOs: 58 and 52.
- the pair of gRNAs can be SEQ ID Nos: 120 and 111.
- the pair of gRNAs can be SEQ ID NOs: 231 and 208.
- the pair of gRNAs can be SEQ ID NOs: 228 and 213.
- the edited ANPEP gene can comprise SEQ ID NO: 252, 253, 254, 255, 256, or 257.
- the isolated cell can be an oocyte, a zygote, or a fibroblast cell. In various configurations, the isolated cell can be a zygote. In various configurations, the isolated cell can be a fibroblast cell. In various configurations, the isolated cell can be an oocyte. In some configurations, the producing an animal can comprise fertilizing the oocyte. In some configurations, the producing an animal can further comprise implanting the fertilized oocyte into a surrogate mother. In various configurations, the livestock animal can be resistant to coronavirus. In various configurations, the livestock animal can be a pig.
- the present teachings can include at least one gRNA having a sequence selected from the group consisting of SEQ ID NOs: 10, 26, 32, 52, 58, 102, 111, 113, 115, 119, 120, 128, 130, 133, 137, 183, 189, 208, 213, 216, 228, and 231.
- the at least one gRNA is a pair of gRNAs comprising, consisting of, or selected from the group consisting of SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 102 and 10, SEQ ID NOs: 58 and 52, SEQ ID NOs: 58 and 26, SEQ ID NOs: 58 and 10, SEQ ID NOs: 32 and 26, SEQ ID NOs: 32 and 10, SEQ ID NOs: 120 and 111, SEQ ID NOs: 119 and 113, SEQ ID NOs: 115 and 111, SEQ ID NOs 137 and 130, SEQ ID NOs: 133 and 130, SEQ ID NOs: 137 and 128, SEQ ID NOs 133 and 128, SEQ ID NOs: 189 and 183, SEQ ID NOs: 231 and 208, SEQ ID NOs: 228 and 213, SEQ ID NOs: 216 and 213, and/or SEQ ID NOs: 216 and 208.
- the at least one gRNA is a pair of gRNAs comprising, consisting of, or selected from the group consisting of SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 58 and 52, SEQ ID Nos: 120 and 111, SEQ ID NOs: 231 and 208, and/or SEQ ID NOs. 228 and 213.
- the at least one gRNA is a pair of gRNAs comprising SEQ ID NOs: 102 and 52.
- the at least one gRNA is a pair of gRNAs comprising SEQ ID NOs: 102 and 26.
- the at least one gRNA is a pair of gRNAs comprising SEQ ID NOs: 58 and 52. In various configurations, the at least one gRNA is a pair of gRNAs comprising SEQ ID Nos: 120 and 111. In various configurations, the at least one gRNA is a pair of gRNAs comprising SEQ ID NOs: 231 and 208. In various configurations, the at least one gRNA is a pair of gRNAs comprising SEQ ID NOs. 228 and 213.
- the present disclosure relates to livestock animals and progeny thereof comprising at least one edited chromosomal sequence that reduces expression or activity of an ANPEP protein.
- the disclosure further relates to animal cells comprising at least one edited chromosomal sequence that reduces expression or activity of an ANPEP protein.
- the animals and cells have chromosomal gene edits (e.g., insertions, deletions, or substitutions) that inactivate or otherwise modulate ANPEP expression or activity.
- ANPEP plays important roles in coronavirus resistance.
- the animals and cells can be created using any number of protocols, including those that make use of gene editing.
- nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively.
- Amino acids may be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted singleletter codes.
- range such as from 1 to 6 should be considered to have specifically disclosed sub-ranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 3, 4, 5, and 6, and decimals and fractions, for example, 1.2, 3.8, I’A, and 4 3 /4 This applies regardless of the breadth of the range.
- the term “cell” as used herein can refer to an oocyte, a sperm cell, a zygote, an embryonic cell, a germ layer cell, or a differentiated cell.
- Whole genome edits are typically performed in oocytes, sperm, zygotic, or embryonic cells.
- tissue specific edits may be made in tissue cells or tissue progenitor cells.
- Cas refers to a “CRISPR associated” protein.
- Cas proteins include Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cash, Cas7, Cas8, CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, Csf4, homologs thereof, or modified versions thereof.
- Cas9 (formerly referred to as Cas5, Csnl, or Csxl2) herein refers to a Cas endonuclease of a type II CRISPR system that forms a complex with a crNucleotide and a tracrNucleotide, or with a single guide polynucleotide, for specifically recognizing and cleaving all or part of a DNA target sequence.
- Cleavage refers to the breakage of the covalent backbone of a DNA molecule. Cleavage can be initiated by a variety of methods including, but not limited to, enzymatic or chemical hydrolysis of a phosphodi ester bond. Both single-stranded cleavage and double-stranded cleavage are possible, and double-stranded cleavage can occur as a result of two distinct single-stranded cleavage events. DNA cleavage can result in the production of either blunt ends or staggered ends. In certain embodiments, fusion polypeptides are used for targeted double-stranded DNA cleavage.
- CRISPR system refers collectively to transcripts and other elements involved in the expression of or directing the activity of CRISPR-associated (“Cas") genes, including sequences encoding a Cas gene, a tracr (trans-activating CRISPR) sequence (e.g. tracrRNA or an active partial tracrRNA), a tracr-mate sequence (encompassing a "direct repeat” and a tracrRNA-processed partial direct repeat in the context of an endogenous CRISPR system), a guide sequence (also referred to as a "spacer” in the context of an endogenous CRISPR system), or other sequences and transcripts from a CRISPR locus.
- a tracr trans-activating CRISPR
- tracr-mate sequence encompassing a "direct repeat” and a tracrRNA-processed partial direct repeat in the context of an endogenous CRISPR system
- guide sequence also referred to as a "spacer” in the context of an endogenous CRISPR system
- nucleic acid sequences refers to those nucleic acids which encode identical or conservatively edited variants of the amino acid sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are "silent variations" and represent one species of conservatively edited variation. Every nucleic acid sequence herein that encodes a polypeptide also, by reference to the genetic code, describes every possible silent variation of the nucleic acid.
- each codon in a nucleic acid can be edited to yield a functionally identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a polypeptide of the present teachings is implicit in each described polypeptide sequence and is within the scope of the present disclosure.
- amino acid sequences one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds, or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservatively edited variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid.
- any number of amino acid residues selected from the group of integers consisting of from 1 to 15 can be so altered.
- 1, 2, 3, 4, 5, 7, or 10 alterations can be made.
- Conservatively edited variants typically provide similar biological activity as the unedited polypeptide sequence from which they are derived.
- substrate specificity, enzyme activity, or ligand/receptor binding is generally at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% of the native protein for its native substrate.
- Conservative substitution tables providing functionally similar amino acids are well known in the art.
- the following six groups each contain amino acids that are conservative substitutions for one another: [1] Alanine (A), Serine (S), Threonine (T); [2] Aspartic acid (D), Glutamic acid (E); [3] Asparagine (N), Glutamine (Q); [4] Arginine (R), Lysine (K); [5] Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and [6] Phenylalanine (F), Tyrosine (Y), Tryptophan (W). See also, Creighton (1984) Proteins W. H. Freeman and Company.
- Exogenous refers to a nucleic acid sequence originating outside an organism that has been introduced into the organism. This can refer to sequences which naturally occur in a sexual compatible species, sequences which are synthetic, or sequences from another species.
- full-length sequence in reference to a specified polynucleotide or its encoded protein means having the entire amino acid sequence of a native (nonsynthetic), endogenous, biologically active form of the specified protein.
- Methods to determine whether a sequence is full-length are well known in the art including such exemplary techniques as northern or western blots, primer extension, SI protection, and ribonuclease protection.
- Comparison to known full-length homologous (orthologous and/or paralogous) sequences can also be used to identify full-length sequences of the present disclosure. Additionally, consensus sequences typically present at the 5' and 3' untranslated regions of mRNA aid in the identification of a polynucleotide as full-length.
- the consensus sequence ANNNN/H/G’G where the underlined codon represents the N- terminal methionine, aids in determining whether the polynucleotide has a complete 5' end.
- Consensus sequences at the 3' end such as polyadenylation sequences, aid in determining whether the polynucleotide has a complete 3' end.
- gene editing refers to the use of naturally occurring or artificially engineered nucleases, also referred to as “molecular scissors.”
- the nucleases create specific double-stranded break (DSBs) at desired locations in the genome, which in some cases harnesses the cell’s endogenous mechanisms to repair the induced break by natural processes of homologous recombination (HR) and/or nonhomologous end-joining (NHEJ).
- HR homologous recombination
- NHEJ nonhomologous end-joining
- Gene editing effectors include Zinc Finger Nucleases (ZFNs), Transcription Activator-Like Effector Nucleases (TALENs), the Clustered Regularly Interspaced Short Palindromic Repeats/CAS (CRISPR/Cas) system, and meganucleases re-engineered as homing endonucleases.
- ZFNs Zinc Finger Nucleases
- TALENs Transcription Activator-Like Effector Nucleases
- CRISPR/Cas Clustered Regularly Interspaced Short Palindromic Repeats/CAS
- meganucleases re-engineered as homing endonucleases.
- the terms also include the use of genetic editing procedures and techniques, including, for example, where the change is relatively small and/or does not introduce DNA from a foreign species.
- the term “gene edit”, “gene edited”, or “genetically edited” refers to an organism where human intervention, such as but without limitation by using a gene editing effector, has created a genetic difference in its genome when compared to a wild type genome of the same organism. These differences can include but are not limited to nucleotide substitutions, excision of a start codon, or small deletions that do not introduce frame shift mutations into the genome but may excise an exon or form a premature stop codon when the ends of the deletion are ligated together.
- a gene edit does not introduce DNA from another species into an organism.
- a “gene edited animal” refers to an animal with one or more cells comprising a gene edit.
- heterologous in reference to a nucleic acid is a nucleic acid that originates from a foreign species, or, if from the same species, is substantially changed from its native form in composition and/or genomic locus by deliberate human intervention.
- a promoter operably linked to a heterologous structural gene is from a species different from that from which the structural gene was derived, or, if from the same species, one or both are substantially changed from their original form.
- a heterologous protein may originate from a foreign species or, if from the same species, is substantially changed from its original form by deliberate human intervention.
- homosing DNA technology covers any mechanisms that allow a specified molecule to be targeted to a specified DNA sequence including Zinc Finger (ZF) proteins, Transcription Activator-Like Effectors (TALEs) meganucleases, and CRISPR systems (e.g., CRISPR/Cas9 systems).
- ZF Zinc Finger
- TALEs Transcription Activator-Like Effectors
- CRISPR systems e.g., CRISPR/Cas9 systems.
- increased resistance and “reduced susceptibility” herein mean, but are not limited to, a statistically significant reduction of the incidence and/or severity of clinical signs or clinical symptoms which are associated with infection by pathogen.
- increased resistance or “reduced susceptibility” can refer to a statistically significant reduction of the incidence and/or severity of clinical signs or clinical symptoms which are associated with infection by a coronavirus in an animal comprising an edited chromosomal sequence as compared to a control animal having an unedited chromosomal sequence.
- statically significant reduction of clinical symptoms means, but is not limited to, the frequency in the incidence of at least one clinical symptom in the edited group of subjects is at least 10%, preferably at least 20%, more preferably at least 30%, even more preferably at least 50%, and even more preferably at least 70% lower than in the non-edited control group after the challenge with the infectious agent.
- livestock animal includes any animals traditionally raised in livestock farming, for example an ungulate (e.g., an artiodactyl), an avian animal (e.g., chickens, turkeys, ducks, geese, guinea fowl, or squabs), an equine animal (e.g, horses or donkeys).
- Ungulates include, but are not limited to porcine animals e.g, pigs), bovine animals (e.g., beef or dairy cattle, buffalo), ovine animals, caprine animals, camels, llamas, alpacas, and deer.
- the term does not include rats, mice, or other rodents.
- nucleic acid includes reference to a deoxyribonucleotide or ribonucleotide polymer in either single-or double-stranded form, and unless otherwise limited, encompasses conservatively edited variants and known analogues having the essential nature of natural nucleotides in that they hybridize to single-stranded nucleic acids in a manner similar to naturally occurring nucleotides (e. g., peptide nucleic acids).
- mutation includes alterations in the nucleotide sequence of a polynucleotide, such as for example a gene or coding DNA sequence (CDS), compared to the wild-type sequence.
- CDS coding DNA sequence
- the term includes, without limitation, substitutions, insertions, frameshifts, deletions, inversions, translocations, duplications, splice-donor site mutations, point-mutations and the like.
- polypeptide polypeptide
- peptide protein
- proteins proteins
- amino acid residues amino acid residues
- the essential nature of such analogues of naturally occurring amino acids is that, when incorporated into a protein, the protein is specifically reactive to antibodies elicited to the same protein but consisting entirely of naturally occurring amino acids.
- polypeptide is also inclusive of modifications including, but not limited to, glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, hydroxylation and ADP- ribosylation. It will be appreciated, as is well known and as noted above, that polypeptides are not always entirely linear. For instance, polypeptides may be branched as a result of ubiquitization, and they may be circular, with or without branching, generally as a result of post-translation events, including natural processing event and events brought about by human manipulation which do not occur naturally.
- Circular, branched and branched circular polypeptides may be synthesized by non-translation natural process and by entirely synthetic methods, as well. Further, the present teachings contemplate the use of both the methionine-containing and the methionine-less amino terminal variants of the protein of the present disclosure.
- promoter includes reference to a region of DNA upstream from the start of transcription and involved in recognition and binding of RNA polymerase and other proteins to initiate transcription. Examples of promoters under developmental control include promoters that preferentially initiate transcription in certain tissues, such as testes, ovaries, or placenta. Such promoters are referred to as "tissue-preferred”. Promoters which initiate transcription only in certain tissue are referred to as "tissue-specific”.
- a “cell type” specific promoter primarily drives expression in certain cell types in one or more organs, for example, germ cells in testes or ovaries.
- An "inducible” or “repressible” promoter is a promoter which is under environmental control. Examples of environmental conditions that may affect transcription by inducible promoters include stress, and temperature. Tissue-specific, tissue-preferred, cell-type specific and inducible promoters constitute the class of "non-constitutive" promoters.
- a “constitutive” promoter is a promoter which is active under most environmental conditions.
- amino acid residue or “amino acid residue” or “amino acid” are used interchangeably herein to refer to an amino acid that is incorporated into a protein, polypeptide, or peptide (collectively “protein”).
- the amino acid may be a naturally occurring amino acid and, unless otherwise limited, may encompass non-natural analogs of natural amino acids that can function in a similar manner as naturally occurring amino acids.
- “Resistance" of an animal to a disease is a characteristic of an animal, wherein the animal avoids the disease symptoms that are the outcome of animal-pathogen interactions, such as interactions between a porcine animal and coronavirus. That is, pathogens are prevented from causing animal diseases and the associated disease symptoms, or alternatively, a reduction of the incidence and/or severity of clinical signs or reduction of clinical symptoms.
- pathogens are prevented from causing animal diseases and the associated disease symptoms, or alternatively, a reduction of the incidence and/or severity of clinical signs or reduction of clinical symptoms.
- a "TALE DNA binding domain” or “TALE” is a polypeptide comprising one or more TALE repeat domains/units. The repeat domains are involved in binding of the TALE to its cognate target DNA sequence.
- a single “repeat unit” (also referred to as a “repeat”) is typically 33-35 amino acids in length and exhibits at least some sequence homology with other TALE repeat sequences within a naturally occurring TALE protein.
- Zinc finger and TALE binding domains can be "engineered” to bind to a predetermined nucleotide sequence, for example via engineering (altering one or more amino acids) of the recognition helix region of naturally occurring zinc finger or TALE proteins.
- engineered DNA binding proteins are proteins that are non- naturally occurring.
- methods for engineering DNA-binding proteins are design and selection.
- a designed DNA binding protein is a protein not occurring in nature whose design/ composition results principally from rational criteria. Rational criteria for design include application of substitution rules and computerized algorithms for processing information in a database storing information of existing ZFP and/or TALE designs and binding data. See, for example, U.S. Pat. Nos. 6,140,081; 6,453,242; and 6,534,261; see also WO 98/53058; WO 98/53059; WO 98/53060; WO 02/016536 and WO 03/016496 and U.S. Publication No. 20110301073.
- a “zinc finger DNA binding protein” (or binding domain) is a protein, or a domain within a larger protein, that binds DNA in a sequence-specific manner through one or more zinc fingers, which are regions of amino acid sequence within the binding domain whose structure is stabilized through coordination of a zinc ion.
- the term zinc finger DNA binding protein is often abbreviated as zinc finger protein or ZFP.
- a "selected" zinc finger protein or TALE is a protein not found in nature whose production results primarily from an empirical process such as phage display, interaction trap or hybrid selection. See e.g., U.S. Pat. No. 5,789,538; U.S. Pat. No. 5,925,523; U.S. Pat. No. 6,007,988; U.S. Pat. No. 6,013,453; U.S. Pat. No. 6,200,759; WO 95/19431; WO 96/06166; WO 98/53057; WO 98/54311; WO 00/27878; WO 01/60970 WO 01/88197, WO 02/099084 and U.S. Publication No. 20110301073.
- Wild type means those animals and blastocysts, embryos or cells derived therefrom, which have not been gene edited and are usually inbred and outbred strains developed from naturally occurring strains.
- sequence relationships between a polynucleotide/polypeptide of the present teachings with a reference polynucleotide/polypeptide (a)"reference sequence", (b)” comparison window", (c) "sequence identity”, and (d)"percentage of sequence identity”.
- reference sequence is a defined sequence used as a basis for sequence comparison with a polynucleotide/polypeptide of the present teachings.
- a reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full-length cDNA or gene sequence, or the complete cDNA or gene sequence.
- comparison window includes reference to a contiguous and specified segment of a polynucleotide/polypeptide sequence, wherein the polynucleotide/polypeptide sequence may be compared to a reference sequence and wherein the portion of the polynucleotide/polypeptide sequence in the comparison window may comprise additions or deletions (/. ⁇ ., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
- the comparison window is at least 20 contiguous nucleotides/amino acids residues in length, and optionally can be 30, 40, 50, 100, or longer.
- Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith, T.F. and Waterman, M.S., Adv. Appl. Math., 1981, 2, 482-489; by the homology alignment algorithm of Needleman, S.B. and Wunsch, C.D. (J. Mol. Biol., 1970, 48, 443-453); by the search for similarity method of Pearson, W.R. and Lipman, D.J. (Proc. Natl. Acad.
- the BLAST family of programs that can be used for database similarity searches includes: BLASTN for nucleotide query sequences against nucleotide database sequences; BLASTX for nucleotide query sequences against protein database sequences; BLASTP for protein query sequences against protein database sequences; TBLASTN for protein query sequences against nucleotide database sequences; and TBLASTX for nucleotide query sequences against nucleotide database sequences.
- the BLAST algorithm In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin, S & Altschul, S.F., Proc. Nat'l. Acad. Sci. USA, 1993, 90, 5873-5877).
- Anumber of low- complexity filter programs can be employed to reduce such low-complexity alignments.
- the SEG Wang, J.C. and Federhen, S., Comput. Chem., 1993, 17, 149-163
- XNU Caverie, J.M. and States, D.J., Comput. Chem., 1993, 17, 191-201
- nucleotide and protein identity/ similarity values provided herein are calculated using GAP (GCG Version 10) under default values.
- GAP Global Alignment Program
- GAP uses the algorithm of Needleman, S.B. and Wunsch, C.D. (J. Mol. Biol., 1970, 48: 443-453), to find the alignment of two complete sequences that maximizes the number of matches and minimizes the number of gaps.
- GAP represents one member of the family of best alignments. There may be many members of this family, but no other member has a better quality.
- GAP displays four figures of merit for alignments: Quality, Ratio, Identity, and Similarity.
- the Quality is the metric maximized in order to align the sequences. Ratio is the quality divided by the number of bases in the shorter segment. Percent Identity is the percent of the symbols that actually match. Percent Similarity is the percent of the symbols that are similar. Symbols that are across from gaps are ignored. A similarity is scored when the scoring matrix value for a pair of symbols is greater than or equal to 0.50, the similarity threshold.
- the scoring matrix used in Version 10 of the Wisconsin Genetics Software Package is BLOSUM62 (see Henikoff, S. & Henikoff, J.G., Proc. Natl. Acad. Sci. USA, 1989, 89, 10915-10919).
- sequence identity in the context of two nucleic acid or polypeptide sequences includes reference to the residues in the two sequences which are the same when aligned for maximum correspondence over a specified comparison window.
- sequence identity When percentage of sequence identity is used in reference to proteins it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g. charge or hydrophobicity) and therefore do not change the functional properties of the molecule. Where sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences which differ by such conservative substitutions are said to have "sequence similarity" or “similarity”.
- Means for making this adjustment are well-known to those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1.
- the scoring of conservative substitutions may be calculated according to the algorithm of Meyers. E.W. and Miller, W., Computer Applic. Biol. Sci., 1988, 4: 11-17, for example as implemented in the program PC/GENE (Intelligenetics, Mountain View, California, USA).
- percentage of sequence identity means the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (/. ⁇ ., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
- porcine amino peptidase N has been reported to be involved in transmissible gastroenteritis virus (TGEV) infection.
- Porcine ANPEP is a 963 amino acid, type II membrane metallopeptidase, that is responsible for removing N-terminal amino acids from protein substrates during digestion.
- Sus scrofa ANPEP is a membrane alanyl aminopeptidase described at ENSEMBL (useast.ensembl.org/) accession number ENS SSCG00000001849, located on the reverse strand of the 7th chromosome.
- ENSEMBL has six transcripts. The gene was listed in ENSSSCG00000001849.5 (SEQ ID NO: 258) and the full transcript was designated as ANPEP -205, and the locations of the exons on the 7th chromosome are listed below in Table 1 (these data are from Ensembl release 109).
- the present inventors designed edits to introduce a premature stop codon in order to make a truncated protein.
- gRNAs were engineered to create double stranded breaks in the gene that, when repaired via non-homologous end joining (NHEJ), create a stop codon.
- NHEJ non-homologous end joining
- the present inventors developed two potential strategies: one strategy called for introduction of an early stop codon in Exon 2, 3, or 4.
- the gRNAs listed in SEQ ID NOs: 1-106 and 147-175 are directed to Exon 2.
- the gRNAs listed in SEQ ID NOs: 107-126 and 176 are directed to Exon 3.
- the gRNAs listed in SEQ ID NOs: 127-147 and 177-182 are directed to Exon 4.
- the second strategy was to introduce an exogenous stop codon late in the gene — in exon 15, 16, 17, or 18.
- the gRNAs listed in SEQ ID NOs: 183-190 and 240 are directed to Exon 16.
- the gRNAS listed in SEQ ID NOs: 191-201 and 241 are directed to Exon 17.
- the gRNAs listed in SEQ ID NOs: 202-239 and 242-251 are directed to Exon 18.
- gRNAs listed in SEQ ID NOs: 1-146 and 183-239 were designed for use with spCas9.
- SEQ ID NOs: 147-182 and 240-251 were designed for use with SthC3Cas9.
- the present disclosure provides a livestock animal or animal cell, including sperm or egg cells, with improved resistance to coronavirus, specifically transmissible gastroenteritis virus (TEGV).
- the livestock animals or cells comprise altered expression or activity of an ANPEP protein.
- the livestock animals or cells can comprise at least one edited chromosomal sequence that reduces expression or activity of the ANPEP protein.
- the chromosomal sequence may be (1) inactivated, (2) edited, or (3) comprise an integrated sequence.
- An inactivated chromosomal sequence is altered such that ANPEP protein function is impaired, reduced or eliminated.
- reduced ANPEP protein function or activity refers to a reduction in protein function or activity relative to the function of a wild type ANPEP protein.
- a gene edited animal comprising an inactivated chromosomal sequence may be termed a “knock-out”.
- a genetically edited animal comprising an integrated sequence may be termed a “knock in”.
- a gene edited animal comprising an edited chromosomal sequence may comprise a targeted point edit(s) or other edit such that an altered protein product is produced.
- the process can comprise using a CRISPR system (e.g., a CRISPR/Cas9 system) to edit the genomic sequence.
- a CRISPR system e.g., a CRISPR/Cas9 system
- Cas9 Cas9
- an mRNA that encodes Cas9 can be delivered to a cell, or a gene that provides for expression of an mRNA that encodes Cas9 can be delivered to a cell.
- target specific crRNA and a tracrRNA can be delivered directly to a cell or target specific sgRNA(s) can be delivered to a cell (these RNAs can alternatively be produced by a gene constructed to express these RNAs).
- the process of editing chromosomal sequences using a CRISPR system is rapid, precise, and highly efficient.
- an ANPEP locus is used as a target site for the site-specific editing. This can include insertion of an exogenous nucleic acid (e.g., a nucleic acid comprising a nucleotide sequence encoding a polypeptide of interest) or deletions of nucleic acids from the locus. In particular embodiments, insertions and/or deletions result in an edited locus. For example, integration of the premature nucleic acid and/or deletion of part of the genomic nucleic acid may edit the locus so as to produce a disrupted (/. ⁇ ., inactivated) ANPEP gene.
- the ANPEP protein encoded by the edited nucleic acid can have reduced coronavirus binding relative to an unedited ANPEP protein. In some configurations, the edited ANPEP protein may not bind to a coronavirus.
- any of the animals or cells can be an animal or cell that has been gene edited using a Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/Cas system.
- CRISPR/Cas system can suitably comprise any of the guide RNAs (gRNAs) described herein.
- the edited chromosomal sequence can be an edited chromosomal sequence that has been produced via homology directed repair (HDR).
- the edited chromosomal sequence can be an edited chromosomal sequence that has been produced via non-homologous end-joining (NHEJ).
- the edited chromosomal sequence reduces the susceptibility of the animal, progeny, or cell to infection by a coronavirus, as compared to the susceptibility of a livestock animal, progeny, or cell that does not comprise the edited chromosomal sequence.
- coronavirus nucleic acid is detected at lower titers in the nasal secretions, feces, or serum relative to a control.
- coronavirus antigen is detected at lower concentrations in the tissues of the animal e.g., in lung tissue) relative to a control.
- the edited chromosomal sequence preferably substantially eliminates susceptibility of the animal, progeny, or cell to the coronavirus.
- the edited chromosomal sequence more preferably completely eliminates susceptibility of the animal, progeny, or cell to the coronavirus, such that animals do not show any clinical signs of disease following exposure to the coronavirus.
- porcine animals having the edited chromosomal sequence do not show any clinical signs of coronavirus (e.g., fever, lethargy, anorexia, weight loss, nasal and ocular discharge, cough, sneezing, conjunctivitis, diarrhea, and/or breathing difficulties) following exposure to the coronavirus.
- coronavirus nucleic acid cannot be detected in the nasal secretions, feces, or serum
- coronavirus antigen cannot be detected in the tissues of the animal (e.g., in lung tissue)
- serum is negative for coronavirusspecific antibody.
- cells having the edited chromosomal sequence that are exposed to the pathogen do not become infected with the pathogen.
- exemplary coronaviruses include TEGV, PEDV, and PRCV.
- the edited chromosomal sequence can comprise an insertion, a deletion, a substitution, or a combination of any thereof.
- the insertion, the deletion, the substitution, or the combination of any thereof can result in a miscoding in the allele of the gene encoding the ANPEP protein.
- the miscoding can result in a premature stop codon in the allele of the gene encoding the ANPEP protein.
- editing strategies are designed to introduce premature stop codons in conserved exonic sequences of ANPEP.
- premature stop codons are introduced either early in the coding sequence or in coding regions containing protease active site residues.
- these editing strategies include the use of two guides to direct the creation of double stranded breaks and then, through non-homologous end joining (NHEJ) create a premature stop codon when the cut ends are ligated.
- NHEJ non-homologous end joining
- the present inventors have employed this strategy to create premature stop codons early in the ANPEP gene, such as in Exons 2, 3, and 4.
- Pairs of guides that can be used to introduce a premature stop codon in Exon 2 include SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 102 and 10, SEQ ID NOs: 58 and 52, SEQ ID NOs: 58 and 26, SEQ ID NOs: 58 and 10, SEQ ID NOs: 32 and 26, and SEQ ID NOs: 32 and 10.
- Pairs of guides that can be used to introduce a premature stop codon in Exon 3 include SEQ ID NOs: 120 and 111, SEQ ID NOs: 119 and 113, and SEQ ID NOs: 115 and 111.
- Pairs of guides that can be used to introduce a premature stop codon in Exon 4 include SEQ ID NOs 137 and 130, SEQ ID NOs: 133 and 130, SEQ ID NOs: 137 and 128, and SEQ ID NOs 133 and 128.
- pairs of guides were designed to introduce a premature stop codon within Exon 15 or 17.
- An exemplary pair of guides that can be used to introduce a premature stop codon in Exon 15 is SEQ ID NOs: 189 and 183.
- Pairs of guides that can be used to introduce a premature stop codon in Exon 17 include SEQ ID NOs: 231 and 208, SEQ ID NOs: 228 and 213, SEQ ID NOs: 216 and 213, and SEQ ID NOs: 216 and 208.
- the edited sequences produced by select guide pairs are shown in Table 2. (The sequences recite 50 base pairs on either side of the join.)
- the edited chromosomal sequence preferably causes ANPEP protein production or activity to be reduced, as compared to ANPEP protein production or activity in an animal, progeny, or cell that lacks the edited chromosomal sequence.
- the edited sequence encodes an ANPEP protein that has reduced or no binding to coronaviruses or other pathogens.
- the edited chromosomal sequence can result in production of substantially no functional ANPEP protein by the animal, progeny, or cell.
- substantially no functional ANPEP protein it is meant that the level of ANPEP protein in the animal, progeny, or cell is undetectable, or if detectable, is at least about 90% lower, at least about 95% lower, at least about 98%, lower, and even more preferably at least about 99% lower than the level observed in an animal, progeny, or cell that does not comprise the edited chromosomal sequences.
- the animal, progeny, or cell does not produce ANPEP protein.
- a gene may be so edited as to delete a partial sequence in protein that responsible for an undesirable trait.
- a domain of a protein may be removed that binds to a pathogen. This allows the editing of genes that are essential for host survival while still rendering the host resistant to the disease.
- the ANPEP gene in the animal, progeny, or cell can comprise any combination of any of the edited chromosomal sequences described herein.
- gRNAs Guide RNAs
- the gRNAs have a nucleic acid sequence that is complementary to a sequence of a gene encoding an ANPEP protein and can be used to introduce a chromosomal edit into a gene encoding an ANPEP protein.
- gRNA sequences complementary to a sequence of a gene encoding an ANPEP protein are provided in SEQ ID NOs: 1-251. Although the sequences are listed with DNA nucleotides, a person of ordinary skill would understand that the sequences are in fact RNA sequences and would be readily able to convert the DNA sequences into RNA sequences. In some embodiments, the gRNA sequences target regions early in the ANPEP coding sequence or in coding regions containing protease active site residues.
- the gRNA can comprise a nucleotide sequence comprising one or more of SEQ ID NOs: 1-251.
- the gRNA can have a length of 100 nucleotides or fewer, 90 nucleotides or fewer, 80 nucleotides or fewer, 70 nucleotides or fewer, 60 nucleotides or fewer, 50 nucleotides or fewer, 40 nucleotides or fewer, 30 nucleotides or fewer, or 20 nucleotides or fewer.
- the gRNA can have a length of 20 nucleotides.
- guide RNAs within ANPEP are selected for their ability to generate in-frame stop codons when paired.
- site-specific integration may be accomplished by utilizing factors that are capable of recognizing and binding to particular nucleotide sequences, for example, in the genome of a host organism.
- factors that are capable of recognizing and binding to particular nucleotide sequences, for example, in the genome of a host organism.
- many proteins comprise polypeptide domains that are capable of recognizing and binding to DNA in a site-specific manner.
- a DNA sequence that is recognized by a DNA-binding polypeptide may be referred to as a “target” sequence.
- Polypeptide domains that are capable of recognizing and binding to DNA in a site-specific manner generally fold correctly and function independently to bind DNA in a site-specific manner, even when expressed in a polypeptide other than the protein from which the domain was originally isolated.
- target sequences for recognition and binding by DNA-binding polypeptides are generally able to be recognized and bound by such polypeptides, even when present in large DNA structures (e.g., a chromosome), particularly when the site where the target sequence is located is one known to be accessible to soluble cellular proteins (e.g., a gene).
- DNA-binding polypeptides identified from proteins that exist in nature typically bind to a discrete nucleotide sequence or motif (e.g., a consensus recognition sequence), methods exist and are known in the art for modifying many such DNA-binding polypeptides to recognize a different nucleotide sequence or motif.
- DNA-binding polypeptides include, for example and without limitation: zinc finger DNA-binding domains; leucine zippers; UPA DNA-binding domains; GAL4; TAL; LexA; a Tet repressor; LacR; and a steroid hormone receptor.
- a DNA-binding polypeptide is a zinc finger.
- Individual zinc finger motifs can be designed to target and bind specifically to any of a large range of DNA sites.
- Canonical Cys2His2 (as well as non-canonical CyssHis) zinc finger polypeptides bind DNA by inserting an a-helix into the major groove of the target DNA double helix.
- Recognition of DNA by a zinc finger is modular; each finger contacts primarily three consecutive base pairs in the target, and a few key residues in the polypeptide mediate recognition.
- the DNA-binding specificity of the targeting endonuclease may be further increased (and hence the specificity of any gene regulatory effects conferred thereby may also be increased). See, e.g., Umov, F.D., et al., Nature, 2005, 435, 646-651.
- one or more zinc finger DNA-binding polypeptides may be engineered and utilized such that a targeting endonuclease introduced into a host cell interacts with a DNA sequence that is unique within the genome of the host cell.
- the zinc finger protein is non-naturally occurring in that it is engineered to bind to a target site of choice.
- a target site of choice See, for example, Beerli, R.R., et al., Nature Biotechnol., 2002, 20, 135-141; Pabo, C.O., el a Ann. Rev. Biochem., 2001, 70, 313-340; Isalan, M., et al.. Nature Biotechnol., 2001, 19, 656-660; Segal, D.J., et al.. Curr. Opin. Biotechnol., 2001, 12, 632-637; Choo, Y., et al., 2000, Curr. Opin. Struct.
- An engineered zinc finger binding domain can have a novel binding specificity, compared to a naturally occurring zinc finger protein.
- Engineering methods include, but are not limited to, rational design and various types of selection. Rational design includes, for example, using databases comprising triplet (or quadruplet) nucleotide sequences and individual zinc finger amino acid sequences, in which each triplet or quadruplet nucleotide sequence is associated with one or more amino acid sequences of zinc fingers which bind the particular triplet or quadruplet sequence. See, for example, U.S. Pat. Nos. 6,453,242 and 6,534,261, incorporated by reference herein in their entireties.
- Exemplary selection methods including phage display and two-hybrid systems, are disclosed in U.S. Pat. Nos. 5,789,538; 5,925,523; 6,007,988; 6,013,453; 6,410,248; 6,140,466; 6,200,759; and 6,242,568; as well as WO 98/37186; WO 98/53057; WO 00/27878; WO 01/88197 and GB 2,338,237.
- enhancement of binding specificity for zinc finger binding domains has been described, for example, in WO 02/077227.
- zinc finger domains and/or multi-fingered zinc finger proteins may be linked together using any suitable linker sequences, including for example, linkers of 5 or more amino acids in length. See, also, U.S. Pat. Nos. 6,479,626; 6,903,185; and 7,153,949 for exemplary linker sequences 6 or more amino acids in length.
- the proteins described herein may include any combination of suitable linkers between the individual zinc fingers of the protein.
- a DNA-binding polypeptide is a DNA-binding domain from GAL4.
- GAL4 is a modular transactivator in Saccharomyces cerevisiae, but it also operates as a transactivator in many other organisms. See, e.g., Sadowski, I., et al., Nature, 1988, 335, 563-564.
- the expression of genes encoding enzymes of the galactose metabolic pathway in S. cerevisiae is stringently regulated by the available carbon source. Johnston, M., Microbiol. Rev., 1987, 51, 458-476. Transcriptional control of these metabolic enzymes is mediated by the interaction between the positive regulatory protein, GAIA, and a 17 bp symmetrical DNA sequence to which GAL4 specifically binds (the UAS).
- Native GAIA consists of 881 amino acid residues, with a molecular weight of 99 kDa.
- GAL4 comprises functionally autonomous domains, the combined activities of which account for activity of GAL4 in vivo (Ma, J. and Ptashne, M., Cell, 1987, 48, 847-853; Brent, R. and Ptashne, M., Cell, 1985 43, 729-736).
- the N-terminal 65 amino acids of GAL4 comprise the GAIA DNA-binding domain (Keegan, L., et al., Science, 1986, 231, 699-704; Johnston, M., Nature, 1987, 328, 353-355).
- Sequence-specific binding requires the presence of a divalent cation coordinated by 6 Cys residues present in the DNA binding domain.
- the coordinated cation-containing domain interacts with and recognizes a conserved CCG triplet at each end of the 17 bp UAS via direct contacts with the major groove of the DNA helix (Marmorstein, M., et al., Nature, 1992, 356, 408-414).
- the DNA- binding function of the protein positions C-terminal transcriptional activating domains in the vicinity of the promoter, such that the activating domains can direct transcription.
- Additional DNA-binding polypeptides that may be utilized in certain embodiments include, for example and without limitation, a binding sequence from a AVRB S3 -inducible gene; a consensus binding sequence from a AVRB S3 -inducible gene or synthetic binding sequence engineered therefrom (e.g., UPA DNA-binding domain); TAL; LexA (see, e.g., Brent, R. and Ptashne, M., Cell, 1985 43, 729-736); LacR (see, e.g., Labow, M.A., et aL, Mol. Cell. Biol., 1990, 10, 3343-3356; Bairn, S.B., et al., Proc. Natl. Acad.
- a binding sequence from a AVRB S3 -inducible gene e.g., a consensus binding sequence from a AVRB S3 -inducible gene or synthetic binding sequence engineered therefrom (e.g., UPA DNA-
- the DNA-binding domain of one or more of the nucleases used in the methods and compositions described herein comprises a naturally occurring or engineered (non-naturally occurring) TAL effector DNA binding domain. See, e.g., U.S. Patent Publication No. 20110301073, incorporated by reference in its entirety herein.
- the nuclease comprises a CRISPR system (e.g., CRISPR/Cas9 system).
- CRISPR clustered regularly interspaced short palindromic repeats
- Cas CRISPR- associated locus, which encodes proteins
- CRISPR loci in microbial hosts contain a combination of Cas genes as well as non-coding RNA elements capable of programming the specificity of the CRISPR-mediated nucleic acid cleavage.
- the Type II CRISPR system is one of the most well characterized systems and carries out targeted DNA double-strand break in four sequential steps.
- the mature crRNA:tracrRNA complex directs Cas9 to the target DNA via Watson- Crick base-pairing between the spacer on the crRNA and the protospacer on the target DNA next to the protospacer adjacent motif (PAM), an additional requirement for target recognition.
- PAM protospacer adjacent motif
- Cas9 mediates cleavage of target DNA to create a double-stranded break within the protospacer.
- Activity of the CRISPR/Cas system comprises of three steps: (i) insertion of exogenous DNA sequences into the CRISPR array to prevent future attacks, in a process called 'adaptation', (ii) expression of the relevant proteins, as well as expression and processing of the array, followed by (iii) RNA-mediated interference with the exogenous nucleic acid.
- Cas protein may be a "functional derivative" of a naturally occurring Cas protein.
- a “functional derivative” of a native sequence polypeptide is a compound having a qualitative biological property in common with a native sequence polypeptide.
- “Functional derivatives” include, but are not limited to, fragments of a native sequence and derivatives of a native sequence polypeptide and its fragments, provided that they have a biological activity in common with a corresponding native sequence polypeptide.
- a biological activity contemplated herein is the ability of the functional derivative to hydrolyze a DNA substrate into fragments.
- the term “derivative” encompasses both amino acid sequence variants of polypeptide, covalent modifications, and fusions thereof.
- Suitable derivatives of a Cas polypeptide or a fragment thereof include but are not limited to mutants, fusions, covalent modifications of Cas protein or a fragment thereof.
- Cas protein which includes Cas protein or a fragment thereof, as well as derivatives of Cas protein or a fragment thereof, may be obtainable from a cell or synthesized chemically or by a combination of these two procedures.
- the cell may be a cell that naturally produces Cas protein, or a cell that naturally produces Cas protein and is genetically engineered to produce the endogenous Cas protein at a higher expression level or to produce a Cas protein from an exogenously introduced nucleic acid, which nucleic acid encodes a Cas that is same or different from the endogenous Cas.
- the cell does not naturally produce Cas protein and is genetically engineered to produce a Cas protein.
- Cas9 protein comprises a RuvC nuclease domain and an HNH (H-N-H) nuclease domain, each of which can cleave a single DNA strand at a target sequence (the concerted action of both domains leads to DNA double-strand cleavage, whereas activity of one domain leads to a nick).
- the RuvC domain comprises subdomains I, II and III, where domain I is located near the N-terminus of Cas9 and subdomains II and III are located in the middle of the protein, flanking the HNH domain (Hsu, P.D., et al.. Cell, 2014, 157, 1262-1278).
- a type II CRISPR system includes a DNA cleavage system utilizing a Cas9 endonuclease in complex with at least one polynucleotide component.
- a Cas9 can be in complex with a CRISPR RNA (crRNA) and a trans-activating CRISPR RNA (tracrRNA).
- a Cas9 can be in complex with a single guide RNA that combines a crRNA and a tracrRNA into a single molecule.
- the amino acid sequence of a Cas9 protein described herein, as well as certain other Cas proteins herein, may be derived from a Streptococcus (e.g., S. pyogenes, S. pneumoniae, S.
- thermophilus S. agalactiae, S. parasanguinis, S. oralis, S. salivarius, S. macacae, S. dysgalactiae, S. anginosus, S. constellatus, S.pseudoporcinus, S. mutans), Listeria (e.g., L. innocua), Spiroplasma (e.g., S. apis, S. syrphidicola), Peptostreptococcaceae, Atopobium, Porphyromonas (e.g., P. catoniae), Prevotella (e.g., P.
- Veillonella Treponema (e.g., T socranskii, T denticola), Capnocytophaga, Finegoldia (e.g., F. magno), Coriobacteriaceae (e.g., C. bacterium), Olsenella (e.g., O. profusa), Haemophilus (e.g., H. sputorum, H. pittmaniae), Pasteurella (e.g., P. bettyae), Olivibacter (e.g., O. sitiensis), Epilithonimonas (e.g., E. tenax), Mesonia (e.g., M. mobilis), Lactobacillus (e.g., L.
- Treponema e.g., T socranskii, T denticola
- Capnocytophaga Finegoldia (e.g., F. magno)
- Coriobacteriaceae e.g., C.
- a Cas9 protein can be any of the Cas9 proteins disclosed in Chylinski et al. (Chylinski, K., et al., RNA Biology, 2013, 10, 726-737 and US patent application 62/162377, filed May 15, 2015) which is incorporated herein by reference.
- sequence of a Cas9 protein herein can comprise, for example, any of the Cas9 amino acid sequences disclosed in GenBank Accession Nos. G3ECR1 (S. thermophilus), WP_026709422, WP_027202655, WP_027318179, WP_027347504, WP_027376815, WP_027414302, WP_027821588, WP_027886314, WP_027963583, WP 028123848, WP 028298935, Q03J16 (S. thermophilus), EGP66723, EGS38969, EGV05092, EH165578 (S. pseudoporcinus), EIC75614 (S.
- EID22027 S. constellatus
- EU69711 EJP22331 (S. oralis)
- EJP26004 S.. anginosus
- EJP30321 EPZ44001 (S. pyogenes), EPZ46028 (S. pyogenes), EQL78043 (S. pyogenes), EQL78548 (S. pyogenes), ERL10511, ERL12345, ERL19088 (S. pyogenes), ESA57807 (S. pyogenes), ESA59254 (S. pyogenes), ESU85303 (S.
- SR4 ETS92439, EUB27844, (Streptococcus sp. BS21), AFJ08616, EUC82735 (Streptococcus sp. CM6), EWC92088, EWC94390, EJP25691, YP_008027038, YP_008868573, AGM26527, AHK22391, AHB36273, Q927P4, G3ECR1, or Q99ZW2 (S. pyogenes), which are incorporated by reference.
- a variant of any of these Cas9 protein sequences may be used, but should have specific binding activity, and optionally endonucleolytic activity, toward DNA when associated with an RNA component herein.
- Such a variant Cas9 protein should have specific binding activity, and optionally cleavage or nicking activity, toward DNA when associated with an RNA component herein.
- a Cas protein herein such as a Cas9 can comprise a heterologous nuclear localization sequence (NLS).
- a heterologous NLS amino acid sequence herein may be of sufficient strength to drive accumulation of a Cas protein in a detectable amount in the nucleus of a yeast cell herein, for example.
- An NLS may comprise one (monopartite) or more (e.g., bipartite) short sequences (e.g., 2 to 20 residues) of basic, positively charged residues (e.g., lysine and/or arginine), and can be located anywhere in a Cas amino acid sequence but such that it is exposed on the protein surface.
- An NLS may be operably linked to the N-terminus or C-terminus of a Cas protein herein, for example.
- Two or more NLS sequences can be linked to a Cas protein, for example, such as on both the N- and C-termini of a Cas protein.
- suitable NLS sequences herein include those disclosed in U.S. Patent No. 7309576, which is incorporated herein by reference.
- the Cas endonuclease can comprise a modified form of the Cas9 polypeptide.
- the modified form of the Cas9 polypeptide can include an amino acid change (e.g., deletion, insertion, or substitution) that reduces the naturally occurring nuclease activity of the Cas9 protein.
- the modified form of the Cas9 protein has less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, or less than 1% of the nuclease activity of the corresponding wild-type Cas9 polypeptide (US patent application US20140068797 Al, published on March 6, 2014).
- the modified form of the Cas9 polypeptide has no substantial nuclease activity and is referred to as catalytically “inactivated Cas9” or “deactivated cas9 (dCas9).”
- Catalytically inactivated Cas9 variants include Cas9 variants that contain mutations in the HNH and RuvC nuclease domains. These catalytically inactivated Cas9 variants are capable of interacting with sgRNA and binding to the target site in vivo but cannot cleave either strand of the target DNA.
- a catalytically inactive Cas9 can be fused to a heterologous sequence (US patent application US20140068797 Al, published on March 6, 2014).
- Suitable fusion partners include, but are not limited to, a polypeptide that provides an activity that indirectly increases transcription by acting directly on the target DNA or on a polypeptide (e.g., a histone or other DNA-binding protein) associated with the target DNA.
- Additional suitable fusion partners include, but are not limited to, a polypeptide that provides for methyltransferase activity, demethylase activity, acetyltransferase activity, deacetylase activity, kinase activity, phosphatase activity, ubiquitin ligase activity, deubiquitinating activity, adenylation activity, deadenylation activity, SUMOylating activity, deSUMOylating activity, ribosylation activity, deribosylation activity, myristoylation activity, or demyristoylation activity.
- fusion partners include, but are not limited to, a polypeptide that directly provides for increased transcription of the target nucleic acid (e.g., a transcription activator or a fragment thereof, a protein or fragment thereof that recruits a transcription activator, a small molecule/drug-responsive transcription regulator, etc.).
- a catalytically inactive Cas9 can also be fused to a Fok I nuclease to generate double-strand breaks (Guilinger, J.P., et al., Nature Biotechnology, 2014, 32, 577-582).
- a DNA-binding polypeptide specifically recognizes and binds to a target nucleotide sequence comprised within a genomic nucleic acid of a host organism. Any number of discrete instances of the target nucleotide sequence may be found in the host genome in some examples.
- the target nucleotide sequence may be rare within the genome of the organism (e.g., fewer than about 10, about 9, about 8, about 7, about 6, about 5, about 4, about 3, about 2, or about 1 copy(ies) of the target sequence may exist in the genome).
- the target nucleotide sequence may be located at a unique site within the genome of the organism.
- Target nucleotide sequences may be, for example and without limitation, randomly dispersed throughout the genome with respect to one another; located in different linkage groups in the genome; located in the same linkage group; located on different chromosomes; located on the same chromosome; located in the genome at sites that are expressed under similar conditions in the organism (e.g., under the control of the same, or substantially functionally identical, regulatory factors); and located closely to one another in the genome (e.g., target sequences may be comprised within nucleic acids integrated as concatemers at genomic loci).
- a DNA-binding polypeptide that specifically recognizes and binds to a target nucleotide sequence may be comprised within a chimeric polypeptide, so as to confer specific binding to the target sequence upon the chimeric polypeptide.
- a chimeric polypeptide may comprise, for example and without limitation, nuclease, recombinase, and/or ligase polypeptides, as these polypeptides are described above.
- Chimeric polypeptides comprising a DNA-binding polypeptide and a nuclease, recombinase, and/or ligase polypeptide may also comprise other functional polypeptide motifs and/or domains, such as for example and without limitation: a spacer sequence positioned between the functional polypeptides in the chimeric protein; a leader peptide; a peptide that targets the fusion protein to an organelle (e.g., the nucleus); polypeptides that are cleaved by a cellular enzyme; peptide tags (e.g., Myc, His, etc.); and other amino acid sequences that do not interfere with the function of the chimeric polypeptide.
- a spacer sequence positioned between the functional polypeptides in the chimeric protein
- a leader peptide e that targets the fusion protein to an organelle (e.g., the nucleus)
- polypeptides that are cleaved by a cellular enzyme
- Functional polypeptides in a chimeric polypeptide may be operatively linked.
- functional polypeptides of a chimeric polypeptide may be operatively linked by their expression from a single polynucleotide encoding at least the functional polypeptides ligated to each other in-frame, so as to create a chimeric gene encoding a chimeric protein.
- the functional polypeptides of a chimeric polypeptide may be operatively linked by other means, such as by cross-linkage of independently expressed polypeptides.
- a DNA-binding polypeptide, or guide RNA that specifically recognizes and binds to a target nucleotide sequence may be comprised within a natural isolated protein (or mutant thereof), wherein the natural isolated protein or mutant thereof also comprises a nuclease polypeptide (and may also comprise a recombinase and/or ligase polypeptide).
- isolated proteins include TALENs, recombinases (e.g., Cre, Hin, Tre, and FLP recombinase), CRISPR systems (e.g., CRISPR/Cas9 systems), and meganucleases.
- targeting endonuclease refers to natural or engineered isolated proteins and mutants thereof that comprise a DNA-binding polypeptide or guide RNA and a nuclease polypeptide, as well as to chimeric polypeptides comprising a DNA- binding polypeptide or guide RNA and a nuclease.
- Any targeting endonuclease comprising a DNA-binding polypeptide or guide RNA that specifically recognizes and binds to a target nucleotide sequence comprised within an ANPEP locus (e.g., either because the target sequence is comprised within the native sequence at the locus, or because the target sequence has been introduced into the locus, for example, by recombination) may be utilized in certain embodiments.
- chimeric polypeptides that may be useful in particular embodiments of the present teachings include, without limitation, combinations of the following polypeptides: zinc finger DNA-binding polypeptides; a Fok I nuclease polypeptide; TALE domains; leucine zippers; transcription factor DNA-binding motifs; and DNA recognition and/or cleavage domains isolated from, for example and without limitation, a TALEN, a recombinase (e.g., Cre, Hin, RecA, Tre, and FLP recombinases), a CRISPR system (e.g., CRISPR/Cas9 system), a meganuclease; and others known to those in the art.
- TALEN a recombinase
- Cre e.g., Cre, Hin, RecA, Tre, and FLP recombinases
- CRISPR system e.g., CRISPR/Cas9 system
- meganuclease and others
- Chimeric polypeptides may be engineered by methods known to those of skill in the art to alter the recognition sequence of a DNA- binding polypeptide comprised within the chimeric polypeptide, so as to target the chimeric polypeptide to a particular nucleotide sequence of interest.
- the chimeric polypeptide comprises a DNA-binding domain (e.g., zinc finger, TAL-effector domain, etc.) and a nuclease (cleavage) domain.
- the cleavage domain may be heterologous to the DNA-binding domain, for example a zinc finger DNA-binding domain and a cleavage domain from a nuclease or a TALEN DNA- binding domain and a cleavage domain, or meganuclease DNA-binding domain and cleavage domain from a different nuclease.
- Heterologous cleavage domains can be obtained from any endonuclease or exonuclease.
- Exemplary endonucleases from which a cleavage domain can be derived include, but are not limited to, restriction endonucleases and homing endonucleases. See, for example, 2002-2003 Catalogue, New England Biolabs, Beverly, Mass.; and Belfort, M. and Roberts, J., Nucleic Acids Res., 1997, 25, 3379-3388. Additional enzymes which cleave DNA are known (e.g., 51 Nuclease; mung bean nuclease; pancreatic DNase I; micrococcal nuclease; yeast HO endonuclease; see also Linn et al. (eds.) Nucleases, Cold Spring Harbor Laboratory Press, 1993). One or more of these enzymes (or functional fragments thereof) can be used as a source of cleavage domains and cleavage half-domains.
- a cleavage half-domain can be derived from any nuclease or portion thereof, as set forth above, that requires dimerization for cleavage activity.
- two fusion proteins are required for cleavage if the fusion proteins comprise cleavage half- domains.
- a single protein comprising two cleavage half-domains can be used.
- the two cleavage half-domains can be derived from the same endonuclease (or functional fragments thereof), or each cleavage half-domain can be derived from a different endonuclease (or functional fragments thereof).
- the target sites for the two fusion proteins are preferably disposed, with respect to each other, such that binding of the two fusion proteins to their respective target sites places the cleavage half-domains in a spatial orientation to each other that allows the cleavage half-domains to form a functional cleavage domain, e.g., by dimerizing.
- the near edges of the target sites are separated by 5-8 nucleotides or by 15-18 nucleotides.
- any integral number of nucleotides, or nucleotide pairs can intervene between two target sites (e.g., from 2 to 50 nucleotide pairs or more).
- the site of cleavage lies between the target sites.
- Restriction endonucleases are present in many species and are capable of sequence-specific binding to DNA (at a recognition site), and cleaving DNA at or near the site of binding, for example, such that one or more exogenous sequences (donors/transgenes) are integrated at or near the binding (target) sites.
- Certain restriction enzymes e.g., Type IIS
- Fok I catalyzes double-stranded cleavage of DNA, at 9 nucleotides from its recognition site on one strand and 13 nucleotides from its recognition site on the other.
- fusion proteins comprise the cleavage domain (or cleavage half-domain) from at least one Type IIS restriction enzyme and one or more zinc finger binding domains, which may or may not be engineered.
- Fok I An exemplary Type IIS restriction enzyme, whose cleavage domain is separable from the binding domain, is Fok I.
- This particular enzyme is active as a dimer (Bitinaite, J., et al., Proc. Natl. Acad. Sci. USA, 1998, 95, 10570-10575). Accordingly, for the purposes of the present disclosure, the portion of the Fok I enzyme used in the disclosed fusion proteins is considered a cleavage half-domain.
- two fusion proteins each comprising a Fok I cleavage half-domain, can be used to reconstitute a catalytically active cleavage domain.
- a single polypeptide molecule containing a DNA binding domain and two Fok I cleavage half-domains can also be used.
- a cleavage domain or cleavage half-domain can be any portion of a protein that retains cleavage activity, or that retains the ability to multimerize (e.g., dimerize) to form a functional cleavage domain.
- Type IIS restriction enzymes are described in U.S. Patent Publication No. 20070134796, incorporated herein in its entirety. Additional restriction enzymes also contain separable binding and cleavage domains, and these are contemplated by the present disclosure. See, for example, Roberts, R.J., et al., Nucleic Acids Res., 2003, 31, 418-420.
- the cleavage domain comprises one or more engineered cleavage half-domain (also referred to as dimerization domain mutants) that minimize or prevent homodimerization, as described, for example, in U.S. Patent Publication Nos. 20050064474; 20060188987 and 20080131962, the disclosures of all of which are incorporated by reference in their entireties herein.
- engineered cleavage half-domain also referred to as dimerization domain mutants
- nucleases may be assembled in vivo at the nucleic acid target site using so-called "split-enzyme” technology (see e.g. U.S. Patent Publication No. 20090068164).
- split-enzyme e.g. U.S. Patent Publication No. 20090068164.
- Components of such split enzymes may be expressed either on separate expression constructs, or can be linked in one open reading frame where the individual components are separated, for example, by a self-cleaving 2A peptide or IRES sequence.
- Components may be individual zinc finger binding domains or domains of a meganuclease nucleic acid binding domain.
- a chimeric polypeptide is a custom-designed zinc finger nuclease (ZFN) that may be designed to deliver a targeted site-specific double-strand DNA break into which an exogenous nucleic acid, or donor DNA, may be integrated (See US Patent publication 20100257638, incorporated by reference herein).
- ZFNs are chimeric polypeptides containing a non-specific cleavage domain from a restriction endonuclease (for example, Fok I) and a zinc finger DNA-binding domain polypeptide. See, e.g., Huang, B., et al., J. Protein Chem., 1996, 15, 481-489; Kim, J-S., et al. Proc. Natl.
- the ZFNs comprise non-canonical zinc finger DNA binding domains (see US Patent publication 20080182332, incorporated by reference herein).
- the Fok I restriction endonuclease must dimerize via the nuclease domain in order to cleave DNA and introduce a double-strand break.
- ZFNs containing a nuclease domain from such an endonuclease also require dimerization of the nuclease domain in order to cleave target DNA (Mani, M. et al., Biochem. Biophys. Res. Commun., 2005, 334, 1191-1197; Smith, J., et al., Nucleic Acids Res. 2000, 28, 3361-3369). Dimerization of the ZFN can be facilitated by two adjacent, oppositely oriented DNA- binding sites (Id).
- Embodiments of the present teachings may include one or more nucleic acids selected from the group consisting of an exogenous nucleic acid for site-specific integration in an ANPEP locus, for example and without limitation, an ORF; a nucleic acid comprising a nucleotide sequence encoding a targeting endonuclease; and a vector comprising at least one of either or both of the foregoing.
- nucleic acids for use in some embodiments include nucleotide sequences encoding a polypeptide, structural nucleotide sequences, and/or DNA-binding polypeptide recognition and binding sites.
- donor sequence an exogenous sequence
- a donor sequence can contain a non-homologous sequence flanked by two regions of homology to allow for efficient HDR at the location of interest.
- donor sequences can comprise a vector molecule containing sequences that are not homologous to the region of interest in cellular chromatin.
- a donor molecule can contain several, discontinuous regions of homology to cellular chromatin. For example, for targeted insertion of sequences not normally present in a region of interest, said sequences can be present in a donor nucleic acid molecule and flanked by regions of homology to sequence in the region of interest.
- the donor polynucleotide can be DNA or RNA, single- stranded or double-stranded and can be introduced into a cell in linear or circular form. See e.g., U.S. Patent Publication Nos. 20100047805, 20110281361, 20110207221 and U.S. application Ser. No. 13/889,162. If introduced in linear form, the ends of the donor sequence can be protected (e.g. from exonucleolytic degradation) by methods known to those of skill in the art.
- one or more dideoxynucleotide residues are added to the 3' terminus of a linear molecule and/or self-complementary oligonucleotides are ligated to one or both ends (see, e.g. Chang, X-B. and Wilson, J.H., Proc. Natl. Acad. Sci. USA, 1987, 84, 4959-4963; Nehls, M., et al., Science, 1996, 272, 886-889).
- Additional methods for protecting exogenous polynucleotides from degradation include, but are not limited to, addition of terminal amino group(s) and the use of modified intemucleotide linkages such as, for example, phosphorothioates, phosphoramidates, and O-methyl ribose or deoxyribose residues.
- a polynucleotide can be introduced into a cell as part of a vector molecule having additional sequences such as, for example, replication origins or promoters.
- donor polynucleotides can be introduced as naked nucleic acid, as nucleic acid complexed with an agent such as a liposome or poloxamer, or can be delivered by viruses; exemplary viruses include adenovirus, AAV, herpesvirus, retrovirus, lentivirus and integrase defective lenti virus (IDLV).
- the donor is generally integrated so that its expression is driven by the endogenous promoter at the integration site, namely the promoter that drives expression of the endogenous gene into which the donor is integrated.
- the donor may comprise a promoter and/or enhancer, for example a constitutive promoter or an inducible or tissue-specific promoter.
- exogenous sequences may also include transcriptional or translational regulatory sequences, for example, promoters, enhancers, insulators, internal ribosome entry sites, sequences encoding 2A peptides and/or polyadenylation signals.
- nucleic Acid Molecules Comprising a Nucleotide Sequence Encoding a Targeting Endonuclease
- a nucleotide sequence encoding a targeting endonuclease may be engineered by manipulation (e.g., ligation) of native nucleotide sequences encoding polypeptides comprised within the targeting endonuclease.
- the nucleotide sequence of a gene encoding a protein comprising a DNA-binding polypeptide may be inspected to identify the nucleotide sequence of the gene that corresponds to the DNA-binding polypeptide, and that nucleotide sequence may be used as an element of a nucleotide sequence encoding a targeting endonuclease comprising the DNA-binding polypeptide.
- the amino acid sequence of a targeting endonuclease may be used to deduce a nucleotide sequence encoding the targeting endonuclease, for example, according to the degeneracy of the genetic code.
- nucleic acid molecules comprising a nucleotide sequence encoding a targeting endonuclease
- the last codon of a first polynucleotide sequence encoding a nuclease polypeptide, and the first codon of a second polynucleotide sequence encoding a DNA-binding polypeptide may be separated by any number of nucleotide triplets, e.g., without coding for an intron or a "STOP.”
- the last codon of a nucleotide sequence encoding a first polynucleotide sequence encoding a DNA-binding polypeptide, and the first codon of a second polynucleotide sequence encoding a nuclease polypeptide may be separated by any number of nucleotide triplets.
- the last codon of the last of a first polynucleotide sequence encoding a nuclease polypeptide, and a second polynucleotide sequence encoding a DNA-binding polypeptide may be fused in phase-register with the first codon of a further polynucleotide coding sequence directly contiguous thereto, or separated therefrom by no more than a short peptide sequence, such as that encoded by a synthetic nucleotide linker (e.g., a nucleotide linker that may have been used to achieve the fusion).
- a synthetic nucleotide linker e.g., a nucleotide linker that may have been used to achieve the fusion.
- polynucleotide sequences include, for example and without limitation, tags, targeting peptides, and enzymatic cleavage sites.
- first codon of the most 5' (in the nucleic acid sequence) of the first and second polynucleotide sequences may be fused in phase-register with the last codon of a further polynucleotide coding sequence directly contiguous thereto, or separated therefrom by no more than a short peptide sequence.
- a sequence separating polynucleotide sequences encoding functional polypeptides in a targeting endonuclease may, for example, consist of any sequence, such that the amino acid sequence encoded is not likely to significantly alter the translation of the targeting endonuclease. Due to the autonomous nature of known nuclease polypeptides and known DNA-binding polypeptides, intervening sequences generally will not interfere with the respective functions of these structures.
- Various other techniques known in the art can be used to inactivate genes to make knock-out animals and/or to introduce nucleic acid constructs into animals to produce founder animals, in which the knockout or nucleic acid construct is integrated into the genome.
- Such techniques include, without limitation, pronuclear microinjection (U.S. Pat. No. 4,873,191), retrovirus mediated gene transfer into germ lines (Van der Putten H., et al., Proc. Natl. Acad. Sci. USA, 1985, 82, 6148-1652), gene targeting into embryonic stem cells (Thompson, S., et al., Cell, 1989, 56, 313-321), electroporation of embryos (Lo, W ., Mol. Cell.
- Pronuclear microinjection, sperm mediated gene transfer, and somatic cell nuclear transfer are particularly useful techniques.
- An animal that is gene edited is an animal wherein all of its cells have the genetic edit, including its germ line cells.
- the animals may be bred and progeny that are gene edited may be selected.
- Cloning for instance, may be used to make a mosaic animal if its cells are edited at the blastocyst state, or gene editing can take place when a single cell is edited. Animals that are edited so that they exhibit reduced susceptibility to infection by a coronavirus can be homozygous or heterozygous for the edit, depending on the specific approach that is used.
- homozygosity would normally be required. However, the instant application provides for and includes mosaic animals and heterozygous animals that may be used to breed homozygous animals. Further, if a particular gene is inactivated by an RNA interference or dominant negative strategy, then heterozygosity is often adequate.
- a nucleic acid construct or mRNA is introduced into a fertilized egg; 1 or 2 cell fertilized eggs are used as the pronuclei containing the genetic material from the sperm head and the egg are visible within the protoplasm.
- Pronuclear staged fertilized eggs can be obtained in vitro or in vivo (i.e., surgically recovered from the oviduct of donor animals).
- In vitro fertilized eggs can be produced as follows. For example, swine ovaries can be collected at an abattoir, and maintained at 22-28° C during transport.
- Ovaries can be washed and isolated for follicular aspiration, and follicles ranging from 4-8 mm can be aspirated into 50 mL conical centrifuge tubes using 18-gauge needles and under vacuum to suck the oocyte from the follicle on the ovary.
- Follicular fluid and aspirated oocytes can be rinsed through pre-filters with commercial TL-HEPES (Minitube, Verona, WI).
- Oocytes surrounded by a compact cumulus mass can be selected and placed into TCM-199 OOCYTE MATURATION MEDIUM (Minitube, Verona, WI) supplemented with 0.1 mg/mL cysteine, 10 ng/mL epidermal growth factor, 10% porcine follicular fluid, 50 pM 2-mercaptoethanol, 0.5 mg/ml cAMP, 10 lU/mL each of pregnant mare serum gonadotropin (PMSG) and human chorionic gonadotropin (hCG) for approximately 22 hours in humidified air at 38.7° C and 5% CO2.
- PMSG pregnant mare serum gonadotropin
- hCG human chorionic gonadotropin
- the oocytes can be moved to fresh TCM-199 maturation medium, which will not contain cAMP, PMSG or hCG and then the oocytes can be incubated for an additional 22 hours. Matured oocytes can be stripped of their cumulus cells by vortexing in 0.1% hyaluronidase for 1 minute.
- mature oocytes can be fertilized in 500 pl Minitube PORCPRO IVF MEDIUM SYSTEM (Minitube, Verona, Wis.) in Minitube 5-well fertilization dishes.
- IVF in vitro fertilization
- boar semen can be washed and resuspended in PORCPRO IVF Medium to 4xl0 5 sperm.
- Sperm concentrations can be analyzed by computer assisted semen analysis (SPERM VISION®, Minitube, Verona, WI).
- Final in vitro insemination can be performed in a 10 pl volume at a final concentration of approximately 40 motile sperm/oocyte, depending on the boar.
- the oocytes can be incubated at 38.7° C in 5.0% CO2 atmosphere for 6 hours. Six hours postinsemination, presumptive zygotes can be washed twice in NCSU-23 and moved to 0.5 mL of the same medium. This system can produce 20-30% blastocysts routinely across most boars with a 10-30% polyspermic insemination rate.
- Linearized nucleic acid constructs or mRNA can be injected into one of the pronuclei or into the cytoplasm. Then the injected eggs can be transferred to a recipient female (e.g., into the oviducts of a recipient female) and allowed to develop in the recipient female to produce the genetically edited animals.
- a recipient female e.g., into the oviducts of a recipient female
- in vitro fertilized embryos can be centrifuged at 15,000 x g for 5 minutes to sediment lipids allowing visualization of the pronucleus.
- the embryos can be injected with using an EPPENDORF® FEMTOJET® injector (EPPENDORF®, Hamburg, Germany) and can be cultured until blastocyst formation. Rates of embryo cleavage and blastocyst formation and quality can be recorded.
- Embryos can be surgically transferred into uteri of asynchronous recipients. Typically, 20-200 embryos can be deposited into the ampulla-isthmus junction of the oviduct using a catheter. After surgery, real-time ultrasound examination of pregnancy can be performed.
- a gene edited cell e.g., a gene edited pig cell
- a gene edited cell such as an embryonic blastomere, fetal fibroblast, adult ear fibroblast, or granulosa cell that includes a nucleic acid construct described above
- Oocytes can be enucleated by partial zona dissection near the polar body and then pressing out cytoplasm at the dissection area.
- an injection pipette with a sharp beveled tip is used to inject the gene edited cell into an enucleated oocyte arrested at meiosis 2.
- oocytes arrested at meiosis-2 are termed eggs.
- the embryo After producing a porcine or bovine embryo (e.g., by fusing and activating the oocyte), the embryo is transferred to the oviducts of a recipient female, about 20 to 24 hours after activation. See, for example, Cibelli, J.B., et al., Science, 1998, 280, 1256-1258 and U.S. Pat. No. 6,548,741.
- recipient females can be checked for pregnancy approximately 20-21 days after transfer of the embryos.
- Standard breeding techniques can be used to create animals that are homozygous for the desired edit from the initial heterozygous or mosaic founder animals. Homozygosity may not be required, however. Gene edited animals described herein can be bred with other animals of interest. Regardless of whether or not heterozygous animals are resistant to coronavirus, these animals have utility at least in that they can be used in breeding, for example to breed homozygous animals.
- many different zygotes can be edited using methods of the present teachings. These animals can then be interbred to create a herd comprising an elite germplasm that includes the edit.
- a nucleic acid of interest and a selectable marker can be provided on separate transposons and provided to either embryos or cells in unequal amount, where the amount of transposon containing the selectable marker far exceeds (5- 10 fold excess) the transposon containing the nucleic acid of interest.
- Gene edited cells or animals expressing the nucleic acid of interest can be isolated based on presence and expression of the selectable marker.
- the transposons will integrate into the genome in a precise and unlinked way (independent transposition events), the nucleic acid of interest and the selectable marker are not genetically linked and can easily be separated by genetic segregation through standard breeding. Thus, gene edited animals can be produced that are not constrained to retain selectable markers in subsequent generations.
- ILLUMINA® sequencing ILLUMINA®, Inc., San Diego, CA
- Southern blot analysis to determine whether or not an edit has taken place.
- RNA expression of a nucleic acid sequence encoding a polypeptide in the tissues of gene edited animals can be assessed using techniques that include, for example, northern blot analysis of tissue samples obtained from the animal, in situ hybridization analysis, western analysis, immunoassays such as enzyme-linked immunosorbent assays, and reversetranscriptase PCR (RT-PCR).
- techniques that include, for example, northern blot analysis of tissue samples obtained from the animal, in situ hybridization analysis, western analysis, immunoassays such as enzyme-linked immunosorbent assays, and reversetranscriptase PCR (RT-PCR).
- RNAi interfering RNA
- dsRNA double-stranded RNA
- RISC RNA- induced silencing complex
- RISC metabolizes dsRNA to small 21-23 -nucleotide small interfering RNAs (siRNAs).
- RISC contains a double-stranded RNAse (dsRNase, e.g., Dicer) and ssRNase (e.g., Argonaut 2 or Ago2).
- RISC utilizes antisense strand as a guide to find a cleavable target.
- siRNAs and microRNAs miRNAs
- a method of inactivating a gene in a gene edited animal comprises inducing RNA interference against a target gene and/or nucleic acid such that expression of the target gene and/or nucleic acid is reduced.
- the exogenous nucleic acid sequence can induce RNA interference against a nucleic acid encoding a polypeptide.
- double-stranded small interfering RNA (siRNA) or small hairpin RNA (shRNA) homologous to a target DNA can be used to reduce expression of that DNA.
- Constructs for siRNA can be produced as described, for example, in Fire, A., et al., Nature, 1998, 391, 806-811; Romano, N. and Masino, G., Mol. Microbiol., 1992, 6, 3343-3353; Cogoni, C., et al., EMBO J., 1996, 15, 3153-3161; Cogoni, C.
- shRNAs are transcribed as a single-stranded RNA molecule containing complementary regions, which can anneal and form short hairpins.
- the probability of finding a single, individual functional siRNA or miRNA directed to a specific gene is high.
- the predictability of a specific sequence of siRNA, for instance, is about 50% but a number of interfering RNAs may be made with good confidence that at least one of them will be effective.
- Embodiments include an in vitro cell, an in vivo cell, and a gene edited animal such as a livestock animal that expresses an RNAi directed against an ANPEP gene.
- the RNAi may be, for instance, selected from the group consisting of siRNA, shRNA, dsRNA, and miRNA.
- nucleic acids may be introduced into cells for knockout purposes, for inactivation of a gene, to obtain expression of a gene, or for other purposes.
- nucleic acid includes DNA, RNA, and nucleic acid analogs, and nucleic acids that are double-stranded or single-stranded (i.e., a sense or an antisense single strand).
- Nucleic acid analogs can be modified at the base moiety, sugar moiety, or phosphate backbone to improve, for example, stability, hybridization, or solubility of the nucleic acid.
- Modifications at the base moiety include deoxyuridine for deoxythymidine, and 5-methyl-2'-deoxycytidine and 5-bromo-2'-doxycytidine for deoxycytidine.
- Modifications of the sugar moiety include modification of the 2' hydroxyl of the ribose sugar to form 2'-O-methyl or 2'-O-allyl sugars.
- the deoxyribose phosphate backbone can be modified to produce morpholino nucleic acids, in which each base moiety is linked to a six membered, morpholino ring, or peptide nucleic acids, in which the deoxyphosphate backbone is replaced by a pseudopeptide backbone and the four bases are retained. See, Summerton, J.
- deoxyphosphate backbone can be replaced with, for example, a phosphorothioate or phosphorodithioate backbone, a phosphoroamidite, or an alkyl phosphotri ester backbone.
- the target nucleic acid sequence can be operably linked to a regulatory region such as a promoter.
- Regulatory regions can be porcine regulatory regions or can be from other species.
- operably linked refers to positioning of a regulatory region relative to a nucleic acid sequence in such a way as to permit or facilitate transcription of the target nucleic acid.
- promoter can be operably linked to a target nucleic acid sequence.
- promoters include, without limitation, tissue-specific promoters, constitutive promoters, inducible promoters, and promoters responsive or unresponsive to a particular stimulus. Suitable tissue-specific promoters can result in preferential expression of a nucleic acid transcript in beta cells and include, for example, the human insulin promoter. Other tissue-specific promoters can result in preferential expression in, for example, hepatocytes or heart tissue and can include the albumin or alpha-myosin heavy chain promoters, respectively.
- a promoter that facilitates the expression of a nucleic acid molecule without significant tissue or temporal specificity can be used (z.e., a constitutive promoter).
- a beta-actin promoter such as the chicken beta-actin gene promoter, ubiquitin promoter, miniCAGs promoter, glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) promoter, or 3 -phosphoglycerate kinase (PGK) promoter can be used, as well as viral promoters such as the herpes simplex virus thymidine kinase (HSV- TK) promoter, the SV40 promoter, or a cytomegalovirus (CMV) promoter.
- HSV- TK herpes simplex virus thymidine kinase
- CMV cytomegalovirus
- a fusion of the chicken beta actin gene promoter and the CMV enhancer is used as a promoter. See, for example, Xu, L., et al.. Hum. Gene Then, 2001, 12, 563-573; and Kiwaki, K., et al., Hum. Gene Then, 1996, 7, 821-830.
- Additional regulatory regions that may be useful in nucleic acid constructs, include, but are not limited to, polyadenylation sequences, translation control sequences (e.g., an internal ribosome entry segment, IRES), enhancers, inducible elements, or introns. Such regulatory regions may not be necessary, although they may increase expression by affecting transcription, stability of the mRNA, translational efficiency, or the like. Such regulatory regions can be included in a nucleic acid construct as desired to obtain optimal expression of the nucleic acids in the cell(s). Sufficient expression, however, can sometimes be obtained without such additional elements.
- Nucleic acid constructs can be introduced into embryonic, fetal, or adult animal cells of any type, including, for example, germ cells such as an oocyte or an egg, a progenitor cell, an adult or embryonic stem cell, a primordial germ cell, a kidney cell such as a PK-15 cell, an islet cell, a beta cell, a liver cell, or a fibroblast such as a dermal fibroblast, using a variety of techniques.
- Non-limiting examples of techniques include the use of transposon systems, recombinant viruses that can infect cells, or liposomes or other non-viral methods such as electroporation, microinjection, or calcium phosphate precipitation, that are capable of delivering nucleic acids to cells.
- Nucleic acids can be incorporated into vectors.
- a vector is a broad term that includes any specific DNA segment that is designed to move from a carrier into a target DNA.
- a vector may be referred to as an expression vector, or a vector system, which is a set of components needed to bring about DNA insertion into a genome or other targeted DNA sequence such as an episome, plasmid, or even virus/phage DNA segment.
- Vector systems such as viral vectors (e.g., retroviruses, adeno-associated virus and integrating phage viruses), and non-viral vectors (e.g., transposons) used for gene delivery in animals have two basic components: 1) a vector comprised of DNA (or RNA that is reverse transcribed into a cDNA) and 2) a transposase, recombinase, or other integrase enzyme that recognizes both the vector and a DNA target sequence and inserts the vector into the target DNA sequence.
- Vectors most often contain one or more expression cassettes that comprise one or more expression control sequences, wherein an expression control sequence is a DNA sequence that controls and regulates the transcription and/or translation of another DNA sequence or mRNA, respectively.
- Plasmids and viral vectors are known.
- Mammalian expression plasmids typically have an origin of replication, a suitable promoter and optional enhancer, necessary ribosome binding sites, a polyadenylation site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking non-transcribed sequences.
- vectors include: plasmids (which may also be a carrier of another type of vector), adenovirus, adeno-associated virus (AAV), lentivirus (e.g., modified HIV-1, SIV or FIV), retrovirus (e.g., ASV, ALV or MoMLV), and transposons (e.g., Sleeping Beauty, P-elements, Tol-2, Frog Prince, piggyBac).
- plasmids which may also be a carrier of another type of vector
- adenovirus e.g., adeno-associated virus (AAV)
- lentivirus e.g., modified HIV-1, SIV or FIV
- retrovirus e.g., ASV, ALV or MoMLV
- transposons e.g., Sleeping Beauty, P-elements, Tol-2, Frog Prince, piggyBac.
- nucleic acid refers to both RNA and DNA, including, for example, cDNA, genomic DNA, synthetic (e.g., chemically synthesized) DNA, as well as naturally occurring and chemically modified nucleic acids, e.g, synthetic bases or alternative backbones.
- a nucleic acid molecule can be double-stranded or single-stranded (i.e., a sense or an antisense single strand).
- founder animals may be produced by cloning and other methods described herein.
- the founders can be homozygous for a gene edit, as in the case where a zygote or a primary cell undergoes a homozygous edit. Similarly, founders can also be made that are heterozygous. All of a founder's cells genome may carry the gene edit or founders may be mosaic for an edit. Mosaicism depends on the stage at which the gene editing reagents were introduced, the half life of the reagents, and at what stage editing occurred. For example, viable gene editing reagents may still be in a two-cell stage embryo that was injected as a zygote. Progeny of mosaic animals may be tested to identify progeny that have the desired edit.
- An animal may include a trait chosen from a trait in the group consisting of a production trait, a type trait, a workability trait, a fertility trait, a mothering trait, and a disease resistance trait. Further traits include expression of a recombinant gene product. Animals with a desired trait or traits may be edited to increase their resistance to infection with a coronavirus.
- Other potential porcine lines include lines can be PICTM Line 15, PICTM Line 17, PICTM Line 27, PICTM Line 65, PICTM Line 14, PICTM Line 62, PIC337, PIC800, PIC280, PIC327, PIC408, PICTM 399, PIC410, PIC415, PIC359, PIC380, PIC837, PIC260, PIC265, PIC210, PICTM Line 2, PICTM Line 3, PICTM Line 4, PICTM Line 5, PICTM Line 18, PICTM Line 19, PICTM Line 92, PIC95, PICTM CAMBOROUGH® (Pig Improvement Company, Limited, Basingstoke, UK), PIC 1070, PICTM CAMBOROUGH® 40, PICTM CAMBOROUGH® 22, PIC 1050, PICTM CAMBOROUGH® 29, PICTM CAMBOROUGH® 48, or PICTM CAMBOROUGH® x54.
- PICTM CAMBOROUGH® 40 PICTM CA
- PICTM Line 65 is sold under the trade name PIC337.
- PICTM line 62 is sold under the tradename PIC408.
- hybrid pigs made by crossing PICTM lines 15 and 17 are sold under the tradenames PIC800 or PIC280.
- PICTM Line 27 is sold under the tradename PIC327.
- hybrids created from crossing PICTM Line 65 and PICTM Line 62 are sold under the tradenames PIC399, PIC410, or PIC415.
- hybrids created from crossing PICTM Line 65 and Pic Line 27 are sold under the tradename PIC359.
- hybrids prepared from crossing PICTM Line 800 pigs (which is a hybrid of PICTM Line 15 and PICTM Line 17) to PICTM Line 65 pigs are sold under the tradenames PIC380 or PIC837.
- PICTM Line 14 is sold under the trade name PIC260.
- hybrids created from crossing PICTM Line 14 and PICTM Line 65 are sold under the tradename PIC265.
- hybrids created by crossing PICTM Line 2 and PICTM Line 3 are sold under the tradenames PIC210, PICTM CAMBOROUGH®, and PIC 1050.
- hybrids of PICTM Line 3 and PICTM Line 92 are sold under the tradename PIC95.
- hybrids made from crossing PICTM Line 19 and PICTM Line 3 are sold under the tradename PIC 1070.
- hybrids created by crossing PICTM Line 18 and PICTM Line 3 are sold under the tradename PICTM CAMBOROUGH® 40.
- hybrids created from crossing PICTM Line 19 and PIC 1050 (which is itself a hybrid of PICTM lines 2 and 3) are sold under the tradename PICTM CAMBOROUGH® 22.
- hybrids created from crossing PICTM Line 2 and PIC 1070 (which is itself a hybrid of PICTM lines 19 and 3) are sold under the tradename PICTM CAMBOROUGH® 29.
- hybrids created from crossing PICTM Line 18 and PIC 1050 (which is itself a hybrid of PICTM lines 2 and 3) are sold under the tradename PICTM CAMBOROUGH® 48. In various aspects, hybrids created from crossing PICTM Line 4 and PICTM Line 5 are sold under the tradename PICTM CAMBOROUGH® x54.
- PCR primers flanking the region of interest and a probe that specifically anneals to the region of interest can be designed.
- the probe can be labelled with both a fluorophore and a quencher.
- the primers and probe hybridize in a sequence-dependent manner to the complementary DNA strand of the region of interest. Because the probe is intact, the fluorophore and quencher are in close proximity and the quencher absorbs fluorescence emitted by the fluorophore.
- the polymerase extends from the primers and begins DNA synthesis. When the polymerase reaches the probe, the exonuclease activity of the polymerase cleaves the hybridized probe.
- a commercial real-time PCR kit can be used to probe various animals for the desired edit.
- This example demonstrates the use of gRNAs, in combination with Cas protein activity, to generate double-stranded breaks in DNA, which upon host-cell mediated nonhom ologous end-joining, results in the introduction of a premature stop codons in coding regions of ANPEP.
- premature stop codons in ANPEP mRNA would result in premature termination of protein translation and ultimately truncated and non-functional proteins, and/or initiate nonsense-mediated mRNA decay resulting in elimination of ANPEP mRNA transcripts.
- Premature stop codons can be introduced via the homology directed repair (HDR) pathway by inclusion of a single- or double-stranded DNA template in editing experiments.
- HDR homology directed repair
- exogenously added DNA can integrate randomly in the genome. Therefore, it would be advantageous to identify single gRNAs or gRNA pairs that promote the formation of in-frame stop codons, without the introduction of non-wild type amino acids.
- one or two gRNAs can be used to direct nuclease cut sites, which are then repaired by NHEJ to affect the desired edit.
- PFFs porcine fetal fibroblasts
- the gRNAs listed in Table 3 below were generated by in vitro transcription and complexed with SpyCas9 in water, using 3.2 pg of Cas9 protein and 2.2 pg of gRNA in a total volume of 2.23 pl. Resulting ribonucleoprotein (RNP) complexes were then combined 1 : 1 in a total volume of 2.23 pl to generate gRNA pairs, as indicated in Table 3, and nucleofected into PFFs using a Lonza electroporator.
- RNP ribonucleoprotein
- PFF cells were harvested using TRYPLE EXPRESSTM (ThermoFisher, recombinant Trypsin): the culture medium was removed from cells, cells were washed once with Hank’s Balanced Salt Solution (HBSS) or Dulbecco’s Phosphate-Buffered Saline (DPBS), and incubated for 3-5 minutes at 38.5° C in the presence of TrypLE. Cells were then harvested with complete medium. Cells were pelleted via centrifugation (300 g x 5 minutes at room temperature), supernatant was discarded, and then the cells were resuspended in 10 mL PBS to obtain single cell suspensions to allow cell counting using trypan blue staining.
- TRYPLE EXPRESSTM ThermoFisher, recombinant Trypsin
- nucleofection buffer P3 7.5xl0 6 cells/ml. 20 pl of the cell suspension was added to each well of a nucleofection cuvette containing the RNP mixture and then mixed gently to resuspend the cells. The RNP/cell mixture was then nucleofected with program CM138 (provided by the manufacturer).
- EFM Embryonic Fibroblast Medium
- DMEM Modified Eagle’s Medium
- MEM NEAA minimum essential medium non-essential amino acids
- genomic DNA was prepared from transfected and control PFF cells: 15 pl of QUICKEXTRACTTM DNA Extraction Solution (Lucigen) was added to pelleted cells, and the cells were then lysed by incubating for 10 minutes at 37° C, for 8 minutes at 65° C, and for 5 minutes at 95° C. Lysate was held at 4° C until used for DNA sequencing.
- RNA/Cas endonuclease system To evaluate NHEJ repair outcomes at ANPEP target sites mediated by the guide RNA/Cas endonuclease system, a region of approximately 250 bp of genomic DNA surrounding the target site was amplified by PCR and then the PCR product was examined by amplicon deep sequencing for the presence and nature of repairs. After transfection in triplicate, PFF genomic DNA was extracted and the region surrounding the intended target site was PCR amplified with Q5 Polymerase (NEB) adding sequences necessary for amplicon-specific barcodes and ILLUMINA® sequencing using tailed primers through two rounds of PCR, respectively. The resulting PCR amplification products were deep sequenced on an ILLUMINA® MISEQ® Personal Sequencer (ILLUMINA®, Inc., San Diego, CA).
- NEB Q5 Polymerase
- the resulting reads were examined for the presence and nature of repairs at the expected sites of cleavage by comparison to control experiments where the Cas9 protein and guide RNA were omitted from the transfection or by comparison to the reference genome.
- the total number of mutant reads amplicon sequences containing insertions or deletions when compared to the DNA sequences from control treatments or reference genome
- total read number wild-type plus mutant reads
- porcine embryos were additionally tested for their ability to introduce a premature stop codon in the ANPEP coding regions in porcine embryos.
- the subset of guides to be tested in porcine embryos was chosen based on their efficacy in generating premature stop codons in porcine fibroblasts. Edited porcine embryos were generated as described below. Briefly, oocytes recovered from slaughterhouse ovaries were in vitro fertilized. The sgRNP solution was injected into the cytoplasm of presumptive zygotes at 16-17 hours post-fertilization by using a single pulse from a FEMTOJET® 4i microinjector (EPPENDORF®; Hamburg, Germany).
- Microinjection was performed in TL-Hepes (ABT360, LLC) supplemented with 3 mg/ml BSA (Proliant) on the heated stage of an inverted microscope equipped with Narishige micromanipulators (Narishige International USA, Amityville, NY). Following injections, presumptive zygotes were cultured for 7 days in PZM5 (Cosmo Bio, Co LTD, Tokyo, Japan) in an incubator environment of 5% CO2, 5% O2, 90% N2. Mutation frequency of blastocysts was determined by ILLUMINA® sequencing as described for fetal fibroblasts above. The frequencies of the desired end-to-end NHEJ repairs resulting in premature stop codons in ANPEP are shown in Table 3.
- porcine ANPEP gene nucleotide sequences can be edited through the stimulation of double-strand breaks mediated by transfecting Cas9 protein with single or paired guide RNAs to effect a de novo in-frame stop codon.
- This example illustrates generation of pigs having a premature stop codon in ANPEP.
- Porcine oocytes will be isolated, fertilized, and then the resulting zygotes will be edited as described in Example 1 to generate gene edited pigs.
- ANPEP guide RNA pairs selected from Table 3 will be used to form RNP complexes as described in The present teachings including descriptions provided in the Examples that are not intended to limit the scope of any claim or embodiment. The following non-limiting examples are provided to further illustrate the present teachings. Those of skill in the art, in light of the present disclosure, will appreciate that many changes can be made in the specific embodiments that are disclosed and still obtain a like or similar result without departing from the spirit and scope of the present teachings.
- Example 1 will be microinjected into the cytoplasm of in vivo or in vitro fertilized PICTM porcine one-cell zygotes or Mil oocytes. These zygotes will then be incubated to generate edited multicellular embryos and transferred to surrogate gilts via standard methods to birth gene edited pigs.
- PICTM pubertal gilts will be subjected to estrus synchronization by treatment with 0.22% altrenogest solution (20-36 mg/animal) for 14 days. Follicular growth will be induced by the administration of PMSG 36 hours following the last dose of Matrix, and ovulation will be induced by the administration of hCG 82 hours after PMSG administration.
- In vitro fertilized embryos for gene editing will be derived from non-fertilized PICTM oocytes. Immature oocytes from estrus synchronized PICTM gilts will be collected from medium size (3-6 mm) follicles. Oocytes with evenly dark cytoplasm and intact surrounding cumulus cells will be selected for maturation.
- Cumulus oocyte complexes will be placed in a well containing 500 pl of maturation medium, TCM-199 (Invitrogen) with 3.05 mM glucose, 0.91 mM sodium pyruvate, 0.57 mM cysteine, 10 ng/ml EGF, 0.5 pg/ml luteinizing hormone (LH), 0.5 pg/ml FSH, 10 ng/ml gentamicin (Sigma), and 10% follicular fluid for 42-44 h at 38.5° C and 5% CO2, in humidified air.
- TCM-199 Invitrogen
- IVF medium modified Tris-buffered medium containing 113.1 mM NaCl, 3 mM KC1, 7.5 mM CaCh, 11 mM glucose, 20 mM Tris, 2 mM caffeine, 5 mM sodium pyruvate, and 2 mg/ml bovine serum albumin (BSA)
- BSA bovine serum albumin
- DPBS Dulbecco’s Phosphate Buffered Saline
- spermatozoa will be washed in DPBS three times.
- spermatozoa will be re-suspended in mTBM medium and added to oocytes at a final concentration of 1 x 10 5 spermatozoa/ml, and co-incubated for 4-5 h at 38.5° C and 5% CO2.
- Presumptive zygotes will be microinjected 5 hours post IVF and transferred to a surrogate female after 18-42 hours (1-4 cell stage). Each surrogate receives 20-60 injected embryos. Pregnancies will be confirmed by lack of return to estrus (21 days) and ultrasound at 28 days post embryo transfer.
- This example illustrates molecular characterization of pigs by sequencing.
- a tissue sample will be taken from an animal with a genome that was edited according to the examples herein.
- Tail, ear notch, or blood samples are suitable tissue types.
- the tissue sample will be frozen at -20°C within 1 hour of sampling to preserve integrity of the DNA in the tissue sample.
- DNA will be extracted from tissue samples after proteinase K digestion in lysis buffer. Characterization will be performed on two different sequence platforms, short sequence reads using the ILLUMINA® platform (ILLUMINA®, Inc., San Diego, CA) and long sequence reads on an Oxford NANOPORETM platform (Oxford NANOPORETM Technologies, Oxford, UK).
- two-step PCR will be used to amplify and sequence the region of interest.
- the first step is a locus-specific PCR which amplifies the locus of interest from the DNA sample using a combined locus-specific primer with a vendorspecific primer.
- the second step attaches the sequencing index and adaptor sequences to the amplicon from the first step so that sequencing can occur.
- the locus-specific primers for the first step PCR are chosen so that they amplify a region ⁇ 300bp such that ILLUMINA® paired-end sequencing reads could span the amplified fragment. Multiple amplicons are preferred to provide redundancy should deletions or naturally occurring point mutations prevent primers from correctly binding. Sequence data for the amplicon will be generated using an ILLUMINA® sequencing platform (MISEQ®, ILLUMINA®, Inc., San Diego, CA). Sequence reads are analyzed to characterize the outcome of the editing process.
- the first step is a locus-specific PCR which amplifies the locus of interest from the DNA sample using a combined locus-specific primer with a vendor- specific adapter.
- the second step PCR attaches the sequencing index to the amplicon from the first-step PCR so that the DNA is ready for preparing a sequencing library.
- the step 2 PCR products undergo a set of chemical reactions from a vendor kit to polish the ends of the DNA and ligate on the adapter containing the motor protein to allow access to the pores for DNA strand-based sequencing.
- the locus specific primers for the first step PCR range will be designed to amplify different regions of the ANPEP gene and will amplify regions different in length. Normalized DNA is then mixed with vendor supplied loading buffer and is loaded onto the NANOPORETM flowcell.
- This example illustrates viral challenge of ANPEP edited pigs.
- PICTM pigs will be edited with guides as described in Example 2. Edits will be confirmed as described in Example 3. Edited pigs will then be crossbred to create pigs that are homozygous for the edit.
- PEDV viral challenge of homozygous edited pigs will be performed as described in Whitworth, K.M., et al., Transgenic Research, 28, 21-32. Briefly, homozygous edited pigs will be inoculated with PEDV. Fecal swabs will be collected at intervals beginning prior to inoculation. Realtime PCR to determine the presence of virus in the feces will be performed using standard methods (see, e.g. Whitworth, K.M., et al., Transgenic research, 28, 21-32). It is expected that the viral amounts for edited pigs will be significantly lower than control pigs without the edit.
- This example illustrates an assay for real time PCR verification of desired edits.
- rtPCR real time PCR
- two sets of primers and two probes will be designed.
- One set of primers will flank the spacer sequence of the first guide RNA.
- a probe, labeled with a fluorescent moiety will be designed to anneal to the unedited version of the spacer sequence.
- the other set of primers will be designed to flank the desired edit sequence.
- a probe, labeled with a different fluorescent moiety will be designed to anneal to nucleotides spanning the joining region of the edit.
- two sets of primers and two probes will also be designed.
- One set of primers will flank the second spacer sequence.
- a probe, labeled with a fluorescent moiety will be designed to anneal to the unedited version of the spacer sequence.
- the other set of primers will be designed to flank the desired edit sequence — these may be the same primers and probe used for the first assay.
- a probe, labeled with a different fluorescent moiety will be designed to anneal to nucleotides spanning the joining region of the edit.
- real time PCR will be performed using a commercial kit using DNA isolated from pigs of confirmed status from sequencing, and fluorescence will be charted.
- Validation will occur if, as expected, the homozygotes are close to the y axis (representing the fluorescent moiety for the probe annealing to the desired edit), the heterozygotes group near the center of the chart, and the wild type pigs group close to the X axis (representing the fluorescent moiety for the probe annealing to the spacer sequence).
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Environmental Sciences (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Veterinary Medicine (AREA)
- Plant Pathology (AREA)
- Biodiversity & Conservation Biology (AREA)
- Animal Husbandry (AREA)
- Biophysics (AREA)
- Animal Behavior & Ethology (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Livestock animals and progeny thereof comprising at least one edited chromosomal sequence that reduces expression or activity of an ANPEP protein are provided. Livestock animal cells that contain such edited chromosomal sequences are also provided. The livestock animals, progeny, and cells have increased resistance to coronavirus. Methods for producing coronavirus resistant livestock animals are also provided.
Description
GENE EDITED LIVESTOCK ANIMALS HAVING CORONAVIRUS RESISTANCE
CROSS REFERENCE TO RELATED APPLICATION
This application claims benefit of and priority to US Provisional application 63/368,603, filed on July 15, 2022. This application is hereby incorporated by reference in its entirety.
SEQUENCE LISTING
The Sequence Listing, including the file named TD-14-2022-WOl.xml is hereby incorporated by reference in its entirety.
TECHNICAL FIELD
The present teachings relate to gene edited livestock animals and the editing of amino peptidase N (ANPEP) to provide coronavirus resistance.
BACKGROUND
Respiratory and enteric infections caused by coronaviruses have important impacts to both human and animal health. Infection of immunologically naive newborn pigs with transmissible gastroenteritis virus (TGEV) or porcine epidemic diarrhea virus (PEDV) can incur losses approaching 100% mortality; the result of dehydration caused by the virus- mediated destruction of enterocytes resulting in a malabsorptive diarrhea and dehydration (Madson, D. M., et al.. Veterinary Pathology, 2016, 53, 44-52; Saif, L. J., et al.. Coronaviruses, Diseases of Swine, Chapter 35, 10th Edition, 2012, Wiley-Blackwell.). TGEV first appeared in the US in the 1940s (Doyle, L. P., et al., Journal of the American Veterinary Medical Association, 1946, 108, 257-259). The more recent emergence of porcine epidemic diarrhea virus (PEDV) in 2013 was responsible for the death of nearly seven million pigs in the US, an estimated 10% loss in pig production (Stevenson, G. W., et al., Journal of Veterinary Diagnostic Investigation, 2013, 25, 649-654). TGEV can also cause 100% neonatal mortality. In older pigs, infection with TGEV or PEDV results in only mild clinical signs followed by complete recovery.
Along with the human, canine, and feline coronaviruses, PEDV and TGEV belong to the genus Alphacoronavirus in the family Coronaviridae (Lin, C.-M., et al., Journal of Virology, 2015, 89, 3332-3342). Porcine respiratory coronavirus (PRCV) is also an Alphacoronavirus and is closely related to TGEV. PRCV generally causes subclinical infection or mild respiratory disease, but severe cases have been described and there is evidence that it may worsen the severity of disease when pigs are dually infected with both PRCV and another virus such as porcine respiratory and reproductive syndrome virus (PRRSV) (Killoren, K. E. et al. , Porcine Respiratory Coronavirus, Swine Health Information Center and Center for Food Security and Public Health, 2016, www.cfsph.iastate.edu; Van Reeth, K., et al., Veterinary Microbiology, 1996, 48, 325- 335). Moreover, PRCV-positive status of a herd may have economic implications, because some countries will not import animals that are PRCV-positive.
Coronaviruses are enveloped, single stranded, positive sense RNA viruses, placed in the order, Nidovirales. The characteristic hallmark of nidoviruses is the synthesis of a nested set of subgenomic mRNAs. The unique structural feature of coronaviruses is the “corona” formed by the spike proteins protruding from the surface of the virion. Even though the viral spike protein is the primary receptor protein for all coronaviruses, the corresponding cell surface receptors vary (Li, F., Journal of Virology, 2015, 89, 1954- 1964). Delmas et al. were the first to characterize porcine aminopeptidase N (ANPEP, APN, or CD13) as a candidate receptor for TGEV (Delmas, B., et al., Nature, 1992, 357 417-420).
SUMMARY
The present teachings provide for and include livestock animals that have gene edits in the ANPEP gene confer resistance to coronaviruses.
In some embodiments, the present teachings can include a livestock animal or progeny thereof with a genome comprising an edited ANPEP gene comprising a premature stop codon. In some configurations, an ANPEP protein encoded by the edited ANPEP gene can show reduced binding to a coronavirus relative to an unedited ANPEP protein. In some configurations, the coronavirus can be transmissible gastroenteritis virus (TGEV), porcine epidemic diarrhea virus (PEDV), or Porcine respiratory coronavirus (PRCV). In some configurations, the coronavirus can be TGEV. In various configurations, the protein produced by the edited ANPEP gene shows no binding to TGEV.
In various configurations, the premature stop codon can be in exon 2, 3, 4, 16, or 18. In various configurations, the premature stop codon can be in exon 2. In various configurations, the premature stop codon can be in exon 3. In various configurations, the premature stop codon can be in exon 4. In various configurations, the premature stop codon can be in exon 16. In various configurations, the premature stop codon can be in exon 18.
In various configurations, the animal can be a pig. In some configurations, the edited ANPEP gene can comprise SEQ ID NO: 252, 253, 254, 255, 256, or 257. In various configurations, the edited ANPEP gene can comprise SEQ ID NO: 252. In various configurations, the edited ANPEP gene can comprise SEQ ID NO: 253. In various configurations, the edited ANPEP gene can comprise SEQ ID NO: 254. In various configurations, the edited ANPEP gene can comprise SEQ ID NO: 255. In various configurations, the edited ANPEP gene can comprise SEQ ID NO: 256. In various configurations, the edited ANPEP gene can comprise SEQ ID NO: 257.
In some embodiments, a cell isolated from the animal can have an ANPEP gene as described herein. In various embodiments, the present teachings can include an isolated cell line obtained from the livestock animal as described herein.
In some embodiments, the present teachings can include an isolated fibroblast line obtained from the livestock animal as described herein.
In some embodiments, a method of producing a gene edited livestock animal comprising: introducing into an isolated cell: a Cas9 protein or a nucleic acid encoding a Cas9 protein; and a pair of gRNAs that each create a double stranded break in an ANPEP gene such that a premature stop codon is formed when the double stranded breaks are repaired. In some configurations, the method can further comprise producing an animal from the isolated cell. In various configurations, the pair of gRNAs can be SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 102 and 10, SEQ ID NOs: 58 and 52, SEQ ID NOs: 58 and 26, SEQ ID NOs: 58 and 10, SEQ ID NOs: 32 and 26, SEQ ID NOs: 32 and 10, SEQ ID NOs: 120 and 111, SEQ ID NOs: 119 and 113, SEQ ID NOs: 115 and 111, SEQ ID NOs 137 and 130, SEQ ID NOs: 133 and 130, SEQ ID NOs: 137 and 128, SEQ ID NOs: 133 and 128, SEQ ID NOs: 189 and 183, SEQ ID NOs: 231 and 208, SEQ ID NOs: 228 and 213, SEQ ID NOs: 216 and 213, or SEQ ID NOs: 216 and 208. In various configurations, the pair of gRNAs can be SEQ ID NOs: 102 and 52, SEQ ID NOs:
102 and 26, SEQ ID NOs: 58 and 52, SEQ ID Nos: 120 and 111, SEQ ID NOs: 231 and 208, or SEQ ID NOs: 228 and 213. In various configurations, the pair of gRNAs can be SEQ ID NOs: 102 and 52. In various configurations, the pair of gRNAs can be SEQ ID NOs: 102 and 26. In various configurations, the pair of gRNAs can be SEQ ID NOs: 58 and 52. In various configurations, the pair of gRNAs can be SEQ ID Nos: 120 and 111. In various configurations, the pair of gRNAs can be SEQ ID NOs: 231 and 208. In various configurations, the pair of gRNAs can be SEQ ID NOs: 228 and 213.
In various configurations, the edited ANPEP gene can comprise SEQ ID NO: 252, 253, 254, 255, 256, or 257.
In various configurations, the isolated cell can be an oocyte, a zygote, or a fibroblast cell. In various configurations, the isolated cell can be a zygote. In various configurations, the isolated cell can be a fibroblast cell. In various configurations, the isolated cell can be an oocyte. In some configurations, the producing an animal can comprise fertilizing the oocyte. In some configurations, the producing an animal can further comprise implanting the fertilized oocyte into a surrogate mother. In various configurations, the livestock animal can be resistant to coronavirus. In various configurations, the livestock animal can be a pig.
In various embodiments, the present teachings can include at least one gRNA having a sequence selected from the group consisting of SEQ ID NOs: 10, 26, 32, 52, 58, 102, 111, 113, 115, 119, 120, 128, 130, 133, 137, 183, 189, 208, 213, 216, 228, and 231. In some configurations, the at least one gRNA is a pair of gRNAs comprising, consisting of, or selected from the group consisting of SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 102 and 10, SEQ ID NOs: 58 and 52, SEQ ID NOs: 58 and 26, SEQ ID NOs: 58 and 10, SEQ ID NOs: 32 and 26, SEQ ID NOs: 32 and 10, SEQ ID NOs: 120 and 111, SEQ ID NOs: 119 and 113, SEQ ID NOs: 115 and 111, SEQ ID NOs 137 and 130, SEQ ID NOs: 133 and 130, SEQ ID NOs: 137 and 128, SEQ ID NOs 133 and 128, SEQ ID NOs: 189 and 183, SEQ ID NOs: 231 and 208, SEQ ID NOs: 228 and 213, SEQ ID NOs: 216 and 213, and/or SEQ ID NOs: 216 and 208. In various configurations, the at least one gRNA is a pair of gRNAs comprising, consisting of, or selected from the group consisting of SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 58 and 52, SEQ ID Nos: 120 and 111, SEQ ID NOs: 231 and 208, and/or SEQ ID NOs. 228 and 213. In various configurations the at least one gRNA is a pair of gRNAs comprising SEQ
ID NOs: 102 and 52. In various configurations, the at least one gRNA is a pair of gRNAs comprising SEQ ID NOs: 102 and 26. In various configurations, the at least one gRNA is a pair of gRNAs comprising SEQ ID NOs: 58 and 52. In various configurations, the at least one gRNA is a pair of gRNAs comprising SEQ ID Nos: 120 and 111. In various configurations, the at least one gRNA is a pair of gRNAs comprising SEQ ID NOs: 231 and 208. In various configurations, the at least one gRNA is a pair of gRNAs comprising SEQ ID NOs. 228 and 213.
DETAILED DESCRIPTION
The present disclosure relates to livestock animals and progeny thereof comprising at least one edited chromosomal sequence that reduces expression or activity of an ANPEP protein. The disclosure further relates to animal cells comprising at least one edited chromosomal sequence that reduces expression or activity of an ANPEP protein. The animals and cells have chromosomal gene edits (e.g., insertions, deletions, or substitutions) that inactivate or otherwise modulate ANPEP expression or activity. ANPEP plays important roles in coronavirus resistance. The animals and cells can be created using any number of protocols, including those that make use of gene editing.
Definitions
So that the present disclosure may be more readily understood, certain terms are first defined. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which embodiments of the present teachings pertain. Many methods and materials similar, modified, or equivalent to those described herein can be used in the practice of the embodiments of the present teachings without undue experimentation, the preferred materials and methods are described herein. In describing and claiming the embodiments of the present teachings, the following terminology will be used in accordance with the definitions set out below.
It is to be understood that all terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting in any manner or scope. For example, as used in this specification and the appended claims, the singular forms “a”, “an”, and “the” can include plural referents unless the content clearly indicates otherwise. Similarly, the word “or” is intended to include “and” unless the context clearly
indicate otherwise. The word “or” means any one member of a particular list and also includes any combination of members of that list.
Units, prefixes, and symbols may be denoted in their SI accepted form. Unless otherwise indicated, nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively. Amino acids may be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted singleletter codes.
Numeric ranges recited within the specification are inclusive of the numbers defining the range and include each integer within the defined range. Throughout this disclosure, various aspects of the present teachings are presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the present teachings. Accordingly, the description of a range should be considered to have specifically disclosed all the possible sub-ranges, fractions, and individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed sub-ranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 3, 4, 5, and 6, and decimals and fractions, for example, 1.2, 3.8, I’A, and 43/4 This applies regardless of the breadth of the range.
The term “about” as used herein, refers to variation in the numerical quantity that can occur, for example, through typical measuring techniques and equipment, with respect to any quantifiable variable, including, but not limited to, mass, volume, time, and temperature. Further, given solid and liquid handling procedures used in the real world, there is certain inadvertent error and variation that is likely through differences in the manufacture, source, or purity of the ingredients used to make the compositions or carry out the methods and the like. The term “about” also encompasses these variations. Whether or not modified by the term “about,” the claims include equivalents to the quantities.
Unless otherwise specified, the term “cell” as used herein can refer to an oocyte, a sperm cell, a zygote, an embryonic cell, a germ layer cell, or a differentiated cell. Whole genome edits are typically performed in oocytes, sperm, zygotic, or embryonic cells. For
some disease traits, tissue specific edits may be made in tissue cells or tissue progenitor cells.
The term “Cas” refers to a “CRISPR associated” protein. Non-limiting examples of Cas proteins include Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cash, Cas7, Cas8, CaslO, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, Csf4, homologs thereof, or modified versions thereof.
"Cas9" (formerly referred to as Cas5, Csnl, or Csxl2) herein refers to a Cas endonuclease of a type II CRISPR system that forms a complex with a crNucleotide and a tracrNucleotide, or with a single guide polynucleotide, for specifically recognizing and cleaving all or part of a DNA target sequence.
"Cleavage" refers to the breakage of the covalent backbone of a DNA molecule. Cleavage can be initiated by a variety of methods including, but not limited to, enzymatic or chemical hydrolysis of a phosphodi ester bond. Both single-stranded cleavage and double-stranded cleavage are possible, and double-stranded cleavage can occur as a result of two distinct single-stranded cleavage events. DNA cleavage can result in the production of either blunt ends or staggered ends. In certain embodiments, fusion polypeptides are used for targeted double-stranded DNA cleavage.
The term "CRISPR system" refers collectively to transcripts and other elements involved in the expression of or directing the activity of CRISPR-associated ("Cas") genes, including sequences encoding a Cas gene, a tracr (trans-activating CRISPR) sequence (e.g. tracrRNA or an active partial tracrRNA), a tracr-mate sequence (encompassing a "direct repeat" and a tracrRNA-processed partial direct repeat in the context of an endogenous CRISPR system), a guide sequence (also referred to as a "spacer" in the context of an endogenous CRISPR system), or other sequences and transcripts from a CRISPR locus.
The term "conservatively edited variants" applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, "conservatively edited variants" refers to those nucleic acids which encode identical or conservatively edited variants of the amino acid sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every
position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are "silent variations" and represent one species of conservatively edited variation. Every nucleic acid sequence herein that encodes a polypeptide also, by reference to the genetic code, describes every possible silent variation of the nucleic acid.
One of ordinary skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine; and UGG, which is ordinarily the only codon for tryptophan) can be edited to yield a functionally identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a polypeptide of the present teachings is implicit in each described polypeptide sequence and is within the scope of the present disclosure.
As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds, or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservatively edited variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Thus, any number of amino acid residues selected from the group of integers consisting of from 1 to 15 can be so altered. Thus, for example, 1, 2, 3, 4, 5, 7, or 10 alterations can be made.
Conservatively edited variants typically provide similar biological activity as the unedited polypeptide sequence from which they are derived. For example, substrate specificity, enzyme activity, or ligand/receptor binding is generally at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% of the native protein for its native substrate. Conservative substitution tables providing functionally similar amino acids are well known in the art.
The following six groups each contain amino acids that are conservative substitutions for one another: [1] Alanine (A), Serine (S), Threonine (T); [2] Aspartic acid (D), Glutamic acid (E); [3] Asparagine (N), Glutamine (Q); [4] Arginine (R), Lysine (K); [5] Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and [6] Phenylalanine (F), Tyrosine (Y), Tryptophan (W). See also, Creighton (1984) Proteins W. H. Freeman and Company.
“Exogenous” refers to a nucleic acid sequence originating outside an organism that has been introduced into the organism. This can refer to sequences which naturally occur
in a sexual compatible species, sequences which are synthetic, or sequences from another species.
As used herein "full-length sequence" in reference to a specified polynucleotide or its encoded protein means having the entire amino acid sequence of a native (nonsynthetic), endogenous, biologically active form of the specified protein. Methods to determine whether a sequence is full-length are well known in the art including such exemplary techniques as northern or western blots, primer extension, SI protection, and ribonuclease protection. Comparison to known full-length homologous (orthologous and/or paralogous) sequences can also be used to identify full-length sequences of the present disclosure. Additionally, consensus sequences typically present at the 5' and 3' untranslated regions of mRNA aid in the identification of a polynucleotide as full-length. For example, the consensus sequence ANNNN/H/G’G, where the underlined codon represents the N- terminal methionine, aids in determining whether the polynucleotide has a complete 5' end. Consensus sequences at the 3' end, such as polyadenylation sequences, aid in determining whether the polynucleotide has a complete 3' end.
As used herein, "gene editing," "gene edited" "genetically edited" and "gene editing effectors" refer to the use of naturally occurring or artificially engineered nucleases, also referred to as "molecular scissors." The nucleases create specific double-stranded break (DSBs) at desired locations in the genome, which in some cases harnesses the cell’s endogenous mechanisms to repair the induced break by natural processes of homologous recombination (HR) and/or nonhomologous end-joining (NHEJ). Gene editing effectors include Zinc Finger Nucleases (ZFNs), Transcription Activator-Like Effector Nucleases (TALENs), the Clustered Regularly Interspaced Short Palindromic Repeats/CAS (CRISPR/Cas) system, and meganucleases re-engineered as homing endonucleases. The terms also include the use of genetic editing procedures and techniques, including, for example, where the change is relatively small and/or does not introduce DNA from a foreign species.
As used herein, the term “gene edit”, “gene edited”, or “genetically edited” refers to an organism where human intervention, such as but without limitation by using a gene editing effector, has created a genetic difference in its genome when compared to a wild type genome of the same organism. These differences can include but are not limited to nucleotide substitutions, excision of a start codon, or small deletions that do not introduce
frame shift mutations into the genome but may excise an exon or form a premature stop codon when the ends of the deletion are ligated together. A gene edit does not introduce DNA from another species into an organism.
A “gene edited animal” refers to an animal with one or more cells comprising a gene edit.
As used herein, "heterologous" in reference to a nucleic acid is a nucleic acid that originates from a foreign species, or, if from the same species, is substantially changed from its native form in composition and/or genomic locus by deliberate human intervention. For example, a promoter operably linked to a heterologous structural gene is from a species different from that from which the structural gene was derived, or, if from the same species, one or both are substantially changed from their original form. A heterologous protein may originate from a foreign species or, if from the same species, is substantially changed from its original form by deliberate human intervention.
As used herein “homing DNA technology” or “homing technology” covers any mechanisms that allow a specified molecule to be targeted to a specified DNA sequence including Zinc Finger (ZF) proteins, Transcription Activator-Like Effectors (TALEs) meganucleases, and CRISPR systems (e.g., CRISPR/Cas9 systems).
The terms "increased resistance" and "reduced susceptibility" herein mean, but are not limited to, a statistically significant reduction of the incidence and/or severity of clinical signs or clinical symptoms which are associated with infection by pathogen. For example, "increased resistance" or "reduced susceptibility" can refer to a statistically significant reduction of the incidence and/or severity of clinical signs or clinical symptoms which are associated with infection by a coronavirus in an animal comprising an edited chromosomal sequence as compared to a control animal having an unedited chromosomal sequence. The term "statistically significant reduction of clinical symptoms" means, but is not limited to, the frequency in the incidence of at least one clinical symptom in the edited group of subjects is at least 10%, preferably at least 20%, more preferably at least 30%, even more preferably at least 50%, and even more preferably at least 70% lower than in the non-edited control group after the challenge with the infectious agent.
The term “livestock animal” includes any animals traditionally raised in livestock farming, for example an ungulate (e.g., an artiodactyl), an avian animal (e.g., chickens, turkeys, ducks, geese, guinea fowl, or squabs), an equine animal (e.g, horses or donkeys).
Ungulates include, but are not limited to porcine animals e.g, pigs), bovine animals (e.g., beef or dairy cattle, buffalo), ovine animals, caprine animals, camels, llamas, alpacas, and deer. The term does not include rats, mice, or other rodents.
As used herein, "nucleic acid" includes reference to a deoxyribonucleotide or ribonucleotide polymer in either single-or double-stranded form, and unless otherwise limited, encompasses conservatively edited variants and known analogues having the essential nature of natural nucleotides in that they hybridize to single-stranded nucleic acids in a manner similar to naturally occurring nucleotides (e. g., peptide nucleic acids).
The term “mutation” includes alterations in the nucleotide sequence of a polynucleotide, such as for example a gene or coding DNA sequence (CDS), compared to the wild-type sequence. The term includes, without limitation, substitutions, insertions, frameshifts, deletions, inversions, translocations, duplications, splice-donor site mutations, point-mutations and the like.
The terms "polypeptide", "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The terms also may apply to conservatively edited variants and to amino acid polymers in which one or more amino acid residues is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. The essential nature of such analogues of naturally occurring amino acids is that, when incorporated into a protein, the protein is specifically reactive to antibodies elicited to the same protein but consisting entirely of naturally occurring amino acids. The terms "polypeptide", "peptide" and "protein" are also inclusive of modifications including, but not limited to, glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, hydroxylation and ADP- ribosylation. It will be appreciated, as is well known and as noted above, that polypeptides are not always entirely linear. For instance, polypeptides may be branched as a result of ubiquitization, and they may be circular, with or without branching, generally as a result of post-translation events, including natural processing event and events brought about by human manipulation which do not occur naturally. Circular, branched and branched circular polypeptides may be synthesized by non-translation natural process and by entirely synthetic methods, as well. Further, the present teachings contemplate the use of both the methionine-containing and the methionine-less amino terminal variants of the protein of the present disclosure.
As used herein "promoter" includes reference to a region of DNA upstream from the start of transcription and involved in recognition and binding of RNA polymerase and other proteins to initiate transcription. Examples of promoters under developmental control include promoters that preferentially initiate transcription in certain tissues, such as testes, ovaries, or placenta. Such promoters are referred to as "tissue-preferred". Promoters which initiate transcription only in certain tissue are referred to as "tissue-specific". A "cell type" specific promoter primarily drives expression in certain cell types in one or more organs, for example, germ cells in testes or ovaries. An "inducible" or "repressible" promoter is a promoter which is under environmental control. Examples of environmental conditions that may affect transcription by inducible promoters include stress, and temperature. Tissue-specific, tissue-preferred, cell-type specific and inducible promoters constitute the class of "non-constitutive" promoters. A "constitutive" promoter is a promoter which is active under most environmental conditions.
The terms "residue" or "amino acid residue" or "amino acid" are used interchangeably herein to refer to an amino acid that is incorporated into a protein, polypeptide, or peptide (collectively "protein"). The amino acid may be a naturally occurring amino acid and, unless otherwise limited, may encompass non-natural analogs of natural amino acids that can function in a similar manner as naturally occurring amino acids.
"Resistance" of an animal to a disease is a characteristic of an animal, wherein the animal avoids the disease symptoms that are the outcome of animal-pathogen interactions, such as interactions between a porcine animal and coronavirus. That is, pathogens are prevented from causing animal diseases and the associated disease symptoms, or alternatively, a reduction of the incidence and/or severity of clinical signs or reduction of clinical symptoms. One of skill in the art will appreciate that the methods disclosed herein can be used with other compositions and methods available in the art for protecting animals from pathogens.
A "TALE DNA binding domain" or "TALE" is a polypeptide comprising one or more TALE repeat domains/units. The repeat domains are involved in binding of the TALE to its cognate target DNA sequence. A single "repeat unit" (also referred to as a "repeat") is typically 33-35 amino acids in length and exhibits at least some sequence homology with other TALE repeat sequences within a naturally occurring TALE protein.
Zinc finger and TALE binding domains can be "engineered" to bind to a predetermined nucleotide sequence, for example via engineering (altering one or more amino acids) of the recognition helix region of naturally occurring zinc finger or TALE proteins. Therefore, engineered DNA binding proteins (zinc fingers or TALEs) are proteins that are non- naturally occurring. Non-limiting examples of methods for engineering DNA-binding proteins are design and selection. A designed DNA binding protein is a protein not occurring in nature whose design/ composition results principally from rational criteria. Rational criteria for design include application of substitution rules and computerized algorithms for processing information in a database storing information of existing ZFP and/or TALE designs and binding data. See, for example, U.S. Pat. Nos. 6,140,081; 6,453,242; and 6,534,261; see also WO 98/53058; WO 98/53059; WO 98/53060; WO 02/016536 and WO 03/016496 and U.S. Publication No. 20110301073.
A "zinc finger DNA binding protein" (or binding domain) is a protein, or a domain within a larger protein, that binds DNA in a sequence-specific manner through one or more zinc fingers, which are regions of amino acid sequence within the binding domain whose structure is stabilized through coordination of a zinc ion. The term zinc finger DNA binding protein is often abbreviated as zinc finger protein or ZFP.
A "selected" zinc finger protein or TALE is a protein not found in nature whose production results primarily from an empirical process such as phage display, interaction trap or hybrid selection. See e.g., U.S. Pat. No. 5,789,538; U.S. Pat. No. 5,925,523; U.S. Pat. No. 6,007,988; U.S. Pat. No. 6,013,453; U.S. Pat. No. 6,200,759; WO 95/19431; WO 96/06166; WO 98/53057; WO 98/54311; WO 00/27878; WO 01/60970 WO 01/88197, WO 02/099084 and U.S. Publication No. 20110301073.
"Wild type" means those animals and blastocysts, embryos or cells derived therefrom, which have not been gene edited and are usually inbred and outbred strains developed from naturally occurring strains.
The following terms are used to describe the sequence relationships between a polynucleotide/polypeptide of the present teachings with a reference polynucleotide/polypeptide: (a)"reference sequence", (b)" comparison window", (c) "sequence identity", and (d)"percentage of sequence identity".
(a) As used herein, "reference sequence" is a defined sequence used as a basis for sequence comparison with a polynucleotide/polypeptide of the present teachings. A
reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full-length cDNA or gene sequence, or the complete cDNA or gene sequence.
(b) As used herein, "comparison window" includes reference to a contiguous and specified segment of a polynucleotide/polypeptide sequence, wherein the polynucleotide/polypeptide sequence may be compared to a reference sequence and wherein the portion of the polynucleotide/polypeptide sequence in the comparison window may comprise additions or deletions (/.< ., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. Generally, the comparison window is at least 20 contiguous nucleotides/amino acids residues in length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide/polypeptide sequence, a gap penalty is typically introduced and is subtracted from the number of matches.
Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith, T.F. and Waterman, M.S., Adv. Appl. Math., 1981, 2, 482-489; by the homology alignment algorithm of Needleman, S.B. and Wunsch, C.D. (J. Mol. Biol., 1970, 48, 443-453); by the search for similarity method of Pearson, W.R. and Lipman, D.J. (Proc. Natl. Acad. Sci., 1988, 85, 2444-2448); and by computerized implementations of these algorithms, including, but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics, Mountain View, California; GAP, BESTFIT, BLAST, FASTA, and TFASTA, and related programs in the GCG Wisconsin Genetics Software Package, Version 10 (available from Accelrys Inc., 9685 Scranton Road, San Diego, California, USA). The CLUSTAL program is well described in Higgins, D.G. and Sharp, P.M.,Gene, 1988, 73, 237-244; Higgins, D.G. and Sharp, P.M., CABIOS, 1989, 5, 151-153; Corpet, F., Nucleic Acids Research, 1988, 16, 10881-10890; Huang, X., et al.. Computer Applications in the Biosciences, 1992, 8, 155-165, and Pearson, W.R, Methods in Molecular Biology, 1994, 24, 307-331.
The BLAST family of programs that can be used for database similarity searches includes: BLASTN for nucleotide query sequences against nucleotide database sequences; BLASTX for nucleotide query sequences against protein database sequences; BLASTP for
protein query sequences against protein database sequences; TBLASTN for protein query sequences against nucleotide database sequences; and TBLASTX for nucleotide query sequences against nucleotide database sequences. See, Current Protocols in Molecular Biology, Chapter 19, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York (1995); Altschul, S.F., et al., J. Mol. Biol., 1990, 215: 403-410; and Altschul, S.E et al., Nucleic Acids Res., 1997, 25, 3389-3402. Software for performing BLAST analyses is publicly available, for example through the National Center for Biotechnology Information (ncbi.nlm.nih.gov). This algorithm has been thoroughly described in a number of publications. See, e.g., Altschul, S.F., et al., NUCLEIC ACIDS RES., 1997, 25, 3389-3402; National Center for Biotechnology Information, THE NCBI HANDBOOK [INTERNET], Chapter 16: The BLAST Sequence Analysis Tool (McEntyre J, Ostell J, eds., 2002), available at www(dot)ncbi(dot)nlm(dot)nih(dot)gov/books/NBK21097/pdf/chl6.pdf. The BLASTP program for amino acid sequences has also been thoroughly described (see Henikoff, S., and Henikoff, J.G., Proc. Natl. Acad. Sci. USA, 1992, 89, 10915-10919).
In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin, S & Altschul, S.F., Proc. Nat'l. Acad. Sci. USA, 1993, 90, 5873-5877). Anumber of low- complexity filter programs can be employed to reduce such low-complexity alignments. For example, the SEG (Wooten, J.C. and Federhen, S., Comput. Chem., 1993, 17, 149-163) and XNU (Claverie, J.M. and States, D.J., Comput. Chem., 1993, 17, 191-201) low- complexity filters can be employed alone or in combination.
Unless otherwise stated, nucleotide and protein identity/ similarity values provided herein are calculated using GAP (GCG Version 10) under default values. GAP (Global Alignment Program) can also be used to compare a polynucleotide or polypeptide of the present teachings with a reference sequence. GAP uses the algorithm of Needleman, S.B. and Wunsch, C.D. (J. Mol. Biol., 1970, 48: 443-453), to find the alignment of two complete sequences that maximizes the number of matches and minimizes the number of gaps. GAP represents one member of the family of best alignments. There may be many members of this family, but no other member has a better quality. GAP displays four figures of merit for alignments: Quality, Ratio, Identity, and Similarity. The Quality is the metric maximized in order to align the sequences. Ratio is the quality divided by the number of bases in the shorter segment. Percent Identity is the percent of the symbols that
actually match. Percent Similarity is the percent of the symbols that are similar. Symbols that are across from gaps are ignored. A similarity is scored when the scoring matrix value for a pair of symbols is greater than or equal to 0.50, the similarity threshold. The scoring matrix used in Version 10 of the Wisconsin Genetics Software Package is BLOSUM62 (see Henikoff, S. & Henikoff, J.G., Proc. Natl. Acad. Sci. USA, 1989, 89, 10915-10919).
Multiple alignment of the sequences can be performed using the CLUSTAL method of alignment (Higgins, D.G. and Sharp, P.M., CAB IOS, 1989, 5, 151-153) with the default parameters (GAPPEN ALT Y= 10, GAP LENGTH PENALTY=10). Default parameters for pairwise alignments using the CLUSTAL method include KTUPLE 1, GAP PENALTY=3, WIND0W=5 and DIAGONALS SAVED=5.
(c) As used herein, "sequence identity" or "identity" in the context of two nucleic acid or polypeptide sequences includes reference to the residues in the two sequences which are the same when aligned for maximum correspondence over a specified comparison window. When percentage of sequence identity is used in reference to proteins it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g. charge or hydrophobicity) and therefore do not change the functional properties of the molecule. Where sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences which differ by such conservative substitutions are said to have "sequence similarity" or "similarity". Means for making this adjustment are well-known to those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions may be calculated according to the algorithm of Meyers. E.W. and Miller, W., Computer Applic. Biol. Sci., 1988, 4: 11-17, for example as implemented in the program PC/GENE (Intelligenetics, Mountain View, California, USA).
(d) As used herein, "percentage of sequence identity" means the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions
or deletions (/.< ., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
The porcine amino peptidase N (ANPEP) has been reported to be involved in transmissible gastroenteritis virus (TGEV) infection. Porcine ANPEP is a 963 amino acid, type II membrane metallopeptidase, that is responsible for removing N-terminal amino acids from protein substrates during digestion.
Gene description
Sus scrofa ANPEP is a membrane alanyl aminopeptidase described at ENSEMBL (useast.ensembl.org/) accession number ENS SSCG00000001849, located on the reverse strand of the 7th chromosome. ENSEMBL has six transcripts. The gene was listed in ENSSSCG00000001849.5 (SEQ ID NO: 258) and the full transcript was designated as ANPEP -205, and the locations of the exons on the 7th chromosome are listed below in Table 1 (these data are from Ensembl release 109).
The present inventors designed edits to introduce a premature stop codon in order to make a truncated protein. gRNAs were engineered to create double stranded breaks in the gene that, when repaired via non-homologous end joining (NHEJ), create a stop codon. The present inventors developed two potential strategies: one strategy called for introduction of an early stop codon in Exon 2, 3, or 4. The gRNAs listed in SEQ ID NOs: 1-106 and 147-175 are directed to Exon 2. The gRNAs listed in SEQ ID NOs: 107-126 and 176 are directed to Exon 3. The gRNAs listed in SEQ ID NOs: 127-147 and 177-182 are directed to Exon 4. The second strategy was to introduce an exogenous stop codon late in the gene — in exon 15, 16, 17, or 18. The gRNAs listed in SEQ ID NOs: 183-190 and 240 are directed to Exon 16. The gRNAS listed in SEQ ID NOs: 191-201 and 241 are directed to Exon 17. The gRNAs listed in SEQ ID NOs: 202-239 and 242-251 are directed to Exon 18. gRNAs listed in SEQ ID NOs: 1-146 and 183-239 were designed for use with spCas9. SEQ ID NOs: 147-182 and 240-251 were designed for use with SthC3Cas9.
Gene Editing
The present disclosure provides a livestock animal or animal cell, including sperm or egg cells, with improved resistance to coronavirus, specifically transmissible gastroenteritis virus (TEGV). The livestock animals or cells comprise altered expression or activity of an ANPEP protein. The livestock animals or cells can comprise at least one edited chromosomal sequence that reduces expression or activity of the ANPEP protein. The chromosomal sequence may be (1) inactivated, (2) edited, or (3) comprise an integrated sequence. An inactivated chromosomal sequence is altered such that ANPEP protein function is impaired, reduced or eliminated. As used herein, reduced ANPEP protein function or activity (e.g., coronavirus binding activity) refers to a reduction in protein function or activity relative to the function of a wild type ANPEP protein. Thus, a gene edited animal comprising an inactivated chromosomal sequence may be termed a “knock-out”. Similarly, a genetically edited animal comprising an integrated sequence may be termed a “knock in”. Furthermore, a gene edited animal comprising an edited chromosomal sequence may comprise a targeted point edit(s) or other edit such that an altered protein product is produced. Briefly, the process can comprise using a CRISPR system (e.g., a CRISPR/Cas9
system) to edit the genomic sequence. To use Cas9 to edit genomic sequences, the protein can be delivered directly to a cell. Alternatively, an mRNA that encodes Cas9 can be delivered to a cell, or a gene that provides for expression of an mRNA that encodes Cas9 can be delivered to a cell. In addition, either target specific crRNA and a tracrRNA can be delivered directly to a cell or target specific sgRNA(s) can be delivered to a cell (these RNAs can alternatively be produced by a gene constructed to express these RNAs). The process of editing chromosomal sequences using a CRISPR system is rapid, precise, and highly efficient.
In some embodiments, an ANPEP locus is used as a target site for the site-specific editing. This can include insertion of an exogenous nucleic acid (e.g., a nucleic acid comprising a nucleotide sequence encoding a polypeptide of interest) or deletions of nucleic acids from the locus. In particular embodiments, insertions and/or deletions result in an edited locus. For example, integration of the premature nucleic acid and/or deletion of part of the genomic nucleic acid may edit the locus so as to produce a disrupted (/.< ., inactivated) ANPEP gene. The ANPEP protein encoded by the edited nucleic acid can have reduced coronavirus binding relative to an unedited ANPEP protein. In some configurations, the edited ANPEP protein may not bind to a coronavirus.
Any of the animals or cells can be an animal or cell that has been gene edited using a Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/Cas system. The CRISPR/Cas system can suitably comprise any of the guide RNAs (gRNAs) described herein.
For any of the animals, progeny, or cells, the edited chromosomal sequence can be an edited chromosomal sequence that has been produced via homology directed repair (HDR). Alternatively, the edited chromosomal sequence can be an edited chromosomal sequence that has been produced via non-homologous end-joining (NHEJ).
The edited chromosomal sequence reduces the susceptibility of the animal, progeny, or cell to infection by a coronavirus, as compared to the susceptibility of a livestock animal, progeny, or cell that does not comprise the edited chromosomal sequence. In addition, in porcine animals having the edited chromosomal sequence, coronavirus nucleic acid is detected at lower titers in the nasal secretions, feces, or serum relative to a control. Alternatively or in addition, coronavirus antigen is detected at lower concentrations in the tissues of the animal e.g., in lung tissue) relative to a control. The
edited chromosomal sequence preferably substantially eliminates susceptibility of the animal, progeny, or cell to the coronavirus. The edited chromosomal sequence more preferably completely eliminates susceptibility of the animal, progeny, or cell to the coronavirus, such that animals do not show any clinical signs of disease following exposure to the coronavirus. For example, porcine animals having the edited chromosomal sequence do not show any clinical signs of coronavirus (e.g., fever, lethargy, anorexia, weight loss, nasal and ocular discharge, cough, sneezing, conjunctivitis, diarrhea, and/or breathing difficulties) following exposure to the coronavirus. In addition, in porcine animals having the edited chromosomal sequence, coronavirus nucleic acid cannot be detected in the nasal secretions, feces, or serum, coronavirus antigen cannot be detected in the tissues of the animal (e.g., in lung tissue), and serum is negative for coronavirusspecific antibody. Similarly, cells having the edited chromosomal sequence that are exposed to the pathogen do not become infected with the pathogen. Exemplary coronaviruses include TEGV, PEDV, and PRCV.
The edited chromosomal sequence can comprise an insertion, a deletion, a substitution, or a combination of any thereof. The insertion, the deletion, the substitution, or the combination of any thereof can result in a miscoding in the allele of the gene encoding the ANPEP protein. Where the insertion, the deletion, the substitution, or the combination of any thereof results in a miscoding in the allele of the gene encoding the ANPEP protein, the miscoding can result in a premature stop codon in the allele of the gene encoding the ANPEP protein. In some embodiments, editing strategies are designed to introduce premature stop codons in conserved exonic sequences of ANPEP. In certain embodiments, premature stop codons are introduced either early in the coding sequence or in coding regions containing protease active site residues. In some configurations, these editing strategies include the use of two guides to direct the creation of double stranded breaks and then, through non-homologous end joining (NHEJ) create a premature stop codon when the cut ends are ligated. The present inventors have employed this strategy to create premature stop codons early in the ANPEP gene, such as in Exons 2, 3, and 4. Pairs of guides that can be used to introduce a premature stop codon in Exon 2 include SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 102 and 10, SEQ ID NOs: 58 and 52, SEQ ID NOs: 58 and 26, SEQ ID NOs: 58 and 10, SEQ ID NOs: 32 and 26, and SEQ ID NOs: 32 and 10. Pairs of guides that can be used to introduce a premature stop
codon in Exon 3 include SEQ ID NOs: 120 and 111, SEQ ID NOs: 119 and 113, and SEQ ID NOs: 115 and 111. Pairs of guides that can be used to introduce a premature stop codon in Exon 4 include SEQ ID NOs 137 and 130, SEQ ID NOs: 133 and 130, SEQ ID NOs: 137 and 128, and SEQ ID NOs 133 and 128. As an alternative strategy, pairs of guides were designed to introduce a premature stop codon within Exon 15 or 17. An exemplary pair of guides that can be used to introduce a premature stop codon in Exon 15 is SEQ ID NOs: 189 and 183. Pairs of guides that can be used to introduce a premature stop codon in Exon 17 include SEQ ID NOs: 231 and 208, SEQ ID NOs: 228 and 213, SEQ ID NOs: 216 and 213, and SEQ ID NOs: 216 and 208. The edited sequences produced by select guide pairs are shown in Table 2. (The sequences recite 50 base pairs on either side of the join.)
In any of the animals, progeny, or cells described herein, the edited chromosomal sequence preferably causes ANPEP protein production or activity to be reduced, as compared to ANPEP protein production or activity in an animal, progeny, or cell that lacks the edited chromosomal sequence. Specifically, the edited sequence encodes an ANPEP protein that has reduced or no binding to coronaviruses or other pathogens.
The edited chromosomal sequence can result in production of substantially no functional ANPEP protein by the animal, progeny, or cell. By “substantially no functional ANPEP protein,” it is meant that the level of ANPEP protein in the animal, progeny, or cell is undetectable, or if detectable, is at least about 90% lower, at least about 95% lower, at least about 98%, lower, and even more preferably at least about 99% lower than the
level observed in an animal, progeny, or cell that does not comprise the edited chromosomal sequences. For any of the animals, progeny, or cells described herein, the animal, progeny, or cell does not produce ANPEP protein.
Alternatively, or in addition, a gene may be so edited as to delete a partial sequence in protein that responsible for an undesirable trait. For example, but without limitation, a domain of a protein may be removed that binds to a pathogen. This allows the editing of genes that are essential for host survival while still rendering the host resistant to the disease.
The ANPEP gene in the animal, progeny, or cell can comprise any combination of any of the edited chromosomal sequences described herein.
Guide RNAs
Guide RNAs (gRNAs) are provided. The gRNAs have a nucleic acid sequence that is complementary to a sequence of a gene encoding an ANPEP protein and can be used to introduce a chromosomal edit into a gene encoding an ANPEP protein.
Illustrative gRNA sequences complementary to a sequence of a gene encoding an ANPEP protein are provided in SEQ ID NOs: 1-251. Although the sequences are listed with DNA nucleotides, a person of ordinary skill would understand that the sequences are in fact RNA sequences and would be readily able to convert the DNA sequences into RNA sequences. In some embodiments, the gRNA sequences target regions early in the ANPEP coding sequence or in coding regions containing protease active site residues.
The gRNA can comprise a nucleotide sequence comprising one or more of SEQ ID NOs: 1-251. The gRNA can have a length of 100 nucleotides or fewer, 90 nucleotides or fewer, 80 nucleotides or fewer, 70 nucleotides or fewer, 60 nucleotides or fewer, 50 nucleotides or fewer, 40 nucleotides or fewer, 30 nucleotides or fewer, or 20 nucleotides or fewer. For example, the gRNA can have a length of 20 nucleotides.
In certain embodiments, guide RNAs within ANPEP are selected for their ability to generate in-frame stop codons when paired.
DNA-Binding Polypeptides
In some embodiments, site-specific integration may be accomplished by utilizing factors that are capable of recognizing and binding to particular nucleotide sequences, for example, in the genome of a host organism. For instance, many proteins comprise
polypeptide domains that are capable of recognizing and binding to DNA in a site-specific manner. A DNA sequence that is recognized by a DNA-binding polypeptide may be referred to as a “target” sequence. Polypeptide domains that are capable of recognizing and binding to DNA in a site-specific manner generally fold correctly and function independently to bind DNA in a site-specific manner, even when expressed in a polypeptide other than the protein from which the domain was originally isolated. Similarly, target sequences for recognition and binding by DNA-binding polypeptides are generally able to be recognized and bound by such polypeptides, even when present in large DNA structures (e.g., a chromosome), particularly when the site where the target sequence is located is one known to be accessible to soluble cellular proteins (e.g., a gene).
While DNA-binding polypeptides identified from proteins that exist in nature typically bind to a discrete nucleotide sequence or motif (e.g., a consensus recognition sequence), methods exist and are known in the art for modifying many such DNA-binding polypeptides to recognize a different nucleotide sequence or motif. DNA-binding polypeptides include, for example and without limitation: zinc finger DNA-binding domains; leucine zippers; UPA DNA-binding domains; GAL4; TAL; LexA; a Tet repressor; LacR; and a steroid hormone receptor.
In some embodiments, a DNA-binding polypeptide is a zinc finger. Individual zinc finger motifs can be designed to target and bind specifically to any of a large range of DNA sites. Canonical Cys2His2 (as well as non-canonical CyssHis) zinc finger polypeptides bind DNA by inserting an a-helix into the major groove of the target DNA double helix. Recognition of DNA by a zinc finger is modular; each finger contacts primarily three consecutive base pairs in the target, and a few key residues in the polypeptide mediate recognition. By including multiple zinc finger DNA-binding domains in a targeting endonuclease, the DNA-binding specificity of the targeting endonuclease may be further increased (and hence the specificity of any gene regulatory effects conferred thereby may also be increased). See, e.g., Umov, F.D., et al., Nature, 2005, 435, 646-651. Thus, one or more zinc finger DNA-binding polypeptides may be engineered and utilized such that a targeting endonuclease introduced into a host cell interacts with a DNA sequence that is unique within the genome of the host cell.
Preferably, the zinc finger protein is non-naturally occurring in that it is engineered to bind to a target site of choice. See, for example, Beerli, R.R., et al., Nature Biotechnol.,
2002, 20, 135-141; Pabo, C.O., el a Ann. Rev. Biochem., 2001, 70, 313-340; Isalan, M., et al.. Nature Biotechnol., 2001, 19, 656-660; Segal, D.J., et al.. Curr. Opin. Biotechnol., 2001, 12, 632-637; Choo, Y., et al., 2000, Curr. Opin. Struct. Biol., 10, 411-416; U.S. Pat. Nos. 6,453,242; 6,534,261; 6,599,692; 6,503,717; 6,689,558; 7,030,215; 6,794,136; 7,067,317; 7,262,054; 7,070,934; 7,361,635; 7,253,273; and U.S. Patent Publication Nos. 2005/0064474; 2007/0218528; 2005/0267061, all incorporated herein by reference in their entireties.
An engineered zinc finger binding domain can have a novel binding specificity, compared to a naturally occurring zinc finger protein. Engineering methods include, but are not limited to, rational design and various types of selection. Rational design includes, for example, using databases comprising triplet (or quadruplet) nucleotide sequences and individual zinc finger amino acid sequences, in which each triplet or quadruplet nucleotide sequence is associated with one or more amino acid sequences of zinc fingers which bind the particular triplet or quadruplet sequence. See, for example, U.S. Pat. Nos. 6,453,242 and 6,534,261, incorporated by reference herein in their entireties.
Exemplary selection methods, including phage display and two-hybrid systems, are disclosed in U.S. Pat. Nos. 5,789,538; 5,925,523; 6,007,988; 6,013,453; 6,410,248; 6,140,466; 6,200,759; and 6,242,568; as well as WO 98/37186; WO 98/53057; WO 00/27878; WO 01/88197 and GB 2,338,237. In addition, enhancement of binding specificity for zinc finger binding domains has been described, for example, in WO 02/077227.
Selection of target sites; ZFPs and methods for design and construction of fusion proteins (and polynucleotides encoding same) are known to those of skill in the art and described in detail in U.S. Pat. Nos. 6,140,0815; 789,538; 6,453,242; 6,534,261; 5,925,523; 6,007,988; 6,013,453; 6,200,759; WO 95/19431; WO 96/06166; WO 98/53057; WO 98/54311; WO 00/27878; WO 01/60970 WO 01/88197; WO 02/099084; WO 98/53058; WO 98/53059; WO 98/53060; WO 02/016536 and WO 03/016496.
In addition, as disclosed in these and other references, zinc finger domains and/or multi-fingered zinc finger proteins may be linked together using any suitable linker sequences, including for example, linkers of 5 or more amino acids in length. See, also, U.S. Pat. Nos. 6,479,626; 6,903,185; and 7,153,949 for exemplary linker sequences 6 or
more amino acids in length. The proteins described herein may include any combination of suitable linkers between the individual zinc fingers of the protein.
In some examples, a DNA-binding polypeptide is a DNA-binding domain from GAL4. GAL4 is a modular transactivator in Saccharomyces cerevisiae, but it also operates as a transactivator in many other organisms. See, e.g., Sadowski, I., et al., Nature, 1988, 335, 563-564. In this regulatory system, the expression of genes encoding enzymes of the galactose metabolic pathway in S. cerevisiae is stringently regulated by the available carbon source. Johnston, M., Microbiol. Rev., 1987, 51, 458-476. Transcriptional control of these metabolic enzymes is mediated by the interaction between the positive regulatory protein, GAIA, and a 17 bp symmetrical DNA sequence to which GAL4 specifically binds (the UAS).
Native GAIA consists of 881 amino acid residues, with a molecular weight of 99 kDa. GAL4 comprises functionally autonomous domains, the combined activities of which account for activity of GAL4 in vivo (Ma, J. and Ptashne, M., Cell, 1987, 48, 847-853; Brent, R. and Ptashne, M., Cell, 1985 43, 729-736). The N-terminal 65 amino acids of GAL4 comprise the GAIA DNA-binding domain (Keegan, L., et al., Science, 1986, 231, 699-704; Johnston, M., Nature, 1987, 328, 353-355). Sequence-specific binding requires the presence of a divalent cation coordinated by 6 Cys residues present in the DNA binding domain. The coordinated cation-containing domain interacts with and recognizes a conserved CCG triplet at each end of the 17 bp UAS via direct contacts with the major groove of the DNA helix (Marmorstein, M., et al., Nature, 1992, 356, 408-414). The DNA- binding function of the protein positions C-terminal transcriptional activating domains in the vicinity of the promoter, such that the activating domains can direct transcription.
Additional DNA-binding polypeptides that may be utilized in certain embodiments include, for example and without limitation, a binding sequence from a AVRB S3 -inducible gene; a consensus binding sequence from a AVRB S3 -inducible gene or synthetic binding sequence engineered therefrom (e.g., UPA DNA-binding domain); TAL; LexA (see, e.g., Brent, R. and Ptashne, M., Cell, 1985 43, 729-736); LacR (see, e.g., Labow, M.A., et aL, Mol. Cell. Biol., 1990, 10, 3343-3356; Bairn, S.B., et al., Proc. Natl. Acad. Sci. USA 1991, 88, 5072-5076); a steroid hormone receptor (Elliston, J.F., et al., J. Biol. Chem. 1990, 265, 11517-11521); the Tet repressor (U.S. Pat. No. 6,271,341) and a mutated Tet repressor that binds to a tet operator sequence in the presence, but not the absence, of tetracycline (Tc);
the DNA-binding domain of NF-KB; and components of the regulatory system described in Wang, Y., et al., Proc. Natl. Acad. Sci. USA, 1994, 91, 8180-8184, which utilizes a fusion of GAL4, a hormone receptor, and VP 16.
In certain embodiments, the DNA-binding domain of one or more of the nucleases used in the methods and compositions described herein comprises a naturally occurring or engineered (non-naturally occurring) TAL effector DNA binding domain. See, e.g., U.S. Patent Publication No. 20110301073, incorporated by reference in its entirety herein.
In other embodiments, the nuclease comprises a CRISPR system (e.g., CRISPR/Cas9 system). The CRISPR (clustered regularly interspaced short palindromic repeats) locus, which encodes RNA components of the system, and the Cas (CRISPR- associated) locus, which encodes proteins (Jansen, R., et al., Mol. Microbiol., 2002, 43, 1565-1575; Makarova, K.,S., et aL, Nucleic Acids Res. 2002, 30: 482-496; Makarova, K.S., et al., Biol. Direct, 2006, 1, 7; Haft, D.H., et al., PLoS Comput. Biol., 2005, 1, e60) make up the gene sequences of the CRISPR/Cas nuclease system. CRISPR loci in microbial hosts contain a combination of Cas genes as well as non-coding RNA elements capable of programming the specificity of the CRISPR-mediated nucleic acid cleavage.
The Type II CRISPR system is one of the most well characterized systems and carries out targeted DNA double-strand break in four sequential steps. First, two noncoding RNAs, the pre-crRNA array and tracrRNA, are transcribed from the CRISPR locus. Second, tracrRNA hybridizes to the repeat regions of the pre-crRNA and mediates the processing of pre-crRNA into mature crRNAs containing individual spacer sequences. Third, the mature crRNA:tracrRNA complex directs Cas9 to the target DNA via Watson- Crick base-pairing between the spacer on the crRNA and the protospacer on the target DNA next to the protospacer adjacent motif (PAM), an additional requirement for target recognition. Finally, Cas9 mediates cleavage of target DNA to create a double-stranded break within the protospacer. Activity of the CRISPR/Cas system comprises of three steps: (i) insertion of exogenous DNA sequences into the CRISPR array to prevent future attacks, in a process called 'adaptation', (ii) expression of the relevant proteins, as well as expression and processing of the array, followed by (iii) RNA-mediated interference with the exogenous nucleic acid. Thus, in the bacterial cell, several Cas proteins are involved with the natural function of the CRISPR/Cas system and serve roles in functions such as insertion of the foreign DNA etc.
In certain embodiments, Cas protein may be a "functional derivative" of a naturally occurring Cas protein. A "functional derivative" of a native sequence polypeptide is a compound having a qualitative biological property in common with a native sequence polypeptide. "Functional derivatives" include, but are not limited to, fragments of a native sequence and derivatives of a native sequence polypeptide and its fragments, provided that they have a biological activity in common with a corresponding native sequence polypeptide. A biological activity contemplated herein is the ability of the functional derivative to hydrolyze a DNA substrate into fragments. The term "derivative" encompasses both amino acid sequence variants of polypeptide, covalent modifications, and fusions thereof. Suitable derivatives of a Cas polypeptide or a fragment thereof include but are not limited to mutants, fusions, covalent modifications of Cas protein or a fragment thereof. Cas protein, which includes Cas protein or a fragment thereof, as well as derivatives of Cas protein or a fragment thereof, may be obtainable from a cell or synthesized chemically or by a combination of these two procedures. The cell may be a cell that naturally produces Cas protein, or a cell that naturally produces Cas protein and is genetically engineered to produce the endogenous Cas protein at a higher expression level or to produce a Cas protein from an exogenously introduced nucleic acid, which nucleic acid encodes a Cas that is same or different from the endogenous Cas. In some cases, the cell does not naturally produce Cas protein and is genetically engineered to produce a Cas protein.
Cas9 protein comprises a RuvC nuclease domain and an HNH (H-N-H) nuclease domain, each of which can cleave a single DNA strand at a target sequence (the concerted action of both domains leads to DNA double-strand cleavage, whereas activity of one domain leads to a nick). In general, the RuvC domain comprises subdomains I, II and III, where domain I is located near the N-terminus of Cas9 and subdomains II and III are located in the middle of the protein, flanking the HNH domain (Hsu, P.D., et al.. Cell, 2014, 157, 1262-1278). A type II CRISPR system includes a DNA cleavage system utilizing a Cas9 endonuclease in complex with at least one polynucleotide component. For example, a Cas9 can be in complex with a CRISPR RNA (crRNA) and a trans-activating CRISPR RNA (tracrRNA). In another example, a Cas9 can be in complex with a single guide RNA that combines a crRNA and a tracrRNA into a single molecule. The amino acid sequence of a Cas9 protein described herein, as well as certain other Cas proteins herein,
may be derived from a Streptococcus (e.g., S. pyogenes, S. pneumoniae, S. thermophilus, S. agalactiae, S. parasanguinis, S. oralis, S. salivarius, S. macacae, S. dysgalactiae, S. anginosus, S. constellatus, S.pseudoporcinus, S. mutans), Listeria (e.g., L. innocua), Spiroplasma (e.g., S. apis, S. syrphidicola), Peptostreptococcaceae, Atopobium, Porphyromonas (e.g., P. catoniae), Prevotella (e.g., P. intermedia), Veillonella, Treponema (e.g., T socranskii, T denticola), Capnocytophaga, Finegoldia (e.g., F. magno), Coriobacteriaceae (e.g., C. bacterium), Olsenella (e.g., O. profusa), Haemophilus (e.g., H. sputorum, H. pittmaniae), Pasteurella (e.g., P. bettyae), Olivibacter (e.g., O. sitiensis), Epilithonimonas (e.g., E. tenax), Mesonia (e.g., M. mobilis), Lactobacillus (e.g., L. plantarum), Bacillus (e.g., B. cere us), Aquimarina (e.g., A. muelleri), Chryseobacterium (e.g., C. palustre), Bacteroides (e.g., B. graminisolvens), Neisseria (e.g., N. meningitidis), Francisella (e.g., F. novicida), or Flavobacterium (e.g. , F. frigidarium, F. soli) species, for example. As another example, a Cas9 protein can be any of the Cas9 proteins disclosed in Chylinski et al. (Chylinski, K., et al., RNA Biology, 2013, 10, 726-737 and US patent application 62/162377, filed May 15, 2015) which is incorporated herein by reference.
Accordingly, the sequence of a Cas9 protein herein can comprise, for example, any of the Cas9 amino acid sequences disclosed in GenBank Accession Nos. G3ECR1 (S. thermophilus), WP_026709422, WP_027202655, WP_027318179, WP_027347504, WP_027376815, WP_027414302, WP_027821588, WP_027886314, WP_027963583, WP 028123848, WP 028298935, Q03J16 (S. thermophilus), EGP66723, EGS38969, EGV05092, EH165578 (S. pseudoporcinus), EIC75614 (S. oralis), EID22027 (S. constellatus), EU69711, EJP22331 (S. oralis), EJP26004 (S.. anginosus), EJP30321, EPZ44001 (S. pyogenes), EPZ46028 (S. pyogenes), EQL78043 (S. pyogenes), EQL78548 (S. pyogenes), ERL10511, ERL12345, ERL19088 (S. pyogenes), ESA57807 (S. pyogenes), ESA59254 (S. pyogenes), ESU85303 (S. pyogenes), ETS96804, UC75522, EGR87316 (S. dysgalactiae), EGS33732, EGV01468 (S. oralis), EHJ52063 (S. macacae), EID26207 (S. oralis), EID33364, EIG27013 (S. parasanguinis), EJF37476, EJO19166 (Streptococcus sp. BS35b), EJU16049, EJU32481, YP_006298249, ERF61304, ERK04546, ETJ95568 (S. agalactiae), TS89875, ETS90967 (Streptococcus sp. SR4), ETS92439, EUB27844, (Streptococcus sp. BS21), AFJ08616, EUC82735 (Streptococcus sp. CM6), EWC92088, EWC94390, EJP25691, YP_008027038, YP_008868573, AGM26527, AHK22391, AHB36273, Q927P4, G3ECR1, or Q99ZW2 (S. pyogenes),
which are incorporated by reference. A variant of any of these Cas9 protein sequences may be used, but should have specific binding activity, and optionally endonucleolytic activity, toward DNA when associated with an RNA component herein. Such a variant Cas9 protein should have specific binding activity, and optionally cleavage or nicking activity, toward DNA when associated with an RNA component herein. A Cas protein herein such as a Cas9 can comprise a heterologous nuclear localization sequence (NLS). A heterologous NLS amino acid sequence herein may be of sufficient strength to drive accumulation of a Cas protein in a detectable amount in the nucleus of a yeast cell herein, for example. An NLS may comprise one (monopartite) or more (e.g., bipartite) short sequences (e.g., 2 to 20 residues) of basic, positively charged residues (e.g., lysine and/or arginine), and can be located anywhere in a Cas amino acid sequence but such that it is exposed on the protein surface. An NLS may be operably linked to the N-terminus or C-terminus of a Cas protein herein, for example. Two or more NLS sequences can be linked to a Cas protein, for example, such as on both the N- and C-termini of a Cas protein. Non-limiting examples of suitable NLS sequences herein include those disclosed in U.S. Patent No. 7309576, which is incorporated herein by reference.
The Cas endonuclease can comprise a modified form of the Cas9 polypeptide. The modified form of the Cas9 polypeptide can include an amino acid change (e.g., deletion, insertion, or substitution) that reduces the naturally occurring nuclease activity of the Cas9 protein. For example, in some instances, the modified form of the Cas9 protein has less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, or less than 1% of the nuclease activity of the corresponding wild-type Cas9 polypeptide (US patent application US20140068797 Al, published on March 6, 2014). In some cases, the modified form of the Cas9 polypeptide has no substantial nuclease activity and is referred to as catalytically "inactivated Cas9" or "deactivated cas9 (dCas9)." Catalytically inactivated Cas9 variants include Cas9 variants that contain mutations in the HNH and RuvC nuclease domains. These catalytically inactivated Cas9 variants are capable of interacting with sgRNA and binding to the target site in vivo but cannot cleave either strand of the target DNA.
A catalytically inactive Cas9 can be fused to a heterologous sequence (US patent application US20140068797 Al, published on March 6, 2014). Suitable fusion partners include, but are not limited to, a polypeptide that provides an activity that indirectly
increases transcription by acting directly on the target DNA or on a polypeptide (e.g., a histone or other DNA-binding protein) associated with the target DNA. Additional suitable fusion partners include, but are not limited to, a polypeptide that provides for methyltransferase activity, demethylase activity, acetyltransferase activity, deacetylase activity, kinase activity, phosphatase activity, ubiquitin ligase activity, deubiquitinating activity, adenylation activity, deadenylation activity, SUMOylating activity, deSUMOylating activity, ribosylation activity, deribosylation activity, myristoylation activity, or demyristoylation activity. Further suitable fusion partners include, but are not limited to, a polypeptide that directly provides for increased transcription of the target nucleic acid (e.g., a transcription activator or a fragment thereof, a protein or fragment thereof that recruits a transcription activator, a small molecule/drug-responsive transcription regulator, etc.). A catalytically inactive Cas9 can also be fused to a Fok I nuclease to generate double-strand breaks (Guilinger, J.P., et al., Nature Biotechnology, 2014, 32, 577-582).
In particular embodiments, a DNA-binding polypeptide specifically recognizes and binds to a target nucleotide sequence comprised within a genomic nucleic acid of a host organism. Any number of discrete instances of the target nucleotide sequence may be found in the host genome in some examples. The target nucleotide sequence may be rare within the genome of the organism (e.g., fewer than about 10, about 9, about 8, about 7, about 6, about 5, about 4, about 3, about 2, or about 1 copy(ies) of the target sequence may exist in the genome). For example, the target nucleotide sequence may be located at a unique site within the genome of the organism. Target nucleotide sequences may be, for example and without limitation, randomly dispersed throughout the genome with respect to one another; located in different linkage groups in the genome; located in the same linkage group; located on different chromosomes; located on the same chromosome; located in the genome at sites that are expressed under similar conditions in the organism (e.g., under the control of the same, or substantially functionally identical, regulatory factors); and located closely to one another in the genome (e.g., target sequences may be comprised within nucleic acids integrated as concatemers at genomic loci).
Targeting Endonucleases
In particular embodiments, a DNA-binding polypeptide that specifically recognizes and binds to a target nucleotide sequence may be comprised within a chimeric polypeptide,
so as to confer specific binding to the target sequence upon the chimeric polypeptide. In examples, such a chimeric polypeptide may comprise, for example and without limitation, nuclease, recombinase, and/or ligase polypeptides, as these polypeptides are described above. Chimeric polypeptides comprising a DNA-binding polypeptide and a nuclease, recombinase, and/or ligase polypeptide may also comprise other functional polypeptide motifs and/or domains, such as for example and without limitation: a spacer sequence positioned between the functional polypeptides in the chimeric protein; a leader peptide; a peptide that targets the fusion protein to an organelle (e.g., the nucleus); polypeptides that are cleaved by a cellular enzyme; peptide tags (e.g., Myc, His, etc.); and other amino acid sequences that do not interfere with the function of the chimeric polypeptide.
Functional polypeptides (e.g., DNA-binding polypeptides and nuclease polypeptides) in a chimeric polypeptide may be operatively linked. In some embodiments, functional polypeptides of a chimeric polypeptide may be operatively linked by their expression from a single polynucleotide encoding at least the functional polypeptides ligated to each other in-frame, so as to create a chimeric gene encoding a chimeric protein. In alternative embodiments, the functional polypeptides of a chimeric polypeptide may be operatively linked by other means, such as by cross-linkage of independently expressed polypeptides.
In some embodiments, a DNA-binding polypeptide, or guide RNA that specifically recognizes and binds to a target nucleotide sequence may be comprised within a natural isolated protein (or mutant thereof), wherein the natural isolated protein or mutant thereof also comprises a nuclease polypeptide (and may also comprise a recombinase and/or ligase polypeptide). Examples of such isolated proteins include TALENs, recombinases (e.g., Cre, Hin, Tre, and FLP recombinase), CRISPR systems (e.g., CRISPR/Cas9 systems), and meganucleases.
As used herein, the term "targeting endonuclease" refers to natural or engineered isolated proteins and mutants thereof that comprise a DNA-binding polypeptide or guide RNA and a nuclease polypeptide, as well as to chimeric polypeptides comprising a DNA- binding polypeptide or guide RNA and a nuclease. Any targeting endonuclease comprising a DNA-binding polypeptide or guide RNA that specifically recognizes and binds to a target nucleotide sequence comprised within an ANPEP locus (e.g., either because the target sequence is comprised within the native sequence at the locus, or because the target
sequence has been introduced into the locus, for example, by recombination) may be utilized in certain embodiments.
Some examples of chimeric polypeptides that may be useful in particular embodiments of the present teachings include, without limitation, combinations of the following polypeptides: zinc finger DNA-binding polypeptides; a Fok I nuclease polypeptide; TALE domains; leucine zippers; transcription factor DNA-binding motifs; and DNA recognition and/or cleavage domains isolated from, for example and without limitation, a TALEN, a recombinase (e.g., Cre, Hin, RecA, Tre, and FLP recombinases), a CRISPR system (e.g., CRISPR/Cas9 system), a meganuclease; and others known to those in the art. Particular examples include a chimeric protein comprising a site-specific DNA binding polypeptide and a nuclease polypeptide. Chimeric polypeptides may be engineered by methods known to those of skill in the art to alter the recognition sequence of a DNA- binding polypeptide comprised within the chimeric polypeptide, so as to target the chimeric polypeptide to a particular nucleotide sequence of interest.
In certain embodiments, the chimeric polypeptide comprises a DNA-binding domain (e.g., zinc finger, TAL-effector domain, etc.) and a nuclease (cleavage) domain. The cleavage domain may be heterologous to the DNA-binding domain, for example a zinc finger DNA-binding domain and a cleavage domain from a nuclease or a TALEN DNA- binding domain and a cleavage domain, or meganuclease DNA-binding domain and cleavage domain from a different nuclease. Heterologous cleavage domains can be obtained from any endonuclease or exonuclease. Exemplary endonucleases from which a cleavage domain can be derived include, but are not limited to, restriction endonucleases and homing endonucleases. See, for example, 2002-2003 Catalogue, New England Biolabs, Beverly, Mass.; and Belfort, M. and Roberts, J., Nucleic Acids Res., 1997, 25, 3379-3388. Additional enzymes which cleave DNA are known (e.g., 51 Nuclease; mung bean nuclease; pancreatic DNase I; micrococcal nuclease; yeast HO endonuclease; see also Linn et al. (eds.) Nucleases, Cold Spring Harbor Laboratory Press, 1993). One or more of these enzymes (or functional fragments thereof) can be used as a source of cleavage domains and cleavage half-domains.
Similarly, a cleavage half-domain can be derived from any nuclease or portion thereof, as set forth above, that requires dimerization for cleavage activity. In general, two fusion proteins are required for cleavage if the fusion proteins comprise cleavage half-
domains. Alternatively, a single protein comprising two cleavage half-domains can be used. The two cleavage half-domains can be derived from the same endonuclease (or functional fragments thereof), or each cleavage half-domain can be derived from a different endonuclease (or functional fragments thereof). In addition, the target sites for the two fusion proteins are preferably disposed, with respect to each other, such that binding of the two fusion proteins to their respective target sites places the cleavage half-domains in a spatial orientation to each other that allows the cleavage half-domains to form a functional cleavage domain, e.g., by dimerizing. Thus, in certain embodiments, the near edges of the target sites are separated by 5-8 nucleotides or by 15-18 nucleotides. However, any integral number of nucleotides, or nucleotide pairs, can intervene between two target sites (e.g., from 2 to 50 nucleotide pairs or more). In general, the site of cleavage lies between the target sites.
Restriction endonucleases (restriction enzymes) are present in many species and are capable of sequence-specific binding to DNA (at a recognition site), and cleaving DNA at or near the site of binding, for example, such that one or more exogenous sequences (donors/transgenes) are integrated at or near the binding (target) sites. Certain restriction enzymes (e.g., Type IIS) cleave DNA at sites removed from the recognition site and have separable binding and cleavage domains. For example, the Type IIS enzyme Fok I catalyzes double-stranded cleavage of DNA, at 9 nucleotides from its recognition site on one strand and 13 nucleotides from its recognition site on the other. See, for example, U.S. Pat. Nos. 5,356,802; 5,436,150 and 5,487,994; as well as Li, L., et al., Proc. Natl. Acad. Sci. USA, 1992, 89, 4275-4279; Li, L., el al.. Proc. Natl. Acad. Sci. USA, 1993, 90, 2764- 2768; Kim,Y-G., el al.. Proc. Natl. Acad. Sci. USA, 1994, 91, 883-887; Kim, Y-G., el al., J. Biol. Chem., 1994, 269, 31978-31982. Thus, in one embodiment, fusion proteins comprise the cleavage domain (or cleavage half-domain) from at least one Type IIS restriction enzyme and one or more zinc finger binding domains, which may or may not be engineered.
An exemplary Type IIS restriction enzyme, whose cleavage domain is separable from the binding domain, is Fok I. This particular enzyme is active as a dimer (Bitinaite, J., et al., Proc. Natl. Acad. Sci. USA, 1998, 95, 10570-10575). Accordingly, for the purposes of the present disclosure, the portion of the Fok I enzyme used in the disclosed fusion proteins is considered a cleavage half-domain. Thus, for targeted double-stranded cleavage
and/or targeted replacement of cellular sequences using zinc finger-Fok I fusions, two fusion proteins, each comprising a Fok I cleavage half-domain, can be used to reconstitute a catalytically active cleavage domain. Alternatively, a single polypeptide molecule containing a DNA binding domain and two Fok I cleavage half-domains can also be used.
A cleavage domain or cleavage half-domain can be any portion of a protein that retains cleavage activity, or that retains the ability to multimerize (e.g., dimerize) to form a functional cleavage domain.
Exemplary Type IIS restriction enzymes are described in U.S. Patent Publication No. 20070134796, incorporated herein in its entirety. Additional restriction enzymes also contain separable binding and cleavage domains, and these are contemplated by the present disclosure. See, for example, Roberts, R.J., et al., Nucleic Acids Res., 2003, 31, 418-420.
In certain embodiments, the cleavage domain comprises one or more engineered cleavage half-domain (also referred to as dimerization domain mutants) that minimize or prevent homodimerization, as described, for example, in U.S. Patent Publication Nos. 20050064474; 20060188987 and 20080131962, the disclosures of all of which are incorporated by reference in their entireties herein.
Alternatively, nucleases may be assembled in vivo at the nucleic acid target site using so-called "split-enzyme" technology (see e.g. U.S. Patent Publication No. 20090068164). Components of such split enzymes may be expressed either on separate expression constructs, or can be linked in one open reading frame where the individual components are separated, for example, by a self-cleaving 2A peptide or IRES sequence. Components may be individual zinc finger binding domains or domains of a meganuclease nucleic acid binding domain.
Zinc Finger Nucleases
In specific embodiments, a chimeric polypeptide is a custom-designed zinc finger nuclease (ZFN) that may be designed to deliver a targeted site-specific double-strand DNA break into which an exogenous nucleic acid, or donor DNA, may be integrated (See US Patent publication 20100257638, incorporated by reference herein). ZFNs are chimeric polypeptides containing a non-specific cleavage domain from a restriction endonuclease (for example, Fok I) and a zinc finger DNA-binding domain polypeptide. See, e.g., Huang, B., et al., J. Protein Chem., 1996, 15, 481-489; Kim, J-S., et al. Proc. Natl. Acad. Sci. USA, 1997, 94, 3616-3620; Kim, Y-G., el al.. Proc. Natl. Acad. Sci. USA, 1996, 93, 1156-
1160; Kim, Y-G., et al., Proc Natl. Acad. Sci. USA, 1994, 91, 883-887; Kim, Y-G., etal., Proc. Natl. Acad. Sci. USA, 1997, 94, 12875-12879; Kim, Y-G., et al., Gene, 1997, 203, 43-49; Kim, Y-G., et al., Biol. Chem., 1998, 379, 489-495; Nahon, E. and Raveh, D., Nucleic Acids Res., 1998, 26, 1233-1239; Smith, J., et al., Nucleic Acids Res., 1999, 27, 674-681. In some embodiments, the ZFNs comprise non-canonical zinc finger DNA binding domains (see US Patent publication 20080182332, incorporated by reference herein). The Fok I restriction endonuclease must dimerize via the nuclease domain in order to cleave DNA and introduce a double-strand break. Consequently, ZFNs containing a nuclease domain from such an endonuclease also require dimerization of the nuclease domain in order to cleave target DNA (Mani, M. et al., Biochem. Biophys. Res. Commun., 2005, 334, 1191-1197; Smith, J., et al., Nucleic Acids Res. 2000, 28, 3361-3369). Dimerization of the ZFN can be facilitated by two adjacent, oppositely oriented DNA- binding sites (Id).
Optional Exogenous Nucleic Acids for Integration at an ANPEP Locus
Embodiments of the present teachings may include one or more nucleic acids selected from the group consisting of an exogenous nucleic acid for site-specific integration in an ANPEP locus, for example and without limitation, an ORF; a nucleic acid comprising a nucleotide sequence encoding a targeting endonuclease; and a vector comprising at least one of either or both of the foregoing. Thus, particular nucleic acids for use in some embodiments include nucleotide sequences encoding a polypeptide, structural nucleotide sequences, and/or DNA-binding polypeptide recognition and binding sites. Optional Exogenous Nucleic Acid Molecules for Site-Specific Integration
As noted above, insertion of an exogenous sequence (also called a "donor sequence" or "donor") is provided, for example for expression of a polypeptide, correction of a mutant gene or for increased expression of a wild-type gene. It will be readily apparent that the donor sequence is typically not identical to the genomic sequence where it is placed. A donor sequence can contain a non-homologous sequence flanked by two regions of homology to allow for efficient HDR at the location of interest. Additionally, donor sequences can comprise a vector molecule containing sequences that are not homologous to the region of interest in cellular chromatin. A donor molecule can contain several, discontinuous regions of homology to cellular chromatin. For example, for targeted insertion of sequences not normally present in a region of interest, said sequences can be
present in a donor nucleic acid molecule and flanked by regions of homology to sequence in the region of interest.
The donor polynucleotide can be DNA or RNA, single- stranded or double-stranded and can be introduced into a cell in linear or circular form. See e.g., U.S. Patent Publication Nos. 20100047805, 20110281361, 20110207221 and U.S. application Ser. No. 13/889,162. If introduced in linear form, the ends of the donor sequence can be protected (e.g. from exonucleolytic degradation) by methods known to those of skill in the art. For example, one or more dideoxynucleotide residues are added to the 3' terminus of a linear molecule and/or self-complementary oligonucleotides are ligated to one or both ends (see, e.g. Chang, X-B. and Wilson, J.H., Proc. Natl. Acad. Sci. USA, 1987, 84, 4959-4963; Nehls, M., et al., Science, 1996, 272, 886-889). Additional methods for protecting exogenous polynucleotides from degradation include, but are not limited to, addition of terminal amino group(s) and the use of modified intemucleotide linkages such as, for example, phosphorothioates, phosphoramidates, and O-methyl ribose or deoxyribose residues.
A polynucleotide can be introduced into a cell as part of a vector molecule having additional sequences such as, for example, replication origins or promoters. Moreover, donor polynucleotides can be introduced as naked nucleic acid, as nucleic acid complexed with an agent such as a liposome or poloxamer, or can be delivered by viruses; exemplary viruses include adenovirus, AAV, herpesvirus, retrovirus, lentivirus and integrase defective lenti virus (IDLV).
The donor is generally integrated so that its expression is driven by the endogenous promoter at the integration site, namely the promoter that drives expression of the endogenous gene into which the donor is integrated. However, it will be apparent that the donor may comprise a promoter and/or enhancer, for example a constitutive promoter or an inducible or tissue-specific promoter.
Furthermore, although not required for expression, exogenous sequences may also include transcriptional or translational regulatory sequences, for example, promoters, enhancers, insulators, internal ribosome entry sites, sequences encoding 2A peptides and/or polyadenylation signals.
Nucleic Acid Molecules Comprising a Nucleotide Sequence Encoding a Targeting Endonuclease
In some embodiments, a nucleotide sequence encoding a targeting endonuclease may be engineered by manipulation (e.g., ligation) of native nucleotide sequences encoding polypeptides comprised within the targeting endonuclease. For example, the nucleotide sequence of a gene encoding a protein comprising a DNA-binding polypeptide may be inspected to identify the nucleotide sequence of the gene that corresponds to the DNA-binding polypeptide, and that nucleotide sequence may be used as an element of a nucleotide sequence encoding a targeting endonuclease comprising the DNA-binding polypeptide. Alternatively, the amino acid sequence of a targeting endonuclease may be used to deduce a nucleotide sequence encoding the targeting endonuclease, for example, according to the degeneracy of the genetic code.
In exemplary nucleic acid molecules comprising a nucleotide sequence encoding a targeting endonuclease, the last codon of a first polynucleotide sequence encoding a nuclease polypeptide, and the first codon of a second polynucleotide sequence encoding a DNA-binding polypeptide, may be separated by any number of nucleotide triplets, e.g., without coding for an intron or a "STOP." Likewise, the last codon of a nucleotide sequence encoding a first polynucleotide sequence encoding a DNA-binding polypeptide, and the first codon of a second polynucleotide sequence encoding a nuclease polypeptide, may be separated by any number of nucleotide triplets. In these and further embodiments, the last codon of the last of a first polynucleotide sequence encoding a nuclease polypeptide, and a second polynucleotide sequence encoding a DNA-binding polypeptide, may be fused in phase-register with the first codon of a further polynucleotide coding sequence directly contiguous thereto, or separated therefrom by no more than a short peptide sequence, such as that encoded by a synthetic nucleotide linker (e.g., a nucleotide linker that may have been used to achieve the fusion). Examples of such further polynucleotide sequences include, for example and without limitation, tags, targeting peptides, and enzymatic cleavage sites. Likewise, the first codon of the most 5' (in the nucleic acid sequence) of the first and second polynucleotide sequences may be fused in phase-register with the last codon of a further polynucleotide coding sequence directly contiguous thereto, or separated therefrom by no more than a short peptide sequence.
A sequence separating polynucleotide sequences encoding functional polypeptides in a targeting endonuclease (e.g., a DNA-binding polypeptide and a nuclease polypeptide) may, for example, consist of any sequence, such that the amino acid sequence encoded is
not likely to significantly alter the translation of the targeting endonuclease. Due to the autonomous nature of known nuclease polypeptides and known DNA-binding polypeptides, intervening sequences generally will not interfere with the respective functions of these structures.
Other Knockout Methods
Various other techniques known in the art can be used to inactivate genes to make knock-out animals and/or to introduce nucleic acid constructs into animals to produce founder animals, in which the knockout or nucleic acid construct is integrated into the genome. Such techniques include, without limitation, pronuclear microinjection (U.S. Pat. No. 4,873,191), retrovirus mediated gene transfer into germ lines (Van der Putten H., et al., Proc. Natl. Acad. Sci. USA, 1985, 82, 6148-1652), gene targeting into embryonic stem cells (Thompson, S., et al., Cell, 1989, 56, 313-321), electroporation of embryos (Lo, W ., Mol. Cell. Biol., 1983, 3, 1803-1814), sperm-mediated gene transfer (Lavitrano, M., et al., Proc. Natl. Acad. Sci. USA, 2002 99, 14230-14235; Lavitrano, M., et al., Reprod. Fert. Develop., 2006, 18, 19-23), and in vitro transformation of somatic cells, such as cumulus or mammary cells, or adult, fetal, or embryonic stem cells, followed by nuclear transplantation (Wilmut, I, et al., Nature, 1997, 385, 810-813 and Wakayama, T., et al., Nature, 1998, 394, 369-374). Pronuclear microinjection, sperm mediated gene transfer, and somatic cell nuclear transfer are particularly useful techniques. An animal that is gene edited is an animal wherein all of its cells have the genetic edit, including its germ line cells. When methods are used that produce an animal that is mosaic in its genetic edit, the animals may be bred and progeny that are gene edited may be selected. Cloning, for instance, may be used to make a mosaic animal if its cells are edited at the blastocyst state, or gene editing can take place when a single cell is edited. Animals that are edited so that they exhibit reduced susceptibility to infection by a coronavirus can be homozygous or heterozygous for the edit, depending on the specific approach that is used. If a particular gene is inactivated by an edit, homozygosity would normally be required. However, the instant application provides for and includes mosaic animals and heterozygous animals that may be used to breed homozygous animals. Further, if a particular gene is inactivated by an RNA interference or dominant negative strategy, then heterozygosity is often adequate.
Typically, in embryo/zygote microinjection, a nucleic acid construct or mRNA is introduced into a fertilized egg; 1 or 2 cell fertilized eggs are used as the pronuclei
containing the genetic material from the sperm head and the egg are visible within the protoplasm. Pronuclear staged fertilized eggs can be obtained in vitro or in vivo (i.e., surgically recovered from the oviduct of donor animals). In vitro fertilized eggs can be produced as follows. For example, swine ovaries can be collected at an abattoir, and maintained at 22-28° C during transport. Ovaries can be washed and isolated for follicular aspiration, and follicles ranging from 4-8 mm can be aspirated into 50 mL conical centrifuge tubes using 18-gauge needles and under vacuum to suck the oocyte from the follicle on the ovary. Follicular fluid and aspirated oocytes can be rinsed through pre-filters with commercial TL-HEPES (Minitube, Verona, WI). Oocytes surrounded by a compact cumulus mass can be selected and placed into TCM-199 OOCYTE MATURATION MEDIUM (Minitube, Verona, WI) supplemented with 0.1 mg/mL cysteine, 10 ng/mL epidermal growth factor, 10% porcine follicular fluid, 50 pM 2-mercaptoethanol, 0.5 mg/ml cAMP, 10 lU/mL each of pregnant mare serum gonadotropin (PMSG) and human chorionic gonadotropin (hCG) for approximately 22 hours in humidified air at 38.7° C and 5% CO2. Subsequently, the oocytes can be moved to fresh TCM-199 maturation medium, which will not contain cAMP, PMSG or hCG and then the oocytes can be incubated for an additional 22 hours. Matured oocytes can be stripped of their cumulus cells by vortexing in 0.1% hyaluronidase for 1 minute.
For swine, mature oocytes can be fertilized in 500 pl Minitube PORCPRO IVF MEDIUM SYSTEM (Minitube, Verona, Wis.) in Minitube 5-well fertilization dishes. In preparation for in vitro fertilization (IVF), freshly collected or frozen boar semen can be washed and resuspended in PORCPRO IVF Medium to 4xl05 sperm. Sperm concentrations can be analyzed by computer assisted semen analysis (SPERM VISION®, Minitube, Verona, WI). Final in vitro insemination can be performed in a 10 pl volume at a final concentration of approximately 40 motile sperm/oocyte, depending on the boar. The oocytes can be incubated at 38.7° C in 5.0% CO2 atmosphere for 6 hours. Six hours postinsemination, presumptive zygotes can be washed twice in NCSU-23 and moved to 0.5 mL of the same medium. This system can produce 20-30% blastocysts routinely across most boars with a 10-30% polyspermic insemination rate.
Linearized nucleic acid constructs or mRNA can be injected into one of the pronuclei or into the cytoplasm. Then the injected eggs can be transferred to a recipient female (e.g., into the oviducts of a recipient female) and allowed to develop in the recipient
female to produce the genetically edited animals. In particular, in vitro fertilized embryos can be centrifuged at 15,000 x g for 5 minutes to sediment lipids allowing visualization of the pronucleus. The embryos can be injected with using an EPPENDORF® FEMTOJET® injector (EPPENDORF®, Hamburg, Germany) and can be cultured until blastocyst formation. Rates of embryo cleavage and blastocyst formation and quality can be recorded.
Embryos can be surgically transferred into uteri of asynchronous recipients. Typically, 20-200 embryos can be deposited into the ampulla-isthmus junction of the oviduct using a catheter. After surgery, real-time ultrasound examination of pregnancy can be performed.
In somatic cell nuclear transfer (SCNT), a gene edited cell (e.g., a gene edited pig cell) such as an embryonic blastomere, fetal fibroblast, adult ear fibroblast, or granulosa cell that includes a nucleic acid construct described above, can be introduced into an enucleated oocyte to establish a combined cell. Oocytes can be enucleated by partial zona dissection near the polar body and then pressing out cytoplasm at the dissection area. Typically, an injection pipette with a sharp beveled tip is used to inject the gene edited cell into an enucleated oocyte arrested at meiosis 2. In some conventions, oocytes arrested at meiosis-2 are termed eggs. After producing a porcine or bovine embryo (e.g., by fusing and activating the oocyte), the embryo is transferred to the oviducts of a recipient female, about 20 to 24 hours after activation. See, for example, Cibelli, J.B., et al., Science, 1998, 280, 1256-1258 and U.S. Pat. No. 6,548,741. For pigs, recipient females can be checked for pregnancy approximately 20-21 days after transfer of the embryos.
Standard breeding techniques can be used to create animals that are homozygous for the desired edit from the initial heterozygous or mosaic founder animals. Homozygosity may not be required, however. Gene edited animals described herein can be bred with other animals of interest. Regardless of whether or not heterozygous animals are resistant to coronavirus, these animals have utility at least in that they can be used in breeding, for example to breed homozygous animals. In order to add a gene edit to the germplasm of an elite herd, many different zygotes can be edited using methods of the present teachings. These animals can then be interbred to create a herd comprising an elite germplasm that includes the edit. This is especially important in establishing a pig herd with the edit, as it allows for the rapid integration of the edited gene into the pig herd.
In some embodiments, a nucleic acid of interest and a selectable marker can be provided on separate transposons and provided to either embryos or cells in unequal amount, where the amount of transposon containing the selectable marker far exceeds (5- 10 fold excess) the transposon containing the nucleic acid of interest. Gene edited cells or animals expressing the nucleic acid of interest can be isolated based on presence and expression of the selectable marker. Because the transposons will integrate into the genome in a precise and unlinked way (independent transposition events), the nucleic acid of interest and the selectable marker are not genetically linked and can easily be separated by genetic segregation through standard breeding. Thus, gene edited animals can be produced that are not constrained to retain selectable markers in subsequent generations.
Once gene edited animals have been generated, expression of a nucleic acid can be assessed using standard techniques. Initial screening can be accomplished by ILLUMINA® sequencing (ILLUMINA®, Inc., San Diego, CA) or Southern blot analysis to determine whether or not an edit has taken place.
Expression of a nucleic acid sequence encoding a polypeptide in the tissues of gene edited animals can be assessed using techniques that include, for example, northern blot analysis of tissue samples obtained from the animal, in situ hybridization analysis, western analysis, immunoassays such as enzyme-linked immunosorbent assays, and reversetranscriptase PCR (RT-PCR).
Interfering RNAs
A variety of interfering RNA (RNAi) systems are known. Double-stranded RNA (dsRNA) induces sequence-specific degradation of homologous gene transcripts. RNA- induced silencing complex (RISC) metabolizes dsRNA to small 21-23 -nucleotide small interfering RNAs (siRNAs). RISC contains a double-stranded RNAse (dsRNase, e.g., Dicer) and ssRNase (e.g., Argonaut 2 or Ago2). RISC utilizes antisense strand as a guide to find a cleavable target. Both siRNAs and microRNAs (miRNAs) are known. A method of inactivating a gene in a gene edited animal comprises inducing RNA interference against a target gene and/or nucleic acid such that expression of the target gene and/or nucleic acid is reduced.
For example, the exogenous nucleic acid sequence can induce RNA interference against a nucleic acid encoding a polypeptide. For example, double-stranded small interfering RNA (siRNA) or small hairpin RNA (shRNA) homologous to a target DNA
can be used to reduce expression of that DNA. Constructs for siRNA can be produced as described, for example, in Fire, A., et al., Nature, 1998, 391, 806-811; Romano, N. and Masino, G., Mol. Microbiol., 1992, 6, 3343-3353; Cogoni, C., et al., EMBO J., 1996, 15, 3153-3161; Cogoni, C. and Masino, G., Nature, 1999, 399, 166-169; Misquitta, L. and Paterson, B.M., Proc. Natl. Acad. Sci. USA, 1999, 96, 1451-1456; and Kennerdell, K. and Carthew, C., Cell, 1998 95, 1017-1026. Constructs for shRNA can be produced as described by McIntyre, G.J. and Fanning, G.C., BMC Biotechnology, 2006, 6, 1-8. In general, shRNAs are transcribed as a single-stranded RNA molecule containing complementary regions, which can anneal and form short hairpins.
The probability of finding a single, individual functional siRNA or miRNA directed to a specific gene is high. The predictability of a specific sequence of siRNA, for instance, is about 50% but a number of interfering RNAs may be made with good confidence that at least one of them will be effective.
Embodiments include an in vitro cell, an in vivo cell, and a gene edited animal such as a livestock animal that expresses an RNAi directed against an ANPEP gene. The RNAi may be, for instance, selected from the group consisting of siRNA, shRNA, dsRNA, and miRNA.
Vectors and Nucleic Acids
A variety of nucleic acids may be introduced into cells for knockout purposes, for inactivation of a gene, to obtain expression of a gene, or for other purposes. As used herein, the term nucleic acid includes DNA, RNA, and nucleic acid analogs, and nucleic acids that are double-stranded or single-stranded (i.e., a sense or an antisense single strand). Nucleic acid analogs can be modified at the base moiety, sugar moiety, or phosphate backbone to improve, for example, stability, hybridization, or solubility of the nucleic acid. Modifications at the base moiety include deoxyuridine for deoxythymidine, and 5-methyl-2'-deoxycytidine and 5-bromo-2'-doxycytidine for deoxycytidine. Modifications of the sugar moiety include modification of the 2' hydroxyl of the ribose sugar to form 2'-O-methyl or 2'-O-allyl sugars. The deoxyribose phosphate backbone can be modified to produce morpholino nucleic acids, in which each base moiety is linked to a six membered, morpholino ring, or peptide nucleic acids, in which the deoxyphosphate backbone is replaced by a pseudopeptide backbone and the four bases are retained. See, Summerton, J. and Weller, D., Antisense Nucleic Acid Drug Dev., 1997, 7, 187-195; and
Hyrup, B and Nielsen, P.E., Bioorgan. Med. Chem., 1996, 4, 5-23. In addition, the deoxyphosphate backbone can be replaced with, for example, a phosphorothioate or phosphorodithioate backbone, a phosphoroamidite, or an alkyl phosphotri ester backbone.
The target nucleic acid sequence can be operably linked to a regulatory region such as a promoter. Regulatory regions can be porcine regulatory regions or can be from other species. As used herein, operably linked refers to positioning of a regulatory region relative to a nucleic acid sequence in such a way as to permit or facilitate transcription of the target nucleic acid.
Any type of promoter can be operably linked to a target nucleic acid sequence. Examples of promoters include, without limitation, tissue-specific promoters, constitutive promoters, inducible promoters, and promoters responsive or unresponsive to a particular stimulus. Suitable tissue-specific promoters can result in preferential expression of a nucleic acid transcript in beta cells and include, for example, the human insulin promoter. Other tissue-specific promoters can result in preferential expression in, for example, hepatocytes or heart tissue and can include the albumin or alpha-myosin heavy chain promoters, respectively. In other embodiments, a promoter that facilitates the expression of a nucleic acid molecule without significant tissue or temporal specificity can be used (z.e., a constitutive promoter). For example, a beta-actin promoter such as the chicken beta-actin gene promoter, ubiquitin promoter, miniCAGs promoter, glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) promoter, or 3 -phosphoglycerate kinase (PGK) promoter can be used, as well as viral promoters such as the herpes simplex virus thymidine kinase (HSV- TK) promoter, the SV40 promoter, or a cytomegalovirus (CMV) promoter. In some embodiments, a fusion of the chicken beta actin gene promoter and the CMV enhancer is used as a promoter. See, for example, Xu, L., et al.. Hum. Gene Then, 2001, 12, 563-573; and Kiwaki, K., et al., Hum. Gene Then, 1996, 7, 821-830.
Additional regulatory regions that may be useful in nucleic acid constructs, include, but are not limited to, polyadenylation sequences, translation control sequences (e.g., an internal ribosome entry segment, IRES), enhancers, inducible elements, or introns. Such regulatory regions may not be necessary, although they may increase expression by affecting transcription, stability of the mRNA, translational efficiency, or the like. Such regulatory regions can be included in a nucleic acid construct as desired to obtain optimal
expression of the nucleic acids in the cell(s). Sufficient expression, however, can sometimes be obtained without such additional elements.
Nucleic acid constructs can be introduced into embryonic, fetal, or adult animal cells of any type, including, for example, germ cells such as an oocyte or an egg, a progenitor cell, an adult or embryonic stem cell, a primordial germ cell, a kidney cell such as a PK-15 cell, an islet cell, a beta cell, a liver cell, or a fibroblast such as a dermal fibroblast, using a variety of techniques. Non-limiting examples of techniques include the use of transposon systems, recombinant viruses that can infect cells, or liposomes or other non-viral methods such as electroporation, microinjection, or calcium phosphate precipitation, that are capable of delivering nucleic acids to cells.
Nucleic acids can be incorporated into vectors. A vector is a broad term that includes any specific DNA segment that is designed to move from a carrier into a target DNA. A vector may be referred to as an expression vector, or a vector system, which is a set of components needed to bring about DNA insertion into a genome or other targeted DNA sequence such as an episome, plasmid, or even virus/phage DNA segment. Vector systems such as viral vectors (e.g., retroviruses, adeno-associated virus and integrating phage viruses), and non-viral vectors (e.g., transposons) used for gene delivery in animals have two basic components: 1) a vector comprised of DNA (or RNA that is reverse transcribed into a cDNA) and 2) a transposase, recombinase, or other integrase enzyme that recognizes both the vector and a DNA target sequence and inserts the vector into the target DNA sequence. Vectors most often contain one or more expression cassettes that comprise one or more expression control sequences, wherein an expression control sequence is a DNA sequence that controls and regulates the transcription and/or translation of another DNA sequence or mRNA, respectively.
Many different types of vectors are known. For example, plasmids and viral vectors, e.g., retroviral vectors, are known. Mammalian expression plasmids typically have an origin of replication, a suitable promoter and optional enhancer, necessary ribosome binding sites, a polyadenylation site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking non-transcribed sequences. Examples of vectors include: plasmids (which may also be a carrier of another type of vector), adenovirus, adeno-associated virus (AAV), lentivirus (e.g., modified HIV-1, SIV or FIV), retrovirus
(e.g., ASV, ALV or MoMLV), and transposons (e.g., Sleeping Beauty, P-elements, Tol-2, Frog Prince, piggyBac).
As used herein, the term nucleic acid refers to both RNA and DNA, including, for example, cDNA, genomic DNA, synthetic (e.g., chemically synthesized) DNA, as well as naturally occurring and chemically modified nucleic acids, e.g, synthetic bases or alternative backbones. A nucleic acid molecule can be double-stranded or single-stranded (i.e., a sense or an antisense single strand).
Founder Animals, Traits, and Reproduction
Founder animals may be produced by cloning and other methods described herein. The founders can be homozygous for a gene edit, as in the case where a zygote or a primary cell undergoes a homozygous edit. Similarly, founders can also be made that are heterozygous. All of a founder's cells genome may carry the gene edit or founders may be mosaic for an edit. Mosaicism depends on the stage at which the gene editing reagents were introduced, the half life of the reagents, and at what stage editing occurred. For example, viable gene editing reagents may still be in a two-cell stage embryo that was injected as a zygote. Progeny of mosaic animals may be tested to identify progeny that have the desired edit.
In livestock, many alleles are known to be linked to various traits such as production traits, type traits, workability traits, and other functional traits. Practitioners are accustomed to monitoring and quantifying these traits, see e.g., Visscher, P.M., et al., Livestock Production Science, 1994, 40, 123-137, U.S. Pat. No. 7,709,206, US 2001/0016315, US 2011/0023140, and US 2005/0153317. An animal may include a trait chosen from a trait in the group consisting of a production trait, a type trait, a workability trait, a fertility trait, a mothering trait, and a disease resistance trait. Further traits include expression of a recombinant gene product. Animals with a desired trait or traits may be edited to increase their resistance to infection with a coronavirus.
In addition to monitoring traits, artisans can look at the genetic background of the animal as a whole. Gene edits such as those of the present teachings may be made in elite genetic backgrounds. Elite PIC™ (Pig Improvement Company, Limited, Basingstoke, UK) lines 2, 3, 15, 19, 27, 62 and 65 are lines selected for superior commercial phenotypes. Fibroblast cell lines can be grown from collagenase treated ear notch samples extracted from the animals from these cell lines using methods known in the art.
Other potential porcine lines include lines can be PIC™ Line 15, PIC™ Line 17, PIC™ Line 27, PIC™ Line 65, PIC™ Line 14, PIC™ Line 62, PIC337, PIC800, PIC280, PIC327, PIC408, PIC™ 399, PIC410, PIC415, PIC359, PIC380, PIC837, PIC260, PIC265, PIC210, PIC™ Line 2, PIC™ Line 3, PIC™ Line 4, PIC™ Line 5, PIC™ Line 18, PIC™ Line 19, PIC™ Line 92, PIC95, PIC™ CAMBOROUGH® (Pig Improvement Company, Limited, Basingstoke, UK), PIC 1070, PIC™ CAMBOROUGH® 40, PIC™ CAMBOROUGH® 22, PIC 1050, PIC™ CAMBOROUGH® 29, PIC™ CAMBOROUGH® 48, or PIC™ CAMBOROUGH® x54.
In various aspects, PIC™ Line 65 is sold under the trade name PIC337. In various aspects, PIC™ line 62 is sold under the tradename PIC408. In various aspects, hybrid pigs made by crossing PIC™ lines 15 and 17 are sold under the tradenames PIC800 or PIC280. In various aspects, PIC™ Line 27 is sold under the tradename PIC327. In various aspects, hybrids created from crossing PIC™ Line 65 and PIC™ Line 62 are sold under the tradenames PIC399, PIC410, or PIC415. In various aspects, hybrids created from crossing PIC™ Line 65 and Pic Line 27 are sold under the tradename PIC359. In various aspects, hybrids prepared from crossing PIC™ Line 800 pigs (which is a hybrid of PIC™ Line 15 and PIC™ Line 17) to PIC™ Line 65 pigs are sold under the tradenames PIC380 or PIC837. In various aspects, PIC™ Line 14 is sold under the trade name PIC260. In various aspects, hybrids created from crossing PIC™ Line 14 and PIC™ Line 65 are sold under the tradename PIC265. In various aspects, hybrids created by crossing PIC™ Line 2 and PIC™ Line 3 are sold under the tradenames PIC210, PIC™ CAMBOROUGH®, and PIC 1050. In various aspects, hybrids of PIC™ Line 3 and PIC™ Line 92 are sold under the tradename PIC95. In various configurations, hybrids made from crossing PIC™ Line 19 and PIC™ Line 3 are sold under the tradename PIC 1070. In various aspects, hybrids created by crossing PIC™ Line 18 and PIC™ Line 3 are sold under the tradename PIC™ CAMBOROUGH® 40. In various aspects, hybrids created from crossing PIC™ Line 19 and PIC 1050 (which is itself a hybrid of PIC™ lines 2 and 3) are sold under the tradename PIC™ CAMBOROUGH® 22. In various aspects, hybrids created from crossing PIC™ Line 2 and PIC 1070 (which is itself a hybrid of PIC™ lines 19 and 3) are sold under the tradename PIC™ CAMBOROUGH® 29. In various aspects, hybrids created from crossing PIC™ Line 18 and PIC 1050 (which is itself a hybrid of PIC™ lines 2 and 3) are sold under the tradename PIC™ CAMBOROUGH® 48. In various aspects, hybrids created from
crossing PIC™ Line 4 and PIC™ Line 5 are sold under the tradename PIC™ CAMBOROUGH® x54.
Confirmation of Edits
One useful method of detecting the desired edit is to use real-time PCR. PCR primers flanking the region of interest and a probe that specifically anneals to the region of interest can be designed. The probe can be labelled with both a fluorophore and a quencher. In the PCR reaction, the primers and probe hybridize in a sequence-dependent manner to the complementary DNA strand of the region of interest. Because the probe is intact, the fluorophore and quencher are in close proximity and the quencher absorbs fluorescence emitted by the fluorophore. The polymerase extends from the primers and begins DNA synthesis. When the polymerase reaches the probe, the exonuclease activity of the polymerase cleaves the hybridized probe. As a result of cleavage, the fluorophore is separated from the quencher and fluoresces. This fluorescence is detected by the real time instrument. These steps are repeated for each PCR cycle and allow detection of specific products. A commercial real-time PCR kit can be used to probe various animals for the desired edit. A variety of commercial real-time PCR kits exist including, such as, but without limitation, PRIMETIME® from IDT, TAQMAN® (Roche Molecular Systems, Inc, Pleasonton, CA) from Applied Biosystems, and various kits from Qiagen and Bio-Rad. Skilled persons will recognize that any such kit can be used with the primers and methods of the present teachings to achieve like results.
EXAMPLES
The present teachings including descriptions provided in the Examples that are not intended to limit the scope of any claim or embodiment. The following non-limiting examples are provided to further illustrate the present teachings. Those of skill in the art, in light of the present disclosure, will appreciate that many changes can be made in the specific embodiments that are disclosed and still obtain a like or similar result without departing from the spirit and scope of the present teachings.
Example 1
This example demonstrates the use of gRNAs, in combination with Cas protein activity, to generate double-stranded breaks in DNA, which upon host-cell mediated nonhom ologous end-joining, results in the introduction of a premature stop codons in coding
regions of ANPEP. Upon protein translation of the resultant mRNA encoded by the new ANPEP gene, premature stop codons in ANPEP mRNA would result in premature termination of protein translation and ultimately truncated and non-functional proteins, and/or initiate nonsense-mediated mRNA decay resulting in elimination of ANPEP mRNA transcripts. Premature stop codons can be introduced via the homology directed repair (HDR) pathway by inclusion of a single- or double-stranded DNA template in editing experiments. However, exogenously added DNA can integrate randomly in the genome. Therefore, it would be advantageous to identify single gRNAs or gRNA pairs that promote the formation of in-frame stop codons, without the introduction of non-wild type amino acids. To accomplish this, one or two gRNAs can be used to direct nuclease cut sites, which are then repaired by NHEJ to affect the desired edit.
Guides within ANPEP were tested computationally for their ability to generate inframe stop codons when paired. Computational predictions were subsequently tested in porcine fetal fibroblasts (PFFs), as described below. The gRNAs listed in Table 3 below were generated by in vitro transcription and complexed with SpyCas9 in water, using 3.2 pg of Cas9 protein and 2.2 pg of gRNA in a total volume of 2.23 pl. Resulting ribonucleoprotein (RNP) complexes were then combined 1 : 1 in a total volume of 2.23 pl to generate gRNA pairs, as indicated in Table 3, and nucleofected into PFFs using a Lonza electroporator. In preparation for nucleofection, PFF cells were harvested using TRYPLE EXPRESS™ (ThermoFisher, recombinant Trypsin): the culture medium was removed from cells, cells were washed once with Hank’s Balanced Salt Solution (HBSS) or Dulbecco’s Phosphate-Buffered Saline (DPBS), and incubated for 3-5 minutes at 38.5° C in the presence of TrypLE. Cells were then harvested with complete medium. Cells were pelleted via centrifugation (300 g x 5 minutes at room temperature), supernatant was discarded, and then the cells were resuspended in 10 mL PBS to obtain single cell suspensions to allow cell counting using trypan blue staining. After counting, cells were pelleted via centrifugation, the supernatant was discarded, and the cells were resuspended in nucleofection buffer P3 at a final concentration of 7.5xl06 cells/ml. 20 pl of the cell suspension was added to each well of a nucleofection cuvette containing the RNP mixture and then mixed gently to resuspend the cells. The RNP/cell mixture was then nucleofected with program CM138 (provided by the manufacturer). Following nucleofection, 80 pl of warm Embryonic Fibroblast Medium (EFM) (Dulbecco’s Modified Eagle’s Medium
(DMEM) containing 2.77 mM glucose, 1.99 mM L-glutamine, and 0.5 mM sodium pyruvate, supplemented with 100 pM 2-Mercaptoethanol, IX Eagle’s minimum essential medium non-essential amino acids (MEM NEAA), 100 pg/mL Penicillin-Streptomycin, and 12% Fetal Bovine Serum) was added to each well. The suspensions were mixed gently by pipetting, and then 100 pl was transferred to a 12-well plate containing 900 pl of EFM pre-incubated at 38.5°C. The plate was then incubated at 38.5°C, 5% CO2, 5% O2, 90% N2 for 48 hours. Forty-eight hours after nucleofection, genomic DNA was prepared from transfected and control PFF cells: 15 pl of QUICKEXTRACT™ DNA Extraction Solution (Lucigen) was added to pelleted cells, and the cells were then lysed by incubating for 10 minutes at 37° C, for 8 minutes at 65° C, and for 5 minutes at 95° C. Lysate was held at 4° C until used for DNA sequencing.
To evaluate NHEJ repair outcomes at ANPEP target sites mediated by the guide RNA/Cas endonuclease system, a region of approximately 250 bp of genomic DNA surrounding the target site was amplified by PCR and then the PCR product was examined by amplicon deep sequencing for the presence and nature of repairs. After transfection in triplicate, PFF genomic DNA was extracted and the region surrounding the intended target site was PCR amplified with Q5 Polymerase (NEB) adding sequences necessary for amplicon-specific barcodes and ILLUMINA® sequencing using tailed primers through two rounds of PCR, respectively. The resulting PCR amplification products were deep sequenced on an ILLUMINA® MISEQ® Personal Sequencer (ILLUMINA®, Inc., San Diego, CA). The resulting reads were examined for the presence and nature of repairs at the expected sites of cleavage by comparison to control experiments where the Cas9 protein and guide RNA were omitted from the transfection or by comparison to the reference genome. To calculate the frequency of NHEJ mutations for a target site/Cas9 protein/guide RNA combination, the total number of mutant reads (amplicon sequences containing insertions or deletions when compared to the DNA sequences from control treatments or reference genome) was divided by total read number (wild-type plus mutant reads) of an appropriate length containing a perfect match to the barcode and forward primer. NHEJ activity is expressed as the average (n=#) mutant fraction in Table 3
A subset of gRNA pairs screened in porcine fetal fibroblasts were additionally tested for their ability to introduce a premature stop codon in the ANPEP coding regions in porcine embryos. The subset of guides to be tested in porcine embryos was chosen based
on their efficacy in generating premature stop codons in porcine fibroblasts. Edited porcine embryos were generated as described below. Briefly, oocytes recovered from slaughterhouse ovaries were in vitro fertilized. The sgRNP solution was injected into the cytoplasm of presumptive zygotes at 16-17 hours post-fertilization by using a single pulse from a FEMTOJET® 4i microinjector (EPPENDORF®; Hamburg, Germany).
Microinjection was performed in TL-Hepes (ABT360, LLC) supplemented with 3 mg/ml BSA (Proliant) on the heated stage of an inverted microscope equipped with Narishige micromanipulators (Narishige International USA, Amityville, NY). Following injections, presumptive zygotes were cultured for 7 days in PZM5 (Cosmo Bio, Co LTD, Tokyo, Japan) in an incubator environment of 5% CO2, 5% O2, 90% N2. Mutation frequency of blastocysts was determined by ILLUMINA® sequencing as described for fetal fibroblasts above. The frequencies of the desired end-to-end NHEJ repairs resulting in premature stop codons in ANPEP are shown in Table 3.
The guides encoded by Guide 1 and Guide 2 each create a double strand break in conjunction with Cas9, and then the blunt ends are joined through NHEJ to create a premature stop codon at the ligated breakpoint. (ND = not done)
This example demonstrates that the porcine ANPEP gene nucleotide sequences can be edited through the stimulation of double-strand breaks mediated by transfecting Cas9 protein with single or paired guide RNAs to effect a de novo in-frame stop codon.
Example 2
This example illustrates generation of pigs having a premature stop codon in ANPEP.
Porcine oocytes will be isolated, fertilized, and then the resulting zygotes will be edited as described in Example 1 to generate gene edited pigs.
ANPEP guide RNA pairs selected from Table 3 will be used to form RNP complexes as described in The present teachings including descriptions provided in the Examples that are not intended to limit the scope of any claim or embodiment. The following non-limiting examples are provided to further illustrate the present teachings. Those of skill in the art, in light of the present disclosure, will appreciate that many changes can be made in the specific embodiments that are disclosed and still obtain a like or similar result without departing from the spirit and scope of the present teachings.
Example 1 and will be microinjected into the cytoplasm of in vivo or in vitro fertilized PIC™ porcine one-cell zygotes or Mil oocytes. These zygotes will then be incubated to generate edited multicellular embryos and transferred to surrogate gilts via standard methods to birth gene edited pigs. To prepare embryo donors and surrogates, PIC™ pubertal gilts will be subjected to estrus synchronization by treatment with 0.22%
altrenogest solution (20-36 mg/animal) for 14 days. Follicular growth will be induced by the administration of PMSG 36 hours following the last dose of Matrix, and ovulation will be induced by the administration of hCG 82 hours after PMSG administration. To generate in vivo fertilized zygotes, females in standing heat will be artificially inseminated (Al) with PIC™ boar semen. In vivo derived zygotes will be recovered surgically 12-24 hours after Al by retrograde flushing the oviduct with sterile TL-HEPES medium supplemented with 0.3% BSA (w/v). Fertilized zygotes will be subjected to a single 2 - 50 picoliter (pl) cytoplasmic injection of Cas9 protein and guide RNA complex (25 - 50 ng/pl and 12.5 - 35 ng/pl) targeting ANPEP and cultured in PZM5 medium (Yoshioka, K., et al., Biol. Reprod., 2002, 60: 112-119; Suzuki, C., et al., Reprod. Fertil. Dev., 2006 18, 789-795; Yoshioka, K., J. Reprod. Dev. 2008, 54, 208-213). Injected zygotes will be surgically implanted into the oviducts of estrus synchronized, un-mated surrogate females by a midline laparotomy under general anesthesia (each surrogate will receive 20-60 injected embryos).
In vitro fertilized embryos for gene editing will be derived from non-fertilized PIC™ oocytes. Immature oocytes from estrus synchronized PIC™ gilts will be collected from medium size (3-6 mm) follicles. Oocytes with evenly dark cytoplasm and intact surrounding cumulus cells will be selected for maturation. Cumulus oocyte complexes will be placed in a well containing 500 pl of maturation medium, TCM-199 (Invitrogen) with 3.05 mM glucose, 0.91 mM sodium pyruvate, 0.57 mM cysteine, 10 ng/ml EGF, 0.5 pg/ml luteinizing hormone (LH), 0.5 pg/ml FSH, 10 ng/ml gentamicin (Sigma), and 10% follicular fluid for 42-44 h at 38.5° C and 5% CO2, in humidified air. At the end of the maturation, the surrounding cumulus cells will be removed from the oocytes by vortexing for 3 min in the presence of 0.1% hyaluronidase. Then, in vitro matured oocytes will be placed in 100 pl droplets of IVF medium (modified Tris-buffered medium containing 113.1 mM NaCl, 3 mM KC1, 7.5 mM CaCh, 11 mM glucose, 20 mM Tris, 2 mM caffeine, 5 mM sodium pyruvate, and 2 mg/ml bovine serum albumin (BSA)) in groups of 25-30 oocytes and will be fertilized according to established protocol (Abeydeera, L.R. and Day, B.N., Biol. Reprod., 1997, 57:729-734) using fresh extended boar semen. One ml of extended semen will be mixed with Dulbecco’s Phosphate Buffered Saline (DPBS) containing 1 mg/ml BSA to a final volume of 10 ml and centrifuged at 1000 x g, 25° C for 4 minutes, and spermatozoa will be washed in DPBS three times. After the final wash,
spermatozoa will be re-suspended in mTBM medium and added to oocytes at a final concentration of 1 x 105 spermatozoa/ml, and co-incubated for 4-5 h at 38.5° C and 5% CO2. Presumptive zygotes will be microinjected 5 hours post IVF and transferred to a surrogate female after 18-42 hours (1-4 cell stage). Each surrogate receives 20-60 injected embryos. Pregnancies will be confirmed by lack of return to estrus (21 days) and ultrasound at 28 days post embryo transfer.
Example 3
This example illustrates molecular characterization of pigs by sequencing.
A tissue sample will be taken from an animal with a genome that was edited according to the examples herein. Tail, ear notch, or blood samples are suitable tissue types. The tissue sample will be frozen at -20°C within 1 hour of sampling to preserve integrity of the DNA in the tissue sample.
DNA will be extracted from tissue samples after proteinase K digestion in lysis buffer. Characterization will be performed on two different sequence platforms, short sequence reads using the ILLUMINA® platform (ILLUMINA®, Inc., San Diego, CA) and long sequence reads on an Oxford NANOPORE™ platform (Oxford NANOPORE™ Technologies, Oxford, UK).
For short sequence reads, two-step PCR will be used to amplify and sequence the region of interest. The first step is a locus-specific PCR which amplifies the locus of interest from the DNA sample using a combined locus-specific primer with a vendorspecific primer. The second step attaches the sequencing index and adaptor sequences to the amplicon from the first step so that sequencing can occur.
The locus-specific primers for the first step PCR are chosen so that they amplify a region <300bp such that ILLUMINA® paired-end sequencing reads could span the amplified fragment. Multiple amplicons are preferred to provide redundancy should deletions or naturally occurring point mutations prevent primers from correctly binding. Sequence data for the amplicon will be generated using an ILLUMINA® sequencing platform (MISEQ®, ILLUMINA®, Inc., San Diego, CA). Sequence reads are analyzed to characterize the outcome of the editing process.
For long sequence reads, two-step PCR will be used to amplify and sequence the region of interest. The first step is a locus-specific PCR which amplifies the locus of interest from the DNA sample using a combined locus-specific primer with a vendor-
specific adapter. The second step PCR attaches the sequencing index to the amplicon from the first-step PCR so that the DNA is ready for preparing a sequencing library. The step 2 PCR products undergo a set of chemical reactions from a vendor kit to polish the ends of the DNA and ligate on the adapter containing the motor protein to allow access to the pores for DNA strand-based sequencing.
The locus specific primers for the first step PCR range will be designed to amplify different regions of the ANPEP gene and will amplify regions different in length. Normalized DNA is then mixed with vendor supplied loading buffer and is loaded onto the NANOPORE™ flowcell.
Long sequence reads, while having lower per base accuracy than short reads, are very useful for observing the long-range context of the sequence around the target site.
Example 4
This example illustrates viral challenge of ANPEP edited pigs.
PIC™ pigs will be edited with guides as described in Example 2. Edits will be confirmed as described in Example 3. Edited pigs will then be crossbred to create pigs that are homozygous for the edit. PEDV viral challenge of homozygous edited pigs will be performed as described in Whitworth, K.M., et al., Transgenic Research, 28, 21-32. Briefly, homozygous edited pigs will be inoculated with PEDV. Fecal swabs will be collected at intervals beginning prior to inoculation. Realtime PCR to determine the presence of virus in the feces will be performed using standard methods (see, e.g. Whitworth, K.M., et al., Transgenic research, 28, 21-32). It is expected that the viral amounts for edited pigs will be significantly lower than control pigs without the edit.
Example 5
This example illustrates an assay for real time PCR verification of desired edits. There will be two assays created for detection of ANPEP edits using real time PCR (rtPCR). For the first assay, two sets of primers and two probes will be designed. One set of primers will flank the spacer sequence of the first guide RNA. A probe, labeled with a fluorescent moiety, will be designed to anneal to the unedited version of the spacer sequence. The other set of primers will be designed to flank the desired edit sequence. A probe, labeled with a different fluorescent moiety, will be designed to anneal to nucleotides spanning the joining region of the edit. For validation, quantitative real time PCR will be performed using a commercial kit using DNA isolated from pigs of confirmed status from
sequencing, and fluorescence will be charted. Validation will occur if, as expected, the homozygotes are close to the y axis (representing the fluorescent moiety for the probe annealing to the desired edit), the heterozygotes group near the center of the chart, and the wild type pigs group close to the X axis (representing the fluorescent moiety for the probe annealing to the spacer sequence).
For the second assay, two sets of primers and two probes will also be designed. One set of primers will flank the second spacer sequence. A probe, labeled with a fluorescent moiety, will be designed to anneal to the unedited version of the spacer sequence. The other set of primers will be designed to flank the desired edit sequence — these may be the same primers and probe used for the first assay. A probe, labeled with a different fluorescent moiety, will be designed to anneal to nucleotides spanning the joining region of the edit. For validation, real time PCR will be performed using a commercial kit using DNA isolated from pigs of confirmed status from sequencing, and fluorescence will be charted. Validation will occur if, as expected, the homozygotes are close to the y axis (representing the fluorescent moiety for the probe annealing to the desired edit), the heterozygotes group near the center of the chart, and the wild type pigs group close to the X axis (representing the fluorescent moiety for the probe annealing to the spacer sequence).
All references, including publications, patents, and patent applications, cited herein are hereby incorporated by reference to the extent they are not inconsistent with the explicit details of this disclosure, and are so incorporated to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and were set forth in its entirety herein. Examples disclosed herein are provided by way of exemplification and are not intended to limit the scope of the present teachings.
Claims
1. A livestock animal or progeny thereof with a genome comprising an edited ANPEP gene comprising a premature stop codon.
2. The livestock animal according to claim 1, wherein an ANPEP protein encoded by the edited ANPEP gene shows reduced binding to a coronavirus relative to an ANPEP protein encoded by an unedited ANPEP gene.
3. The livestock animal according to claim 1, wherein the premature stop codon is in exon 2, 3, 4, 16, or 18.
4. The livestock animal according to claim 1, wherein the livestock animal is resistant to a coronavirus.
5. The livestock animal according to claim 1, wherein the livestock animal is a pig.
6. The pig according to claim 5, wherein the pig is resistant to transmissible gastroenteritis virus (TGEV), porcine epidemic diarrhea virus (PEDV), or Porcine respiratory coronavirus (PRCV).
7. The pig according to claim 5, wherein the edited ANPEP gene comprises SEQ ID NO: 252, 253, 254, 255, 256, or 257.
8. A cell isolated from the livestock animal according to claim 1.
9. An isolated cell line obtained from the livestock animal of claim 1.
10. An isolated fibroblast line obtained from the livestock animal of claim 1.
11. A method of producing a gene edited livestock animal comprising: introducing into an isolated cell: a Cas9 protein or a nucleic acid encoding a Cas9 protein; and
a pair of gRNAs that each create a double stranded break in an ANPEP gene such that a premature stop codon is formed when the double stranded breaks are repaired.
12. The method of claim 11, further comprising producing an animal from the isolated cell.
13. The method according to claim 11, wherein the pair of gRNAs is selected from the group consisting of SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 102 and 10, SEQ ID NOs: 58 and 52, SEQ ID NOs: 58 and 26, SEQ ID NOs: 58 and 10, SEQ ID NOs: 32 and 26, SEQ ID NOs: 32 and 10, SEQ ID NOs: 120 and 111, SEQ ID NOs: 119 and 113, SEQ ID NOs: 115 and 111, SEQ ID NOs 137 and 130, SEQ ID NOs: 133 and 130, SEQ ID NOs: 137 and 128, SEQ ID NOs 133 and 128, SEQ ID NOs: 189 and 183, SEQ ID NOs: 231 and 208, SEQ ID NOs: 228 and 213, SEQ ID NOs: 216 and 213, and SEQ ID NOs: 216 and 208.
14. The method according to claim 11, wherein pair of gRNAs is selected from the group consisting of SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 58 and 52, SEQ ID Nos: 120 and 111, SEQ ID NOs: 231 and 208, and SEQ ID NOs: 228 and 213.
15. The method according to claim 11, wherein the edited ANPEP gene comprises SEQ ID NO: 252, 253, 254, 255, 256, or 257.
16. The method according to claim 11, wherein the isolated cell is an oocyte, a zygote, or a fibroblast cell.
17. The method according to claim 12, wherein the isolated cell is an oocyte.
18. The method according to claim 12, wherein the producing an animal comprises fertilizing the oocyte.
19. The method according to claim 18, wherein the producing an animal further comprises implanting the fertilized oocyte into a surrogate mother.
20. The method according to claim 11, wherein the livestock animal is resistant to a coronavirus.
21. The method according to claim 11, wherein the livestock animal is a pig.
22. At least one gRNA having a sequence selected from the group consisting of SEQ ID NOs: 10, 26, 32, 52, 58, 102, 111, 113, 115, 119, 120, 128, 130, 133, 137, 183, 189, 208, 213, 216, 228, and 231.
23. The at least one gRNA according to claim 22, wherein the at least one gRNA is a pair of gRNAs having sequences selected from the group consisting of SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 102 and 10, SEQ ID NOs: 58 and 52, SEQ ID NOs: 58 and 26, SEQ ID NOs: 58 and 10, SEQ ID NOs: 32 and 26, SEQ ID NOs: 32 and 10, SEQ ID NOs: 120 and 111, SEQ ID NOs: 119 and 113, SEQ ID NOs: 115 and 111, SEQ ID NOs 137 and 130, SEQ ID NOs: 133 and 130, SEQ ID NOs: 137 and 128, SEQ ID NOs: 133 and 128, SEQ ID NOs: 189 and 183, SEQ ID NOs: 231 and 208, SEQ ID NOs: 228 and 213, SEQ ID NOs: 216 and 213, and SEQ ID NOs: 216 and 208.
24. The at least one gRNA according to claim 22, wherein the at least one gRNA is a pair of gRNAs selected from the group consisting of SEQ ID NOs: 102 and 52, SEQ ID NOs: 102 and 26, SEQ ID NOs: 58 and 52, SEQ ID Nos: 120 and 111, SEQ ID NOs: 231 and 208, and SEQ ID NOs: 228 and 213.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263368603P | 2022-07-15 | 2022-07-15 | |
US63/368,603 | 2022-07-15 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2024013514A2 true WO2024013514A2 (en) | 2024-01-18 |
WO2024013514A3 WO2024013514A3 (en) | 2024-02-22 |
Family
ID=87514184
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/GB2023/051854 WO2024013514A2 (en) | 2022-07-15 | 2023-07-14 | Gene edited livestock animals having coronavirus resistance |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024013514A2 (en) |
Citations (58)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US789538A (en) | 1904-11-11 | 1905-05-09 | Colin E Ham | Dumb-bell. |
US4873191A (en) | 1981-06-12 | 1989-10-10 | Ohio University | Genetic transformation of zygotes |
US5356802A (en) | 1992-04-03 | 1994-10-18 | The Johns Hopkins University | Functional domains in flavobacterium okeanokoites (FokI) restriction endonuclease |
WO1995019431A1 (en) | 1994-01-18 | 1995-07-20 | The Scripps Research Institute | Zinc finger protein derivatives and methods therefor |
US5436150A (en) | 1992-04-03 | 1995-07-25 | The Johns Hopkins University | Functional domains in flavobacterium okeanokoities (foki) restriction endonuclease |
US5487994A (en) | 1992-04-03 | 1996-01-30 | The Johns Hopkins University | Insertion and deletion mutants of FokI restriction endonuclease |
WO1996006166A1 (en) | 1994-08-20 | 1996-02-29 | Medical Research Council | Improvements in or relating to binding proteins for recognition of dna |
US5789538A (en) | 1995-02-03 | 1998-08-04 | Massachusetts Institute Of Technology | Zinc finger proteins with high affinity new DNA binding specificities |
WO1998037186A1 (en) | 1997-02-18 | 1998-08-27 | Actinova Limited | In vitro peptide or protein expression library |
WO1998053058A1 (en) | 1997-05-23 | 1998-11-26 | Gendaq Limited | Nucleic acid binding proteins |
WO1998053059A1 (en) | 1997-05-23 | 1998-11-26 | Medical Research Council | Nucleic acid binding proteins |
WO1998054311A1 (en) | 1997-05-27 | 1998-12-03 | The Scripps Research Institute | Zinc finger protein derivatives and methods therefor |
US5925523A (en) | 1996-08-23 | 1999-07-20 | President & Fellows Of Harvard College | Intraction trap assay, reagents and uses thereof |
GB2338237A (en) | 1997-02-18 | 1999-12-15 | Actinova Ltd | In vitro peptide or protein expression library |
WO2000027878A1 (en) | 1998-11-09 | 2000-05-18 | Gendaq Limited | Screening system for zinc finger polypeptides for a desired binding ability |
US6140081A (en) | 1998-10-16 | 2000-10-31 | The Scripps Research Institute | Zinc finger binding domains for GNN |
US6140815A (en) | 1998-06-17 | 2000-10-31 | Dover Instrument Corporation | High stability spin stand platform |
US6242568B1 (en) | 1994-01-18 | 2001-06-05 | The Scripps Research Institute | Zinc finger protein derivatives and methods therefor |
US6271341B1 (en) | 1997-07-03 | 2001-08-07 | Basf Aktiengesellschaft | Transcriptional activators with graded transactivation potential |
US20010016315A1 (en) | 1996-07-22 | 2001-08-23 | Robert Renaville | Pit-1 gene polymorphism and trait selection in animals |
WO2001060970A2 (en) | 2000-02-18 | 2001-08-23 | Toolgen, Inc. | Zinc finger domains and methods of identifying same |
WO2001088197A2 (en) | 2000-05-16 | 2001-11-22 | Massachusetts Institute Of Technology | Methods and compositions for interaction trap assays |
WO2002016536A1 (en) | 2000-08-23 | 2002-02-28 | Kao Corporation | Bactericidal antifouling detergent for hard surface |
US6410248B1 (en) | 1998-01-30 | 2002-06-25 | Massachusetts Institute Of Technology | General strategy for selecting high-affinity zinc finger proteins for diverse DNA target sites |
US6453242B1 (en) | 1999-01-12 | 2002-09-17 | Sangamo Biosciences, Inc. | Selection of sites for targeting by zinc finger proteins and methods of designing zinc finger proteins to bind to preselected sites |
WO2002077227A2 (en) | 2000-11-20 | 2002-10-03 | Sangamo Biosciences, Inc. | Iterative optimization in the design of binding proteins |
US6479626B1 (en) | 1998-03-02 | 2002-11-12 | Massachusetts Institute Of Technology | Poly zinc finger proteins with improved linkers |
WO2002099084A2 (en) | 2001-04-04 | 2002-12-12 | Gendaq Limited | Composite binding polypeptides |
US6503717B2 (en) | 1999-12-06 | 2003-01-07 | Sangamo Biosciences, Inc. | Methods of using randomized libraries of zinc finger proteins for the identification of gene function |
WO2003016496A2 (en) | 2001-08-20 | 2003-02-27 | The Scripps Research Institute | Zinc finger binding domains for cnn |
US6534261B1 (en) | 1999-01-12 | 2003-03-18 | Sangamo Biosciences, Inc. | Regulation of endogenous gene expression in cells using zinc finger proteins |
US6548741B2 (en) | 1998-10-12 | 2003-04-15 | Geron Corporation | Developmental competence for assisted reproduction and nuclear transfer in pigs |
US6599692B1 (en) | 1999-09-14 | 2003-07-29 | Sangamo Bioscience, Inc. | Functional genomics using zinc finger proteins |
US6689558B2 (en) | 2000-02-08 | 2004-02-10 | Sangamo Biosciences, Inc. | Cells for drug discovery |
US20050064474A1 (en) | 2003-08-08 | 2005-03-24 | Sangamo Biosciences, Inc. | Methods and compositions for targeted cleavage and recombination |
US20050153317A1 (en) | 2003-10-24 | 2005-07-14 | Metamorphix, Inc. | Methods and systems for inferring traits to breed and manage non-beef livestock |
US20050267061A1 (en) | 2004-04-08 | 2005-12-01 | Sangamo Biosciences, Inc. | Methods and compositions for treating neuropathic and neurodegenerative conditions |
US7030215B2 (en) | 1999-03-24 | 2006-04-18 | Sangamo Biosciences, Inc. | Position dependent recognition of GNN nucleotide triplets by zinc fingers |
US7067317B2 (en) | 2000-12-07 | 2006-06-27 | Sangamo Biosciences, Inc. | Regulation of angiogenesis with zinc finger proteins |
US7070934B2 (en) | 1999-01-12 | 2006-07-04 | Sangamo Biosciences, Inc. | Ligand-controlled regulation of endogenous gene expression |
US20060188987A1 (en) | 2003-08-08 | 2006-08-24 | Dmitry Guschin | Targeted deletion of cellular DNA sequences |
US20070134796A1 (en) | 2005-07-26 | 2007-06-14 | Sangamo Biosciences, Inc. | Targeted integration and expression of exogenous nucleic acid sequences |
US7253273B2 (en) | 2004-04-08 | 2007-08-07 | Sangamo Biosciences, Inc. | Treatment of neuropathic pain with zinc finger proteins |
US7262054B2 (en) | 2002-01-22 | 2007-08-28 | Sangamo Biosciences, Inc. | Zinc finger proteins for DNA binding and gene regulation in plants |
US20070218528A1 (en) | 2004-02-05 | 2007-09-20 | Sangamo Biosciences, Inc. | Methods and compositions for targeted cleavage and recombination |
US7309576B2 (en) | 2002-04-12 | 2007-12-18 | O'dowd Brian F | Method of identifying transmembrane protein-interacting compounds |
US7361635B2 (en) | 2002-08-29 | 2008-04-22 | Sangamo Biosciences, Inc. | Simultaneous modulation of multiple genes |
US20080131962A1 (en) | 2006-05-25 | 2008-06-05 | Sangamo Biosciences, Inc. | Engineered cleavage half-domains |
US20080182332A1 (en) | 2006-12-14 | 2008-07-31 | Cai Qihua C | Optimized non-canonical zinc finger proteins |
US20090068164A1 (en) | 2005-05-05 | 2009-03-12 | The Ariz Bd Of Regents On Behald Of The Univ Of Az | Sequence enabled reassembly (seer) - a novel method for visualizing specific dna sequences |
US20100047805A1 (en) | 2008-08-22 | 2010-02-25 | Sangamo Biosciences, Inc. | Methods and compositions for targeted single-stranded cleavage and targeted integration |
US7709206B2 (en) | 2002-12-31 | 2010-05-04 | Metamorphix, Inc. | Compositions, methods and systems for inferring bovine breed or trait |
US20100257638A1 (en) | 2006-08-11 | 2010-10-07 | Dow Agrosciences Llc | Zinc finger nuclease-mediated homologous recombination |
US20110023140A1 (en) | 2008-12-04 | 2011-01-27 | Sigma-Aldrich Co. | Rabbit genome editing with zinc finger nucleases |
US20110207221A1 (en) | 2010-02-09 | 2011-08-25 | Sangamo Biosciences, Inc. | Targeted genomic modification with partially single-stranded donor molecules |
US20110281361A1 (en) | 2005-07-26 | 2011-11-17 | Sangamo Biosciences, Inc. | Linear donor constructs for targeted integration |
US20110301073A1 (en) | 2010-05-17 | 2011-12-08 | Sangamo Biosciences, Inc. | Novel DNA-binding proteins and uses thereof |
US20140068797A1 (en) | 2012-05-25 | 2014-03-06 | University Of Vienna | Methods and compositions for rna-directed target dna modification and for rna-directed modulation of transcription |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107893088A (en) * | 2017-11-16 | 2018-04-10 | 中国农业科学院北京畜牧兽医研究所 | A kind of method of the pig fibroblast for preparing CD13 gene knockouts and gene editing pig |
CN112313330A (en) * | 2018-04-27 | 2021-02-02 | 密苏里大学管理者 | Antipathogenic animals with modified Aminopeptidase N (ANPEP) genes |
KR101961667B1 (en) * | 2018-09-12 | 2019-03-25 | 주식회사 엠젠플러스 | Transgenic cloned pig resistant to the Porcine epidemic diarrhea virus and producing method thereof |
CN113512534B (en) * | 2020-09-23 | 2024-04-23 | 杭州启函生物科技有限公司 | Compositions and methods for genetic modification and targeting |
CN113957069B (en) * | 2021-08-26 | 2022-08-26 | 中国农业科学院北京畜牧兽医研究所 | Composition for simultaneous modification of amino acids at 736 th site and 738 th site of pAPN gene and application thereof |
-
2023
- 2023-07-14 WO PCT/GB2023/051854 patent/WO2024013514A2/en unknown
Patent Citations (67)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US789538A (en) | 1904-11-11 | 1905-05-09 | Colin E Ham | Dumb-bell. |
US4873191A (en) | 1981-06-12 | 1989-10-10 | Ohio University | Genetic transformation of zygotes |
US5356802A (en) | 1992-04-03 | 1994-10-18 | The Johns Hopkins University | Functional domains in flavobacterium okeanokoites (FokI) restriction endonuclease |
US5436150A (en) | 1992-04-03 | 1995-07-25 | The Johns Hopkins University | Functional domains in flavobacterium okeanokoities (foki) restriction endonuclease |
US5487994A (en) | 1992-04-03 | 1996-01-30 | The Johns Hopkins University | Insertion and deletion mutants of FokI restriction endonuclease |
WO1995019431A1 (en) | 1994-01-18 | 1995-07-20 | The Scripps Research Institute | Zinc finger protein derivatives and methods therefor |
US6140466A (en) | 1994-01-18 | 2000-10-31 | The Scripps Research Institute | Zinc finger protein derivatives and methods therefor |
US6242568B1 (en) | 1994-01-18 | 2001-06-05 | The Scripps Research Institute | Zinc finger protein derivatives and methods therefor |
WO1996006166A1 (en) | 1994-08-20 | 1996-02-29 | Medical Research Council | Improvements in or relating to binding proteins for recognition of dna |
US6007988A (en) | 1994-08-20 | 1999-12-28 | Medical Research Council | Binding proteins for recognition of DNA |
US6013453A (en) | 1994-08-20 | 2000-01-11 | Medical Research Council | Binding proteins for recognition of DNA |
US5789538A (en) | 1995-02-03 | 1998-08-04 | Massachusetts Institute Of Technology | Zinc finger proteins with high affinity new DNA binding specificities |
US20010016315A1 (en) | 1996-07-22 | 2001-08-23 | Robert Renaville | Pit-1 gene polymorphism and trait selection in animals |
US5925523A (en) | 1996-08-23 | 1999-07-20 | President & Fellows Of Harvard College | Intraction trap assay, reagents and uses thereof |
US6200759B1 (en) | 1996-08-23 | 2001-03-13 | President And Fellows Of Harvard College | Interaction trap assay, reagents and uses thereof |
GB2338237A (en) | 1997-02-18 | 1999-12-15 | Actinova Ltd | In vitro peptide or protein expression library |
WO1998037186A1 (en) | 1997-02-18 | 1998-08-27 | Actinova Limited | In vitro peptide or protein expression library |
WO1998053060A1 (en) | 1997-05-23 | 1998-11-26 | Gendaq Limited | Nucleic acid binding proteins |
WO1998053057A1 (en) | 1997-05-23 | 1998-11-26 | Gendaq Limited | Nucleic acid binding polypeptide library |
WO1998053058A1 (en) | 1997-05-23 | 1998-11-26 | Gendaq Limited | Nucleic acid binding proteins |
WO1998053059A1 (en) | 1997-05-23 | 1998-11-26 | Medical Research Council | Nucleic acid binding proteins |
WO1998054311A1 (en) | 1997-05-27 | 1998-12-03 | The Scripps Research Institute | Zinc finger protein derivatives and methods therefor |
US6271341B1 (en) | 1997-07-03 | 2001-08-07 | Basf Aktiengesellschaft | Transcriptional activators with graded transactivation potential |
US6410248B1 (en) | 1998-01-30 | 2002-06-25 | Massachusetts Institute Of Technology | General strategy for selecting high-affinity zinc finger proteins for diverse DNA target sites |
US7153949B2 (en) | 1998-03-02 | 2006-12-26 | Massachusetts Institute Of Technology | Nucleic acid encoding poly-zinc finger proteins with improved linkers |
US6479626B1 (en) | 1998-03-02 | 2002-11-12 | Massachusetts Institute Of Technology | Poly zinc finger proteins with improved linkers |
US6903185B2 (en) | 1998-03-02 | 2005-06-07 | Massachusetts Institute Of Technology | Poly zinc finger proteins with improved linkers |
US6140815A (en) | 1998-06-17 | 2000-10-31 | Dover Instrument Corporation | High stability spin stand platform |
US6548741B2 (en) | 1998-10-12 | 2003-04-15 | Geron Corporation | Developmental competence for assisted reproduction and nuclear transfer in pigs |
US6140081A (en) | 1998-10-16 | 2000-10-31 | The Scripps Research Institute | Zinc finger binding domains for GNN |
WO2000027878A1 (en) | 1998-11-09 | 2000-05-18 | Gendaq Limited | Screening system for zinc finger polypeptides for a desired binding ability |
US7070934B2 (en) | 1999-01-12 | 2006-07-04 | Sangamo Biosciences, Inc. | Ligand-controlled regulation of endogenous gene expression |
US6534261B1 (en) | 1999-01-12 | 2003-03-18 | Sangamo Biosciences, Inc. | Regulation of endogenous gene expression in cells using zinc finger proteins |
US6453242B1 (en) | 1999-01-12 | 2002-09-17 | Sangamo Biosciences, Inc. | Selection of sites for targeting by zinc finger proteins and methods of designing zinc finger proteins to bind to preselected sites |
US7030215B2 (en) | 1999-03-24 | 2006-04-18 | Sangamo Biosciences, Inc. | Position dependent recognition of GNN nucleotide triplets by zinc fingers |
US6599692B1 (en) | 1999-09-14 | 2003-07-29 | Sangamo Bioscience, Inc. | Functional genomics using zinc finger proteins |
US6503717B2 (en) | 1999-12-06 | 2003-01-07 | Sangamo Biosciences, Inc. | Methods of using randomized libraries of zinc finger proteins for the identification of gene function |
US6689558B2 (en) | 2000-02-08 | 2004-02-10 | Sangamo Biosciences, Inc. | Cells for drug discovery |
WO2001060970A2 (en) | 2000-02-18 | 2001-08-23 | Toolgen, Inc. | Zinc finger domains and methods of identifying same |
WO2001088197A2 (en) | 2000-05-16 | 2001-11-22 | Massachusetts Institute Of Technology | Methods and compositions for interaction trap assays |
WO2002016536A1 (en) | 2000-08-23 | 2002-02-28 | Kao Corporation | Bactericidal antifouling detergent for hard surface |
US6794136B1 (en) | 2000-11-20 | 2004-09-21 | Sangamo Biosciences, Inc. | Iterative optimization in the design of binding proteins |
WO2002077227A2 (en) | 2000-11-20 | 2002-10-03 | Sangamo Biosciences, Inc. | Iterative optimization in the design of binding proteins |
US7067317B2 (en) | 2000-12-07 | 2006-06-27 | Sangamo Biosciences, Inc. | Regulation of angiogenesis with zinc finger proteins |
WO2002099084A2 (en) | 2001-04-04 | 2002-12-12 | Gendaq Limited | Composite binding polypeptides |
WO2003016496A2 (en) | 2001-08-20 | 2003-02-27 | The Scripps Research Institute | Zinc finger binding domains for cnn |
US7262054B2 (en) | 2002-01-22 | 2007-08-28 | Sangamo Biosciences, Inc. | Zinc finger proteins for DNA binding and gene regulation in plants |
US7309576B2 (en) | 2002-04-12 | 2007-12-18 | O'dowd Brian F | Method of identifying transmembrane protein-interacting compounds |
US7361635B2 (en) | 2002-08-29 | 2008-04-22 | Sangamo Biosciences, Inc. | Simultaneous modulation of multiple genes |
US7709206B2 (en) | 2002-12-31 | 2010-05-04 | Metamorphix, Inc. | Compositions, methods and systems for inferring bovine breed or trait |
US20060188987A1 (en) | 2003-08-08 | 2006-08-24 | Dmitry Guschin | Targeted deletion of cellular DNA sequences |
US20050064474A1 (en) | 2003-08-08 | 2005-03-24 | Sangamo Biosciences, Inc. | Methods and compositions for targeted cleavage and recombination |
US20050153317A1 (en) | 2003-10-24 | 2005-07-14 | Metamorphix, Inc. | Methods and systems for inferring traits to breed and manage non-beef livestock |
US20070218528A1 (en) | 2004-02-05 | 2007-09-20 | Sangamo Biosciences, Inc. | Methods and compositions for targeted cleavage and recombination |
US20050267061A1 (en) | 2004-04-08 | 2005-12-01 | Sangamo Biosciences, Inc. | Methods and compositions for treating neuropathic and neurodegenerative conditions |
US7253273B2 (en) | 2004-04-08 | 2007-08-07 | Sangamo Biosciences, Inc. | Treatment of neuropathic pain with zinc finger proteins |
US20090068164A1 (en) | 2005-05-05 | 2009-03-12 | The Ariz Bd Of Regents On Behald Of The Univ Of Az | Sequence enabled reassembly (seer) - a novel method for visualizing specific dna sequences |
US20110281361A1 (en) | 2005-07-26 | 2011-11-17 | Sangamo Biosciences, Inc. | Linear donor constructs for targeted integration |
US20070134796A1 (en) | 2005-07-26 | 2007-06-14 | Sangamo Biosciences, Inc. | Targeted integration and expression of exogenous nucleic acid sequences |
US20080131962A1 (en) | 2006-05-25 | 2008-06-05 | Sangamo Biosciences, Inc. | Engineered cleavage half-domains |
US20100257638A1 (en) | 2006-08-11 | 2010-10-07 | Dow Agrosciences Llc | Zinc finger nuclease-mediated homologous recombination |
US20080182332A1 (en) | 2006-12-14 | 2008-07-31 | Cai Qihua C | Optimized non-canonical zinc finger proteins |
US20100047805A1 (en) | 2008-08-22 | 2010-02-25 | Sangamo Biosciences, Inc. | Methods and compositions for targeted single-stranded cleavage and targeted integration |
US20110023140A1 (en) | 2008-12-04 | 2011-01-27 | Sigma-Aldrich Co. | Rabbit genome editing with zinc finger nucleases |
US20110207221A1 (en) | 2010-02-09 | 2011-08-25 | Sangamo Biosciences, Inc. | Targeted genomic modification with partially single-stranded donor molecules |
US20110301073A1 (en) | 2010-05-17 | 2011-12-08 | Sangamo Biosciences, Inc. | Novel DNA-binding proteins and uses thereof |
US20140068797A1 (en) | 2012-05-25 | 2014-03-06 | University Of Vienna | Methods and compositions for rna-directed target dna modification and for rna-directed modulation of transcription |
Non-Patent Citations (90)
Title |
---|
"Current Protocols in Molecular Biology", 1995, GREENE PUBLISHING AND WILEY-INTERSCIENCE |
"GenBank", Database accession no. YP_008868573 |
ABEYDEERA, L.R.DAY, B.N., BIOL. REPROD., vol. 57, 1997, pages 729 - 734 |
ALTSCHUL, S.F. ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 410 |
ALTSCHUL, S.F. ET AL., NUCLEIC ACIDS RES., vol. 25, 1997, pages 3389 - 3402 |
BAIRN, S.B. ET AL., PROC. NATL. ACAD. SCI. USA, vol. 88, 1991, pages 5072 - 5076 |
BEERLI, R.R. ET AL., NATURE BIOTECHNOL., vol. 20, 2002, pages 135 - 141 |
BELFORT, M.ROBERTS, J., NUCLEIC ACIDS RES., vol. 25, 1997, pages 3379 - 3388 |
BITINAITE, J. ET AL., PROC. NATL. ACAD. SCI. USA, vol. 95, 1998, pages 10570 - 10575 |
BRENT, R.PTASHNE, M., CELL, vol. 43, 1985, pages 729 - 736 |
CHANG, X-B.WILSON, J.H., PROC. NATL. ACAD. SCI. USA, vol. 84, 1987, pages 4959 - 4963 |
CHOO, Y. ET AL., CURR. OPIN. STRUCT. BIOL., vol. 10, 2000, pages 411 - 416 |
CHYLINSKI, K. ET AL., RNA BIOLOGY, vol. 10, 2013, pages 726 - 737 |
CIBELLI, J.B. ET AL., SCIENCE, vol. 280, 1998, pages 1256 - 1258 |
CLAVERIE, J.M.STATES, D.J., COMPUT. CHEM., vol. 17, 1993, pages 191 - 201 |
COGONI, C. ET AL., EMBO J., vol. 15, 1996, pages 3153 - 3161 |
COGONI, C.MASINO, G., NATURE, vol. 399, 1999, pages 166 - 169 |
CORPET, F., NUCLEIC ACIDS RESEARCH, vol. 16, 1988, pages 10881 - 10890 |
DOYLE, L. P. ET AL., JOURNAL OF THE AMERICAN VETERINARY MEDICAL ASSOCIATION, vol. 108, 1946, pages 257 - 259 |
ELLISTON, J.F. ET AL., J. BIOL. CHEM., vol. 265, 1990, pages 11517 - 11521 |
GUILINGER, J.P. ET AL., NATURE BIOTECHNOLOGY, vol. 32, 2014, pages 577 - 582 |
HAFT, D.H. ET AL., PLOS COMPUT. BIOL., vol. 1, 2005, pages e60 |
HENIKOFF, S.HENIKOFF, J.G., PROC. NATL. ACAD. SCI. USA, vol. 89, 1989, pages 10915 - 10919 |
HENIKOFF, S.HENIKOFF, J.G., PROC. NATL. ACAD. SCI. USA, vol. 89, 1992, pages 10915 - 10919 |
HIGGINS, D.G.SHARP, P.M., CABIOS, vol. 5, 1989, pages 151 - 153 |
HIGGINS, D.G.SHARP, P.M., GENE, vol. 73, 1988, pages 237 - 244 |
HSU, P.D. ET AL., CELL, vol. 157, 2014, pages 1262 - 1278 |
HUANG, B. ET AL., J. PROTEIN CHEM., vol. 15, 1996, pages 481 - 489 |
HUANG, X. ET AL., COMPUTER APPLICATIONS IN THE BIOSCIENCES, vol. 8, 1992, pages 155 - 165 |
HYRUP, BNIELSEN, P.E., BIOORGAN. MED. CHEM., vol. 4, 1996, pages 5 - 23 |
ISALAN, M. ET AL., NATURE BIOTECHNOL., vol. 19, 2001, pages 656 - 660 |
JANSEN, R. ET AL., MOL. MICROBIOL., vol. 43, 2002, pages 1565 - 1575 |
JOHNSTON, M., MICROBIOL. REV., vol. 51, 1987, pages 458 - 476 |
JOHNSTON, M., NATURE, vol. 328, 1987, pages 353 - 355 |
KARLIN, SALTSCHUL, S.F., PROC. NAT'L. ACAD. SCI. USA, vol. 90, 1993, pages 5873 - 5877 |
KEEGAN, L ET AL., SCIENCE, vol. 231, 1986, pages 699 - 704 |
KENNERDELL, K.CARTHEW, C., CELL, vol. 95, 1998, pages 1017 - 1026 |
KILLOREN, K. E. ET AL.: "Porcine Respiratory Coronavirus", 2016, SWINE HEALTH INFORMATION CENTER |
KIM, Y-G. ET AL., BIOL. CHEM., vol. 379, 1998, pages 489 - 495 |
KIM, Y-G. ET AL., GENE, vol. 203, 1997, pages 43 - 49 |
KIM, Y-G. ET AL., J. BIOL. CHEM., vol. 269, 1994, pages 31978 - 31982 |
KIM, Y-G. ET AL., PROC NATL. ACAD. SCI. USA, vol. 91, 1994, pages 883 - 887 |
KIM, Y-G. ET AL., PROC. NATL. ACAD. SCI. USA, vol. 93, 1996, pages 1156 - 1160 |
KIM, Y-G. ET AL., PROC. NATL. ACAD. SCI. USA, vol. 94, 1997, pages 12875 - 12879 |
KIM,Y-G. ET AL., PROC. NATL. ACAD. SCI. USA, vol. 91, 1994, pages 8180 - 8184 |
KIWAKI, K. ET AL., HUM. GENE THER., vol. 7, 1996, pages 821 - 830 |
LABOW, M.A. ET AL., MOL. CELL. BIOL., vol. 10, 1990, pages 3343 - 3356 |
LAVITRANO, M. ET AL., PROC. NATL. ACAD. SCI. USA, vol. 99, 2002, pages 14230 - 14235 |
LAVITRANO, M. ET AL., REPROD. FERT. DEVELOP., vol. 18, 2006, pages 19 - 23 |
LI, L. ET AL., PROC. NATL. ACAD. SCI. USA, vol. 90, 1993, pages 2764 - 2768 |
LIN, C.-M. ET AL., JOURNAL OF VIROLOGY, vol. 89, 2015, pages 1954 - 1964 |
LO, W., MOL. CELL. BIOL., vol. 3, 1983, pages 1803 - 1814 |
MA, J.PTASHNE, M., CELL, vol. 48, 1987, pages 847 - 853 |
MADSON, D. M. ET AL., VETERINARY PATHOLOGY, vol. 53, 2016, pages 44 - 52 |
MAKAROVA, K. S. ET AL., BIOL. DIRECT, vol. 1, 2006, pages 7 |
MAKAROVA, K.,S. ET AL., NUCLEIC ACIDS RES., vol. 30, 2002, pages 482 - 496 |
MANI, M ET AL., BIOCHEM. BIOPHYS. RES. COMMUN., vol. 334, 2005, pages 1191 - 1197 |
MARMORSTEIN, M. ET AL., NATURE, vol. 356, 1992, pages 408 - 414 |
MCINTYRE, G.J.FANNING, G.C., BMC BIOTECHNOLOGY, vol. 6, 2006, pages 1 - 8 |
MEYERS. E.W.MILLER, W., COMPUTER APPLIC. BIOL. SCI., vol. 4, 1988, pages 11 - 17 |
MISQUITTA, L.PATERSON, B.M., PROC. NATL. ACAD. SCI. USA, vol. 96, 1999, pages 1451 - 1456 |
NAHON, E.RAVEH, D., NUCLEIC ACIDS RES., vol. 26, 1998, pages 1233 - 1239 |
NATIONAL CENTER FOR BIOTECHNOLOGY INFORMATION: "THE NCBI HANDBOOK", 2002, article "The BLAST Sequence Analysis Tool" |
NEEDLEMAN, S.B.WUNSCH, C.D., J. MOL. BIOL., vol. 48, 1970, pages 443 - 453 |
NEHLS, M. ET AL., SCIENCE, vol. 272, 1996, pages 886 - 889 |
PABO, C.O. ET AL., ANN. REV. BIOCHEM., vol. 70, 2001, pages 313 - 340 |
PEARSON, W.R, METHODS IN MOLECULAR BIOLOGY, vol. 24, 1994, pages 307 - 331 |
PEARSON, W.R.LIPMAN, D.J., PROC. NATL. ACAD. SCI., vol. 85, 1988, pages 2444 - 2448 |
ROBERTS, R.J. ET AL., NUCLEIC ACIDS RES., vol. 31, 2003, pages 418 - 420 |
ROMANO, N.MASINO, G., MOL. MICROBIOL., vol. 6, 1992, pages 3343 - 3353 |
SADOWSKI, I. ET AL., NATURE, vol. 335, 1988, pages 563 - 564 |
SAIF, L. J. ET AL.: "Diseases of Swine", 2012, WILEY-BLACKWELL, article "Coronaviruses" |
SEGAL, D.J. ET AL., CURR. OPIN. BIOTECHNOL., vol. 12, 2001, pages 632 - 637 |
SMITH, J. ET AL., NUCLEIC ACIDS RES., vol. 27, 1999, pages 674 - 681 |
SMITH, J. ET AL., NUCLEIC ACIDS RES., vol. 28, 2000, pages 3361 - 3369 |
SMITH, T.F.WATERMAN, M.S., ADV. APPL. MATH., vol. 2, 1981, pages 482 - 489 |
STEVENSON, G. W. ET AL., JOURNAL OF VETERINARY DIAGNOSTIC INVESTIGATION, vol. 25, 2013, pages 649 - 654 |
SUMMERTON, J.WELLER, D., ANTISENSE NUCLEIC ACID DRUG DEV., vol. 7, 1997, pages 187 - 195 |
SUZUKI, C. ET AL., REPROD. FERTIL. DEV., vol. 18, 2006, pages 789 - 795 |
THOMPSON, S. ET AL., CELL, vol. 56, 1989, pages 313 - 321 |
UMOV, F.D. ET AL., NATURE, vol. 435, 2005, pages 646 - 651 |
VAN DER PUTTEN H. ET AL., PROC. NATL. ACAD. SCI. USA, vol. 82, 1985, pages 6148 - 1652 |
VAN REETH, K. ET AL., VETERINARY MICROBIOLOGY, vol. 48, 1996, pages 325 - 335 |
VISSCHER, P.M. ET AL., LIVESTOCK PRODUCTION SCIENCE, vol. 40, 1994, pages 123 - 137 |
WAKAYAMA, T. ET AL., NATURE, vol. 391, 1998, pages 806 - 811 |
WHITWORTH, K.M. ET AL., TRANSGENIC RESEARCH, vol. 28, pages 21 - 32 |
WILMUT, I ET AL., NATURE, vol. 385, 1997, pages 810 - 813 |
XU, L. ET AL., HUM. GENE THER., vol. 12, 2001, pages 563 - 573 |
YOSHIOKA, K. ET AL., BIOL. REPROD., vol. 60, 2002, pages 112 - 119 |
YOSHIOKA, K., J. REPROD. DEV., vol. 54, 2008, pages 208 - 213 |
Also Published As
Publication number | Publication date |
---|---|
WO2024013514A3 (en) | 2024-02-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Kalds et al. | Sheep and goat genome engineering: from random transgenesis to the CRISPR era | |
US20240122164A1 (en) | Pathogen-resistant animals having modified cd163 genes | |
AU2020267283B2 (en) | Nanos knock-out that ablates germline cells | |
AU2015404563B2 (en) | Pathogen-resistant animals having modified CD163 genes | |
WO2015153647A1 (en) | Method of preventing or reducing virus transmission in animals | |
WO2024013514A2 (en) | Gene edited livestock animals having coronavirus resistance | |
US11732273B2 (en) | Methods and compositions for in situ germline genome engineering | |
WO2023105244A1 (en) | Editing tmprss2/4 for disease resistance in livestock | |
EP3954208A1 (en) | Hemophilia b rat model | |
US11793179B2 (en) | Livestock animals with improved growth performance | |
WO2022101641A1 (en) | Influenza a-resistant animals having edited anp32 genes | |
WO2022164993A1 (en) | Hyperprolactinemia or lactation without pregnancy | |
BR122020004913B1 (en) | METHOD OF REPRODUCTION OF FARM ANIMALS, METHOD FOR GENERATING A FARM ANIMAL WITHOUT ANY FUNCTIONAL SPERMATOGONIC CELLS, METHOD FOR PRODUCING A RECIPIENT MALE AND METHOD FOR PRODUCING FARM ANIMALS |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23748108 Country of ref document: EP Kind code of ref document: A2 |