Gene Symbol
HEG1
Entrez Gene ID
57493
Full Name
heart development protein with EGF like domains 1
Synonyms
HEG,MST112,MSTP112
General protein information
Preferred Namesheart development protein with EGF like domains 1
Namesprotein HEG homolog 1 HEG homolog 1 heart of glass
Gene Type
protein-coding
Organism
Homo sapiens(human)
Genome
Chromosome: 3
Map Location: 3q21.2
Disorder MIM:
614182
mRNA and Protein(s)
mRNA
Protein
Name
NM_020733.1
NP_065784.1
protein HEG homolog 1 precursor
XM_005247666.1
XP_005247723.1
protein HEG homolog 1 isoform X1
NM_020733.2
NP_065784.1
protein HEG homolog 1 precursor
Homo sapiens (human)
HEG1
NP_065784.1
HEG1
XP_002802724.1
Bos taurus (cattle)
HEG1
XP_002684860.1
Mus musculus (house mouse)
Heg1
NP_780465.4
Rattus norvegicus (Norway rat)
Heg1
XP_006248505.1
Pan troglodytes (chimpanzee)
HEG1
XP_516708.4
Canis lupus familiaris (dog)
HEG1
XP_005639641.1
Related articles in PubMed
HEG1 is a novel mucin-like membrane protein that serves as a diagnostic and therapeutic target for malignant mesothelioma.
Tsuji S, Washimi K, Kageyama T, Yamashita M, Yoshihara M, Matsuura R, Yokose T, Kameda Y, Hayashi H, Morohoshi T, Tsuura Y, Yusa T, Sato T, Togayachi A, Narimatsu H, Nagasaki T, Nakamoto K, Moriwaki Y, Misawa H, Hiroshima K, Miyagi Y, Imai K Scientific reports745768(2017 Mar)
Heart of glass anchors Rasip1 at endothelial cell-cell junctions to support vascular integrity.
de Kreuk BJ, Gingras AR, Knight JD, Liu JJ, Gingras AC, Ginsberg MH eLife5e11394(2016 Jan)
Cerebral cavernous malformations arise independent of the heart of glass receptor.
Zheng X, Riant F, Bergametti F, Myers CD, Tang AT, Kleaveland B, Pan W, Yang J, Tournier-Lasserve E, Kahn ML Stroke45(5)1505-1509(2014 May)
The structure of the ternary complex of Krev interaction trapped 1 (KRIT1) bound to both the Rap1 GTPase and the heart of glass (HEG1) cytoplasmic tail.
Gingras AR, Puzon-McLaughlin W, Ginsberg MH The Journal of biological chemistry288(33)23639-49(2013 Aug)
GeneRIFs: Gene References Into Functions What's a GeneRIF?
The following HEG1 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the HEG1 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+ /C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.
CloneID
OHu03459
Clone ID Related Accession (Same CDS sequence)
NM_020733.1
, NM_020733.2
Accession Version
NM_020733.1
Documents for ORF clone product in default vector
Sequence Information
ORF Nucleotide Sequence (Length: 4146bp)
Protein sequence
SNP
Vector
pcDNA3.1+ /C-(K)DYK or customized vector
User Manual
Clone information
Clone Map
MSDS
Tag on pcDNA3.1+ /C-(K)DYK
C terminal DYKDDDDK tags
ORF Insert Method
CloneEZ™ Seamless cloning technology
Insert Structure
linear
Update Date
2017-12-26
Organism
Homo sapiens(human)
Product
protein HEG homolog 1 precursor
Comment
Comment: VALIDATED REFSEQ: This record has undergone validation or preliminary review. The reference sequence was derived from DR003209.1, AB033063.2, AC092983.13 and AC026342.34. On or before Jul 26, 2007 this sequence version replaced XM_087386.9, XM_001129251.1.
##Evidence-Data-START##
RNAseq introns :: mixed/partial sample support SAMEA1965299, SAMEA1966682 [ECO:0000350]
##Evidence-Data-END##
COMPLETENESS: complete on the 3' end.
1 61 121 181 241 301 361 421 481 541 601 661 721 781 841 901 961 1021 1081 1141 1201 1261 1321 1381 1441 1501 1561 1621 1681 1741 1801 1861 1921 1981 2041 2101 2161 2221 2281 2341 2401 2461 2521 2581 2641 2701 2761 2821 2881 2941 3001 3061 3121 3181 3241 3301 3361 3421 3481 3541 3601 3661 3721 3781 3841 3901 3961 4021 4081 4141 ATGGCCTCGC CGCGCGCCTC GCGGTGGCCG CCGCCGCTCC TGCTGCTGTT GCTGCCGCTG CTGCTGCTGC CGCCGGCGGC CCCCGGGACG CGGGACCCGC CGCCTTCCCC GGCTCGCCGC GCGCTGAGCC TGGCGCCCCT CGCGGGAGCG GGGCTGGAGC TGCAGCTGGA GCGCCGCCCG GAGCGCGAGC CGCCGCCCAC GCCGCCCCGG GAGCGCCGCG GGCCCGCGAC CCCCGGCCCC AGCTACAGGG CCCCTGAGCC AGGCGCCGCG ACACAGCGGG GACCCTCCGG CCGGGCCCCC AGAGGCGGGA GCGCGGATGC TGCCTGGAAA CATTGGCCAG AAAGTAACAC TGAGGCCCAT GTAGAAAACA TCACCTTCTA TCAGAATCAA GAGGACTTTT CAACAGTGTC CTCCAAAGAG GGCGTGATGG TTCAGACCTC TGGGAAGAGC CATGCTGCTT CGGATGCTCC AGAAAACCTC ACTCTACTCG CTGAAACAGC AGATGCTAGA GGAAGGAGCG GCTCTTCAAG TAGAACAAAC TTCACCATTT TGCCTGTTGG GTACTCACTG GAGATAGCAA CAGCTCTGAC TTCCCAGAGT GGCAACTTAG CCTCAGAAAG TCTTCACCTG CCATCCAGCA GTTCAGAGTT CGATGAAAGA ATTGCCGCTT TTCAAACAAA GAGTGGAACA GCCTCGGAGA TGGGAACAGA GAGGGCGATG GGGCTGTCAG AAGAATGGAC TGTGCACAGC CAAGAGGCCA CCACTTCGGC TTGGAGCCCG TCCTTTCTTC CTGCTTTGGA GATGGGAGAG CTGACCACGC CTTCTAGGAA GAGAAATTCC TCAGGACCAG ATCTCTCCTG GCTGCATTTC TACAGGACAG CAGCTTCCTC TCCTCTCTTA GACCTTTCCT CATCTTCTGA AAGTACAGAG AAGCTTAACA ACTCCACTGG CCTCCAGAGC TCCTCAGTCA GTCAAACAAA GACAATGCAT GTTGCCACCG TGTTCACTGA TGGTGGCCCG AGAACGCTGC GATCTTTGAC GGTCAGTCTG GGACCTGTGA GCAAGACAGA AGGCTTCCCC AAGGACTCCA GAATTGCCAC GACTTCATCC TCAGTCCTTC TTTCACCCTC TGCAGTGGAA TCGAGAAGAA ACAGTAGAGT AACTGGGAAT CCAGGGGATG AGGAATTCAT TGAACCATCC ACAGAAAATG AATTTGGACT TACGTCTTTG CGTTGGCAAA ATGATTCCCC AACCTTTGGA GAACATCAGC TTGCCAGCAG CTCTGAGGTG CAAAATGGAA GTCCCATGTC TCAGACTGAG ACTGTGTCTA GGTCAGTCGC ACCCATGAGA GGTGGAGAGA TCACTGCACA CTGGCTCTTG ACCAACAGCA CAACATCTGC AGATGTGACA GGAAGCTCTG CTTCATATCC TGAAGGTGTG AATGCTTCAG TGTTGACCCA GTTCTCAGAC TCTACTGTAC AGTCTGGAGG AAGTCACACA GCATTGGGAG ATAGGAGTTA TTCAGAGTCT TCATCTACAT CTTCCTCGGA AAGCTTGAAT TCATCAGCAC CACGTGGAGA ACGTTCGATC GCTGGGATTA GCTACGGTCA AGTGCGTGGC ACAGCTATTG AACAAAGGAC TTCCAGCGAC CACACAGACC ACACCTACCT GTCATCTACT TTCACCAAAG GAGAACGGGC GTTACTGTCC ATTACAGATA ACAGTTCATC CTCAGACATT GTGGAGAGCT CAACTTCTTA TATTAAAATC TCAAACTCTT CACATTCAGA GTATTCCTCC TTTTTTCATG CTCAGACTGA GAGAAGTAAC ATCTCATCCT ATGACGGGGA ATATGCTCAG CCTTCTACTG AGTCGCCAGT TCTGCATACA TCCAACCTTC CGTCCTACAC ACCCACCATT AATATGCCGA ACACTTCGGT TGTTCTGGAC ACTGATGCTG AGTTTGTTAG TGACTCCTCC TCCTCCTCTT CCTCCTCCTC CTCTTCTTCT TCTTCAGGGC CTCCTTTGCC TCTGCCCTCT GTGTCACAAT CCCACCATTT ATTTTCATCA ATTTTACCAT CAACCAGGGC CTCTGTGCAT CTACTAAAGT CTACCTCTGA TGCATCCACA CCATGGTCTT CCTCACCATC ACCTTTACCA GTATCCTTAA CGACATCTAC ATCTGCCCCA CTTTCTGTCT CACAAACAAC CTTGCCACAG TCATCTTCTA CCCCTGTCCT GCCCAGGGCA AGGGAGACTC CTGTGACTTC ATTTCAGACA TCAACAATGA CATCATTCAT GACAATGCTC CATAGTAGTC AAACTGCAGA CCTTAAGAGC CAGAGCACCC CACACCAAGA GAAAGTCATT ACAGAATCAA AGTCACCAAG CCTGGTGTCT CTGCCCACAG AGTCCACCAA AGCTGTAACA ACAAACTCTC CTTTGCCTCC ATCCTTAACA GAGTCCTCCA CAGAGCAAAC CCTTCCAGCC ACAAGCACCA ACTTAGCACA AATGTCTCCA ACTTTCACAA CTACCATTCT GAAGACCTCT CAGCCTCTTA TGACCACTCC TGGCACCCTG TCAAGCACAG CATCTCTGGT CACTGGCCCT ATAGCCGTAC AGACTACAGC TGGAAAACAG CTCTCGCTGA CCCATCCTGA AATACTAGTT CCTCAAATCT CAACAGAAGG TGGCATCAGC ACAGAAAGGA ACCGAGTGAT TGTGGATGCT ACCACTGGAT TGATCCCTTT GACCAGTGTA CCCACATCAG CAAAAGAAAT GACCACAAAG CTTGGCGTTA CAGCAGAGTA CAGCCCAGCT TCACGTTCCC TCGGAACATC TCCTTCTCCC CAAACCACAG TTGTTTCCAC GGCTGAAGAC TTGGCTCCCA AATCTGCCAC CTTTGCTGTT CAGAGCAGCA CACAGTCACC AACAACAGTG TCCTCTTCAG CCTCAGTCAA CAGCTGTGCT GTGAACCCTT GTCTTCACAA TGGCGAATGC GTCGCAGACA ACACCAGCCG TGGCTACCAC TGCAGGTGCC CGCCTTCCTG GCAAGGGGAT GATTGCAGTG TGGATGTGAA TGAGTGCCTG TCGAACCCCT GCCCATCCAC AGCCATGTGC AACAATACTC AGGGATCCTT TATCTGCAAA TGCCCGGTTG GGTACCAGTT GGAAAAAGGG ATATGCAATT TGGTTAGAAC CTTCGTGACA GAGTTTAAAT TAAAGAGAAC TTTTCTTAAT ACAACTGTGG AAAAACATTC AGACCTACAA GAAGTTGAAA ATGAGATCAC CAAAACGTTA AATATGTGTT TTTCAGCGTT ACCTAGTTAC ATCCGATCTA CAGTTCACGC CTCTAGGGAG TCCAACGCGG TGGTGATCTC ACTGCAAACA ACCTTTTCCC TGGCCTCCAA TGTGACGCTA TTTGACCTGG CTGATAGGAT GCAGAAATGT GTCAACTCCT GCAAGTCCTC TGCTGAGGTC TGCCAGCTCT TGGGATCTCA GAGGCGGATC TTTAGAGCGG GCAGCTTGTG CAAGCGGAAG AGTCCCGAAT GTGACAAAGA CACCTCCATC TGCACTGACC TGGACGGCGT TGCCCTGTGC CAGTGCAAGT CGGGATACTT TCAGTTCAAC AAGATGGACC ACTCCTGCCG AGCATGTGAA GATGGATATA GGCTTGAAAA TGAAACCTGC ATGAGTTGCC CATTTGGCCT TGGTGGTCTC AACTGTGGAA ACCCCTATCA GCTTATCACT GTGGTGATCG CAGCCGCGGG AGGTGGGCTC CTGCTCATCC TAGGCATCGC ACTGATTGTT ACCTGTTGCA GAAAGAATAA AAATGACATA AGCAAACTCA TCTTCAAAAG TGGAGATTTC CAAATGTCCC CGTATGCTGA ATACCCCAAA AATCCTCGCT CACAAGAATG GGGCCGAGAA GCTATTGAAA TGCATGAGAA TGGAAGTACC AAAAACCTCC TCCAGATGAC GGATGTGTAC TACTCGCCTA CAAGTGTAAG GAATCCAGAA CTTGAACGAA ACGGACTCTA CCCGGCCTAC ACTGGACTGC CAGGATCACG GCATTCTTGC ATTTTCCCCG GACAGTATAA CCCGTCTTTC ATCAGTGATG AAAGCAGAAG AAGAGACTAC TTTTAA
The stop codons will be deleted if pcDNA3.1+ /C-(K)DYK vector is selected.
RefSeq
NP_065784.1
CDS 69..4214
Translation
Target ORF information:
RefSeq Version NM_020733.1
Organism Homo sapiens(human)
Definition Homo sapiens heart development protein with EGF like domains 1 (HEG1), mRNA.
Target ORF information:
Epitope DYKDDDDK
Bacterial selection AMPR
Mammalian selection NeoR
Vector pcDNA3.1+ /C-(K)DYK
NM_020733.1
ORF Insert Sequence:
1 61 121 181 241 301 361 421 481 541 601 661 721 781 841 901 961 1021 1081 1141 1201 1261 1321 1381 1441 1501 1561 1621 1681 1741 1801 1861 1921 1981 2041 2101 2161 2221 2281 2341 2401 2461 2521 2581 2641 2701 2761 2821 2881 2941 3001 3061 3121 3181 3241 3301 3361 3421 3481 3541 3601 3661 3721 3781 3841 3901 3961 4021 4081 4141 ATGGCCTCGC CGCGCGCCTC GCGGTGGCCG CCGCCGCTCC TGCTGCTGTT GCTGCCGCTG CTGCTGCTGC CGCCGGCGGC CCCCGGGACG CGGGACCCGC CGCCTTCCCC GGCTCGCCGC GCGCTGAGCC TGGCGCCCCT CGCGGGAGCG GGGCTGGAGC TGCAGCTGGA GCGCCGCCCG GAGCGCGAGC CGCCGCCCAC GCCGCCCCGG GAGCGCCGCG GGCCCGCGAC CCCCGGCCCC AGCTACAGGG CCCCTGAGCC AGGCGCCGCG ACACAGCGGG GACCCTCCGG CCGGGCCCCC AGAGGCGGGA GCGCGGATGC TGCCTGGAAA CATTGGCCAG AAAGTAACAC TGAGGCCCAT GTAGAAAACA TCACCTTCTA TCAGAATCAA GAGGACTTTT CAACAGTGTC CTCCAAAGAG GGCGTGATGG TTCAGACCTC TGGGAAGAGC CATGCTGCTT CGGATGCTCC AGAAAACCTC ACTCTACTCG CTGAAACAGC AGATGCTAGA GGAAGGAGCG GCTCTTCAAG TAGAACAAAC TTCACCATTT TGCCTGTTGG GTACTCACTG GAGATAGCAA CAGCTCTGAC TTCCCAGAGT GGCAACTTAG CCTCAGAAAG TCTTCACCTG CCATCCAGCA GTTCAGAGTT CGATGAAAGA ATTGCCGCTT TTCAAACAAA GAGTGGAACA GCCTCGGAGA TGGGAACAGA GAGGGCGATG GGGCTGTCAG AAGAATGGAC TGTGCACAGC CAAGAGGCCA CCACTTCGGC TTGGAGCCCG TCCTTTCTTC CTGCTTTGGA GATGGGAGAG CTGACCACGC CTTCTAGGAA GAGAAATTCC TCAGGACCAG ATCTCTCCTG GCTGCATTTC TACAGGACAG CAGCTTCCTC TCCTCTCTTA GACCTTTCCT CATCTTCTGA AAGTACAGAG AAGCTTAACA ACTCCACTGG CCTCCAGAGC TCCTCAGTCA GTCAAACAAA GACAATGCAT GTTGCCACCG TGTTCACTGA TGGTGGCCCG AGAACGCTGC GATCTTTGAC GGTCAGTCTG GGACCTGTGA GCAAGACAGA AGGCTTCCCC AAGGACTCCA GAATTGCCAC GACTTCATCC TCAGTCCTTC TTTCACCCTC TGCAGTGGAA TCGAGAAGAA ACAGTAGAGT AACTGGGAAT CCAGGGGATG AGGAATTCAT TGAACCATCC ACAGAAAATG AATTTGGACT TACGTCTTTG CGTTGGCAAA ATGATTCCCC AACCTTTGGA GAACATCAGC TTGCCAGCAG CTCTGAGGTG CAAAATGGAA GTCCCATGTC TCAGACTGAG ACTGTGTCTA GGTCAGTCGC ACCCATGAGA GGTGGAGAGA TCACTGCACA CTGGCTCTTG ACCAACAGCA CAACATCTGC AGATGTGACA GGAAGCTCTG CTTCATATCC TGAAGGTGTG AATGCTTCAG TGTTGACCCA GTTCTCAGAC TCTACTGTAC AGTCTGGAGG AAGTCACACA GCATTGGGAG ATAGGAGTTA TTCAGAGTCT TCATCTACAT CTTCCTCGGA AAGCTTGAAT TCATCAGCAC CACGTGGAGA ACGTTCGATC GCTGGGATTA GCTACGGTCA AGTGCGTGGC ACAGCTATTG AACAAAGGAC TTCCAGCGAC CACACAGACC ACACCTACCT GTCATCTACT TTCACCAAAG GAGAACGGGC GTTACTGTCC ATTACAGATA ACAGTTCATC CTCAGACATT GTGGAGAGCT CAACTTCTTA TATTAAAATC TCAAACTCTT CACATTCAGA GTATTCCTCC TTTTTTCATG CTCAGACTGA GAGAAGTAAC ATCTCATCCT ATGACGGGGA ATATGCTCAG CCTTCTACTG AGTCGCCAGT TCTGCATACA TCCAACCTTC CGTCCTACAC ACCCACCATT AATATGCCGA ACACTTCGGT TGTTCTGGAC ACTGATGCTG AGTTTGTTAG TGACTCCTCC TCCTCCTCTT CCTCCTCCTC CTCTTCTTCT TCTTCAGGGC CTCCTTTGCC TCTGCCCTCT GTGTCACAAT CCCACCATTT ATTTTCATCA ATTTTACCAT CAACCAGGGC CTCTGTGCAT CTACTAAAGT CTACCTCTGA TGCATCCACA CCATGGTCTT CCTCACCATC ACCTTTACCA GTATCCTTAA CGACATCTAC ATCTGCCCCA CTTTCTGTCT CACAAACAAC CTTGCCACAG TCATCTTCTA CCCCTGTCCT GCCCAGGGCA AGGGAGACTC CTGTGACTTC ATTTCAGACA TCAACAATGA CATCATTCAT GACAATGCTC CATAGTAGTC AAACTGCAGA CCTTAAGAGC CAGAGCACCC CACACCAAGA GAAAGTCATT ACAGAATCAA AGTCACCAAG CCTGGTGTCT CTGCCCACAG AGTCCACCAA AGCTGTAACA ACAAACTCTC CTTTGCCTCC ATCCTTAACA GAGTCCTCCA CAGAGCAAAC CCTTCCAGCC ACAAGCACCA ACTTAGCACA AATGTCTCCA ACTTTCACAA CTACCATTCT GAAGACCTCT CAGCCTCTTA TGACCACTCC TGGCACCCTG TCAAGCACAG CATCTCTGGT CACTGGCCCT ATAGCCGTAC AGACTACAGC TGGAAAACAG CTCTCGCTGA CCCATCCTGA AATACTAGTT CCTCAAATCT CAACAGAAGG TGGCATCAGC ACAGAAAGGA ACCGAGTGAT TGTGGATGCT ACCACTGGAT TGATCCCTTT GACCAGTGTA CCCACATCAG CAAAAGAAAT GACCACAAAG CTTGGCGTTA CAGCAGAGTA CAGCCCAGCT TCACGTTCCC TCGGAACATC TCCTTCTCCC CAAACCACAG TTGTTTCCAC GGCTGAAGAC TTGGCTCCCA AATCTGCCAC CTTTGCTGTT CAGAGCAGCA CACAGTCACC AACAACAGTG TCCTCTTCAG CCTCAGTCAA CAGCTGTGCT GTGAACCCTT GTCTTCACAA TGGCGAATGC GTCGCAGACA ACACCAGCCG TGGCTACCAC TGCAGGTGCC CGCCTTCCTG GCAAGGGGAT GATTGCAGTG TGGATGTGAA TGAGTGCCTG TCGAACCCCT GCCCATCCAC AGCCATGTGC AACAATACTC AGGGATCCTT TATCTGCAAA TGCCCGGTTG GGTACCAGTT GGAAAAAGGG ATATGCAATT TGGTTAGAAC CTTCGTGACA GAGTTTAAAT TAAAGAGAAC TTTTCTTAAT ACAACTGTGG AAAAACATTC AGACCTACAA GAAGTTGAAA ATGAGATCAC CAAAACGTTA AATATGTGTT TTTCAGCGTT ACCTAGTTAC ATCCGATCTA CAGTTCACGC CTCTAGGGAG TCCAACGCGG TGGTGATCTC ACTGCAAACA ACCTTTTCCC TGGCCTCCAA TGTGACGCTA TTTGACCTGG CTGATAGGAT GCAGAAATGT GTCAACTCCT GCAAGTCCTC TGCTGAGGTC TGCCAGCTCT TGGGATCTCA GAGGCGGATC TTTAGAGCGG GCAGCTTGTG CAAGCGGAAG AGTCCCGAAT GTGACAAAGA CACCTCCATC TGCACTGACC TGGACGGCGT TGCCCTGTGC CAGTGCAAGT CGGGATACTT TCAGTTCAAC AAGATGGACC ACTCCTGCCG AGCATGTGAA GATGGATATA GGCTTGAAAA TGAAACCTGC ATGAGTTGCC CATTTGGCCT TGGTGGTCTC AACTGTGGAA ACCCCTATCA GCTTATCACT GTGGTGATCG CAGCCGCGGG AGGTGGGCTC CTGCTCATCC TAGGCATCGC ACTGATTGTT ACCTGTTGCA GAAAGAATAA AAATGACATA AGCAAACTCA TCTTCAAAAG TGGAGATTTC CAAATGTCCC CGTATGCTGA ATACCCCAAA AATCCTCGCT CACAAGAATG GGGCCGAGAA GCTATTGAAA TGCATGAGAA TGGAAGTACC AAAAACCTCC TCCAGATGAC GGATGTGTAC TACTCGCCTA CAAGTGTAAG GAATCCAGAA CTTGAACGAA ACGGACTCTA CCCGGCCTAC ACTGGACTGC CAGGATCACG GCATTCTTGC ATTTTCCCCG GACAGTATAA CCCGTCTTTC ATCAGTGATG AAAGCAGAAG AAGAGACTAC TTTTAA
The stop codons will be deleted if pcDNA3.1+ /C-(K)DYK vector is selected.
CloneID
OHu03459
Clone ID Related Accession (Same CDS sequence)
NM_020733.1
, NM_020733.2
Accession Version
NM_020733.2 Latest version!
Documents for ORF clone product in default vector
Sequence Information
ORF Nucleotide Sequence (Length: 4146bp)
Protein sequence
SNP
Vector
pcDNA3.1+ /C-(K)DYK or customized vector
User Manual
Clone information
Clone Map
MSDS
Tag on pcDNA3.1+ /C-(K)DYK
C terminal DYKDDDDK tags
ORF Insert Method
CloneEZ™ Seamless cloning technology
Insert Structure
linear
Update Date
2019-12-30
Organism
Homo sapiens(human)
Product
protein HEG homolog 1 precursor
Comment
Comment: VALIDATED REFSEQ: This record has undergone validation or preliminary review. The reference sequence was derived from AC117488.11, DR003209.1, AB033063.2, AC092983.13 and AC026342.34. On Nov 23, 2018 this sequence version replaced NM_020733.1.
##Evidence-Data-START##
RNAseq introns :: mixed/partial sample support SAMEA1965299, SAMEA1966682 [ECO:0000350]
##Evidence-Data-END##
##RefSeq-Attributes-START##
MANE Ensembl match :: ENST00000311127.9/ ENSP00000311502.3
RefSeq Select criteria :: based on conservation
##RefSeq-Attributes-END##
COMPLETENESS: full length.
1 61 121 181 241 301 361 421 481 541 601 661 721 781 841 901 961 1021 1081 1141 1201 1261 1321 1381 1441 1501 1561 1621 1681 1741 1801 1861 1921 1981 2041 2101 2161 2221 2281 2341 2401 2461 2521 2581 2641 2701 2761 2821 2881 2941 3001 3061 3121 3181 3241 3301 3361 3421 3481 3541 3601 3661 3721 3781 3841 3901 3961 4021 4081 4141 ATGGCCTCGC CGCGCGCCTC GCGGTGGCCG CCGCCGCTCC TGCTGCTGTT GCTGCCGCTG CTGCTGCTGC CGCCGGCGGC CCCCGGGACG CGGGACCCGC CGCCTTCCCC GGCTCGCCGC GCGCTGAGCC TGGCGCCCCT CGCGGGAGCG GGGCTGGAGC TGCAGCTGGA GCGCCGCCCG GAGCGCGAGC CGCCGCCCAC GCCGCCCCGG GAGCGCCGCG GGCCCGCGAC CCCCGGCCCC AGCTACAGGG CCCCTGAGCC AGGCGCCGCG ACACAGCGGG GACCCTCCGG CCGGGCCCCC AGAGGCGGGA GCGCGGATGC TGCCTGGAAA CATTGGCCAG AAAGTAACAC TGAGGCCCAT GTAGAAAACA TCACCTTCTA TCAGAATCAA GAGGACTTTT CAACAGTGTC CTCCAAAGAG GGCGTGATGG TTCAGACCTC TGGGAAGAGC CATGCTGCTT CGGATGCTCC AGAAAACCTC ACTCTACTCG CTGAAACAGC AGATGCTAGA GGAAGGAGCG GCTCTTCAAG TAGAACAAAC TTCACCATTT TGCCTGTTGG GTACTCACTG GAGATAGCAA CAGCTCTGAC TTCCCAGAGT GGCAACTTAG CCTCAGAAAG TCTTCACCTG CCATCCAGCA GTTCAGAGTT CGATGAAAGA ATTGCCGCTT TTCAAACAAA GAGTGGAACA GCCTCGGAGA TGGGAACAGA GAGGGCGATG GGGCTGTCAG AAGAATGGAC TGTGCACAGC CAAGAGGCCA CCACTTCGGC TTGGAGCCCG TCCTTTCTTC CTGCTTTGGA GATGGGAGAG CTGACCACGC CTTCTAGGAA GAGAAATTCC TCAGGACCAG ATCTCTCCTG GCTGCATTTC TACAGGACAG CAGCTTCCTC TCCTCTCTTA GACCTTTCCT CATCTTCTGA AAGTACAGAG AAGCTTAACA ACTCCACTGG CCTCCAGAGC TCCTCAGTCA GTCAAACAAA GACAATGCAT GTTGCCACCG TGTTCACTGA TGGTGGCCCG AGAACGCTGC GATCTTTGAC GGTCAGTCTG GGACCTGTGA GCAAGACAGA AGGCTTCCCC AAGGACTCCA GAATTGCCAC GACTTCATCC TCAGTCCTTC TTTCACCCTC TGCAGTGGAA TCGAGAAGAA ACAGTAGAGT AACTGGGAAT CCAGGGGATG AGGAATTCAT TGAACCATCC ACAGAAAATG AATTTGGACT TACGTCTTTG CGTTGGCAAA ATGATTCCCC AACCTTTGGA GAACATCAGC TTGCCAGCAG CTCTGAGGTG CAAAATGGAA GTCCCATGTC TCAGACTGAG ACTGTGTCTA GGTCAGTCGC ACCCATGAGA GGTGGAGAGA TCACTGCACA CTGGCTCTTG ACCAACAGCA CAACATCTGC AGATGTGACA GGAAGCTCTG CTTCATATCC TGAAGGTGTG AATGCTTCAG TGTTGACCCA GTTCTCAGAC TCTACTGTAC AGTCTGGAGG AAGTCACACA GCATTGGGAG ATAGGAGTTA TTCAGAGTCT TCATCTACAT CTTCCTCGGA AAGCTTGAAT TCATCAGCAC CACGTGGAGA ACGTTCGATC GCTGGGATTA GCTACGGTCA AGTGCGTGGC ACAGCTATTG AACAAAGGAC TTCCAGCGAC CACACAGACC ACACCTACCT GTCATCTACT TTCACCAAAG GAGAACGGGC GTTACTGTCC ATTACAGATA ACAGTTCATC CTCAGACATT GTGGAGAGCT CAACTTCTTA TATTAAAATC TCAAACTCTT CACATTCAGA GTATTCCTCC TTTTTTCATG CTCAGACTGA GAGAAGTAAC ATCTCATCCT ATGACGGGGA ATATGCTCAG CCTTCTACTG AGTCGCCAGT TCTGCATACA TCCAACCTTC CGTCCTACAC ACCCACCATT AATATGCCGA ACACTTCGGT TGTTCTGGAC ACTGATGCTG AGTTTGTTAG TGACTCCTCC TCCTCCTCTT CCTCCTCCTC CTCTTCTTCT TCTTCAGGGC CTCCTTTGCC TCTGCCCTCT GTGTCACAAT CCCACCATTT ATTTTCATCA ATTTTACCAT CAACCAGGGC CTCTGTGCAT CTACTAAAGT CTACCTCTGA TGCATCCACA CCATGGTCTT CCTCACCATC ACCTTTACCA GTATCCTTAA CGACATCTAC ATCTGCCCCA CTTTCTGTCT CACAAACAAC CTTGCCACAG TCATCTTCTA CCCCTGTCCT GCCCAGGGCA AGGGAGACTC CTGTGACTTC ATTTCAGACA TCAACAATGA CATCATTCAT GACAATGCTC CATAGTAGTC AAACTGCAGA CCTTAAGAGC CAGAGCACCC CACACCAAGA GAAAGTCATT ACAGAATCAA AGTCACCAAG CCTGGTGTCT CTGCCCACAG AGTCCACCAA AGCTGTAACA ACAAACTCTC CTTTGCCTCC ATCCTTAACA GAGTCCTCCA CAGAGCAAAC CCTTCCAGCC ACAAGCACCA ACTTAGCACA AATGTCTCCA ACTTTCACAA CTACCATTCT GAAGACCTCT CAGCCTCTTA TGACCACTCC TGGCACCCTG TCAAGCACAG CATCTCTGGT CACTGGCCCT ATAGCCGTAC AGACTACAGC TGGAAAACAG CTCTCGCTGA CCCATCCTGA AATACTAGTT CCTCAAATCT CAACAGAAGG TGGCATCAGC ACAGAAAGGA ACCGAGTGAT TGTGGATGCT ACCACTGGAT TGATCCCTTT GACCAGTGTA CCCACATCAG CAAAAGAAAT GACCACAAAG CTTGGCGTTA CAGCAGAGTA CAGCCCAGCT TCACGTTCCC TCGGAACATC TCCTTCTCCC CAAACCACAG TTGTTTCCAC GGCTGAAGAC TTGGCTCCCA AATCTGCCAC CTTTGCTGTT CAGAGCAGCA CACAGTCACC AACAACAGTG TCCTCTTCAG CCTCAGTCAA CAGCTGTGCT GTGAACCCTT GTCTTCACAA TGGCGAATGC GTCGCAGACA ACACCAGCCG TGGCTACCAC TGCAGGTGCC CGCCTTCCTG GCAAGGGGAT GATTGCAGTG TGGATGTGAA TGAGTGCCTG TCGAACCCCT GCCCATCCAC AGCCATGTGC AACAATACTC AGGGATCCTT TATCTGCAAA TGCCCGGTTG GGTACCAGTT GGAAAAAGGG ATATGCAATT TGGTTAGAAC CTTCGTGACA GAGTTTAAAT TAAAGAGAAC TTTTCTTAAT ACAACTGTGG AAAAACATTC AGACCTACAA GAAGTTGAAA ATGAGATCAC CAAAACGTTA AATATGTGTT TTTCAGCGTT ACCTAGTTAC ATCCGATCTA CAGTTCACGC CTCTAGGGAG TCCAACGCGG TGGTGATCTC ACTGCAAACA ACCTTTTCCC TGGCCTCCAA TGTGACGCTA TTTGACCTGG CTGATAGGAT GCAGAAATGT GTCAACTCCT GCAAGTCCTC TGCTGAGGTC TGCCAGCTCT TGGGATCTCA GAGGCGGATC TTTAGAGCGG GCAGCTTGTG CAAGCGGAAG AGTCCCGAAT GTGACAAAGA CACCTCCATC TGCACTGACC TGGACGGCGT TGCCCTGTGC CAGTGCAAGT CGGGATACTT TCAGTTCAAC AAGATGGACC ACTCCTGCCG AGCATGTGAA GATGGATATA GGCTTGAAAA TGAAACCTGC ATGAGTTGCC CATTTGGCCT TGGTGGTCTC AACTGTGGAA ACCCCTATCA GCTTATCACT GTGGTGATCG CAGCCGCGGG AGGTGGGCTC CTGCTCATCC TAGGCATCGC ACTGATTGTT ACCTGTTGCA GAAAGAATAA AAATGACATA AGCAAACTCA TCTTCAAAAG TGGAGATTTC CAAATGTCCC CGTATGCTGA ATACCCCAAA AATCCTCGCT CACAAGAATG GGGCCGAGAA GCTATTGAAA TGCATGAGAA TGGAAGTACC AAAAACCTCC TCCAGATGAC GGATGTGTAC TACTCGCCTA CAAGTGTAAG GAATCCAGAA CTTGAACGAA ACGGACTCTA CCCGGCCTAC ACTGGACTGC CAGGATCACG GCATTCTTGC ATTTTCCCCG GACAGTATAA CCCGTCTTTC ATCAGTGATG AAAGCAGAAG AAGAGACTAC TTTTAA
The stop codons will be deleted if pcDNA3.1+ /C-(K)DYK vector is selected.
RefSeq
NP_065784.1
CDS 108..4253
Translation
Target ORF information:
RefSeq Version NM_020733.2
Organism Homo sapiens(human)
Definition Homo sapiens heart development protein with EGF like domains 1 (HEG1), mRNA.
Target ORF information:
Epitope DYKDDDDK
Bacterial selection AMPR
Mammalian selection NeoR
Vector pcDNA3.1+ /C-(K)DYK
NM_020733.2
ORF Insert Sequence:
1 61 121 181 241 301 361 421 481 541 601 661 721 781 841 901 961 1021 1081 1141 1201 1261 1321 1381 1441 1501 1561 1621 1681 1741 1801 1861 1921 1981 2041 2101 2161 2221 2281 2341 2401 2461 2521 2581 2641 2701 2761 2821 2881 2941 3001 3061 3121 3181 3241 3301 3361 3421 3481 3541 3601 3661 3721 3781 3841 3901 3961 4021 4081 4141 ATGGCCTCGC CGCGCGCCTC GCGGTGGCCG CCGCCGCTCC TGCTGCTGTT GCTGCCGCTG CTGCTGCTGC CGCCGGCGGC CCCCGGGACG CGGGACCCGC CGCCTTCCCC GGCTCGCCGC GCGCTGAGCC TGGCGCCCCT CGCGGGAGCG GGGCTGGAGC TGCAGCTGGA GCGCCGCCCG GAGCGCGAGC CGCCGCCCAC GCCGCCCCGG GAGCGCCGCG GGCCCGCGAC CCCCGGCCCC AGCTACAGGG CCCCTGAGCC AGGCGCCGCG ACACAGCGGG GACCCTCCGG CCGGGCCCCC AGAGGCGGGA GCGCGGATGC TGCCTGGAAA CATTGGCCAG AAAGTAACAC TGAGGCCCAT GTAGAAAACA TCACCTTCTA TCAGAATCAA GAGGACTTTT CAACAGTGTC CTCCAAAGAG GGCGTGATGG TTCAGACCTC TGGGAAGAGC CATGCTGCTT CGGATGCTCC AGAAAACCTC ACTCTACTCG CTGAAACAGC AGATGCTAGA GGAAGGAGCG GCTCTTCAAG TAGAACAAAC TTCACCATTT TGCCTGTTGG GTACTCACTG GAGATAGCAA CAGCTCTGAC TTCCCAGAGT GGCAACTTAG CCTCAGAAAG TCTTCACCTG CCATCCAGCA GTTCAGAGTT CGATGAAAGA ATTGCCGCTT TTCAAACAAA GAGTGGAACA GCCTCGGAGA TGGGAACAGA GAGGGCGATG GGGCTGTCAG AAGAATGGAC TGTGCACAGC CAAGAGGCCA CCACTTCGGC TTGGAGCCCG TCCTTTCTTC CTGCTTTGGA GATGGGAGAG CTGACCACGC CTTCTAGGAA GAGAAATTCC TCAGGACCAG ATCTCTCCTG GCTGCATTTC TACAGGACAG CAGCTTCCTC TCCTCTCTTA GACCTTTCCT CATCTTCTGA AAGTACAGAG AAGCTTAACA ACTCCACTGG CCTCCAGAGC TCCTCAGTCA GTCAAACAAA GACAATGCAT GTTGCCACCG TGTTCACTGA TGGTGGCCCG AGAACGCTGC GATCTTTGAC GGTCAGTCTG GGACCTGTGA GCAAGACAGA AGGCTTCCCC AAGGACTCCA GAATTGCCAC GACTTCATCC TCAGTCCTTC TTTCACCCTC TGCAGTGGAA TCGAGAAGAA ACAGTAGAGT AACTGGGAAT CCAGGGGATG AGGAATTCAT TGAACCATCC ACAGAAAATG AATTTGGACT TACGTCTTTG CGTTGGCAAA ATGATTCCCC AACCTTTGGA GAACATCAGC TTGCCAGCAG CTCTGAGGTG CAAAATGGAA GTCCCATGTC TCAGACTGAG ACTGTGTCTA GGTCAGTCGC ACCCATGAGA GGTGGAGAGA TCACTGCACA CTGGCTCTTG ACCAACAGCA CAACATCTGC AGATGTGACA GGAAGCTCTG CTTCATATCC TGAAGGTGTG AATGCTTCAG TGTTGACCCA GTTCTCAGAC TCTACTGTAC AGTCTGGAGG AAGTCACACA GCATTGGGAG ATAGGAGTTA TTCAGAGTCT TCATCTACAT CTTCCTCGGA AAGCTTGAAT TCATCAGCAC CACGTGGAGA ACGTTCGATC GCTGGGATTA GCTACGGTCA AGTGCGTGGC ACAGCTATTG AACAAAGGAC TTCCAGCGAC CACACAGACC ACACCTACCT GTCATCTACT TTCACCAAAG GAGAACGGGC GTTACTGTCC ATTACAGATA ACAGTTCATC CTCAGACATT GTGGAGAGCT CAACTTCTTA TATTAAAATC TCAAACTCTT CACATTCAGA GTATTCCTCC TTTTTTCATG CTCAGACTGA GAGAAGTAAC ATCTCATCCT ATGACGGGGA ATATGCTCAG CCTTCTACTG AGTCGCCAGT TCTGCATACA TCCAACCTTC CGTCCTACAC ACCCACCATT AATATGCCGA ACACTTCGGT TGTTCTGGAC ACTGATGCTG AGTTTGTTAG TGACTCCTCC TCCTCCTCTT CCTCCTCCTC CTCTTCTTCT TCTTCAGGGC CTCCTTTGCC TCTGCCCTCT GTGTCACAAT CCCACCATTT ATTTTCATCA ATTTTACCAT CAACCAGGGC CTCTGTGCAT CTACTAAAGT CTACCTCTGA TGCATCCACA CCATGGTCTT CCTCACCATC ACCTTTACCA GTATCCTTAA CGACATCTAC ATCTGCCCCA CTTTCTGTCT CACAAACAAC CTTGCCACAG TCATCTTCTA CCCCTGTCCT GCCCAGGGCA AGGGAGACTC CTGTGACTTC ATTTCAGACA TCAACAATGA CATCATTCAT GACAATGCTC CATAGTAGTC AAACTGCAGA CCTTAAGAGC CAGAGCACCC CACACCAAGA GAAAGTCATT ACAGAATCAA AGTCACCAAG CCTGGTGTCT CTGCCCACAG AGTCCACCAA AGCTGTAACA ACAAACTCTC CTTTGCCTCC ATCCTTAACA GAGTCCTCCA CAGAGCAAAC CCTTCCAGCC ACAAGCACCA ACTTAGCACA AATGTCTCCA ACTTTCACAA CTACCATTCT GAAGACCTCT CAGCCTCTTA TGACCACTCC TGGCACCCTG TCAAGCACAG CATCTCTGGT CACTGGCCCT ATAGCCGTAC AGACTACAGC TGGAAAACAG CTCTCGCTGA CCCATCCTGA AATACTAGTT CCTCAAATCT CAACAGAAGG TGGCATCAGC ACAGAAAGGA ACCGAGTGAT TGTGGATGCT ACCACTGGAT TGATCCCTTT GACCAGTGTA CCCACATCAG CAAAAGAAAT GACCACAAAG CTTGGCGTTA CAGCAGAGTA CAGCCCAGCT TCACGTTCCC TCGGAACATC TCCTTCTCCC CAAACCACAG TTGTTTCCAC GGCTGAAGAC TTGGCTCCCA AATCTGCCAC CTTTGCTGTT CAGAGCAGCA CACAGTCACC AACAACAGTG TCCTCTTCAG CCTCAGTCAA CAGCTGTGCT GTGAACCCTT GTCTTCACAA TGGCGAATGC GTCGCAGACA ACACCAGCCG TGGCTACCAC TGCAGGTGCC CGCCTTCCTG GCAAGGGGAT GATTGCAGTG TGGATGTGAA TGAGTGCCTG TCGAACCCCT GCCCATCCAC AGCCATGTGC AACAATACTC AGGGATCCTT TATCTGCAAA TGCCCGGTTG GGTACCAGTT GGAAAAAGGG ATATGCAATT TGGTTAGAAC CTTCGTGACA GAGTTTAAAT TAAAGAGAAC TTTTCTTAAT ACAACTGTGG AAAAACATTC AGACCTACAA GAAGTTGAAA ATGAGATCAC CAAAACGTTA AATATGTGTT TTTCAGCGTT ACCTAGTTAC ATCCGATCTA CAGTTCACGC CTCTAGGGAG TCCAACGCGG TGGTGATCTC ACTGCAAACA ACCTTTTCCC TGGCCTCCAA TGTGACGCTA TTTGACCTGG CTGATAGGAT GCAGAAATGT GTCAACTCCT GCAAGTCCTC TGCTGAGGTC TGCCAGCTCT TGGGATCTCA GAGGCGGATC TTTAGAGCGG GCAGCTTGTG CAAGCGGAAG AGTCCCGAAT GTGACAAAGA CACCTCCATC TGCACTGACC TGGACGGCGT TGCCCTGTGC CAGTGCAAGT CGGGATACTT TCAGTTCAAC AAGATGGACC ACTCCTGCCG AGCATGTGAA GATGGATATA GGCTTGAAAA TGAAACCTGC ATGAGTTGCC CATTTGGCCT TGGTGGTCTC AACTGTGGAA ACCCCTATCA GCTTATCACT GTGGTGATCG CAGCCGCGGG AGGTGGGCTC CTGCTCATCC TAGGCATCGC ACTGATTGTT ACCTGTTGCA GAAAGAATAA AAATGACATA AGCAAACTCA TCTTCAAAAG TGGAGATTTC CAAATGTCCC CGTATGCTGA ATACCCCAAA AATCCTCGCT CACAAGAATG GGGCCGAGAA GCTATTGAAA TGCATGAGAA TGGAAGTACC AAAAACCTCC TCCAGATGAC GGATGTGTAC TACTCGCCTA CAAGTGTAAG GAATCCAGAA CTTGAACGAA ACGGACTCTA CCCGGCCTAC ACTGGACTGC CAGGATCACG GCATTCTTGC ATTTTCCCCG GACAGTATAA CCCGTCTTTC ATCAGTGATG AAAGCAGAAG AAGAGACTAC TTTTAA
The stop codons will be deleted if pcDNA3.1+ /C-(K)DYK vector is selected.
CloneID
OHu38076
Clone ID Related Accession (Same CDS sequence)
XM_005247666.1
Accession Version
XM_005247666.1 Latest version!
Documents for ORF clone product in default vector
Sequence Information
ORF Nucleotide Sequence (Length: 4446bp)
Protein sequence
SNP
Vector
pcDNA3.1+ /C-(K)DYK or customized vector
User Manual
Clone information
Clone Map
MSDS
Tag on pcDNA3.1+ /C-(K)DYK
C terminal DYKDDDDK tags
ORF Insert Method
CloneEZ™ Seamless cloning technology
Insert Structure
linear
Update Date
2019-12-05
Organism
Homo sapiens(human)
Product
protein HEG homolog 1 isoform X1
Comment
Comment: MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_000003.12) annotated using gene prediction method: Gnomon, supported by mRNA and EST evidence. Also see: Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Updated annotation
Annotation Name :: Homo sapiens Updated Annotation Release 109.20191205
Annotation Version :: 109.20191205
Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline
Annotation Software Version :: 8.3
Annotation Method :: Best-placed RefSeq; propagated RefSeq model
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
1 61 121 181 241 301 361 421 481 541 601 661 721 781 841 901 961 1021 1081 1141 1201 1261 1321 1381 1441 1501 1561 1621 1681 1741 1801 1861 1921 1981 2041 2101 2161 2221 2281 2341 2401 2461 2521 2581 2641 2701 2761 2821 2881 2941 3001 3061 3121 3181 3241 3301 3361 3421 3481 3541 3601 3661 3721 3781 3841 3901 3961 4021 4081 4141 4201 4261 4321 4381 4441 ATGGCCTCGC CGCGCGCCTC GCGGTGGCCG CCGCCGCTCC TGCTGCTGTT GCTGCCGCTG CTGCTGCTGC CGCCGGCGGC CCCCGGGACG CGGGACCCGC CGCCTTCCCC GGCTCGCCGC GCGCTGAGCC TGGCGCCCCT CGCGGGAGCG GGGCTGGAGC TGCAGCTGGA GCGCCGCCCG GAGCGCGAGC CGCCGCCCAC GCCGCCCCGG GAGCGCCGCG GGCCCGCGAC CCCCGGCCCC AGCTACAGGG CCCCTGAGCC AGGCGCCGCG ACACAGCGGG GACCCTCCGG CCGGGCCCCC AGAGGCGGGA GCGCGGATGC TGCCTGGAAA CATTGGCCAG AAAGTAACAC TGAGGCCCAT GTAGAAAACA TCACCTTCTA TCAGAATCAA GAGGACTTTT CAACAGTGTC CTCCAAAGAG GGCGTGATGG TTCAGACCTC TGGGAAGAGC CATGCTGCTT CGGATGCTCC AGAAAACCTC ACTCTACTCG CTGAAACAGC AGATGCTAGA GGAAGGAGCG GCTCTTCAAG TAGAACAAAC TTCACCATTT TGCCTGTTGG GTACTCACTG GAGATAGCAA CAGCTCTGAC TTCCCAGAGT GGCAACTTAG CCTCAGAAAG TCTTCACCTG CCATCCAGCA GTTCAGAGTT CGATGAAAGA ATTGCCGCTT TTCAAACAAA GAGTGGAACA GCCTCGGAGA TGGGAACAGA GAGGGCGATG GGGCTGTCAG AAGAATGGAC TGTGCACAGC CAAGAGGCCA CCACTTCGGC TTGGAGCCCG TCCTTTCTTC CTGCTTTGGA GATGGGAGAG CTGACCACGC CTTCTAGGAA GAGAAATTCC TCAGGACCAG ATCTCTCCTG GCTGCATTTC TACAGGACAG CAGCTTCCTC TCCTCTCTTA GACCTTTCCT CATCTTCTGA AAGTACAGAG AAGCTTAACA ACTCCACTGG CCTCCAGAGC TCCTCAGTCA GTCAAACAAA GACAATGCAT GTTGCCACCG TGTTCACTGA TGGTGGCCCG AGAACGCTGC GATCTTTGAC GGTCAGTCTG GGACCTGTGA GCAAGACAGA AGGCTTCCCC AAGGACTCCA GAATTGCCAC GACTTCATCC TCAGTCCTTC TTTCACCCTC TGCAGTGGAA TCGAGAAGAA ACAGTAGAGT AACTGGGAAT CCAGGGGATG AGGAATTCAT TGAACCATCC ACAGAAAATG AATTTGGACT TACGTCTTTG CGTTGGCAAA ATGATTCCCC AACCTTTGGA GAACATCAGC TTGCCAGCAG CTCTGAGGTG CAAAATGGAA GTCCCATGTC TCAGACTGAG ACTGTGTCTA GGTCAGTCGC ACCCATGAGA GGTGGAGAGA TCACTGCACA CTGGCTCTTG ACCAACAGCA CAACATCTGC AGATGTGACA GGAAGCTCTG CTTCATATCC TGAAGGTGTG AATGCTTCAG TGTTGACCCA GTTCTCAGAC TCTACTGTAC AGTCTGGAGG AAGTCACACA GCATTGGGAG ATAGGAGTTA TTCAGAGTCT TCATCTACAT CTTCCTCGGA AAGCTTGAAT TCATCAGCAC CACGTGGAGA ACGTTCGACC TTGGAAGACA GCCGAGAGCC AGGCCAAGCA CTAGGTGACA GTTCCGCCAA TGCAGAGGAC AGGACTTCTG GGGTGCCCTC TCTCGGCACC CACACCTTGG CTACTGTCAC TGGAAACGGG GAACGCACAC TGCGGTCTGT CACCCTCACC AACACCAGCA TGAGCACGAC TTCTGGGGAA GCAGGCAGCC CTGCAGCGGC CATGCACCAA GAAACAGAGG GTGCCTCTCT GCACGTAAAC GTGACGGACG ACATGGGCCT GGTCTCACGG TCACTGGCCG CCTCCAGTGC ACTCGGAGTC GCTGGGATTA GCTACGGTCA AGTGCGTGGC ACAGCTATTG AACAAAGGAC TTCCAGCGAC CACACAGACC ACACCTACCT GTCATCTACT TTCACCAAAG GAGAACGGGC GTTACTGTCC ATTACAGATA ACAGTTCATC CTCAGACATT GTGGAGAGCT CAACTTCTTA TATTAAAATC TCAAACTCTT CACATTCAGA GTATTCCTCC TTTTTTCATG CTCAGACTGA GAGAAGTAAC ATCTCATCCT ATGACGGGGA ATATGCTCAG CCTTCTACTG AGTCGCCAGT TCTGCATACA TCCAACCTTC CGTCCTACAC ACCCACCATT AATATGCCGA ACACTTCGGT TGTTCTGGAC ACTGATGCTG AGTTTGTTAG TGACTCCTCC TCCTCCTCTT CCTCCTCCTC CTCTTCTTCT TCTTCAGGGC CTCCTTTGCC TCTGCCCTCT GTGTCACAAT CCCACCATTT ATTTTCATCA ATTTTACCAT CAACCAGGGC CTCTGTGCAT CTACTAAAGT CTACCTCTGA TGCATCCACA CCATGGTCTT CCTCACCATC ACCTTTACCA GTATCCTTAA CGACATCTAC ATCTGCCCCA CTTTCTGTCT CACAAACAAC CTTGCCACAG TCATCTTCTA CCCCTGTCCT GCCCAGGGCA AGGGAGACTC CTGTGACTTC ATTTCAGACA TCAACAATGA CATCATTCAT GACAATGCTC CATAGTAGTC AAACTGCAGA CCTTAAGAGC CAGAGCACCC CACACCAAGA GAAAGTCATT ACAGAATCAA AGTCACCAAG CCTGGTGTCT CTGCCCACAG AGTCCACCAA AGCTGTAACA ACAAACTCTC CTTTGCCTCC ATCCTTAACA GAGTCCTCCA CAGAGCAAAC CCTTCCAGCC ACAAGCACCA ACTTAGCACA AATGTCTCCA ACTTTCACAA CTACCATTCT GAAGACCTCT CAGCCTCTTA TGACCACTCC TGGCACCCTG TCAAGCACAG CATCTCTGGT CACTGGCCCT ATAGCCGTAC AGACTACAGC TGGAAAACAG CTCTCGCTGA CCCATCCTGA AATACTAGTT CCTCAAATCT CAACAGAAGG TGGCATCAGC ACAGAAAGGA ACCGAGTGAT TGTGGATGCT ACCACTGGAT TGATCCCTTT GACCAGTGTA CCCACATCAG CAAAAGAAAT GACCACAAAG CTTGGCGTTA CAGCAGAGTA CAGCCCAGCT TCACGTTCCC TCGGAACATC TCCTTCTCCC CAAACCACAG TTGTTTCCAC GGCTGAAGAC TTGGCTCCCA AATCTGCCAC CTTTGCTGTT CAGAGCAGCA CACAGTCACC AACAACAGTG TCCTCTTCAG CCTCAGTCAA CAGCTGTGCT GTGAACCCTT GTCTTCACAA TGGCGAATGC GTCGCAGACA ACACCAGCCG TGGCTACCAC TGCAGGTGCC CGCCTTCCTG GCAAGGGGAT GATTGCAGTG TGGATGTGAA TGAGTGCCTG TCGAACCCCT GCCCATCCAC AGCCATGTGC AACAATACTC AGGGATCCTT TATCTGCAAA TGCCCGGTTG GGTACCAGTT GGAAAAAGGG ATATGCAATT TGGTTAGAAC CTTCGTGACA GAGTTTAAAT TAAAGAGAAC TTTTCTTAAT ACAACTGTGG AAAAACATTC AGACCTACAA GAAGTTGAAA ATGAGATCAC CAAAACGTTA AATATGTGTT TTTCAGCGTT ACCTAGTTAC ATCCGATCTA CAGTTCACGC CTCTAGGGAG TCCAACGCGG TGGTGATCTC ACTGCAAACA ACCTTTTCCC TGGCCTCCAA TGTGACGCTA TTTGACCTGG CTGATAGGAT GCAGAAATGT GTCAACTCCT GCAAGTCCTC TGCTGAGGTC TGCCAGCTCT TGGGATCTCA GAGGCGGATC TTTAGAGCGG GCAGCTTGTG CAAGCGGAAG AGTCCCGAAT GTGACAAAGA CACCTCCATC TGCACTGACC TGGACGGCGT TGCCCTGTGC CAGTGCAAGT CGGGATACTT TCAGTTCAAC AAGATGGACC ACTCCTGCCG AGCATGTGAA GATGGATATA GGCTTGAAAA TGAAACCTGC ATGAGTTGCC CATTTGGCCT TGGTGGTCTC AACTGTGGAA ACCCCTATCA GCTTATCACT GTGGTGATCG CAGCCGCGGG AGGTGGGCTC CTGCTCATCC TAGGCATCGC ACTGATTGTT ACCTGTTGCA GAAAGAATAA AAATGACATA AGCAAACTCA TCTTCAAAAG TGGAGATTTC CAAATGTCCC CGTATGCTGA ATACCCCAAA AATCCTCGCT CACAAGAATG GGGCCGAGAA GCTATTGAAA TGCATGAGAA TGGAAGTACC AAAAACCTCC TCCAGATGAC GGATGTGTAC TACTCGCCTA CAAGTGTAAG GAATCCAGAA CTTGAACGAA ACGGACTCTA CCCGGCCTAC ACTGGACTGC CAGGATCACG GCATTCTTGC ATTTTCCCCG GACAGTATAA CCCGTCTTTC ATCAGTGATG AAAGCAGAAG AAGAGACTAC TTTTAA
The stop codons will be deleted if pcDNA3.1+ /C-(K)DYK vector is selected.
RefSeq
XP_005247723.1
CDS 64..4509
Translation
Target ORF information:
RefSeq Version XM_005247666.1
Organism Homo sapiens(human)
Definition Homo sapiens heart development protein with EGF like domains 1 (HEG1), transcript variant X1, mRNA.
Target ORF information:
Epitope DYKDDDDK
Bacterial selection AMPR
Mammalian selection NeoR
Vector pcDNA3.1+ /C-(K)DYK
XM_005247666.1
ORF Insert Sequence:
1 61 121 181 241 301 361 421 481 541 601 661 721 781 841 901 961 1021 1081 1141 1201 1261 1321 1381 1441 1501 1561 1621 1681 1741 1801 1861 1921 1981 2041 2101 2161 2221 2281 2341 2401 2461 2521 2581 2641 2701 2761 2821 2881 2941 3001 3061 3121 3181 3241 3301 3361 3421 3481 3541 3601 3661 3721 3781 3841 3901 3961 4021 4081 4141 4201 4261 4321 4381 4441 ATGGCCTCGC CGCGCGCCTC GCGGTGGCCG CCGCCGCTCC TGCTGCTGTT GCTGCCGCTG CTGCTGCTGC CGCCGGCGGC CCCCGGGACG CGGGACCCGC CGCCTTCCCC GGCTCGCCGC GCGCTGAGCC TGGCGCCCCT CGCGGGAGCG GGGCTGGAGC TGCAGCTGGA GCGCCGCCCG GAGCGCGAGC CGCCGCCCAC GCCGCCCCGG GAGCGCCGCG GGCCCGCGAC CCCCGGCCCC AGCTACAGGG CCCCTGAGCC AGGCGCCGCG ACACAGCGGG GACCCTCCGG CCGGGCCCCC AGAGGCGGGA GCGCGGATGC TGCCTGGAAA CATTGGCCAG AAAGTAACAC TGAGGCCCAT GTAGAAAACA TCACCTTCTA TCAGAATCAA GAGGACTTTT CAACAGTGTC CTCCAAAGAG GGCGTGATGG TTCAGACCTC TGGGAAGAGC CATGCTGCTT CGGATGCTCC AGAAAACCTC ACTCTACTCG CTGAAACAGC AGATGCTAGA GGAAGGAGCG GCTCTTCAAG TAGAACAAAC TTCACCATTT TGCCTGTTGG GTACTCACTG GAGATAGCAA CAGCTCTGAC TTCCCAGAGT GGCAACTTAG CCTCAGAAAG TCTTCACCTG CCATCCAGCA GTTCAGAGTT CGATGAAAGA ATTGCCGCTT TTCAAACAAA GAGTGGAACA GCCTCGGAGA TGGGAACAGA GAGGGCGATG GGGCTGTCAG AAGAATGGAC TGTGCACAGC CAAGAGGCCA CCACTTCGGC TTGGAGCCCG TCCTTTCTTC CTGCTTTGGA GATGGGAGAG CTGACCACGC CTTCTAGGAA GAGAAATTCC TCAGGACCAG ATCTCTCCTG GCTGCATTTC TACAGGACAG CAGCTTCCTC TCCTCTCTTA GACCTTTCCT CATCTTCTGA AAGTACAGAG AAGCTTAACA ACTCCACTGG CCTCCAGAGC TCCTCAGTCA GTCAAACAAA GACAATGCAT GTTGCCACCG TGTTCACTGA TGGTGGCCCG AGAACGCTGC GATCTTTGAC GGTCAGTCTG GGACCTGTGA GCAAGACAGA AGGCTTCCCC AAGGACTCCA GAATTGCCAC GACTTCATCC TCAGTCCTTC TTTCACCCTC TGCAGTGGAA TCGAGAAGAA ACAGTAGAGT AACTGGGAAT CCAGGGGATG AGGAATTCAT TGAACCATCC ACAGAAAATG AATTTGGACT TACGTCTTTG CGTTGGCAAA ATGATTCCCC AACCTTTGGA GAACATCAGC TTGCCAGCAG CTCTGAGGTG CAAAATGGAA GTCCCATGTC TCAGACTGAG ACTGTGTCTA GGTCAGTCGC ACCCATGAGA GGTGGAGAGA TCACTGCACA CTGGCTCTTG ACCAACAGCA CAACATCTGC AGATGTGACA GGAAGCTCTG CTTCATATCC TGAAGGTGTG AATGCTTCAG TGTTGACCCA GTTCTCAGAC TCTACTGTAC AGTCTGGAGG AAGTCACACA GCATTGGGAG ATAGGAGTTA TTCAGAGTCT TCATCTACAT CTTCCTCGGA AAGCTTGAAT TCATCAGCAC CACGTGGAGA ACGTTCGACC TTGGAAGACA GCCGAGAGCC AGGCCAAGCA CTAGGTGACA GTTCCGCCAA TGCAGAGGAC AGGACTTCTG GGGTGCCCTC TCTCGGCACC CACACCTTGG CTACTGTCAC TGGAAACGGG GAACGCACAC TGCGGTCTGT CACCCTCACC AACACCAGCA TGAGCACGAC TTCTGGGGAA GCAGGCAGCC CTGCAGCGGC CATGCACCAA GAAACAGAGG GTGCCTCTCT GCACGTAAAC GTGACGGACG ACATGGGCCT GGTCTCACGG TCACTGGCCG CCTCCAGTGC ACTCGGAGTC GCTGGGATTA GCTACGGTCA AGTGCGTGGC ACAGCTATTG AACAAAGGAC TTCCAGCGAC CACACAGACC ACACCTACCT GTCATCTACT TTCACCAAAG GAGAACGGGC GTTACTGTCC ATTACAGATA ACAGTTCATC CTCAGACATT GTGGAGAGCT CAACTTCTTA TATTAAAATC TCAAACTCTT CACATTCAGA GTATTCCTCC TTTTTTCATG CTCAGACTGA GAGAAGTAAC ATCTCATCCT ATGACGGGGA ATATGCTCAG CCTTCTACTG AGTCGCCAGT TCTGCATACA TCCAACCTTC CGTCCTACAC ACCCACCATT AATATGCCGA ACACTTCGGT TGTTCTGGAC ACTGATGCTG AGTTTGTTAG TGACTCCTCC TCCTCCTCTT CCTCCTCCTC CTCTTCTTCT TCTTCAGGGC CTCCTTTGCC TCTGCCCTCT GTGTCACAAT CCCACCATTT ATTTTCATCA ATTTTACCAT CAACCAGGGC CTCTGTGCAT CTACTAAAGT CTACCTCTGA TGCATCCACA CCATGGTCTT CCTCACCATC ACCTTTACCA GTATCCTTAA CGACATCTAC ATCTGCCCCA CTTTCTGTCT CACAAACAAC CTTGCCACAG TCATCTTCTA CCCCTGTCCT GCCCAGGGCA AGGGAGACTC CTGTGACTTC ATTTCAGACA TCAACAATGA CATCATTCAT GACAATGCTC CATAGTAGTC AAACTGCAGA CCTTAAGAGC CAGAGCACCC CACACCAAGA GAAAGTCATT ACAGAATCAA AGTCACCAAG CCTGGTGTCT CTGCCCACAG AGTCCACCAA AGCTGTAACA ACAAACTCTC CTTTGCCTCC ATCCTTAACA GAGTCCTCCA CAGAGCAAAC CCTTCCAGCC ACAAGCACCA ACTTAGCACA AATGTCTCCA ACTTTCACAA CTACCATTCT GAAGACCTCT CAGCCTCTTA TGACCACTCC TGGCACCCTG TCAAGCACAG CATCTCTGGT CACTGGCCCT ATAGCCGTAC AGACTACAGC TGGAAAACAG CTCTCGCTGA CCCATCCTGA AATACTAGTT CCTCAAATCT CAACAGAAGG TGGCATCAGC ACAGAAAGGA ACCGAGTGAT TGTGGATGCT ACCACTGGAT TGATCCCTTT GACCAGTGTA CCCACATCAG CAAAAGAAAT GACCACAAAG CTTGGCGTTA CAGCAGAGTA CAGCCCAGCT TCACGTTCCC TCGGAACATC TCCTTCTCCC CAAACCACAG TTGTTTCCAC GGCTGAAGAC TTGGCTCCCA AATCTGCCAC CTTTGCTGTT CAGAGCAGCA CACAGTCACC AACAACAGTG TCCTCTTCAG CCTCAGTCAA CAGCTGTGCT GTGAACCCTT GTCTTCACAA TGGCGAATGC GTCGCAGACA ACACCAGCCG TGGCTACCAC TGCAGGTGCC CGCCTTCCTG GCAAGGGGAT GATTGCAGTG TGGATGTGAA TGAGTGCCTG TCGAACCCCT GCCCATCCAC AGCCATGTGC AACAATACTC AGGGATCCTT TATCTGCAAA TGCCCGGTTG GGTACCAGTT GGAAAAAGGG ATATGCAATT TGGTTAGAAC CTTCGTGACA GAGTTTAAAT TAAAGAGAAC TTTTCTTAAT ACAACTGTGG AAAAACATTC AGACCTACAA GAAGTTGAAA ATGAGATCAC CAAAACGTTA AATATGTGTT TTTCAGCGTT ACCTAGTTAC ATCCGATCTA CAGTTCACGC CTCTAGGGAG TCCAACGCGG TGGTGATCTC ACTGCAAACA ACCTTTTCCC TGGCCTCCAA TGTGACGCTA TTTGACCTGG CTGATAGGAT GCAGAAATGT GTCAACTCCT GCAAGTCCTC TGCTGAGGTC TGCCAGCTCT TGGGATCTCA GAGGCGGATC TTTAGAGCGG GCAGCTTGTG CAAGCGGAAG AGTCCCGAAT GTGACAAAGA CACCTCCATC TGCACTGACC TGGACGGCGT TGCCCTGTGC CAGTGCAAGT CGGGATACTT TCAGTTCAAC AAGATGGACC ACTCCTGCCG AGCATGTGAA GATGGATATA GGCTTGAAAA TGAAACCTGC ATGAGTTGCC CATTTGGCCT TGGTGGTCTC AACTGTGGAA ACCCCTATCA GCTTATCACT GTGGTGATCG CAGCCGCGGG AGGTGGGCTC CTGCTCATCC TAGGCATCGC ACTGATTGTT ACCTGTTGCA GAAAGAATAA AAATGACATA AGCAAACTCA TCTTCAAAAG TGGAGATTTC CAAATGTCCC CGTATGCTGA ATACCCCAAA AATCCTCGCT CACAAGAATG GGGCCGAGAA GCTATTGAAA TGCATGAGAA TGGAAGTACC AAAAACCTCC TCCAGATGAC GGATGTGTAC TACTCGCCTA CAAGTGTAAG GAATCCAGAA CTTGAACGAA ACGGACTCTA CCCGGCCTAC ACTGGACTGC CAGGATCACG GCATTCTTGC ATTTTCCCCG GACAGTATAA CCCGTCTTTC ATCAGTGATG AAAGCAGAAG AAGAGACTAC TTTTAA
The stop codons will be deleted if pcDNA3.1+ /C-(K)DYK vector is selected.
HEG1 is a novel mucin-like membrane protein that serves as a diagnostic and therapeutic target for malignant mesothelioma.
Scientific reports745768(2017 Mar)
Tsuji S,Washimi K,Kageyama T,Yamashita M,Yoshihara M,Matsuura R,Yokose T,Kameda Y,Hayashi H,Morohoshi T,Tsuura Y,Yusa T,Sato T,Togayachi A,Narimatsu H,Nagasaki T,Nakamoto K,Moriwaki Y,Misawa H,Hiroshima K,Miyagi Y,Imai K