NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|488218433|ref|WP_002289641|]
View 

type II CRISPR RNA-guided endonuclease Cas9 [Streptococcus mutans]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
cas_Csn1 TIGR01865
CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile ...
4-1042 0e+00

CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile elements with a wide host range. This model represents a protein found only in CRISPR-containing species, near other CRISPR-associated proteins (cas), as part of the NMENI subtype of CRISPR/Cas locus. The species range so far for this protein is animal pathogens and commensals only.


:

Pssm-ID: 273840  Cd Length: 805  Bit Score: 874.44  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433     4 PYSIGLDIGTNSVGWAVVTDDYKVPAKKMKVLGNtdkshiKKNLLGALLFDSGNTAE-DRRLKRTTRRRYTRRRNRILYL 82
Cdd:TIGR01865    1 EYILGLDIGIASVGWAIVEDDYKVPAAKRLIDGG------VRNFTGAELPKTGETAAlDRRLARGARRRIRRRKHRLLRL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433    83 QEIFSEEMGKVDDSFFHRLEDSFLVTEDKRGerhpifgnleeevkyhenfpTIYHLRQYLADNPEKTDLrlVYLALAHII 162
Cdd:TIGR01865   75 QELFSREGSLTDFDFFSRLENSFLVEEDKRN--------------------TIYHLRKAALENKLKPDE--LYLALLHII 132
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   163 KFRGHFLIEG-KFDTRNndvqrlfqeflavydntfensslqeqnvqveeiltdkisksakkdrvlklfpneksngcfaef 241
Cdd:TIGR01865  133 KHRGHFLIEGnDFDTAN--------------------------------------------------------------- 149
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   242 lklivgnqadfkkhfeleekaplqfskdtyeeelevllaqigdnyaelflsakklydsillsgiltvtdVSTKAPLSASM 321
Cdd:TIGR01865  150 ---------------------------------------------------------------------KETGALLSAVM 160
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   322 IQRYNEHQMDLTQLKQFIRQKLSDKYNEVFSDvskdgyagyidgktnqeafykylkgllnkiegsgyfldkiereDFLRK 401
Cdd:TIGR01865  161 INRYLEHEADLRTLKELILKKFPKKYKEIFSE-------------------------------------------TFLRN 197
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   402 QRTFDNGSIPHQIHLQEMRAIIRRQAEFYPFladnqdriEKLLTFRIPYYVGPLASGKSDFAwlsrksadkitpwnfdeI 481
Cdd:TIGR01865  198 QRGFYNGSIPRQLLLEELEAIFRKQREYYPF--------IKLLTFRIPYYIGPLAEGKSEFA-----------------F 252
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   482 VDKESSAEAFINRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYK-TEQGKTAFFDANMKQEIFDGVFKVYRKVTK 560
Cdd:TIGR01865  253 VDKPASAENFIEKMTGKCTYLPEEKRAPKHSLLAEKFTVLNELNNVRIIiLEQGETKILSKEEKQELLDLLFKKKKLTYK 332
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   561 DKLMDFLEKEFDEFRIVDLTGLDKENKVFNASYGTYHDLCKIL-DKDFLDNSKNEKILEDIVLTLTLFEDREMIRKRLEN 639
Cdd:TIGR01865  333 KLRKLLGLSEDAIFKGLRYEGLDNAEKAFNISLKTYHKLRKALgDKDLLDNPKNPKDLDEIVKILTLYKDREMIKKRLEL 412
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   640 YSDLLTKEQVKKLERRHYTGWGRLSAELIHGIRNKESRKTILDYLIDDGNSNRNFMQLINDDALSFKEEIAKAqvigetd 719
Cdd:TIGR01865  413 YKDVLNEEQVKKLVRLHFTGWGRLSLKALRGIRPLMEQGKRYDEAILELGGNRNFMQNINDSQLLPKINITKA------- 485
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   720 nlnqvvSDIAGSPAIKKGILQSLKIVDELVKIMGhQPENIVVEMARENQFTNQGRRNSQQRLKGLTDSIKEFGS----QI 795
Cdd:TIGR01865  486 ------KDEILNPVVKRALLQARKVVNELVKKYG-PPDKIVIEMAREEQGTNFGKRNSKERYKKNEDKIKEFASalgkEI 558
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   796 LKEHPVENSQLQNDRLFLYYLQNGRDMYTGEELDID---YLSQYDIDHIIPQAFIKDNSIDNRVLTSSKENRGKSDDVPS 872
Cdd:TIGR01865  559 LKEEPTENSSKNILKLRLYYQQNGKCMYTGKEIDIDdlfDLSYYEIDHILPQSRSFDDSISNKVLVLASENQEKGDQTPY 638
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   873 K-DVVRKMKSYWSKLLSAKLITQRKFDNLTKAERGGLTDDDKAGFIKRQLVETRQITKHVARILDERFNTEtdennKKIR 951
Cdd:TIGR01865  639 EaEIVKKDSAFWNKFEAYVLISKRKSDKLTRAERGGLSDDDKAGFIDRNLNDTRYITRVVANYLKDRFNFH-----LKKR 713
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   952 QVKIVTLKSNLVSNFRKEFELYKVREINDYHHAHDAYLNAVIGKALLGVYPQLEPEFVYGDYPHFHGHKENK-ATAKKFF 1030
Cdd:TIGR01865  714 KVKVVTLKGQLTSQLRKKWGLYKKREINNYHHAHDAYINAVSTNALVKKFSQLEPEFRYKEYHNFDGRKKKKsATDKKVK 793
                         1050
                   ....*....|..
gi 488218433  1031 YSNIMNFFKKDD 1042
Cdd:TIGR01865  794 FSNPMEFFKQKV 805
Cas9_PI pfam16595
PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at ...
1081-1335 2.78e-51

PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at the C-terminal of bacterial type II CRISPR system Cas9 endonuclease. This domain adopts a novel protein fold that is unique to the Cas9 family. It is positioned in the structure-DNA-complex to recognize the PAM sequence on the non-complementary DNA strand of the crRNA. PAM sequence is protospacer-adjacent motifs on DNA. See family CRISPR-DR2, Rfam:RF01315. Cas9 carries two nuclease domains, HNH and RuvC, which cleave the DNA strands that are complementary and non-complementary to the 20 nucleotide guide sequence in crRNAs, respectively.


:

Pssm-ID: 435449  Cd Length: 264  Bit Score: 181.75  E-value: 2.78e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  1081 TGGFSKESILP--KGNSDKLIPRKTKKFYWDTKKYGGFDSPIVAYSILVIADIEKGKSKKLktvkaLVGVTIMEKMTFER 1158
Cdd:pfam16595    1 KGGLFNQTILPahKKKGKGLIPLKKDERGLDVEKYGGYSSLTAAYFSLVEYTGKKGKRKRT-----IEGVPLYLAAKIEE 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  1159 DPV--AFLERKGYrnVQEENII--KLPKYSLFKlENGRKRLLASARE---LQKGNEIVLPNHLGTLLYH---AKNIHKVD 1228
Cdd:pfam16595   76 NKDllEYLEEKLG--LKEPKIIlpKIKKNSLIK-IDGFRMLLTGKTEnrlLKNAVQLVLSNDDEKYIKKiekFVKKNKDD 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  1229 EPKHLDY--VDKHKDEFKELLDVVSNFSkKYTLAEGNLEKIKELYAQNNGEDLKELASSFINLLTFTAIGA-PATFKFFD 1305
Cdd:pfam16595  153 IIEEKDGltEEKNIKLYDELLDKMKNTI-YYKRPSNQGEKLEKLKEKFIKLSLEEKCKVLIEILKLTHANPtSADLKLIG 231
                          250       260       270
                   ....*....|....*....|....*....|...
gi 488218433  1306 KNIDRKRYTSTTEIL---NATLIHQSITGLYET 1335
Cdd:pfam16595  232 GSKHAGRIKISNNISkasNIKLINQSVTGLYEK 264
 
Name Accession Description Interval E-value
cas_Csn1 TIGR01865
CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile ...
4-1042 0e+00

CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile elements with a wide host range. This model represents a protein found only in CRISPR-containing species, near other CRISPR-associated proteins (cas), as part of the NMENI subtype of CRISPR/Cas locus. The species range so far for this protein is animal pathogens and commensals only.


Pssm-ID: 273840  Cd Length: 805  Bit Score: 874.44  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433     4 PYSIGLDIGTNSVGWAVVTDDYKVPAKKMKVLGNtdkshiKKNLLGALLFDSGNTAE-DRRLKRTTRRRYTRRRNRILYL 82
Cdd:TIGR01865    1 EYILGLDIGIASVGWAIVEDDYKVPAAKRLIDGG------VRNFTGAELPKTGETAAlDRRLARGARRRIRRRKHRLLRL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433    83 QEIFSEEMGKVDDSFFHRLEDSFLVTEDKRGerhpifgnleeevkyhenfpTIYHLRQYLADNPEKTDLrlVYLALAHII 162
Cdd:TIGR01865   75 QELFSREGSLTDFDFFSRLENSFLVEEDKRN--------------------TIYHLRKAALENKLKPDE--LYLALLHII 132
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   163 KFRGHFLIEG-KFDTRNndvqrlfqeflavydntfensslqeqnvqveeiltdkisksakkdrvlklfpneksngcfaef 241
Cdd:TIGR01865  133 KHRGHFLIEGnDFDTAN--------------------------------------------------------------- 149
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   242 lklivgnqadfkkhfeleekaplqfskdtyeeelevllaqigdnyaelflsakklydsillsgiltvtdVSTKAPLSASM 321
Cdd:TIGR01865  150 ---------------------------------------------------------------------KETGALLSAVM 160
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   322 IQRYNEHQMDLTQLKQFIRQKLSDKYNEVFSDvskdgyagyidgktnqeafykylkgllnkiegsgyfldkiereDFLRK 401
Cdd:TIGR01865  161 INRYLEHEADLRTLKELILKKFPKKYKEIFSE-------------------------------------------TFLRN 197
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   402 QRTFDNGSIPHQIHLQEMRAIIRRQAEFYPFladnqdriEKLLTFRIPYYVGPLASGKSDFAwlsrksadkitpwnfdeI 481
Cdd:TIGR01865  198 QRGFYNGSIPRQLLLEELEAIFRKQREYYPF--------IKLLTFRIPYYIGPLAEGKSEFA-----------------F 252
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   482 VDKESSAEAFINRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYK-TEQGKTAFFDANMKQEIFDGVFKVYRKVTK 560
Cdd:TIGR01865  253 VDKPASAENFIEKMTGKCTYLPEEKRAPKHSLLAEKFTVLNELNNVRIIiLEQGETKILSKEEKQELLDLLFKKKKLTYK 332
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   561 DKLMDFLEKEFDEFRIVDLTGLDKENKVFNASYGTYHDLCKIL-DKDFLDNSKNEKILEDIVLTLTLFEDREMIRKRLEN 639
Cdd:TIGR01865  333 KLRKLLGLSEDAIFKGLRYEGLDNAEKAFNISLKTYHKLRKALgDKDLLDNPKNPKDLDEIVKILTLYKDREMIKKRLEL 412
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   640 YSDLLTKEQVKKLERRHYTGWGRLSAELIHGIRNKESRKTILDYLIDDGNSNRNFMQLINDDALSFKEEIAKAqvigetd 719
Cdd:TIGR01865  413 YKDVLNEEQVKKLVRLHFTGWGRLSLKALRGIRPLMEQGKRYDEAILELGGNRNFMQNINDSQLLPKINITKA------- 485
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   720 nlnqvvSDIAGSPAIKKGILQSLKIVDELVKIMGhQPENIVVEMARENQFTNQGRRNSQQRLKGLTDSIKEFGS----QI 795
Cdd:TIGR01865  486 ------KDEILNPVVKRALLQARKVVNELVKKYG-PPDKIVIEMAREEQGTNFGKRNSKERYKKNEDKIKEFASalgkEI 558
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   796 LKEHPVENSQLQNDRLFLYYLQNGRDMYTGEELDID---YLSQYDIDHIIPQAFIKDNSIDNRVLTSSKENRGKSDDVPS 872
Cdd:TIGR01865  559 LKEEPTENSSKNILKLRLYYQQNGKCMYTGKEIDIDdlfDLSYYEIDHILPQSRSFDDSISNKVLVLASENQEKGDQTPY 638
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   873 K-DVVRKMKSYWSKLLSAKLITQRKFDNLTKAERGGLTDDDKAGFIKRQLVETRQITKHVARILDERFNTEtdennKKIR 951
Cdd:TIGR01865  639 EaEIVKKDSAFWNKFEAYVLISKRKSDKLTRAERGGLSDDDKAGFIDRNLNDTRYITRVVANYLKDRFNFH-----LKKR 713
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   952 QVKIVTLKSNLVSNFRKEFELYKVREINDYHHAHDAYLNAVIGKALLGVYPQLEPEFVYGDYPHFHGHKENK-ATAKKFF 1030
Cdd:TIGR01865  714 KVKVVTLKGQLTSQLRKKWGLYKKREINNYHHAHDAYINAVSTNALVKKFSQLEPEFRYKEYHNFDGRKKKKsATDKKVK 793
                         1050
                   ....*....|..
gi 488218433  1031 YSNIMNFFKKDD 1042
Cdd:TIGR01865  794 FSNPMEFFKQKV 805
Csn1 cd09643
CRISPR/Cas system-associated protein Cas9; CRISPR (Clustered Regularly Interspaced Short ...
4-1041 0e+00

CRISPR/Cas system-associated protein Cas9; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Very large protein containing McrA/HNH-nuclease related domain and a RuvC-like nuclease domain; signature gene for type II


Pssm-ID: 187774 [Multi-domain]  Cd Length: 799  Bit Score: 861.73  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433    4 PYSIGLDIGTNSVGWAVVTDDYKVPAKKMKvlgntdkSHIKKNLLGALLFDSGNTAE-DRRLKRTTRRRYTRRRNRILYL 82
Cdd:cd09643     1 EYILGLDIGIASVGWAIVEDDYKVPAKKMI-------DCGVKIFTGAELFKTGETAAlDRRLARGARRRIRRRKHRLLRL 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   83 QEIFSEEMGKVDDSFFHRLEDSFLvtedkrgerhpifgnleeevKYHENFPTIYHLRQYLADNPEKTDLrlVYLALAHII 162
Cdd:cd09643    74 QELFAREGSLTDFDFFSRLEDSFL--------------------EYHKNYPTIYHLRKAALENKLKPDE--LYLALLHII 131
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  163 KFRGHFLIEGKFDTRNndvqrlfqeflavydntfensslqeqnvqveeiltdkisksakkdrvlklfpneksngcfaefl 242
Cdd:cd09643   132 KHRGHFLIEGDEDTTA---------------------------------------------------------------- 147
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  243 klivgnqadfkkhfeleekaplqfskdtyeeelevllaqigdnyaelflsakklydsillsgiltvtDVSTKAPLSASMI 322
Cdd:cd09643   148 -------------------------------------------------------------------DKETGALLSASMI 160
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  323 QRYNEHQMDLTQLKQFIRQKLSDKYNEVFSDvskdgyagyidgktnqeafykylkgllnkiegsgyfldkierEDFLRKQ 402
Cdd:cd09643   161 KRYDEHKADLRKLKELIKKEFFKKYKEIFGD------------------------------------------ETFLRNQ 198
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  403 RTFDNGSIPHQIHLQEMRAIIRRQAEFYPFladnqdriEKLLTFRIPYYVGPLASGKSDFAWLSRKSADkitpwnfdeiv 482
Cdd:cd09643   199 RGFYNGSIPRQLLLEELEAIFRKQREYYPF--------EKILTFRIPYYIGPLAEGKSEFAWLTRPALS----------- 259
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  483 dkessaEAFINRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYKTEQGKTAFFDANMKQEIFDGVFKVYRKVTKDK 562
Cdd:cd09643   260 ------EAFIEKMTGKCTYLPEEKRAPKHSLLAEKFTVLNELNNLRIIEEQGETKILSKEEKQELLDLLFKKNKLTYKQK 333
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  563 LMDFLEKEFDEFRIVDLTGLdKENKVFNASYGTYHDLCKILDKDFL-DNSKNEKILEDIVLTLTLFEDREMIRKRLENYS 641
Cdd:cd09643   334 RKLLGLKEEEIFKGLRYEGL-KAEKNFNISLKTYHDLRKALGKEFLkDLELNEKILDEIVKILTLYKDREMIEKILELYK 412
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  642 DLLTKEQVKKLERRHYTGWGRLSAELIHGIRNKESRKTILDYLIDDGNSNRNfmQLINDDALSFKEEIAKAQvigetdnl 721
Cdd:cd09643   413 DLLNEEQLKKLLKRHFTGWGRLSLKALRGIRPLMEQGKRYDEAILELGGNHN--QKINSDELKFLPIIKKAQ-------- 482
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  722 nqvVSDIAGSPAIKKGILQSLKIVDELVKIMGhQPENIVVEMARENQfTNQGRRNSQQRLKGLTDSIKEFGS---QILKE 798
Cdd:cd09643   483 ---VKDEILNPVVKRALLQARKVVNELVKKYG-PPDKIVIEMARENG-TNKGTKNRKKRQKKNEDNIKEAASaleQKLKE 557
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  799 HPVENSQLQNDRLFLYYLQNGRDMYTGEELDID---YLSQYDIDHIIPQAFIKDNSIDNRVLTSSKENRGKSDDVPSKDV 875
Cdd:cd09643   558 LPLDIKSKNILKLRLYYQQNGKCMYTGKEIDIDdlfDLSYYEIDHILPQSRSFDDSISNKVLVLASENQEKGDQTPYEEI 637
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  876 VRKMKSYWSKLLSAKLITQR---KFDNLtKAERgGLTDDDKAGFIKRQLVETRQITKHVARILDERFNTEtdennKKIRQ 952
Cdd:cd09643   638 VSKMSAFWNKLEAAKLISQRgdsKKDRL-LLEK-GISDDEKAGFIDRNLNDTRYITRVVANYLKDRFNFH-----LKKRK 710
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  953 VKIVTLKSNLVSNFRKEFELYKVREINDYHHAHDAYLNAVIGKALLGVYPQLEpefVYGDYPHFHGHKENKATA---KKF 1029
Cdd:cd09643   711 VKVVTLKGQLTSQLRKKWGLYKKREINNYHHAHDAYINAVVTNALVKKFSQLE---RYKEYKRFDSEKGNKKTLdenKKF 787
                        1050
                  ....*....|..
gi 488218433 1030 FYSNIMNFFKKD 1041
Cdd:cd09643   788 FFANPMNFFKQE 799
Cas9_REC pfam16592
REC lobe of CRISPR-associated endonuclease Cas9; The REC lobe of Cas9 - the CRISPR-associated ...
181-711 0e+00

REC lobe of CRISPR-associated endonuclease Cas9; The REC lobe of Cas9 - the CRISPR-associated endonuclease Cas9 - includes the REC1 and REC2 domains. REC1 forms an elongated, alpha-helical structure consisting of 25 alpha helices and two beta-sheets, whereas REC2 inserted within REC1 adopts a six-helix bundle structure. The REC lobe and the NUC lobe of Cas9 fold to present a positively charged groove at their interface which accommodates the negatively charged sgRNA:target DNA heteroduplex. CRISPR (clustered regularly interspaced short palindromic repeat)-Cas system occurs naturally in bacteria as a defence against invasion by phages or other mobile genetic elements. Cas9 is targeted to specific genomic locations by sgRNAs or single guide RNAs, in order to complex with invading DNA in order to cleave it and render it inactive.


Pssm-ID: 435447  Cd Length: 539  Bit Score: 732.33  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   181 VQRLFQEFLAVYDNTFENSSLQEQNVQVEEILT-DKISKSAKKDRVLKLFPNEK-SNGCFAEFLKLIVGNQADFKKHFEL 258
Cdd:pfam16592    1 VEESFQDLLNILYEQLENLELETQNVEIEKILKkTKISKKAKLDELLALPPNEKnSKKIFAEILKLILGNKADFTKIFEL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   259 E------EKAPLQFSKDTYEEELEVLLAQIGDNYAELFLSAKKLYDSILLSGILTVTDVSTKAPLSASMIQRYNEHQMDL 332
Cdd:pfam16592   81 EkfveepKKIKLSFSDSNYDEKIEELENQLGDEKAEIILILKKIYDWVVLSDILTVSTDNGKAYLSEAMVNRYDKHKEDL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   333 TQLKQFIRQKLSDKYNEVFSDVSKDGYAGYID----GKTNQEAFYKYLKGLLNKIEGS--GYFLDKIEREDFLRKQRTFD 406
Cdd:pfam16592  161 AQLKKVIKQNLSEKYNDMFRKEKKKGYSAYINgknnGKTSKEDFYKYIKKLINKVETSeaQYILSKIDNENFLPKQRTKS 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   407 NGSIPHQIHLQEMRAIIRRQAEFYPFLADNQDRIEKLLTFRIPYYVGPLASGKSDFAWLSRKSADKITPWNFDEIVDKES 486
Cdd:pfam16592  241 NGSIPYQVHLQELKKIIKNQAEYYPFLKENQEKILKLLTFRIPYYVGPLAEKKSKFAWMKRKEQGKIYPWNFEQKVDIDK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   487 SAEAFINRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYKTEqgktaFFDANMKQEIFDGVFKVYRKVTKDKLMDF 566
Cdd:pfam16592  321 TAEAFITRMTNYCTYLPDEKVLPKNSLLYSKFTVLNELNKIKINGE-----KISVELKQDIFNGLFKKNKKVTKKKLKDW 395
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   567 LEKEFDEFRIVDLTGLDKENkVFNASYGTYHDLCKILdKDFLDNSKNEKILEDIVLTLTLFEDREMIRKRLEN-YSDLLT 645
Cdd:pfam16592  396 LVKEGYNFKAVEIKGFDKEN-NFNNSLTTYIDLAKIF-GDFLDNPDNEDIIEDIIYWLTLFEDRKILKRRLQKkYSNLLT 473
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 488218433   646 KEQVKKLERRHYTGWGRLSAELIHGIRNKESR---KTILDYLIDDgnsNRNFMQLINDDALSFKEEIAK 711
Cdd:pfam16592  474 EKQIKQILKLKYKGWGRLSKELLNGIRGADRQgeiKTIIDLLWND---NRNLMQLINDERLSFKEEIEK 539
Cas9 COG3513
CRISPR-Cas system type-II protein Cas9 [Defense mechanisms]; CRISPR-Cas system type-II protein ...
388-1112 8.20e-123

CRISPR-Cas system type-II protein Cas9 [Defense mechanisms]; CRISPR-Cas system type-II protein Cas9 is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 442735 [Multi-domain]  Cd Length: 812  Bit Score: 401.26  E-value: 8.20e-123
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  388 YFLDKIEREDFLRKQRTFDNGSIPHQIHLQEMRAIIRRQAEFYPFLADN--QDRIEKLLTFRIPYYVGplasgksdfawl 465
Cdd:COG3513   168 YLYRRLQENGKVRNRKGDYDFYIPREDLEDEFEAIWAAQAEFGPALLTEelRDELLEIIFFQRPLKSG------------ 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  466 srksadkitpwnfdeivdkessaeafiNRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYKTEQGKTAFFDANMKQ 545
Cdd:COG3513   236 ---------------------------KKLVGKCTFEPDEKRAPKASPLFQRFRILQKLNNLRIVDDGGEERPLTLEERQ 288
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  546 EIFDGVFKvYRKVTKDKLMDFLEKEfDEFRIVDLTGLDKENKVFnASYGTYHDLCKILDKDFLdNSKNEKILEDIVLTLT 625
Cdd:COG3513   289 KIIDLLEN-KKKLTFKKLRKLLGLP-DGVIFKGFNYEDDDRAKL-KGDKTYAKLAKIFGKAWL-NEFDPEILDDIVEALT 364
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  626 LFEDREMIRKRLENYSDLlTKEQVKKLERRH-YTGWGRLSAELIHGIrnkesrktiLDYLIDDgnsnrnfmqlinddaLS 704
Cdd:COG3513   365 LFKDDEELKEWLKKLYGL-DEEQAEALANLPlPDGYGNLSLKALRKI---------LPLLEEG---------------LD 419
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  705 FKEEIAKAQVIGETDNLN--------QVVSDIAGSPAIKKGILQSLKIVDELVKIMGHqPENIVVEMARENQFTNQGRRN 776
Cdd:COG3513   420 YDEAVKAAGYDHSSLEILdrlppigeEKRKGSIRNPVVHRALNQLRKVVNALIRKYGK-PDEIHIELARDLKKSKKERKE 498
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  777 SQQRLKGLTDSIKEFGSQILKEHPVENSQLQNDRLFLYYLQNGRDMYTGEELDIDYL--SQYDIDHIIPQAFIKDNSIDN 854
Cdd:COG3513   499 IQKRQRENEKAREKAREEIAEEGGGEPSRRDILKYRLWEEQNGRCPYTGKPISISDLldGSVEIDHILPRSRTLDDSFNN 578
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  855 RVLTSSKENRGKSDDVP----SKDVVRKMKSYWSKLLSAKLITQRKFDNLTKAERgglTDDDKAGFIKRQLVETRQITKH 930
Cdd:COG3513   579 KVLCLADANREKGNRTPyealGGDEAEKWEEILARVENLKLIPQKKKKRFLKKEL---DRDDDEGFIARQLNDTRYISRL 655
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  931 VARILDERFNTEtdennkKIRQVKIVTLKSNLVSNFRKEFELYKV-------REINDYHHAHDAYLNAVIGKALLGVYPQ 1003
Cdd:COG3513   656 AAEYLKSLYPFE------DKGKRKVRVVPGQLTAMLRRAWGLNKIlsddgekNRDDHRHHAIDALVIACTTQGLLQRLAK 729
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433 1004 LEPEfvygdyphfhghKENKATAKKFFYSNIMNFFkkDDVRtdkngeiiwkkdehiSNIKKVLsypqvnIVKKVEEQ-TG 1082
Cdd:COG3513   730 ASRE------------REDAEKAEEHFPPPWDGFR--QDVA---------------EAVDEIF------VSHAPRRKvTG 774
                         730       740       750
                  ....*....|....*....|....*....|
gi 488218433 1083 GFSKESILPKGNsDKLIPRKTkkfyWDTKK 1112
Cdd:COG3513   775 QLHKETIYSTGE-GKVVLRKP----LTSLK 799
Cas9_PI pfam16595
PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at ...
1081-1335 2.78e-51

PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at the C-terminal of bacterial type II CRISPR system Cas9 endonuclease. This domain adopts a novel protein fold that is unique to the Cas9 family. It is positioned in the structure-DNA-complex to recognize the PAM sequence on the non-complementary DNA strand of the crRNA. PAM sequence is protospacer-adjacent motifs on DNA. See family CRISPR-DR2, Rfam:RF01315. Cas9 carries two nuclease domains, HNH and RuvC, which cleave the DNA strands that are complementary and non-complementary to the 20 nucleotide guide sequence in crRNAs, respectively.


Pssm-ID: 435449  Cd Length: 264  Bit Score: 181.75  E-value: 2.78e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  1081 TGGFSKESILP--KGNSDKLIPRKTKKFYWDTKKYGGFDSPIVAYSILVIADIEKGKSKKLktvkaLVGVTIMEKMTFER 1158
Cdd:pfam16595    1 KGGLFNQTILPahKKKGKGLIPLKKDERGLDVEKYGGYSSLTAAYFSLVEYTGKKGKRKRT-----IEGVPLYLAAKIEE 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  1159 DPV--AFLERKGYrnVQEENII--KLPKYSLFKlENGRKRLLASARE---LQKGNEIVLPNHLGTLLYH---AKNIHKVD 1228
Cdd:pfam16595   76 NKDllEYLEEKLG--LKEPKIIlpKIKKNSLIK-IDGFRMLLTGKTEnrlLKNAVQLVLSNDDEKYIKKiekFVKKNKDD 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  1229 EPKHLDY--VDKHKDEFKELLDVVSNFSkKYTLAEGNLEKIKELYAQNNGEDLKELASSFINLLTFTAIGA-PATFKFFD 1305
Cdd:pfam16595  153 IIEEKDGltEEKNIKLYDELLDKMKNTI-YYKRPSNQGEKLEKLKEKFIKLSLEEKCKVLIEILKLTHANPtSADLKLIG 231
                          250       260       270
                   ....*....|....*....|....*....|...
gi 488218433  1306 KNIDRKRYTSTTEIL---NATLIHQSITGLYET 1335
Cdd:pfam16595  232 GSKHAGRIKISNNISkasNIKLINQSVTGLYEK 264
 
Name Accession Description Interval E-value
cas_Csn1 TIGR01865
CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile ...
4-1042 0e+00

CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile elements with a wide host range. This model represents a protein found only in CRISPR-containing species, near other CRISPR-associated proteins (cas), as part of the NMENI subtype of CRISPR/Cas locus. The species range so far for this protein is animal pathogens and commensals only.


Pssm-ID: 273840  Cd Length: 805  Bit Score: 874.44  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433     4 PYSIGLDIGTNSVGWAVVTDDYKVPAKKMKVLGNtdkshiKKNLLGALLFDSGNTAE-DRRLKRTTRRRYTRRRNRILYL 82
Cdd:TIGR01865    1 EYILGLDIGIASVGWAIVEDDYKVPAAKRLIDGG------VRNFTGAELPKTGETAAlDRRLARGARRRIRRRKHRLLRL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433    83 QEIFSEEMGKVDDSFFHRLEDSFLVTEDKRGerhpifgnleeevkyhenfpTIYHLRQYLADNPEKTDLrlVYLALAHII 162
Cdd:TIGR01865   75 QELFSREGSLTDFDFFSRLENSFLVEEDKRN--------------------TIYHLRKAALENKLKPDE--LYLALLHII 132
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   163 KFRGHFLIEG-KFDTRNndvqrlfqeflavydntfensslqeqnvqveeiltdkisksakkdrvlklfpneksngcfaef 241
Cdd:TIGR01865  133 KHRGHFLIEGnDFDTAN--------------------------------------------------------------- 149
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   242 lklivgnqadfkkhfeleekaplqfskdtyeeelevllaqigdnyaelflsakklydsillsgiltvtdVSTKAPLSASM 321
Cdd:TIGR01865  150 ---------------------------------------------------------------------KETGALLSAVM 160
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   322 IQRYNEHQMDLTQLKQFIRQKLSDKYNEVFSDvskdgyagyidgktnqeafykylkgllnkiegsgyfldkiereDFLRK 401
Cdd:TIGR01865  161 INRYLEHEADLRTLKELILKKFPKKYKEIFSE-------------------------------------------TFLRN 197
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   402 QRTFDNGSIPHQIHLQEMRAIIRRQAEFYPFladnqdriEKLLTFRIPYYVGPLASGKSDFAwlsrksadkitpwnfdeI 481
Cdd:TIGR01865  198 QRGFYNGSIPRQLLLEELEAIFRKQREYYPF--------IKLLTFRIPYYIGPLAEGKSEFA-----------------F 252
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   482 VDKESSAEAFINRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYK-TEQGKTAFFDANMKQEIFDGVFKVYRKVTK 560
Cdd:TIGR01865  253 VDKPASAENFIEKMTGKCTYLPEEKRAPKHSLLAEKFTVLNELNNVRIIiLEQGETKILSKEEKQELLDLLFKKKKLTYK 332
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   561 DKLMDFLEKEFDEFRIVDLTGLDKENKVFNASYGTYHDLCKIL-DKDFLDNSKNEKILEDIVLTLTLFEDREMIRKRLEN 639
Cdd:TIGR01865  333 KLRKLLGLSEDAIFKGLRYEGLDNAEKAFNISLKTYHKLRKALgDKDLLDNPKNPKDLDEIVKILTLYKDREMIKKRLEL 412
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   640 YSDLLTKEQVKKLERRHYTGWGRLSAELIHGIRNKESRKTILDYLIDDGNSNRNFMQLINDDALSFKEEIAKAqvigetd 719
Cdd:TIGR01865  413 YKDVLNEEQVKKLVRLHFTGWGRLSLKALRGIRPLMEQGKRYDEAILELGGNRNFMQNINDSQLLPKINITKA------- 485
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   720 nlnqvvSDIAGSPAIKKGILQSLKIVDELVKIMGhQPENIVVEMARENQFTNQGRRNSQQRLKGLTDSIKEFGS----QI 795
Cdd:TIGR01865  486 ------KDEILNPVVKRALLQARKVVNELVKKYG-PPDKIVIEMAREEQGTNFGKRNSKERYKKNEDKIKEFASalgkEI 558
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   796 LKEHPVENSQLQNDRLFLYYLQNGRDMYTGEELDID---YLSQYDIDHIIPQAFIKDNSIDNRVLTSSKENRGKSDDVPS 872
Cdd:TIGR01865  559 LKEEPTENSSKNILKLRLYYQQNGKCMYTGKEIDIDdlfDLSYYEIDHILPQSRSFDDSISNKVLVLASENQEKGDQTPY 638
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   873 K-DVVRKMKSYWSKLLSAKLITQRKFDNLTKAERGGLTDDDKAGFIKRQLVETRQITKHVARILDERFNTEtdennKKIR 951
Cdd:TIGR01865  639 EaEIVKKDSAFWNKFEAYVLISKRKSDKLTRAERGGLSDDDKAGFIDRNLNDTRYITRVVANYLKDRFNFH-----LKKR 713
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   952 QVKIVTLKSNLVSNFRKEFELYKVREINDYHHAHDAYLNAVIGKALLGVYPQLEPEFVYGDYPHFHGHKENK-ATAKKFF 1030
Cdd:TIGR01865  714 KVKVVTLKGQLTSQLRKKWGLYKKREINNYHHAHDAYINAVSTNALVKKFSQLEPEFRYKEYHNFDGRKKKKsATDKKVK 793
                         1050
                   ....*....|..
gi 488218433  1031 YSNIMNFFKKDD 1042
Cdd:TIGR01865  794 FSNPMEFFKQKV 805
Csn1 cd09643
CRISPR/Cas system-associated protein Cas9; CRISPR (Clustered Regularly Interspaced Short ...
4-1041 0e+00

CRISPR/Cas system-associated protein Cas9; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Very large protein containing McrA/HNH-nuclease related domain and a RuvC-like nuclease domain; signature gene for type II


Pssm-ID: 187774 [Multi-domain]  Cd Length: 799  Bit Score: 861.73  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433    4 PYSIGLDIGTNSVGWAVVTDDYKVPAKKMKvlgntdkSHIKKNLLGALLFDSGNTAE-DRRLKRTTRRRYTRRRNRILYL 82
Cdd:cd09643     1 EYILGLDIGIASVGWAIVEDDYKVPAKKMI-------DCGVKIFTGAELFKTGETAAlDRRLARGARRRIRRRKHRLLRL 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   83 QEIFSEEMGKVDDSFFHRLEDSFLvtedkrgerhpifgnleeevKYHENFPTIYHLRQYLADNPEKTDLrlVYLALAHII 162
Cdd:cd09643    74 QELFAREGSLTDFDFFSRLEDSFL--------------------EYHKNYPTIYHLRKAALENKLKPDE--LYLALLHII 131
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  163 KFRGHFLIEGKFDTRNndvqrlfqeflavydntfensslqeqnvqveeiltdkisksakkdrvlklfpneksngcfaefl 242
Cdd:cd09643   132 KHRGHFLIEGDEDTTA---------------------------------------------------------------- 147
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  243 klivgnqadfkkhfeleekaplqfskdtyeeelevllaqigdnyaelflsakklydsillsgiltvtDVSTKAPLSASMI 322
Cdd:cd09643   148 -------------------------------------------------------------------DKETGALLSASMI 160
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  323 QRYNEHQMDLTQLKQFIRQKLSDKYNEVFSDvskdgyagyidgktnqeafykylkgllnkiegsgyfldkierEDFLRKQ 402
Cdd:cd09643   161 KRYDEHKADLRKLKELIKKEFFKKYKEIFGD------------------------------------------ETFLRNQ 198
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  403 RTFDNGSIPHQIHLQEMRAIIRRQAEFYPFladnqdriEKLLTFRIPYYVGPLASGKSDFAWLSRKSADkitpwnfdeiv 482
Cdd:cd09643   199 RGFYNGSIPRQLLLEELEAIFRKQREYYPF--------EKILTFRIPYYIGPLAEGKSEFAWLTRPALS----------- 259
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  483 dkessaEAFINRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYKTEQGKTAFFDANMKQEIFDGVFKVYRKVTKDK 562
Cdd:cd09643   260 ------EAFIEKMTGKCTYLPEEKRAPKHSLLAEKFTVLNELNNLRIIEEQGETKILSKEEKQELLDLLFKKNKLTYKQK 333
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  563 LMDFLEKEFDEFRIVDLTGLdKENKVFNASYGTYHDLCKILDKDFL-DNSKNEKILEDIVLTLTLFEDREMIRKRLENYS 641
Cdd:cd09643   334 RKLLGLKEEEIFKGLRYEGL-KAEKNFNISLKTYHDLRKALGKEFLkDLELNEKILDEIVKILTLYKDREMIEKILELYK 412
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  642 DLLTKEQVKKLERRHYTGWGRLSAELIHGIRNKESRKTILDYLIDDGNSNRNfmQLINDDALSFKEEIAKAQvigetdnl 721
Cdd:cd09643   413 DLLNEEQLKKLLKRHFTGWGRLSLKALRGIRPLMEQGKRYDEAILELGGNHN--QKINSDELKFLPIIKKAQ-------- 482
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  722 nqvVSDIAGSPAIKKGILQSLKIVDELVKIMGhQPENIVVEMARENQfTNQGRRNSQQRLKGLTDSIKEFGS---QILKE 798
Cdd:cd09643   483 ---VKDEILNPVVKRALLQARKVVNELVKKYG-PPDKIVIEMARENG-TNKGTKNRKKRQKKNEDNIKEAASaleQKLKE 557
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  799 HPVENSQLQNDRLFLYYLQNGRDMYTGEELDID---YLSQYDIDHIIPQAFIKDNSIDNRVLTSSKENRGKSDDVPSKDV 875
Cdd:cd09643   558 LPLDIKSKNILKLRLYYQQNGKCMYTGKEIDIDdlfDLSYYEIDHILPQSRSFDDSISNKVLVLASENQEKGDQTPYEEI 637
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  876 VRKMKSYWSKLLSAKLITQR---KFDNLtKAERgGLTDDDKAGFIKRQLVETRQITKHVARILDERFNTEtdennKKIRQ 952
Cdd:cd09643   638 VSKMSAFWNKLEAAKLISQRgdsKKDRL-LLEK-GISDDEKAGFIDRNLNDTRYITRVVANYLKDRFNFH-----LKKRK 710
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  953 VKIVTLKSNLVSNFRKEFELYKVREINDYHHAHDAYLNAVIGKALLGVYPQLEpefVYGDYPHFHGHKENKATA---KKF 1029
Cdd:cd09643   711 VKVVTLKGQLTSQLRKKWGLYKKREINNYHHAHDAYINAVVTNALVKKFSQLE---RYKEYKRFDSEKGNKKTLdenKKF 787
                        1050
                  ....*....|..
gi 488218433 1030 FYSNIMNFFKKD 1041
Cdd:cd09643   788 FFANPMNFFKQE 799
Cas9_REC pfam16592
REC lobe of CRISPR-associated endonuclease Cas9; The REC lobe of Cas9 - the CRISPR-associated ...
181-711 0e+00

REC lobe of CRISPR-associated endonuclease Cas9; The REC lobe of Cas9 - the CRISPR-associated endonuclease Cas9 - includes the REC1 and REC2 domains. REC1 forms an elongated, alpha-helical structure consisting of 25 alpha helices and two beta-sheets, whereas REC2 inserted within REC1 adopts a six-helix bundle structure. The REC lobe and the NUC lobe of Cas9 fold to present a positively charged groove at their interface which accommodates the negatively charged sgRNA:target DNA heteroduplex. CRISPR (clustered regularly interspaced short palindromic repeat)-Cas system occurs naturally in bacteria as a defence against invasion by phages or other mobile genetic elements. Cas9 is targeted to specific genomic locations by sgRNAs or single guide RNAs, in order to complex with invading DNA in order to cleave it and render it inactive.


Pssm-ID: 435447  Cd Length: 539  Bit Score: 732.33  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   181 VQRLFQEFLAVYDNTFENSSLQEQNVQVEEILT-DKISKSAKKDRVLKLFPNEK-SNGCFAEFLKLIVGNQADFKKHFEL 258
Cdd:pfam16592    1 VEESFQDLLNILYEQLENLELETQNVEIEKILKkTKISKKAKLDELLALPPNEKnSKKIFAEILKLILGNKADFTKIFEL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   259 E------EKAPLQFSKDTYEEELEVLLAQIGDNYAELFLSAKKLYDSILLSGILTVTDVSTKAPLSASMIQRYNEHQMDL 332
Cdd:pfam16592   81 EkfveepKKIKLSFSDSNYDEKIEELENQLGDEKAEIILILKKIYDWVVLSDILTVSTDNGKAYLSEAMVNRYDKHKEDL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   333 TQLKQFIRQKLSDKYNEVFSDVSKDGYAGYID----GKTNQEAFYKYLKGLLNKIEGS--GYFLDKIEREDFLRKQRTFD 406
Cdd:pfam16592  161 AQLKKVIKQNLSEKYNDMFRKEKKKGYSAYINgknnGKTSKEDFYKYIKKLINKVETSeaQYILSKIDNENFLPKQRTKS 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   407 NGSIPHQIHLQEMRAIIRRQAEFYPFLADNQDRIEKLLTFRIPYYVGPLASGKSDFAWLSRKSADKITPWNFDEIVDKES 486
Cdd:pfam16592  241 NGSIPYQVHLQELKKIIKNQAEYYPFLKENQEKILKLLTFRIPYYVGPLAEKKSKFAWMKRKEQGKIYPWNFEQKVDIDK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   487 SAEAFINRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYKTEqgktaFFDANMKQEIFDGVFKVYRKVTKDKLMDF 566
Cdd:pfam16592  321 TAEAFITRMTNYCTYLPDEKVLPKNSLLYSKFTVLNELNKIKINGE-----KISVELKQDIFNGLFKKNKKVTKKKLKDW 395
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433   567 LEKEFDEFRIVDLTGLDKENkVFNASYGTYHDLCKILdKDFLDNSKNEKILEDIVLTLTLFEDREMIRKRLEN-YSDLLT 645
Cdd:pfam16592  396 LVKEGYNFKAVEIKGFDKEN-NFNNSLTTYIDLAKIF-GDFLDNPDNEDIIEDIIYWLTLFEDRKILKRRLQKkYSNLLT 473
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 488218433   646 KEQVKKLERRHYTGWGRLSAELIHGIRNKESR---KTILDYLIDDgnsNRNFMQLINDDALSFKEEIAK 711
Cdd:pfam16592  474 EKQIKQILKLKYKGWGRLSKELLNGIRGADRQgeiKTIIDLLWND---NRNLMQLINDERLSFKEEIEK 539
Cas9 COG3513
CRISPR-Cas system type-II protein Cas9 [Defense mechanisms]; CRISPR-Cas system type-II protein ...
388-1112 8.20e-123

CRISPR-Cas system type-II protein Cas9 [Defense mechanisms]; CRISPR-Cas system type-II protein Cas9 is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 442735 [Multi-domain]  Cd Length: 812  Bit Score: 401.26  E-value: 8.20e-123
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  388 YFLDKIEREDFLRKQRTFDNGSIPHQIHLQEMRAIIRRQAEFYPFLADN--QDRIEKLLTFRIPYYVGplasgksdfawl 465
Cdd:COG3513   168 YLYRRLQENGKVRNRKGDYDFYIPREDLEDEFEAIWAAQAEFGPALLTEelRDELLEIIFFQRPLKSG------------ 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  466 srksadkitpwnfdeivdkessaeafiNRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYKTEQGKTAFFDANMKQ 545
Cdd:COG3513   236 ---------------------------KKLVGKCTFEPDEKRAPKASPLFQRFRILQKLNNLRIVDDGGEERPLTLEERQ 288
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  546 EIFDGVFKvYRKVTKDKLMDFLEKEfDEFRIVDLTGLDKENKVFnASYGTYHDLCKILDKDFLdNSKNEKILEDIVLTLT 625
Cdd:COG3513   289 KIIDLLEN-KKKLTFKKLRKLLGLP-DGVIFKGFNYEDDDRAKL-KGDKTYAKLAKIFGKAWL-NEFDPEILDDIVEALT 364
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  626 LFEDREMIRKRLENYSDLlTKEQVKKLERRH-YTGWGRLSAELIHGIrnkesrktiLDYLIDDgnsnrnfmqlinddaLS 704
Cdd:COG3513   365 LFKDDEELKEWLKKLYGL-DEEQAEALANLPlPDGYGNLSLKALRKI---------LPLLEEG---------------LD 419
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  705 FKEEIAKAQVIGETDNLN--------QVVSDIAGSPAIKKGILQSLKIVDELVKIMGHqPENIVVEMARENQFTNQGRRN 776
Cdd:COG3513   420 YDEAVKAAGYDHSSLEILdrlppigeEKRKGSIRNPVVHRALNQLRKVVNALIRKYGK-PDEIHIELARDLKKSKKERKE 498
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  777 SQQRLKGLTDSIKEFGSQILKEHPVENSQLQNDRLFLYYLQNGRDMYTGEELDIDYL--SQYDIDHIIPQAFIKDNSIDN 854
Cdd:COG3513   499 IQKRQRENEKAREKAREEIAEEGGGEPSRRDILKYRLWEEQNGRCPYTGKPISISDLldGSVEIDHILPRSRTLDDSFNN 578
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  855 RVLTSSKENRGKSDDVP----SKDVVRKMKSYWSKLLSAKLITQRKFDNLTKAERgglTDDDKAGFIKRQLVETRQITKH 930
Cdd:COG3513   579 KVLCLADANREKGNRTPyealGGDEAEKWEEILARVENLKLIPQKKKKRFLKKEL---DRDDDEGFIARQLNDTRYISRL 655
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  931 VARILDERFNTEtdennkKIRQVKIVTLKSNLVSNFRKEFELYKV-------REINDYHHAHDAYLNAVIGKALLGVYPQ 1003
Cdd:COG3513   656 AAEYLKSLYPFE------DKGKRKVRVVPGQLTAMLRRAWGLNKIlsddgekNRDDHRHHAIDALVIACTTQGLLQRLAK 729
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433 1004 LEPEfvygdyphfhghKENKATAKKFFYSNIMNFFkkDDVRtdkngeiiwkkdehiSNIKKVLsypqvnIVKKVEEQ-TG 1082
Cdd:COG3513   730 ASRE------------REDAEKAEEHFPPPWDGFR--QDVA---------------EAVDEIF------VSHAPRRKvTG 774
                         730       740       750
                  ....*....|....*....|....*....|
gi 488218433 1083 GFSKESILPKGNsDKLIPRKTkkfyWDTKK 1112
Cdd:COG3513   775 QLHKETIYSTGE-GKVVLRKP----LTSLK 799
Cas9_PI pfam16595
PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at ...
1081-1335 2.78e-51

PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at the C-terminal of bacterial type II CRISPR system Cas9 endonuclease. This domain adopts a novel protein fold that is unique to the Cas9 family. It is positioned in the structure-DNA-complex to recognize the PAM sequence on the non-complementary DNA strand of the crRNA. PAM sequence is protospacer-adjacent motifs on DNA. See family CRISPR-DR2, Rfam:RF01315. Cas9 carries two nuclease domains, HNH and RuvC, which cleave the DNA strands that are complementary and non-complementary to the 20 nucleotide guide sequence in crRNAs, respectively.


Pssm-ID: 435449  Cd Length: 264  Bit Score: 181.75  E-value: 2.78e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  1081 TGGFSKESILP--KGNSDKLIPRKTKKFYWDTKKYGGFDSPIVAYSILVIADIEKGKSKKLktvkaLVGVTIMEKMTFER 1158
Cdd:pfam16595    1 KGGLFNQTILPahKKKGKGLIPLKKDERGLDVEKYGGYSSLTAAYFSLVEYTGKKGKRKRT-----IEGVPLYLAAKIEE 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  1159 DPV--AFLERKGYrnVQEENII--KLPKYSLFKlENGRKRLLASARE---LQKGNEIVLPNHLGTLLYH---AKNIHKVD 1228
Cdd:pfam16595   76 NKDllEYLEEKLG--LKEPKIIlpKIKKNSLIK-IDGFRMLLTGKTEnrlLKNAVQLVLSNDDEKYIKKiekFVKKNKDD 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  1229 EPKHLDY--VDKHKDEFKELLDVVSNFSkKYTLAEGNLEKIKELYAQNNGEDLKELASSFINLLTFTAIGA-PATFKFFD 1305
Cdd:pfam16595  153 IIEEKDGltEEKNIKLYDELLDKMKNTI-YYKRPSNQGEKLEKLKEKFIKLSLEEKCKVLIEILKLTHANPtSADLKLIG 231
                          250       260       270
                   ....*....|....*....|....*....|...
gi 488218433  1306 KNIDRKRYTSTTEIL---NATLIHQSITGLYET 1335
Cdd:pfam16595  232 GSKHAGRIKISNNISkasNIKLINQSVTGLYEK 264
HNH_4 pfam13395
HNH endonuclease; This HNH nuclease domain is found in CRISPR-related proteins.
821-871 2.82e-17

HNH endonuclease; This HNH nuclease domain is found in CRISPR-related proteins.


Pssm-ID: 433172 [Multi-domain]  Cd Length: 55  Bit Score: 76.90  E-value: 2.82e-17
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 488218433   821 DMYTGEELDIDYLS---QYDIDHIIPQAFIKDNSIDNRVLTSSKENRGKSDDVP 871
Cdd:pfam13395    1 CPYTGEQISIDDLFsekNYDIDHILPYSRSFDDSFSNKVLVLRSANQEKGNRTP 54
COG3472 COG3472
Uncharacterized conserved protein domain, often C-terminal to DUF262 [Function unknown];
813-945 1.61e-05

Uncharacterized conserved protein domain, often C-terminal to DUF262 [Function unknown];


Pssm-ID: 442695 [Multi-domain]  Cd Length: 566  Bit Score: 49.23  E-value: 1.61e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 488218433  813 LYYLQNGRDMYTGEELDIDYLSQY--DIDHIIPQAFIKD--------NSIDNRVLTSSKENRGKSDDVPSkdvvrkmkSY 882
Cdd:COG3472   435 LLAKLGARDFLSGQKIDLSNLFDNklEIHHIFPKAYLKKqgisrslyNSIANRTPLSARTNRKIGDKAPS--------EY 506
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 488218433  883 WSKLLSAKLITQRKfDNLTK----AERGGLTDDDKAGFIkrqlvETRQitKHVARILDERFNTETDE 945
Cdd:COG3472   507 LAELEEKAGEEELD-EILAShlipEDLELLRADDYEDFL-----EARR--ELLAEAIERAMGKLIDD 565
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH