Select the input type:
A list of sequences in FASTA format, ordered by a parameter of interest (for example: expression level).
DRIMust will search for motifs enriched at the top of the list compared to the rest of the list.
The number of sequences must not exceed 40,000 and the total number of characters is limited to 4,000,000.
The dataset may contain 'AGCTUN' characters for DNA and RNA sequences and 'ACDEFGHIKLMNPQRSTVWYX' for protein sequences.
Sequences that contain more than 5% 'N' or 'X' will be omitted from the analysis.
Upload an ordered list in FASTA format:
Two lists of sequences in FASTA format - a target list and a background list (the order within each list is not important).
DRIMust will search for motifs enriched in the target list compared to the background list.
The total number of sequences (in both lists) must not exceed 40,000 and the total number of characters is limited to 4,000,000.
The dataset may contain 'AGCTUN' characters for DNA and RNA sequences and 'ACDEFGHIKLMNPQRSTVWYX' for protein sequences.
Sequences that contain more than 5% 'N' or 'X' will be omitted from the analysis.
Upload two lists in FASTA format.
Target list:
Background list:
In single-strand search mode, DRIMust treats the input sequences as the plus strand and searches for over-represented motifs at the top of the list.
This mode is suitable for DNA, RNA and protein sequences.
In double-strand search mode, DRIMust takes into account both the given input sequences (as the plus strand) and their reverse-complement sequences (as the minus strand)
and searches for motifs that are enriched in both strands.
Note: the double-strand mode is suitable for DNA or RNA sequences - the dataset may contain 'AGCTUN' characters only.
Sample data is automatically loaded according to the selected input type and search mode (no need to upload input file).
Other parameters can be set as well.
Detailed information about the examples used as sample data can be found in the manual page, under "Examples and sample data".