Input Format:
The input to this tool is a FASTA file containing one or more RNA sequences. Each sequence should follow the standard FASTA format:
FASTA Format Example:
>seq_1
GGGAGGGCGGGTGGG
>seq_2
GGGACTGGGCCCGGGTAAGGG
- Each sequence entry starts with a header line, beginning with the > symbol, immediately followed by a unique sequence identifier (SequenceID).
- The sequence is placed on the following line. Only RNA/DNA bases (A, T or U, G, C) should be included in the sequence.
Output Format:
The output of this tool will be a TSV (tab-separated values) file, where each row corresponds to one sequence from the input FASTA file. The output file will contain the following columns:
- seq_id
- The unique identifier of the RNA sequence, as provided in the input FASTA file.
- sequence
- The RNA sequence itself as extracted from the FASTA file.
- G4mer
- Column of G4mer score calculated for the sequence, which assess the likelihood of the sequence forming rG-quadruplex structures (rG4s).
Output example
seq_id sequence G4mer
seq_1 GGGAGGGCGGGTGGG 0.997177004814148
seq_2 GGGACTGGGCCCGGGTAAGGG 0.9845851063728333
How to Use the Tool:
- Upload/paste your FASTA file containing the RNA sequences of interest
- Submit the file for analysis using the provided interface
- View or download the output as a TSV file, which will include the G4mer score for each sequence