For scripts
- Place all scripts in the main folder.
- Ensure all scripts have parameters.
- Ensure all scripts can process multiple data sets.
For file and folder names
- If the table has all contigs, the name will start with
All
. If the table only has part of the information, like contigs with defense records, the name will start with all
.
- The second word in the name should reflect the table's subject. If the main ID in the file is contig, it should be Contig.
- Use “_” to connect all file names.
- Rename all file contents from - to _.
- During the merging process, remove duplicate titles.
- Remove outliers from the samples. Similarly, but mark why it is an outlier.
- Sequence
- Table: file
- Statistic_file: the file can be used to draw pictures.
- Use csv format to store data.
For file and folder save location
- Place the original tables on Google Drive.
- Place the results in the results folder.
For Bioinformatics data preparation
For FASTA file