王旭蕾,高进,齐鑫,王永波,陈傅晓,刘金叶,符书源.5种石斑鱼全基因组微卫星筛选与特征分析.渔业科学进展,2024,45(3):149-158 |
5种石斑鱼全基因组微卫星筛选与特征分析 |
Screening and characteristics analysis of microsatellites in the whole genome of five groupers |
投稿时间:2023-01-09 修订日期:2023-03-12 |
DOI:10.19663/j.issn2095-9869.20230109002 |
中文关键词: 石斑鱼 全基因组 微卫星 分布特征 |
英文关键词: Grouper (Epinephelus) Whole genome Microsatellite Distribution characteristics |
基金项目: |
|
摘要点击次数: 677 |
全文下载次数: 1337 |
中文摘要: |
为了解赤点石斑鱼(Epinephelus akaara)、斜带石斑鱼(E. coioides)、云纹石斑鱼(E. moara)、棕点石斑鱼(E. fuscoguttatus)和鞍带石斑鱼(E. lanceolatus)全基因组中微卫星的分布特征,本研究使用Micro-Satellite (MISA)软件对公共数据库中获取的5种石斑鱼全基因组序列进行微卫星筛选,分析了微卫星重复类型、重复拷贝类别及核心拷贝数的分布特征。结果显示,在5种石斑鱼全基因组中,均筛选出超过28万个微卫星位点,相对丰度介于271~296个/Mb之间,平均长度在22 bp左右,微卫星数量在全基因组中的占比为0.59%~0.67%,其重复类型数量、占比和相对丰度的分布趋势一致,二碱基重复最多,其次为单碱基,且随着重复单元碱基数目的增加而减少。重复拷贝类别A、AC、AAT、AAG、AGC、AATC、AAAT、AGAT、AATG、AGAGG、AAAAT、AAGAT、ACAGAG、AAANNN和AANNNN (N为除A外其他3种碱基)为优势类别。不同重复类型微卫星的拷贝数变化范围较大,但每种重复类型的拷贝数变化趋势一致,即随着拷贝数增加,微卫星数目随之递减,拷贝数为6和12时微卫星数目出现峰值。其中,各个重复类型中均有拷贝数量尤为突出的位点:鞍带石斑鱼中T、TA和AGACAG分别拷贝502、803和48次,云纹石斑鱼中GAG、CACT和CCACA分别拷贝205、652和111次。5种石斑鱼全基因组微卫星分布特征基本一致,鞍带石斑鱼和云纹石斑鱼中存在与其他3种石斑鱼显著差异的重复拷贝位点。本研究可为5种石斑鱼高质量微卫星分子标记开发提供数据参考,并为其基因组进化、遗传变异、亲缘关系和新品种选育等方面的工作奠定基础。 |
英文摘要: |
Grouper, a species of coral reef fish, exhibits a wide geographical distribution within the warm waters of the tropical and subtropical regions across the globe, primarily inhabiting the middle and lower layers of water. Characterized as a substantial marine economic fish, grouper possesses considerable nutritional value, boasts a high market worth, and garners significant consumer demand. Its popularity among consumers is attributed to its inherent attributes, and it holds immense potential for further cultivation and breeding endeavors. This study utilized micro-satellite (MISA) software to investigate the distribution characteristics of microsatellites in the genomes of five grouper species (Epinephelus akaara, E. coioides, E. fuscoguttatus, E. lanceolatus, and E. moara). A custom script was developed to analyze the screening results, and statistical analyses were conducted on the microsatellite repeat types, duplicate copy types, and core copy numbers in the genomes of the five grouper species. Over 280 000 microsatellite sites were identified from the entire genomes of the five grouper species. The relative abundance of microsatellites ranged from 271–296, with a total length ranging from 6.30–7.06 Mb. The average length of the microsatellites was approximately 22 bp, and their proportion in the genomes ranged from 0.59%–0.67%. These results provide insights into the distribution characteristics of microsatellites in the genomes of these five grouper species and can inform future studies on their genomic architecture and evolution. The repetitive types of microsatellites were analyzed in terms of number, proportion, and relative abundance. The number, proportion, and relative abundance of repetitive types followed a consistent pattern, with the highest number of double base repeats, followed by single base repeats. This pattern decreased as the number of repeat units increased. A, AC, AAT, AAG, AGC, AATC, AAAT, AGAT, AATG, AGAGG, AAAAT, AAGAT, ACAGAG, AAANNN, and AANNNN (N represents any of the three bases except A) were the most dominant types of each duplicate copy type. Type A accounted for 90.00% of single base repeats, while type AC was the most dominant in double base repeats, accounting for nearly 80.00%. Interestingly, the content of the CG duplication category was the least, accounting for only 0.04%–0.10% in the five grouper species. This may be owing to the fact that the composition content of the four bases in the different species' genomes is different, and there may be structural problems with different bases. The results of this study provide insight into the distribution characteristics of microsatellites in the genomes of these five grouper species. The high frequency distribution of AGG and AGC in the dominant types of triple base repeats may play a crucial role in regulating genes involved in immunity, disease, and other genes in groupers. Previous studies show that AGG is a well-known binding site for numerous transcription factors involved in early growth and development of various species. Additionally, the change of base repeat polymorphism of the AGC category is directly linked to genetic diseases and holds significant evolutionary and medical research value. AAAN, AAAAN, and AAAAAN are dominant repeat types that are widely distributed in mammals among the four, five, and six base repeat types, respectively. Different types of microsatellites show significant variability in the number of core copy numbers. Nevertheless, the number of duplicate copies of each type of microsatellite exhibit a consistent trend in the five groupers, and the number of microsatellites decrease with an increase in the number of duplicate copies. The analysis of microsatellite distribution revealed several key findings. First, over 95% of single base repeat copies were concentrated in a range of 12 to 25 times. The main number of copies for two base repeats ranged from 6 to 32 times, with a small peak between 11 and 14 copies, and decreasing numbers with increasing copies. The number of copies for four and five base repeats was mainly concentrated in the ranges of 5–16 and 5–17, and 5–14, respectively. Notably, AGAT, AAAG, AAGAG, AATAT, and AGAGG repeats exhibited a large number of copies, even when the number of copies was high. The increase in copy number may represent changes in polymorphism at these loci that may lead to disease or changes in corresponding functions. Overall, these findings provide important insights into the distribution and potential functional significance of microsatellites in the genome of the studied species. The distribution characteristics of microsatellites in the genomes of the five groupers provide a valuable basis to understand the evolutionary mechanisms and functional expression of these species. The distribution of duplicate copy numbers of each type of duplication displays two peaks at 6 and 12 repetitions, with the number of microsatellites decreasing with increasing numbers of core copies. Some duplication types show particularly prominent numbers in specific species, such as T, TA, and AGACAG in E. lanceolatus, copied 502, 803, and 48 times respectively; GAG, CACT, and CCACA in E. moara, copied 205, 652, and 111 times respectively. These variations highlight the importance of exploring the role of microsatellite loci to develop a better understanding of the genetic distance and kinship among the five groupers. This analysis lays the groundwork to develop high-quality microsatellite molecular markers, and facilitates the selection of favorable varieties and the development of new varieties. In general, these research results provide important data to understand the genomic characteristics of the five groupers and helps to conduct advanced genetic research on these species. |
附件 |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |
|
|
|