求助,将基因组fasta格式的每条序列切割成100kb的片段,相邻片段间以50kb重叠

primary assemblies were transformed into very long
overlapping sequences with a maximum of 100 kb (50 kb overlap)

你好,你有这个代码了吗?能分享一下吗?

思路是将每条序列看成一个字符串,通过split滑窗方式拆分