### Contact Information 1370296886@qq.com ### MaxKB Version v1.10.4-lts ### Problem Description web站点类型的知识库,判断文件及文件夹方法不全面,导致最终文档内的链接访问失效 ### Steps to Reproduce 1、web站点地址:https://hzqcgc.htc.edu.cn/jxky.htm 2、获取文档分段: 3、查看分段内的链接信息: <img width="1182" alt="Image" src="https://github.com/user-attachments/assets/a5f411f4-6e0e-41e1-89ba-2c116ef8101e" /> 4、实际的信息 <img width="716" alt="Image" src="https://github.com/user-attachments/assets/e696b25d-24f8-438e-8460-f79e18227f6e" /> ### The expected correct result _No response_ ### Related log output ```shell ``` ### Additional Information _No response_