请您Star/Please Star
如果您觉得此工具不错,请轻轻点击此页面右上角Star按钮增加项目曝光度,谢谢!软件完全免费(商用除外),只求大家Star和宣传给其他需要的朋友,谢谢!
If you think this tool is good, please gently click the Star button in the upper right corner at this page to increase the project exposure, thank you! The software is completely free (except for commercial use), only ask everyone to Star and promote it to other friends in need, thank you!
官方网站/Official Website
访问易采集官网:www.easyspider.cn
Visit the official website of EasySpider: www.easyspider.net
易采集/EasySpider: Visual Code-Free Web Crawler
一个可视化爬虫软件,可以使用图形化界面,无代码可视化的设计和执行爬虫任务。只需要在网页上选择自己想要爬的内容并根据提示框操作即可完成爬虫设计和执行。同时软件还可以单独以命令行的方式进行执行,从而可以很方便的嵌入到其他系统中。
A visual code-free/no-code web crawler/spider, just select the content you want to crawl on the web page and operate according to the prompt box to complete the design and execution of the crawler. At the same time, the software can be executed by command line alone, so it can be easily embedded into other systems.
示例1/Example 1
(右键)选中一个大商品块 -> 软件自动检测到同类型商品块 -> 点击“选中全部”选项 -> 点击“选中子元素”选项 -> 点击“采集数据”选项,即可采集到所有商品的所有信息,并分成不同字段保存。
(Right click) Select a large product block -> The software will automatically detect similar blocks -> Click the 'Select All' option -> Click the 'Select Child Elements' option -> Click the 'Collect Data' option, you can collect the information of all products, and will be saved by sub-field.
示例2/Example 2
(右键)选中一个商品标题,同类型标题会被自动匹配,点击“选中全部”选项 -> 点击“采集数据”选项,即可采集到所有商品的标题信息。
同时,选中全部后如果选择“循环点击每个元素”选项,即可自动打开每个商品的详情页,然后可以再继续设置采集详情页的信息。
(Right Click) Select a product title, the same type of title will be automatically matched, click the 'Select All' option -> Click the 'Collect Data' option, you can collect the title information of all products.
At the same time, if you select the 'Loop-click every element' option after selecting all, you can automatically open the details page of each product, and then can set to collect the information of the details page.
更多特性/More Features
更多特性请翻到页面底部查看。
More features please scroll to the bottom of this page to view.
下载易采集/Download EasySpider
进入 Releases Page 下载最新版本。如果下载速度慢,可以考虑中国境内下载地址:中国境内下载地址。
Refer to the Releases Page to download the latest version of EasySpider.
支持作者/Support Author
易采集EasySpider是一款完全免费无广告的开源软件,软件开发和维护全靠作者用爱发电,因此您可以选择支持作者让作者有更多的热情和精力维护此软件,或者您使用了此软件进行了盈利,欢迎您通过下面的方式支持作者:
- 支付宝账号:[email protected],也可以扫描下方二维码。
- 微信收款:扫描下方二维码。
- PayPal账号:naibowang,也可以扫描下方二维码。
Support author at paypal if you like this software, or use it to make profit: naibowang
文档/Documentation
请点此进入教程文档,如有英文可暂时翻译一下,或看作者的硕士毕业论文(主要看第三章和第五章)。
Ebay样例博客:https://blog.csdn.net/ihero/article/details/130805504。
Documentation can be found from GitHub Wiki.
视频教程/Video Tutorials
Bilibili/B站视频教程:
如何无代码可视化的爬取需要登录才能爬的网站 - 知乎网站案例
【重要】自定义条件判断之使用循环项内的JS命令返回值 - 第二弹
Refer to Youtube Playlist to see the video tutorials of EasySpider.
样例任务/Sample Tasks
从本项目的Examples文件夹中下载样例任务,更名为大于0的数字,导入到EasySpider中的tasks
文件夹中,然后在EasySpider中打开即可。
Download sample tasks from the Examples folder of this project, rename them to numbers greater than 0, import them into the tasks
folder in EasySpider, and then open them in EasySpider.
声明/Declaration
本软件仅供学习交流使用,严禁使用软件进行任何违法违规的操作,如爬取不允许爬取的政府/军事机关网站等。使用本软件所造成的一切后果由使用者自负,与作者本人无关,作者不会承担任何责任。
This software is for learning and communication only. It is strictly forbidden to use the software for any illegal operations, such as crawling government/military websites that are not allowed to be crawled. All consequences caused by the use of this software are at the user's own risk, and the author is not responsible for any consequences.
对于政府和军事机关等网站的爬虫操作,作者将不会进行任何答疑,以免违反国家相关法律法规和政策。
For the crawler operations of government and military websites, the author will not answer any questions in order to avoid violating relevant national laws, regulations and policies.
同时,软件受到专利权保护,如要用于商业用途,如使用软件进行盈利接单,出售采集到的数据等,请联系杭州天勤知识产权代理有限公司进行专利授权等付费操作。
At the same time, the software is protected by patent rights. If it is used for commercial purposes, such as using the software to make profits, selling the collected data, etc., please contact Hangzhou Tianqin Intellectual Property Agency Co., Ltd. for patent authorization and other paid operations.
答疑QQ群
群号:682921940,建议通过Github提Issue的方式答疑,如果实在有需要才请加QQ群,因为群人数有上限。
出版物/Publications
-
This software has been accepted by The Web Conference (WWW) 2023 (中国计算机学会顶级会议,CCF A): EasySpider: A No-Code Visual System for Crawling the Web, April 2023.
-
中国国家知识产权局发明专利,一种自定义提取流程的服务封装系统, 2022年5月。
-
浙江大学硕士论文,面向WEB应用的智能化服务封装系统设计与实现,2020年6月。
编译说明/Compilation Instructions
查看编译说明。
Refer to Compilation Instructions.