All Projects → OpenScraper → Similar Projects or Alternatives

1209 Open source projects that are alternatives of or similar to OpenScraper

Mailinglistscraper

A python web scraper for public email lists.

Stars: ✭ 19 (-76.25%)

Mutual labels: scraper, spider, scrapy

Fbcrawl

A Facebook crawler

Stars: ✭ 536 (+570%)

Mutual labels: scraper, spider, scrapy

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (+1180%)

Mutual labels: scraper, spider, scrapy

Python Spider

豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章

Stars: ✭ 615 (+668.75%)

Mutual labels: spider, xpath, scrapy

scrapy facebooker

Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.

Stars: ✭ 22 (-72.5%)

Mutual labels: scraper, spider, scrapy

Goribot

[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。

Stars: ✭ 190 (+137.5%)

Mutual labels: scraper, spider, scrapy

Fp Server

Free proxy server, continuously crawling and providing proxies, based on Tornado and Scrapy. 免费代理服务器，基于Tornado和Scrapy，在本地搭建属于自己的代理池

Stars: ✭ 154 (+92.5%)

Mutual labels: spider, tornado, scrapy

arachnod

High performance crawler for Nodejs

Stars: ✭ 17 (-78.75%)

Mutual labels: scraper, spider

Xcrawler

快速、简洁且强大的PHP爬虫框架

Stars: ✭ 344 (+330%)

Mutual labels: scraper, spider

factory

Docker microservice & Crawler by scrapy

Stars: ✭ 56 (-30%)

Mutual labels: tornado, scrapy

Scrapyrt

HTTP API for Scrapy spiders

Stars: ✭ 637 (+696.25%)

Mutual labels: scraper, scrapy

aliexscrape

Get Aliexpress product details in JSON

Stars: ✭ 80 (+0%)

Mutual labels: scraper, spider

Spydan

A web spider for shodan.io without using the Developer API.

Stars: ✭ 30 (-62.5%)

Mutual labels: scraper, spider

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (-35%)

Mutual labels: scraper, spider

Java Spider

一个基于webmagic框架二次开发的java爬虫框架实战，已实现能爬取腾讯，搜狐，今日头条（单独集成功能）等资讯内容，配合elasticsearch框架用法，实现了自动爬虫，已投入线上生产使用。

Stars: ✭ 276 (+245%)

Mutual labels: scraper, spider

OLX Scraper

📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.

Stars: ✭ 15 (-81.25%)

Mutual labels: scraper, scrapy

163Music

163music spider by scrapy.

Stars: ✭ 60 (-25%)

Mutual labels: spider, scrapy

Advanced Web Scraping Tutorial

The Zipru scraper developed in the Advanced Web Scraping Tutorial.

Stars: ✭ 384 (+380%)

Mutual labels: scraper, scrapy

Scrapit

Scraping scripts for various websites.

Stars: ✭ 25 (-68.75%)

Mutual labels: scraper, spider

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+720%)

Mutual labels: scraper, spider

Voyages Sncf Api

A scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.

Stars: ✭ 7 (-91.25%)

Mutual labels: scraper, scrapy

Avbook

AV 电影管理系统， avmoo , javbus , javlibrary 爬虫，线上 AV 影片图书馆，AV 磁力链接数据库，Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database

Stars: ✭ 8,133 (+10066.25%)

Mutual labels: scraper, spider

Seleniumcrawler

An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site

Stars: ✭ 117 (+46.25%)

Mutual labels: scraper, scrapy

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (+113.75%)

Mutual labels: scraper, spider

Querylist

🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。

Stars: ✭ 2,392 (+2890%)

Mutual labels: scraper, spider

Web-Iota

Iota is a web scraper which can find all of the images and links/suburls on a webpage

Stars: ✭ 60 (-25%)

Mutual labels: spider, scrapy

Spider job

招聘网数据爬虫

Stars: ✭ 234 (+192.5%)

Mutual labels: spider, scrapy

Spiderkeeper

admin ui for scrapy/open source scrapinghub

Stars: ✭ 2,562 (+3102.5%)

Mutual labels: spider, scrapy

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+19318.75%)

Mutual labels: scraper, spider

crawler-chrome-extensions

爬虫工程师常用的 Chrome 插件 | Chrome extensions used by crawler developer

Stars: ✭ 53 (-33.75%)

Mutual labels: scraper, spider

photo-spider-scrapy

10 photo website spiders, 10 个国外图库的 scrapy 爬虫代码

Stars: ✭ 17 (-78.75%)

Mutual labels: spider, scrapy

Gerapy

Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

Stars: ✭ 2,601 (+3151.25%)

Mutual labels: spider, scrapy

Xidel

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

Stars: ✭ 335 (+318.75%)

Mutual labels: scraper, xpath

Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy

Stars: ✭ 309 (+286.25%)

Mutual labels: scraper, scrapy

Freshonions Torscraper

Fresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion

Stars: ✭ 348 (+335%)

Mutual labels: scraper, spider

Py Elasticsearch Django

基于python语言开发的千万级别搜索引擎

Stars: ✭ 207 (+158.75%)

Mutual labels: spider, scrapy

Awesome Crawler

A collection of awesome web crawler,spider in different languages

Stars: ✭ 4,793 (+5891.25%)

Mutual labels: scraper, spider

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

Stars: ✭ 440 (+450%)

Mutual labels: scraper, spider

Gosint

OSINT Swiss Army Knife

Stars: ✭ 401 (+401.25%)

Mutual labels: scraper, spider

Crawler

A high performance web crawler in Elixir.

Stars: ✭ 781 (+876.25%)

Mutual labels: scraper, spider

Scrapydweb

Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. DEMO 👉

Stars: ✭ 2,385 (+2881.25%)

Mutual labels: spider, scrapy

TikTokDownloader PyWebIO

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具，支持API调用，在线批量解析及下载。

Stars: ✭ 919 (+1048.75%)

Mutual labels: scraper, spider

Not Your Average Web Crawler

A web crawler (for bug hunting) that gathers more than you can imagine.

Stars: ✭ 107 (+33.75%)

Mutual labels: scraper, spider

Scrapoxy

Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!

Stars: ✭ 1,322 (+1552.5%)

Mutual labels: scraper, scrapy

robotstxt

robots.txt file parsing and checking for R

Stars: ✭ 65 (-18.75%)

Mutual labels: scraper, spider

NScrapy

NScrapy is a .net core corss platform Distributed Spider Framework which provide an easy way to write your own Spider

Stars: ✭ 88 (+10%)

Mutual labels: spider, scrapy

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (+1457.5%)

Mutual labels: scraper, spider

scrapy helper

Dynamic configurable crawl (动态可配置化爬虫)

Stars: ✭ 84 (+5%)

Mutual labels: spider, scrapy

Ruiji.net

crawler framework, distributed crawler extractor

Stars: ✭ 220 (+175%)

Mutual labels: scraper, scrapy

scrapy-LBC

Araignée LeBonCoin avec Scrapy et ElasticSearch

Stars: ✭ 14 (-82.5%)

Mutual labels: scraper, scrapy

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

Stars: ✭ 81 (+1.25%)

Mutual labels: scraper, scrapy

small-spider-project

日常爬虫

Stars: ✭ 14 (-82.5%)

Mutual labels: spider, scrapy

scraper

图片爬取下载工具，极速爬取下载站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户上传的图片/照片/插画。

Stars: ✭ 64 (-20%)

Mutual labels: scraper, spider

python-crawler

爬虫学习仓库，适合零基础的人学习，对新手比较友好