• Stars
    star
    102
  • Rank 335,584 (Top 7 %)
  • Language
    Python
  • Created almost 14 years ago
  • Updated over 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Mongodb support for scrapy

Description

It's a pipeline which allow you to store scrapy items in MongoDB database.

Install

pip install "ScrapyMongoDB"

Configure your settings.py:

ITEM_PIPELINES = [
  'scrapymongodb.MongoDBPipeline',
]

MONGODB_SERVER = 'localhost'
MONGODB_PORT = 27017
MONGODB_DB = 'scrapy'
MONGODB_COLLECTION = 'items'
MONGODB_UNIQ_KEY = 'url'
MONGODB_ITEM_ID_FIELD = '_id'
MONGODB_SAFE = True

Changelog

0.4.1 Cancel use of deprecated settings import

0.4.0 Update only those fields that are defined in the scrapy Item by Mikhail Fedosov (https://github.com/fedosov)

0.3.0 Code improvements by Jordi Llonch (https://github.com/llonchj)

0.2.1 pip now install requires modules scrapy and pymongo

License

Copyright 2011 Julien Duponchelle

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.