• Stars
    star
    184
  • Rank 207,952 (Top 5 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 10 years ago
  • Updated 10 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Extensible RSS 2.0 Feed Generator written in Python

Overview

rfeed is a library to generate RSS 2.0 feeds in Python. It's based on the work from Andrew Dalke in the PyRSS2Gen library (see the Inspiration section below for more information.)

rfeed is extensible, and in my opinion very easy to use. Besides the standard RSS 2.0 specification, it also includes iTunes support for podcast feeds.

Installation

The library is a single file rfeed.py, so you could simply copy it wherever you need it. You can also install it using the following command:

% python setup.py install

Usage

I don't think you are going to find a better reference for using the library than the test suite in tests.py. However, unit tests are sometimes hard to understand and isolated, so here is a simple example from end to end:

import datetime 
from rfeed import *

item1 = Item(
    title = "First article",
    link = "http://www.example.com/articles/1", 
    description = "This is the description of the first article",
    author = "Santiago L. Valdarrama",
    guid = Guid("http://www.example.com/articles/1"),
    pubDate = datetime.datetime(2014, 12, 29, 10, 00))

item2 = Item(
    title = "Second article",
    link = "http://www.example.com/articles/2", 
    description = "This is the description of the second article",
    author = "Santiago L. Valdarrama",
    guid = Guid("http://www.example.com/articles/2"),
    pubDate = datetime.datetime(2014, 12, 30, 14, 15))

feed = Feed(
    title = "Sample RSS Feed",
    link = "http://www.example.com/rss",
    description = "This is an example of how to use rfeed to generate an RSS 2.0 feed",
    language = "en-US",
    lastBuildDate = datetime.datetime.now(),
    items = [item1, item2])

print(feed.rss())

It's a very succinct example, but it exposes the following concepts:

  • The main object of the RSS 2.0 feed is the Feed class.
  • The Feed class supports a list of Item instances.
  • To specify the guid attribute of an item, you can use a Guid instance.
  • To get the final RSS content, you can use the rss() method of the Feed class.

Of course, there's way more than what the example above illustrates. Here is the full list of exposed classes and a brief description of each one of them:

  • Feed: This is the main class that represents the RSS 2.0 feed.
  • Item: Represents an item of a feed's channel.
  • Category: Represents one or more categories that the channel or item belongs to.
  • Cloud: Represents a web service that supports the rssCloud interface which can be implemented in HTTP-POST, XML-RPC or SOAP 1.1.
  • Image: Represents a GIF, JPEG or PNG image that can be displayed with the channel.
  • TextInput: Represents a text input box that can be displayed with the channel.
  • SkipHours: Represents a hint for aggregators telling them which hours they can skip.
  • SkipDays: Represents a hint for aggregators telling them which days they can skip.
  • Enclosure: Represents a media object that is attached to a feed's item.
  • Guid: Represents a string that uniquely identifies the item.
  • Source: Represents the RSS channel that the item came from.

(For more information about each one of these classes, you can check the official RSS 2.0 specification, and check out the rfeed.py source file.)

Extending the library

The RSS 2.0 specification is extensible, so it's rfeed. Adding extra content to your feed is very simple:

  1. Create a class that extends the Extension class.
  2. Overwite the Extension.get_namespace method to return the namespace of your extension (the one will be included in the <rss/> element of your feed.) If you don't need to add a namespace, you can simply extend the Serializable class instead.
  3. Use the Feed.add_extension() method, or the extensions array in the constructor to provide your extension.

Here is an example of extending your feed with a content:encoded element:

import datetime 
from rfeed import *

class Content(Extension):
    def get_namespace(self):
        return {"xmlns:content": "http://purl.org/rss/1.0/modules/content/"}

class ContentItem(Serializable):
    def __init__(self, content):
        Serializable.__init__(self)
        self.content = content

    def publish(self, handler):
        Serializable.publish(self, handler)
        self._write_element("content:encoded", self.content)

item = Item(
    title = "Sample article",
    link = "http://www.example.com/articles/1", 
    description = "This is the description of the first article",
    author = "Santiago L. Valdarrama",
    guid = Guid("http://www.example.com/articles/1"),
    pubDate = datetime.datetime(2014, 12, 29, 10, 00),
    extensions = [ContentItem('This is the value of the enconded content')])

feed = Feed(
    title = "Sample RSS Feed",
    link = "http://www.example.com/rss",
    description = "This is an example of how to use rfeed to generate an RSS 2.0 feed",
    language = "en-US",
    lastBuildDate = datetime.datetime.now(),
    items = [item],
    extensions = [Content()])

print(feed.rss())
  • Note that we want to add our Content instance to the list of extensions at the feed level. This way we make sure the namespace is included in the feed.
  • In this case the Content instance doesn't provide a publish method because there's nothing to add to the <channel/> element of the feed.
  • The ContentItem class extends Serializable because it doesn't need to provide a namespace (it was already provided by the Content instace.)
  • The ContentItem instance implements the publish method and uses the _write_element method to output the specific XML content.

For a more exhaustive example, check the implementation of the iTunes extension in the rfeed.py file.

iTunes Support

Podcasts are a huge medium in 2014, and iTunes is the preferred way of publishing them. This is the reason rfeed provides an extension for iTunes support. Here is an example of how to use it:

import datetime 
from rfeed import *

itunes_item = iTunesItem(
    author = "Santiago L. Valdarrama",
    image = "http://www.example.com/artwork.jpg",
    duration = "01:11:02",
    explicit = "clean",
    subtitle = "The subtitle of the podcast episode",
    summary = "Here is the summary of this specific episode")

item = Item(
    title = "Sample article",
    link = "http://www.example.com/articles/1", 
    description = "This is the description of the first article",
    author = "Santiago L. Valdarrama",
    guid = Guid("http://www.example.com/articles/1"),
    pubDate = datetime.datetime(2014, 12, 29, 10, 00),
    enclosure = Enclosure(url="http://www.example.com/articles/1.mp3", length=0, type='audio/mpeg'),
    extensions = [itunes_item])

itunes = iTunes(
    author = "Santiago L. Valdarrama",
    subtitle = "A sample podcast that will never be produced",
    summary = "This is just a fake description",
    image = "http://www.example.com/artwork.jpg",
    explicit = "clean",
    categories = iTunesCategory(name = 'Technology', subcategory = 'Software How-To'),
    owner = iTunesOwner(name = 'Santiago L. Valdarrama', email = '[email protected]'))

feed = Feed(
    title = "Sample Podcast RSS Feed",
    link = "http://www.example.com/rss",
    description = "An example of how to generate an RSS 2.0 feed",
    language = "en-US",
    lastBuildDate = datetime.datetime.now(),
    items = [item],
    extensions = [itunes])

print(feed.rss())

Inspiration

I created my own blog engine in Python for Google App Engine, thus I needed a way to generate my RSS feed. Later on, I added a podcast site that also needed an RSS feed, but this time with iTunes support.

The only help I could find was the amazing PyRSS2Gen library written by Andrew Dalke. The library is very simple, but it didn't help with the iTunes support, so I found myself modifying it to get as much as I could out of it.

At the end I didn't like what I did to the original library: it was messy all around. It wasn't the library's fault, but my own. I decided to fix the problem from scratch, by rewriting the library in a different way.

I'm not claiming this new version is better than the original. It's just different and I think a little bit easier to extend and maintain (since it comes with a suite full of unit tests). Since I needed iTunes support from the beginning, I also coded an iTunes extension for the library. Now I'm powering my blog and podcast sites with it, and I hope it serves well to anyone with similar needs.

Thanks to Andrew Dalke for writing (what I consider) the first version a long time ago. This project is based on his original work, borrowing ideas and code from it, but with enough differences that I felt it deserved a new name.

Contributing

Contributions, questions and comments are all welcome and encouraged. If you run into any problems, please submit an issue and I'll take a look. If you want to get your hands dirty and submit a pull request, even better. Also, take a look at the test suite in tests.py and tests your changes to make sure nothing else breaks. To run the tests, execute the following command:

$ python tests.py

I really appreciate anything you can contribute to the library.

License

MIT Licence

Copyright (c) 2014 Santiago Valdarrama

More Repositories

1

alloy-voice-assistant

Python
716
star
2

youtube-rag

Jupyter Notebook
193
star
3

ml.school

Machine Learning School
Jupyter Notebook
162
star
4

llm

A bunch of experiments using Large Language Models
Jupyter Notebook
140
star
5

livekit-assistant

Python
132
star
6

twitter

A collection of the most relevant content I've posted on Twitter
86
star
7

twitter-threading

Automatically posting Twitter Threads
JavaScript
82
star
8

tf_object_detection_cm

Confusion Matrix in Object Detection with TensorFlow
Python
77
star
9

clip-container

A containerized REST API around OpenAI's CLIP model.
Python
63
star
10

openai-function-calling

An example of using Function Calling with OpenAI's API
Jupyter Notebook
54
star
11

gpt-playing-tic-tac-toe

Python
36
star
12

lunar-lander

OpenAI Gym's LunarLander-v2 Implementation
Python
33
star
13

apple-silicon

Jupyter Notebook
30
star
14

machine-learning

Jupyter Notebook
20
star
15

svpino

18
star
16

lora-vit

Jupyter Notebook
17
star
17

raise

Template letter asking for a raise
17
star
18

longhorn

Longhorn is the code name for Stocktile, an Android application to follow stock tickers from different markets around the world.
Java
16
star
19

cs7641-assignment4

CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes
Java
14
star
20

stable-diffusion

Jupyter Notebook
13
star
21

twitter-giveaway

Selecting The Winner of a Twitter Giveaway
Jupyter Notebook
9
star
22

youtube-indexing

An example of using Pinecone + OpenAI's API to ask questions about YouTube videos
Jupyter Notebook
8
star
23

tensorflow-object-detection-sagemaker

TensorFlow Docker Image to run Object Detection in SageMaker
Python
8
star
24

contrastive-learning

Jupyter Notebook
6
star
25

mojo

Mojo
4
star
26

cs6310-project1-datasets

Java
3
star
27

blog-engine

A very simple and fully responsive file system-based blog engine implemented using Python, webapp2, jinja2, and running on Google App Engine.
Python
3
star
28

query-bigquery

Command line script to run queries in BigQuery
Python
3
star
29

sagemaker

Jupyter Notebook
2
star
30

voc2aws

Convert from Pascal VOC XML format to AWS SageMaker's JSON format
Python
2
star
31

jagent

jAgent adds css classes to the HTML tag of the page to represent different features of the client's browser.
JavaScript
1
star
32

leiah

Python
1
star