V
V
v0rtex2011-09-14 09:28:07
RSS
v0rtex, 2011-09-14 09:28:07

Make a full-text RSS feed if the site gives only a short announcement?

Good afternoon.
I'm a big fan of reading in the subway RSS news of my favorite sites. Fortunately, there is enough software for Android, which can download news with pictures and store them in the cache.
For the rss feeds themselves, I use popular services that “expand” an article from a short announcement into a full-text one. But they are not ideal - ads, comments, tags and other crap that I don't need are added to the news :( Is it possible to do this cleanly?
For example, I take the NOMOBILE.RU feed www.nomobile.ru/rss-reviews.xml
a little more than nothing =)
7360a20e09f77a235400d74f9737fd4e.png
2) The feedex.net/
service expands well, but at the end it clings to links to other articles. And in huge numbers.
3dfc9b37fc8520796a89e23bdb1e9f1a.png
3) Service rss-farm.ru
When I try to create my own feed, I follow the instructions ( www.rss-farm.ru/docs.dev.quickstart/)
Step 1. Launching DevTools
In the Internet Explorer 6+ browser, follow the link rss-farm.ru/devtools_latest.zip. After a few seconds, after you agree to launch, the DevTools program should start. If something went wrong, then most likely you need to download the .net Framework 3.5 and everything will work.
I get 404.
0ad79965d4a5f8d3fd3105e6b3badb93.png
4) Service readbox.info
You can specify XPath in it.
8aae334594cfde5eeec2fb9d2397e4a9.png
Nevertheless, successfully "tear out" does not work. Anyway, some unnecessary block is hooked :(
Any options?

Answer the question

In order to leave comments, you need to log in

6 answer(s)
L
lakb, 2011-09-14
@v0rtex

Everything stolen before us is already done: feeds.rss-farm.ru/Reset2891/nomobile-reviews

@
@zzeneg, 2011-09-14
_

Yahoo pipes . Not the easiest way, but everything is customized for you.

1
1337, 2011-09-14
@1337

habrahabr.ru/blogs/p2p/123594/
Maybe this topic will help

K
Kindman, 2011-09-14
@Kindman

Well, you can also try to write a simple RSS filter that would remove all "enemy" fragments in the feed.

A
asm0dey, 2011-09-14
@asm0dey

There is also fivefilters.org/content-only
They also have a self-hosted version.

P
png, 2011-09-14
@png

The way of the Jedi is to write yourself.
If you look at nomobile in RSS. There is a link tag.
it encoded a link to the article itself 0L0Snomobile0Bru0Creviews0C1292490Bhtml
0B is the point.
0C - this is a slash
, we take RSS, we pull out the link, we parse the site page with our hands. You can use regular expressions, you can use DOM.
We cut extra tags.
We fill in the results of parsing in a new RSS.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question