dsandler.org

Tag: feedtree

Over the weekend I decided to fire up Google Reader again. (I’m still a fan of NewsFire, but now that I’m using the other GApps I’m sort of hooked on instant, painless, in-sync access from multiple machines.)

The UI is better than I remember it, but what is the deal with the message latency? At 4PM central time I took the following screenshot:

Five hours? I’m all for fetching feeds less frequently, but this delay is absurd. Is Google trying to cure my ADD?

Someone out there has developed a crawler that attacks Trac wiki pages. Once it’s found a Trac installation, it posts an update to the WikiStart and TracIni pages. The new version appends a number of links, hidden from view using Trac’s syntax to allow arbitrary HTML:

{{{
#!html
<u style="display:none">
...nasty links...
</u>
}}}

I’ve been hit over at the FeedTree trac a few times; it’s infrequent enough that periodic checking of the timeline view is sufficient to spot and clean out the crud.

(I guess you know your software has “made it” when someone else writes a piece of software specifically to attack it.)

Elsewhere on the Interthing there’s some concern that because FeedTree moves feeds away from a polling architecture to one of on-demand notification, it takes away control from the user:

Mm, one last things. Several people have noted how happy they are that FeedTree can turn RSS from a push to a pull mechansim. I don’t like that concept. I like RSS for its meta-data, and for the control I’m afforded because I have complete control over it. “Push” the content and you take that away from me. Want push? Just get email updates. Setup your filters to organzie and categorize your emails. Ta da, instant, existing, pretty dang easy push system. :/

I assume that the “control” the author’s worried about losing is control over unsubscription, and if so, this is a very legitimate concern—even for email.

The fundamental problem with push-by-email is that your email address acts like a capability: by giving out your address, you offer someone the irrevocable ability to spam you. You must trust that the other side will never share your address, and will honor any unsubscription request. (As an aside, this is one of many reasons that email is becoming less popular as an interpersonal communication medium; the signal-noise ratio is so low—thanks to spam, mailing lists, and email update services—that many are abandoning it in favor of IM and SMS. Unfortunately, these services suffer most of the same capability-related access control weaknesses, and so it’s only a matter of time before these channels also become polluted.) Throwaway email addresses are one simple way to address the irrevocable-capability problem with email, but they’re hardly user-friendly and are not available to all users.

Feeds, on the other hand, solve this problem by retaining control over subscription locally, rather than deferring that control to outside parties. You can always unsubscribe to a feed: just stop polling it. It’s crucial to observe that FeedTree works exactly the same way. You can always unsubscribe to any feed whose updates are pushed via FeedTree; your local client will cease to receive updates via the Scribe multicast system. (Even if a malicious entity attempts to assault you with updates after you’ve unsubscribed, or updates for a topic which you’ve never been interested in, the Scribe framework will ignore these uninteresting messages.) In short, FeedTree’s “push” service model does nothing to reduce your control over which feeds you see.

The author has other harsh words for the FeedTree research project—most of which are, in my opinion, undeserved. His chief complaint is that all Web feed bandwidth issues are 100% solved by conditional HTTP GET (ETag/If-None-Match and If-Modified-Since). Conditional GET is hugely important for polling clients, but it’s hardly sufficient: if a feed changes frequently (many popular ones do), or has any dynamic content (read: advertisements, becoming increasingly common), this protocol breaks down and the entire feed is always transferred.

There are other aspects of the article which I disagree with, but these are the key areas in which I feel the author misunderstands both the architectural problems inherent in RSS and the way FeedTree intends to address them. As with all academic research projects in the field computer systems, FeedTree is a part of a larger discussion about how technology can be made more efficient, more powerful, and more usable. I welcome the debate.

Crossposted from the FeedTree blog: Bugfix release: 0.7.1. This version addresses some network problems users were having, and adds additional debugging tools to help identify other network issues that may be out there.

FeedTree had a pretty good week. After I released version 0.7.0, I sent the article to Slashdot, where it was run on the front page. The comments told me a few things:

  1. This is not a hot-button topic for Slashdotters; there were very few comments at all. (Furthermore, the Slashdot Effect was nowhere to be seen on our servers; the big iron that runs the main site didn’t really notice, and even the Pentium-III that runs the Trac server and Subversion repository had no issues.)
  2. A few people don’t get it. “Isn’t this just NNTP?” “Why not use BitTorrent?” etc.
  3. A lot of Slashdotters (those reading the article, anyway) do get it, and were quick to set straight the naysayers and the clueless (e.g. 1 2).

So after a front-page Slashdot article (which generated approx. 3500 direct hits to feedtree.net), you’d think I’d be drowning in users, right? Well, we spiked at 30, and are currently hovering around 15 users. That’s … well, it’s not a lot. I have a few users who are patiently waiting around while I try to figure out why their routers are blocking Pastry packets, but I think this is pretty much all I’m going to get for a while. Hopefully it’s enough to generate some meaningful data!

Psst, FeedTree users: you do know that if you get more people to use the system, your own service (speed of updates, amount of polling, etc.) should improve, right? Just a suggestion.

Fortunately, the Slashdot story did result in a number of mentions here and there across the Interthing. Perhaps the most surprising was a podcast mention; the GeekNights ‘cast discussed FeedTree in their 2/20 show (starting at 53:46

FeedTree version 0.7.0 is out. Bug fixes, podcasting compatibility, ease-of-use improvements for publishers, and more. The project needs users, so give it a download.

Update: Slashdotted. (Screenshotted.)

Crossposted from the FeedTree weblog:

FeedTree is a research project, and all research projects need data: preferably lots and lots of it. Each FeedTree node sends a small amount of anonymized data back to a statistics server; it’s enough to reconstruct the Scribe multicast trees that distribute new feed events. Since the FT network is still pretty small, these “trees” are usually more like sprouts, but I noticed this morning that the Reddit tree looked pretty interesting. So here’s a snapshot (PDF) of the current Scribe tree for the main Reddit RSS feed. As the size of the network increases, these trees (which are constructed organically as new subscribers join the group) will look more and more interesting.


subscribe to dsandler.org

  •  
  • for faster updates, subscribe with FeedTree

mac software made on premises

toastycode.com: toasty software for the mac pyrotheque: a new (old) fireworks screensaver for the mac
Cuckoo—the bell tolls for your Mac.

twitter/dsandler [RSS]

    loading…

elsewhere

highlights

between the couch cushions

strongly connected

  • erinmak is not to be trifled with
  • pixelknave says moof when upside-down
  • dave is dangerous
  • rod is one groovy mother
  • adam is googling us all
  • amar is not really a pirate
  • angi sees little blue dots
  • harbinger lets you know it's coming
  • jason looks like an idiot in that hat
  • jeff is keeping austin weird
  • regan seems to tolerate jason
  • emann will not abide your IM-speak
  • jim is a stranger in ein anderes Land
  • liscio is pronounced "lee-show"
  • darryl has no need of identifying objects
  • friends as they appear on dsandler.org
  • sportsgirl reports…on all the pro courts

Search

Recent

Archives

dsandler.org is Dan Sandler's website and notebook.

Powered by WordPress and here's why.