News Feeds from Voice of America

Over the last few weeks, I have been preparing PDF news articles derived from Voice of America www.voanews.com. They have all been uploaded very quickly through the new Filecast Center www.outernet.is/filecast-center/

I selected them because the content is in the public domain and their articles render into small PDFs very easily using my doPDF program www.dopdf.com. Another source of public domain news content is WikiNews, but I have found their coverage of current news to be too dated, and many of their articles render into PDFs that exceed Outernets 100 kB file size.

I was wondering if other Discussion Group members could provide some feedback? Frequently in posts, news ranks high on the list of desired Outernet content. Perhaps by putting our heads together, we could do magic as has been done in the Weather forecast area.

I’m also curious to know how news content would eventually get included in Outernet downloads by Outernet-Corporate. Back in the Ku band days, Outernet used www.dw.com as a source and prepared the data packages.

Your thoughts. Ken

2 Likes

Personally I think it’s too time consuming to try and do news manually.

I think getting RSS working is the way to go with this.

2p

Sam

2 Likes

Sam, as it turns out Voice of America has RSS feeds. Given their content is public domain, that could be a solution for Outernet to explore and implement.

I don’t see a clear path for you or I to do it. The only down side of VOA is some may consider it a biased source too USA centric.

What say you, Syed?

I have not heard much about the News Feed issue lately, but yesterday signed up for the NY Times Digest, which is a 10 page PDF (200 kB file size) of world and national news.

It’s the same product we see in various international hotels and on cruise ships. Unfortunately, it is not public domain, so Outernet can’t post it now. So maybe you could look at a sample at their site www.nytimesdigest.com to see what you think of it.

There are other Digests available in various languages from other sources too, but this might be something we could get Outernet to subscribe to as a satellite news provider. Ken

After an amazing start, Amateur Radio messages, Gribs, NOAA Wether, Online Graphical Map of user location the Outernet team, probably needs to take a break. BUT I think news will be the compelling product that really sells Outernet Library L-Band service to remote communities.

I think NEWS needs to be the next push in the Outernet success story.

Kens Idea of these PDF concise mini newspapers would be very nice.

1 Like

Completely agree with Seasalt - - NEWS will make Outernet a force to be reckoned with.

Seasalt, I have a complementary 2 week trial subscription with the NY Times Digest. If you’d like me to send you some copies to take a look at, send your e mail address to me at [email protected]

I don’t how much it costs yet, but I’ve approached them as an individual user rather than a company like Outernet.

I’m still searching out other options.

Ken

1 Like

Here are some creative commons newsfeeds:

http://www.20minutos.es/rss/internacional/
https://theconversation.com/articles.atom?language=en
https://theconversation.com/articles.atom?language=fr
http://www.voanews.com/api/zq$omekvi_
http://www.voanews.com/api/z-$otevtiq
http://www.voanews.com/api/zo$o_egviy
http://www.voanews.com/api/zr$opeuvim
http://www.voanews.com/api/zj$oveytit
http://www.voanews.com/api/zoripegtim
http://www.rfa.org/english/RSS
http://www.rfa.org/mandarin/RSS
http://www.rfa.org/cantonese/RSS
http://www.rfa.org/burmese/RSS

You can turn them into PDF’s using this £free tool

Sam any idea on size of these newsfeed files.

If the Outernet feed is multiplexing then we could receive these files as soon as they come out.

200k for a 8 page pdf means 1 MB an hour so one fifth equals 12 minutes of transmission.

Newsfeeds plus 5 or 6 small pdf newspapers would not tax the download much.

Hello Sam

I have visited Creative Commons a few times, but am not familiar enough with them to find the above news feed list on their site - - help!!! I’m new to this field of endeavor and want to learn.

What’s interesting with your list is the individual addresses render as XML files in my Chrome browser, and as readable documents with Firefox. Using Firefox to view USA - Voice of America then my PDF creator www.dopdf.com, it renders into a 9 page 110 KB PDF. When I do that with FiveFilters.org, I get an 8 page 753 KB PDF. The FiveFilters document looks cleaner, buts lacks a VOA Header. It has a FiveFilters Header instead,

Otherwise, it is very nice looking. I can compress the FiveFilters 753 KB PDF to 141 KB using www.onlinepdf2.

The same RSS feed viewed directly in my Firefox browser and rendered into a PDF with my doPDF converter looks like this:

which is not as clean, but the file size is smaller. I cannot compress the doPDF version any more than it is.

So we have a lot to learn here, and collectively need to make more recommendations to the Outernet Staff on which direction to go. Ken

I think that news content shall be saved in a text format with Mark down syntax, an then rendered in a newspaper view on the viewer side.

I guess that PDF format is nice, but uses a “high” amount of bytes only for the format, not the content.

By the way, RSS feeds are a great idea, and I think they use only a few kbytes…
What do you think?

I’ve been playing around with this a bit today here

You can right click > Save as to get a local copy (93.4kb)

Great! :grinning:

Now we’re really talking - - :heart_eyes:

The next step is to provide Outernet Staff some of the tools they can use to do the same putting an HTML page like this up on the system.

By the way, Sam, I put your 94 KB page up on Filecast to see how it looks on the rxOSs. Its title is Outernet News 26 Oct 2016 from VOA.html

Hope you don’t mind. Ken

1 Like

Hi Ken, not at all!

@Syed would you be up for using these files on a regular basis? I realise there is probably a more elegant solution in the pipeline but this might work for now?

Set up a script on your box to wget an English, French, Spanish & Chinese version? Each at 100kb or so?

I’ll fiddle with it more if you think it will be used (Wget returns a load of bumph at the moment & Each item should be properly attributed)

Thanks

Sam

I’m not opposed to VOA-content (though not personally thrilled by it). What I’m most concerned with is file size. We’ll be reducing the maximum size from 100kB to 50kB pretty soon, so spending time on chunking files would be a good idea.

We may even go so far as to limit the uploads to 10kB–just so we can ensure that something new is always coming down the pipe.

2 Likes

I’m not thrilled by VOA either, but it is creative commons & regularly updated…

VOA already split into regions, so we could do;

6 Africa items (18kb)
6 USA items (18kb)
6 Asia items (18kb)
6 Middle East items (18kb)
6 Europe items (18kb)
6 Americas items (18kb)

Plus 12 general Spanish languge items from 20minutos (46kb) (I’ll work out a way of removing the images from this)

And 12 In italian (36kb)

And maybe 12 Mandarin (If I can work out how to sort characters)

So a total of 220kb split over 9 files per day?

1 Like

Sounds like a plan.

1 Like

This is good - - please let me know if I can help out.

As to VOA - - I agree it’s not the best, but it is public domain, it’s there TODAY, and it is very newsy. I’ve looked at other public domain sites, but they don’t have the “look and feel” of global unbiased content I believe Outernet needs to offer.

Perhaps other Forum Members could make some additional suggestions. Ken

PS I hope you all like my new Profile Picture - - it is the antennas at dusk at the Varberg Radio Station at Grimeton - - an old (1922) VLF transmission facility still occasionally in operation (using an Alexanderson alternator) close to Varberg, in Halland, Sweden - - a World Heritage site.

1 Like

Hi Ken

You could have a look for a Wordpress plugin / method to strip images out Rss feeds and add the author to them?

thanks

Sam

Not all the feeds have a lot of pictures - - VOA has only 1 VOA logo. The RFA feeds only have titles and short summaries with no pictures - - so they may not be so good to use.

The 20MINUTOS.ES - Internacional is probably a good one to experiment with as it has a good bit of text and pictures.

Am I correct? Am I approaching this correctly? I’m new at this aspect of rendition. Ken