Missions Write a bot for this dataset: 'Financ...

Write a bot for this dataset: 'Finance Licences (Securities) - Financial Market Council (Tunisia)'

Briefing

Your mission is to write a bot (some code) to transform a website into open data

This is the site in question: http://www.cmf.org.tn/htm/publication_cmf/intervenants.htm

This is one for people with technical skills - specifically coding in Ruby or Python (other languages are in the pipeline)

Example

This is the website, that contains links to PDFs with the data.

img

This is an example from the PDF to show the kind of data you should scrape. Extract the headings and associate them with company names. Take a look at the example below:

{"category": "List of approved companies for the excercise of the activity of Listing Sponsor", "company_name": "Amen Invest", "business_activity": "Intermédiation en bourse", "address": "9, rue du Lac Neuchatel. Les Berges du Lac -
1053 Tunis." }
{"category": "List of approved companies for the excercise of the activity of Listing Sponsor", "company_name": "Biat Capital", "business_activity": "Intermédiation en bourse", "address": "Boulevard principal-angle rue Turkana et rue
Malaoui- Les Berges du Lac, 1053 Tunis." }
...

img

Here's how we suggest you go about it:

  1. Start by clicking 'Accept this mission' on this page. Don't worry, you can always give up if you can't finish it.
  2. You'll write the scraper using our "Turbot" framework. Head over to the Turbot website to and click "Start contributing" to read a getting started guide.
  3. If you have any questions, whether they are technical or about the data, get in touch and ask!
  4. When you think you've written a suitable bot, submit it for review using the Turbot command line tool.
  5. Once we've checked over the data, we'll either tell you if anything needs to be fixed or we'll accept the bot which means your mission will be complete!
Still not sure? Don't worry!

Whilst this does require you to be able to code, its probably not as hard as you think. Take a look at our example bots to get a feel for what's required.



Radio chatter

comments powered by Disqus