ILUGC Hackathon – 2 – Wikipedia Hackathon – July 23, 2017


Announcing our second hackathon on July 23. This time it is all about Wikipedia.

Venue :

Hexolabs Interactive Tech Pvt Ltd, Type II/17, Dr.VSI Estate,
Thiruvanmiyur, Chennai 41. Phone – 044 42169699  Near NIFT, Opposite
to Origin Towers.

https://goo.gl/maps/XtTZXXDf3Ku
https://www.openstreetmap.org/node/4978570060#map=17/12.98271/80.25278

Date : July 23, 2017
Time : 10.00 AM – 5.00 PM

Must:

* Bring your laptop
* Knowledge in any programming language

Good:

* To bring any internet device like dongle or 4g smartphone to get your own internet, as there is limited speed in available internet

Pre-Learning:

Exploring these links and installing them is desired.

Installing medaiwiki-
https://www.digitalocean.com/community/tutorials/how-to-install-mediawiki-on-ubuntu-14-04

https://www.mediawiki.org/wiki/Manual:Installing_MediaWiki_on_XAMPP

wikitools – Python Library
https://github.com/alexz-enwp/wikitools/wiki/Documentation

Mediawiki API https://www.mediawiki.org/wiki/API:Main_page

Gadgets https://www.mediawiki.org/wiki/Gadget_kitchen

Hack Ideas:

If you, or Tamil or any other language wiki needs any programming solutions, share the ideas here.

Examples :
1. Report of contributions of all TN school teachers. Usernames will
start as TNSE. Need a report like https://ta.wikipedia.org/s/6s9e

2. Fixing the titles, moving the pages automatically, if they have
errors on page title.

3. Install Tamil TTS – https://www.iitm.ac.in/donlab/tts/index.php
and try to use it for wiki pages.

Registration :

To register, add your name in the following wiki page.
bit.ly/2u5AnT1

If you dont know tamil, just mail me your interest to attend.

Contact:

T Shrinivasan  tshrinivasan@gmail.com 98417 9546 Eight

Minutes – Intro to Wikipedia – Villupuram


Last Sunday, I gave a talk on Wikipedia at Villupuram GLUG. We had around 30 participants. Got few School Teachers, Writers, Media people too.

Started the session with a game. Started a story with one line. Asked everyone to continue one line after another. Thus the story was built collaboratively.

This experience helped them to understand how wiki pages are being written. Explored about wikipedia history, Foundation, Languages, /various wiki projects like wikisource, wiktionary etc.

Explained about issues by copyright, Creative commons license and Commons.

Then, Poet Ramamurthy expressed his thoughts about wikipedia and public contributions.

Journalist Ko.senguttuvan shared his thoughts on copyrights. He requested all to document their knowledge so that the next generation can use it. He presented me a book he wrote about his Journalism experiences.

Teacher Dhilip narrated his efforts on enhancing the government schools with various ICT activities.

Then, we started the practical sessions. Asked all to create an account on wiki. Then, explored about the pages, history, talk page, visual editor, language tools. Asked them to edit the page for villupuram and asked to add few points.

This was a very initial intro session. Hope they get some idea about wiki ecosystem.

Thanks for the organizers, Puduvai GLUG and Villupuram GLUG. They are doing great activities there by introducing Free Software every sunday.

Special thanks to Karkee, Khaleel and Sathish for great efforts on this event.

Few Photos are here – https://goo.gl/photos/JwXmZwYN1QCvnDfH8

Thanks for Dinamani Newspaper for writing about the event.

Intro to wikipedia – a talk at Villupuram


Tomorrow, Sunday, 18-06-1983, I am giving a talk at Villupuram on “Introduction to Wikipedia”.

Inviting you all to the event. Thanks for Villupuram GNU Linux Users Group and Puduvai GNU Linux Users Group for organizing this event.

Date : 18-06-2017 Sunday

Time : 10.00 am

Venu : Bodhi IAS Academy,
Shanthi Nilayam
No:10, Vishvalingam Layout,
Villupuram – 605602

Contact : 995 253 408 Three , 750 227 341 Eight

https://lh3.googleusercontent.com/6z78qfC9Y5TfaFLpPO-bfhS8v2GhgXDhwigcnkb-YaL5huakn1p0jSKfsQ5SzdYCcvRoh_vArXBvW96FgyIP4i3K3d35Vrlw9m3D3mcFvGK0z3Q21BHJt8waEbdoCGEopfGq5dGlU5rD77CEUt6CXUrsups6BpFnHBItZ64fRsvlMOsH7cO_vJOkUZr7RtmPxqJwcDu46pcajvrFaGbNva7CGe3qvrBB8FES0ci_7X5LwNgpQf_iej2cubPF2QgqspdOHLiBxG4u97GcoWgQQmQhfDgwyCo-G8SVroFykIC7oA9nJf0fLJlZ6kjxAwCDCXiUfbjWEX0WuX1WBsIWZMlNgbDy_rwAaBdb94jgP8bTkDpdnrR4enF5YLay8J6l0l62pJVDNFh-l8CpSSysndDWyV_8sBOQKgd8P3_7OdCzs29tGgR62T6OriCcEfC2dkOpFg3gGMuGcA55xCPDbyiuq2F_j_0-yn8IBvI6FCdZYtnzRdGWtjbfimgLJkD42lMLOV2KAL6febwmeZB6kdKEhyQYiF4jzUHvjga_9elV2Z5uWyyRiXGVQUWZHiJxMDk1n0lmAlPValsHNiipwmeW0H0TUxggF0wqx-0kfDIbGCuM3Gw5bQ=w389-h550-no

 

Project Idea – Telegram bot to translate strings for Open Source Projects


telegram bot க்கான பட முடிவு

In wikimedia hackathon, I saw a demo of using telegram bot to translate strings from translatewiki.net

here are the notes about it.

============

Telegram Translation Bot: https://phabricator.wikimedia.org/T131664 DONE

Translate on translatewiki.net without leaving your Telegram app

Code: https://github.com/amire80/mediawiki-telegram-bot/

mediawiki.org page: https://www.mediawiki.org/wiki/User:Amire80/chat_bot_draft

Phabricator: amire80 * Wikipedia: Amire80 * Twitter: @aharoni

Amir E. Aharoni and Taras Bunyk presenting

Justin Du (MtDu), Taras Bunyk, and help from Brian Wolff, Madhvuvishy, bd808, Niklas Laxström, Jon Robson, and more people!

“Most people don’t speak English”

Translatewiki.net – thousands of messages to translate

can now translate through this simple mobile app instead of needing to load the full site in a browser

selects untranslated strings, in your preferred languages, sends them to you, and you translate, and it submits them to translatewiki

Long messages are automatically skipped to fit a use on mobile.

============

Thinking as we can build a bot to translate the strings for mozilla and openstreetmaps.

Need to get your inputs/thoughts/ideas for this.

translate க்கான பட முடிவு

These links may help to build a telegram bot for translations.

https://github.com/zanata/zanata-python-client

https://translate.zanata.org

https://translate.zanata.org/iteration/view/TamilMap/1/settings?dswid=1182

use this command to get the po file in /tmp/ta.po

zanata po pull –url https://translate.zanata.org/ –project-id TamilMap –project-version 1 –transdir /tmp

We can process the po file using polib

http://polib.readthedocs.io/en/latest/quickstart.html

There are many python libraries to create a telegram bot.

http://telepot.readthedocs.io/en/latest/

https://khashtamov.com/en/how-to-create-a-telegram-bot-using-python/

https://blog.pythonanywhere.com/148/

https://www.codementor.io/garethdwyer/building-a-telegram-bot-using-python-part-1-goi5fncay
With all these tools to create a bot, to process Po files and zanata to host the translations, we can connect them all.
If any one is interested in programming for this, reply here.

Thanks.

Image sources:

http://www.asktrustdee.com/2016/03/my-top-5-telegram-bot.html
https://commons.wikimedia.org/wiki/File:Translate_en-ta.png | CC-By-SA

How Wikimedia movement should be in 2030?


Today, we had a discussion on strategy for wikimedia movement for 2030, with few Tamil wikipedians, media, government, academic friends.

Wikimedia Foundation is planning on what are the things we should focus on wiki ecosystem, to make it even better for the people in 2030.

 

 

I added the following thoughts.

1. Space for adding tiny data.

In future, there will be a drastic change on computers and input devices. There will be voice inputs. Computers will be embedded in all devices. They will be communicating to each other. They will be enabled with augemented reality, virtual reality, artificial intellience to get and express various data. There should be a common data source to get any data from. Wikimedia movement should be that common data source, for all devices.

There should be options to give input as tiny bits. Knowledge should be shared by anyone, in any form. It should not be only in text form or as article. sound input and tiny bits of inputs should be allowed in wiki. Those content should be automatically translated into many languages.

For example, I should ask a device like “Hey, what is the movie shown in nearby theatres to me? ” The device should get that data from wiki. I should ask “What is the price of a TV in chennai and in Austria? ” It should get the details from wiki and reply me in voice in my language. Wiki should allow these kind of data.

2. Decentralised Wikis

Git like decentralised wiki editing will enable, more content coming from the poor internet countries.

The following are the inputs from other friends.

1. Archiving old books like google’s one million books project

2. Archiving old photos, pamphlets, advertisements, magazines

3. Connect with many organizations, governments to get their archives released in CC license

4. Connect with mobile/camera manufactures to add CC license details within the device

5. Connect with social media sites like Facebook to add options to release media files in CC license

6. Finding false news within the flood of information will be a huge problem in the coming days. Is it possible for wikipedia to verify and authenticate the news that are being shared on social media?

7. wiki should be help to build education materials for school/college students, so that all the world get free resources for studies.

8. Data should be added in a unique way, so that it is transformed to multi formats, multi languages automatically.

Hope these will be discussed by the foundation team and taken forward further.

 

After the meeting, met photographers dillibabu and sudharsan. They agreed to give their 1000s of high quality photographs taken around India. I will upload them to commons. Will share the details once started.

Thanks for Wikimedia Foundation for starting these kind of discussions. We need to plan for the future and make it happens. Visit the website and http://2030.wikimedia.org/ and share your thoughts on building a better future for wikimedia projects.

Learning on how to set plans for FreeTamilEbooks.com and kaniyam.com projects. Let us dream a lot and make them true.

 

Few more photos are here – https://www.flickr.com/photos/tshrinivasan/albums/72157684407240936

The things I liked at Mediawiki Hackathon 2017, Vienna


Wikimedia hackathon mark horizontal.svg

The Mediawiki Hackathon 2017 was organized at Vienna, on May 19,20,21 2017 very well. I liked many things in the entire event. Listing them here.

1. Awesome Event Organizing Team

Wikimedia Austria has 3 full time employees. They provided their full support all the times, from the event announcement. The announcement pages are full with all the required information. Annemarie Buchmann helped me to get done with all the visa processing on time. For every email I sent, got reply within a hour. It was so awesome to get helping hands from far away, so quickly.

2. Venue/Stay/Food at same place

JUFA Wien City was the venue for the event. It is a big Hotel with conference room, mini halls, Rooms for stay, bar, restaurant, play area, kids area, park and 24 hrs free snacks room.

As we stayed in same place, there is no delay in reaching the hackathon rooms. Just wake up, cleanup, jump into the event, till midnight. Then, reachout to room and sleep.

3. WiFI everywhere

The JUFA team provided good WiFI for the event and rooms. Never felt any disconnection.

4. Free travel Tickets

Anne, sent me few travel tickets to roam in and around the city for 5 days. From airport to return to airport, all the travel was covered by the tickets provided. With that we roamed around the city, in trams, in underground trains etc.

5. City Tour

The event team arranged for two city tours. They took people to a grad church and palace. Vienna is a historic city, full heritage monuments.

6. Party

On the Second day, we were invited for a party, at nearby pub/bar. Danced with the hackers. The rain, made us to dance till  early morning 2 am.

7. Free Snacks/Tea/Coffee

The food provided was good. But to make all of us awake anytime and keep energized, there were free snacks/fruits/tea/coffee.

8. Regular updates from the organizing team

We received emails in regular intervals for preparing the event, calling for volunteering for blogging,photographing etc. Those emails made us to be excited about the event a month ahead.

9. Telegram/IRC chat

We had an IRC channel and telegram channel for quick chat. we can ask anything, search for anyone, ask for chargers, connectors, projectors, etc. All the requests were solved by entire team.

10. Mentor program

Experienced programmers willing to mentor, volunteered as mentors for newbies. They trained,shared their skills and helped to build applications. They met daily twice and discussed on how they can improve their assistance to others.

11. Dedicated Photography/Videography

There were dedicated photographers/videographers to cover the entire event. They are wikipedia volunteers who are contributing to commons. They happily volunteered for this. Wondered to know that wikimedia austria lended them high quality cameras and lenses.

12. Media Coverage

Local Press/Media people interviewed about the event and published online and other channels.

13. Short intro talks

On the sessions like inauguration, valedictory, there are only very short intro talks. These short talks save a lot of time and gave us plenty of time to hack.

14. Multiple Connectors for Projector on stage

We had connectors like VGA/HDMI/Apple for various laptops on every projector.

15. Lot of power sockets

We could roam anywhere in the venue and we found power sockets to plugin our devices. There was no shortage of sockets.

16. Freebies/Cards/Chocolates

We got a little watercan and other freebies on the registration desk.

17. Volunteer to take notes on etherpad on all meetings

On all meetings, trainings, we found one volunteers is taking notes on etherpad. It helped the speaker and listeners to concentrate on the talk, without worrying about jotting something on a notepad.

18. Privacy on Photographs

To give respect to the privacy on photographing people, they provided a Orange color tag. No one should take photo of people who are wearing the Orange Tags. For others, it is a blue tag.

Following people are behind this awesome people.

Program

Event management

Scholarship committee

Photos Courtesy: https://www.mediawiki.org/wiki/Wikimedia_Hackathon_2017/Participants#Team

I thank you all for all arrangements and smooth orchestration of the event.

I suggest all the above features to be in all the events, we conduct. Even very small actions, add more value to the events.

Thanks for the Wikimedia Foundation, Wikimedia Austria and all participants, mentors and volunteers.

Notes : Mediawiki Hackathon 2017 – Day 2


Just now returned from a party by the event organizers, at a nearby pub Arena. It is so much refreshing, to dance after a long time.

Today, was filled with so much of hacking the project I am working on. Learned a lot of new jargons in Docker arena. Finally, I could install the LinguaLibre software in my laptop with the help of dockerisation by Pablo.

LinguaLibre is a web application, which can help you to record audio files with the web browser itself. I am thinking on adding a backend job for this software, to send all the audio files to commons and add them in relevant wiktionary pages.

Schematic image

1.
I tested all the docker fixes by pablo for new installations, so that any one with docker can install the software easily. I am working on integrating python backend scripts. stuck somewhere in the deep hole. Hope pablo will help to create new docker containers for the python scripts.

2.
I helped Dafna, who is next to pablo, on installing Lingualibre on her laptop. She speaks arabic language. She wanted to give support for arabic writing. i.e right to left to this software. She started to hack the Javascript files and to play with PHP files immediately. Pablo is a great mentor. He explained all the internals of symphony, doctorine, git etc to her. She is almost done with her fixes. Good to see the things are happening so fast.

3.
LibraLingue is in French. I translated all the french strings to English, with the help of google translator. Hmmm. Now, I feel better to use the software. Finally, I know what I am clicking on the screen.

4.
Was discussing with praveen, to have some time on fixing the iOS app, for commons. Few years ago, it was there on the iTunes store. But now, it is not available. He agreed to explore on this. If you are an iOS developer, and can help to work on this app, comment to this post. Will connect the interested people. The source code is on GitHub .

5.
Met Tim on the steps. He worked with OpenStreetMaps. Asked him for the tools available to add street names to OSM with any mobile. Most of the mobile apps for OSM, give the facility to add Point of Interest(POI) like shops, buildings. But, India, what we need and miss mostly are the street names. He asked to explore vespucci. we can get it from Googles play store and from amazon and from f-droid.

6.
Attended a session on Future of LDAP Extensions, by Robert Vogel. We discussed about dropping this and moving the Pluggable SSO extension, as it gives more features. Brainstromed on the requirements and ideas to implement. I asked to give a web interface to test the server and authentication details, and to show the error messages on the screen. You can read the notes here.

 

7.
Attend a session on ORES. It is an XRay engine to the wikipedia contents. It has all the interesting buzz words like machine learning, artificial intelligence, building models, training dataset etc. explored on how to use it in English wikipedia, how the training and scoring works. Asked him if this can be used to score and filter the contents apart from wikipedia like wordpress/facebook comments. He will look on this.

Someone asked as what will happen if I train the ORES wrongly and use it to score all my bad edits to good. He replied as to remove all yours training data and train again with some good person.

Remembered this XKCD in machine learning.

 

8.
Checked with the design team, if it is possible to add a floating tool bar to the wikisource text area, so that we can easily add the formatting strings and symbols. They agreed for this. Tried to do as a simple hack. But it needs more efforts. So added an issue on phabricator to work later.

9.
Petr Bena demonstrated his tool Huggle.  One of the great part with this hackathon, I found is to meet and discuss with the people, who created the software we use in our daily life. ORES, Huggle and lot of software were demonstrated by their original creators. Huggle is a desktop application to do gatekeeping the wikipedia articles easily. We can see the recent changes, allow them or delete them quickly.

File:Huggle3 kde ubuntu.png

Asked him to provide support for Tamil wikipedia. He raised a ticket for this immediately. Dear Tamil Wikipedia patrons, Here is a great tool for you. Let us explore it.

10.
Met a photographer, Manfred werner. He lives in Vienna, doing photography as hobby. He helps to shoot all the events of Wikimedia Austria. The chapters lends him good cameras with big lenses. He is so passionate on commons. He explained me the guideliens he follows on uploading to commons, the photography workflow he follows, about copyright issues on various countries.

11.
There was a group photo session. A paronamic view is available here.

12.
There was a DJ party for us, in a nearby pub. We reached there around 10. It was excited to see the hardcore hackers are dancing like regular dancers here. Enjoyed dancing with the geeks. These english songs are for mild, easy dancing. I asked the DJ guy, if he can play any Tamil Rap song for a high energy dancing. Downloaded the song, “Night Varia” from Pudupettai. That is one of the song for which I dance like a monster, with all my heart and bones. Thought of introducing this song to Vienna. Unfortunately, the DJ system does not support input from mobile. Have to check with my audio engineering friends on how to fix this.

Then, DJ played some fast instruments. Those drums made all of us to dance. It started to rain. So, we could not go out. Continued to hangout there till 1 pm. Then reached JUFA hotel.

It is 3.41 am now. Can see still there are some monks are sitting in the lobby and hacking on their projects. I am done for the day. Tomorrow, will be more interesting.

Thanks for all who encouraged my blog posts and photos.

Here are todays photos collections – https://www.flickr.com/photos/tshrinivasan/albums/72157683964819586

Notes : Mediawiki Hackathon 2017 – Vienna, Austria – Day 1


Reached Vienna yesterday evening. It is a nice, clean, low crowd, cold city. Walked around the city till midnight.

The Day 1 of the hackathon started with a welcome session. Organizors explained about the event. Then, mentors introduced themself and many people pitched for working together with their projects.

We are 260 participants from 48 counties. This is the very first time, I work with these many multi country people. Though we are from around the world, everyone followed friendly space policy. It gave a good feel of being with my own friends.

After the intro session, I was roaming around the halls/sessions to find a team for me. I did not even know what project to do. Was thinking on few issues on the task board.

1.
Met Abbas and Ibrahim in Cafe. Abbas is from Iran and living in Austria. He is a Journalist. He explained the issues faced by refugees. It takes around 5 years for them to be accepted here as refugees. Till then, they can not work anywhere. They have to live politely here. He is thinking of making them to contribute for wikipedia and get good score from the wikimedia austria, so that they can get some goodwill and faith to get accepted as refugee.

He spoke about how the arabic wiki community is growing slowly and the issues they face. It is similar to Tamil community’s issues and growth.

2.
Met Praveen. He is an indian student who studies masters here. He is an iOS developer. Gave some project ideas like audio recording app for wiktionary for iOS. He is interested on this. Routed him to the newbies intro session to wiki development.

3.
Met Aaron Halfaker. He is the developer of ORES. An Xray engine for the wikipedia. It can be used to determine the edits are good or bad. It involves machine learning based on the manual scoring training. Asked him, if we can use it for Tamil wikipedia. He demonstrated how it works. Will attend his session tomorrow for a deeper demo. He asked me to collect the list of bad words in Tamil and to collect the patterns of the pages we delete in Tamil Wikipedia. Will explore on this further.

4.
Met the Magioladitis, the developer of AutoWikiBrowser. Happy to know that they are working on a web version of it. They are using javascript to develop it. Wished and thank the team on behalf of GNU/Linux users community.

5.
Had a discussion with Maxlath, founder of http://inventaire.io A book inventory application based on the wikidata. He uses wikidata to get all the info about the books automatically. Explored the site and gave few feature requests so that all of us can use the site.

6.
Met Vivek Maskara, a guy from Banglore. He was a windows application developer then. Now an android developer. His team is adding more features to commons android upload tool. Explained him about the windows tool to bulk upload the images to commons. We need a windows application to add metadata line name, details to images offline. Then, with single click, all the images should be uploaded to commons.

This is the very first community event for him. He asked why you guys are contributing to wiki and other open sources.

Demonstrated my wiki tools.

  1. OCR4WikiSource – https://github.com/tshrinivasan/OCR4wikisource
  2. Audio recorder for wiktionary – https://github.com/tshrinivasan/voice-recorder-for-tawictionary
  3. Photos uploader for commons – https://github.com/tshrinivasan/mediawiki-uploader

Explained him how these open source projects are helping indian language communities to grow and how programmers can help to the core wikipedia content contributors.

Dear photographers. Wait for few more days. The mass upload tool for windows is on the way. Vivek, Spend some time on this in the upcoming days.

9.
Found Hugo Lopez, from France, who is working on a Massive Open Audio Recording Project, LinguaLibre Got the  source and tried to install. Got some issues on installation. Will fix it soon.

He is a linguist, teacher, OSM, wikipedia contributor. He found his anchestors language Gascon, was destroyed by the french, just to promote french as one language form France. In 1910, there were 6.7 million people who spoke Gascon. In 1990, it is 2.11 and in 2010 it is only 0.2 million people who knows Gascon. The French government pushed french everywhere. School kids were beaten of gascon or any other language was spoken except French. This is how a language dies. The same thing is happening for Tamil. To kill and to promote Hindi as one national language, Indian government is doing all the things, it can do.

Huge found Lingua Libre, as a great tool to preserve the languages. We can speak any words and it can record all the sounds. This is the one web application I am dreaming of so many years. And it has been done already. Agreed to add few features like connection with commons and wiktionary. For this have to learn PHP. But it will be worthy effort to add more features to this.

10.
Met Johan Uhle from mapbox.com. He came with his cute girl kid. Asked him about the workflow, tools for translating the OpenStreetMaps in Tamil. He is wondering to know these possible with mapbox and the localized maps are happening slowly.

Evening was filled with fun events with Karoke. Enjoyed the songs sung by various people. Had dinner with fun discussions about marriages on India VS Europe, education systems etc.

There were many sessions for newcomers, wikidata, semantic wiki, and more. Will plan for attending few talks tomorrow. Wikidata is getting more like a rockstar. Seems it can do tons of magics. Will explore it soon.

The day 1 is ending now. Its around 12.00 midnight. Hearing the “Wow.Awesome” sounds from the Karoke floor. Still they are singing. Going to join with them for a group song now. See you all tomorrow.

Here are few snaps I took – https://www.flickr.com/photos/tshrinivasan/albums/72157680964356743

 

 

 

 

 

Attending MediaWiki Hackathon at Vienna, Austria


Hackathons are the events I love most. We sit together with fellow developers, pick any task, focus and work till we make some minimum viable product or complete the project.

I have attended mediawiki hackathon, chennaigeeks hackathons, and Tamil Open Source hackathons in chennai. They gave great results and good projects.

For the first time, I am going to attend an international hackathon at Vienna, Austria on 19-21, May 2017.

 

Wien1-pan

Get full details here – https://www.mediawiki.org/wiki/Wikimedia_Hackathon_2017

Happy to know that my application got approved. Visa process went smoothly and I am all set to go.

Planning to work on these projects.

1. Upload/import wizard for Wikisource works – https://phabricator.wikimedia.org/T154413

2. Notification: Your file was used – https://phabricator.wikimedia.org/T77154

3. Statistics dashboard for data on Wikidata – https://phabricator.wikimedia.org/T138697

Will blog my experiences here.

Hoping to meet other contributors for mediawiki.

Thanks for the organizers for the opportunity.

Announcing OCR4wikisource


There are many PDF files and DJVU files in WikiSource in various languages. In many wikisource projects, those files are splited into individual page as an Image, using proofRead extension.

Contributors see those images and type them manually.

This project helps the wikisource team to OCR the entire PDF or DJVU file, using the google drive OCR. Then it will update the relevant page in the wikisource with the text.

Grab the python code from here and run in your GNU/linux machines.

https://github.com/tshrinivasan/OCR4wikisource

It is based on
https://github.com/tshrinivasan/google-ocr-python

Reply here with your suggestions and improvements.