We have bengali wikisource friends requesting for web scrapping PDF files from a dspace based library.
If you are interested, reply here or mail to me on firstname.lastname@example.org
We have bengali wikisource friends requesting for web scrapping PDF files from a dspace based library.
If you are interested, reply here or mail to me on email@example.com
Last week, I had an interesting meeting with Panjabi Wikimedian community and CIS-A2K team.
Panjabi wikimedia community is small in count. But each of them are contributing with their best. Many of them doing 100-days-of-wiki, personal wiki edithathon for 100 days. Few of them do in in multiple sites and many times a year.
Their interest on contribution and passion on their language is awesome.
Interacted on wikisource, wiktionary and wikipedia. Shared many ideas to improve their workflow. They are looking for many tools to automate their tasks. Those tools will be useful for all wiki communities.
Then, had some great discussions with CIS-A2K team. We spoke about many interesting project ideas.
Listing them all the ideas here.
1. List down the Top 10 tricks/hacks/must know on any wikisource project
2. Make simple tutorials on how to start contributing to wiki, in all possible languages. Still we dont have an ebook or easy starter guide in Tamil. There may be video tutorials. curate them and show them in better way to find them easily.
3. Telegram bot to proofread wikisource contents. Get a page from wikisource. split it into lines, then words. Show a word and OCRed content in a telegrambot. User should verify or type the correct spelling in telegram itself. Submit the changes to wikisource. Thus, we can make the collaborated proofreading easily.
4. Explore how to use flickr for helping photographers to donate their photos for commons. Flickr is easy for them to upload and showcase. From there, we should move the photos to commons. Few tools are already available. Explore them and train them for photographers.
5. We should celebrate the volunteers who contribute to wiki. By events, news announcements, interviews etc. CIS may explore this.
6. Web application for OCR4WikiSource
7. Make a web application to record audio and upload to commons and add in wiktionary words. explore Lingua-Libre for web app.
8. Make a mobile application to record audio and upload to commons and add in wiktionary words.
9. CIS may ask the language based organizations to give their works/tools on public licenses.
10. A one/two day meeting/conference to connect various language technologies. Each team can demonstrate the tools they are working on. others can learn and use them for their languages. CIS may organize this soon.
11. Building spell checkers for Tamil. Learn how other other languages are doing. Odia seems to have good spell checker. Explore that.
12. For iOS, there is no commons app to upload photos. It was there sometime ago. Fix the iOS commons app and rerelease it again.
13. Build Maps with local languages with OSM.
14. One/Two day training on wiki tech. like gadgets, tools, toolserver, API, etc
15. Tweet marketing to promote the ebooks released in wikisource projects. Measure the downloads.
16. CIS may talk with amazon to release the ebooks from wikisource for free always at amazon.
17. Explore Valmigi project of malayalam, chikubuku of kannada – for their ebooks.
18. Download ebooks from dspace, bengali books – West Bengal Public Library Network – url – http://dspace.wbpublibnet.gov.in:8080/jspui/
19. Explore paid works for wikisource proofreading.
20. Blog on how ta wikisource for 2000 ebooks from TN government in public domain license. Send to CIS. They may try to do the same for other languages.
21. ASI website has info about all monuments. Scrap them all and add in wiki.
22. Scrap details from tourism sites and add in wiki.
23. Kannada archeology site has tons of images but with 3 seals added in all images. scrap them, remove seal and add to commons.
24. Tool to audit wiki sites. like new users, edits, measurements, KPIs, reports etc.
25. Discuss with wiki writers and help them to automate their tasks. Build new tools to help them. train existing tools.
26. Get existing photos from many photographers. Get license doc. Add in OTRS. Have a team to upload the photos to commons.
27. Find the pages that don’t have images. Search in commons and add 1 image automatically.
28. Infobox in wiki pages may have 1 image. Check for the same page in other languages.. get the image from infobox and use it in missing pages.
29. Tito showed a broken JS script. Explore it and fix it.
30. Discuss with victor and google team to improve the OCR feature and integrating with wikisource. Explore existing tools like http://tools.wmflabs.org/ws-google-ocr/ and https://wikisource.org/wiki/Wikisource:Google_OCR
Thanks to Ravi, Tito, Tanveer, Dan, Charan Singh, Manavpreet, Rupika,Gurlaal, Stain for the interesting meeting and great ideas.
We can work on these ideas and implement them soon.
If you are interested in doing any of the ideas, reply here or mail me on firstname.lastname@example.org
The Tamil TTS system provided by IITM and SSN College of Engineering had one issue. It can convert one tamil string to audio at one time.
Because of this, we can not do parallel conversion. Few full text books took 4-5 hours for conversion. Because of this, we could not make it as a web application for public use.
Mohan helped to make the Tamil TTS Script simpler to process multiple
Here is the super script that does magics.
Thanks Mohan for your great works.
now, we need to convert this as a web application, so that anyone can
use it easily.
The requirements are below.
1. user registration with gmail
2. user should upload a tamil text file
3. once it is converted, user should receive an email with the link to
the audio file
4. we can keep the audio files for 1 week
5. REPT API support with authentication
6. A queue system
All these will be released in GPL.
If you are interested in doing this, reply here or write to me.
Few months ago, I was talking to Siva. He is part of Tamil Heritage Activities. They take people to Mahabalipuram often and explain all its great history.
They are looking for a mobile app to auto explain about the nearby place, by getting the geolocation of the person.
if you goto near to tiger cave of mahabalipuram, the app should explain all about tiger cave. The data can be provided from a wikipedia page or some custom webservice.
We can extend the same to any place. Like, extending the same for all the temples in kanchipuram or even all over world.
Wikipedia and wikidata can be great starting points for providing required information.
Now, we are looking for android/ios developers to develop this as an open source mobile application.
If you are interested in this, mail me on email@example.com or reply here.
Feb 18, was my birthday. Myself and Nithya dont celebrate each others birthday, as all of the days are very precious for us.
I never gifted her anything. She also did not. But, for this time, she wondered me with a great gift.
Yes. It is a Free Software 🙂
She wrote a grammar checker (Conjunction Checker) for Tamil and released on Feb 18, with a nice blog post. It is here in Tamil https://nithyashrinivasan.wordpress.com/2018/02/19/birthday-wishes-for-shrini-with-a-gift/
It was my decade dream to create a great spell checker for Tamil as a Free Software. Conjunction Checker is one of the big parts of it.
It has too many rules and she read them all, understood, converted as Python code.
It is the awesome birthday gift.
Nithya! You are my best gift. Thanks, is a simple word. It cant express what es I mean.
She is improving it a lot with more rules and testing.
Here is the code – https://github.com/nithyadurai87/tamil-sandhi-checker
She made a video about this tool – https://youtu.be/eC82S7wOr3E
Today, She spoke about this in “ILUGC – Indian Linux Users Group – Chennai” Meet. So happy to see her giving a talk in my homeland.
I am working on making it as web application, so that anyone can use it.
Thanks to open-tamil python library for its all great features of handling tamil text easily.
I am seeing Nithya as avid reader, writer, dancer, home maker and now a programmer.
You can read all her books here – http://freetamilebooks.com/authors/nithyaduraisamy/
Hoping to see more women contributors like Nithya in the upcoming days.
2017 was a great year for me, as all the years. Sharing here on how it went. Yes. It is too late to write this now. It is because of the few lifestyle changes I am following. Its is OK to do many things slowly and drop few things. Am I right?
I was trying to reduce my ongrowing tummy for years. in 2017, took some serious, series of efforts.
Got fever 4 times. Those bad days reminded how bad my health is. It is time to eat good food, and siddha medicines which gave enormous immunity in my school/college days. started to have the medicines prepared by my dad.
Stopped drinking cool drinks, fried items like pizza, burger etc.
Office works made me keep on learning new things. Elasticsearch, Spark, Druid, Python3, AWS api are some great things I explored. Completed reading one book fully on the technology I start to work.
Wrote 57 blog post in this blog and 6 posts in Tamil in my tamil blog. This is low. Will try to increase next year. Writing a book on Python in Tamil. Hope will release by next year. But I have read plenty of books this year. I love reading than writing. Lazy me.
Thanks to PacktPub’s One free ebook per day, Kindle Unlimited Plan and All authors of FreeTamilEbooks.com for providing tons of great books to read.
With the Kindle device, I enjoyed plenty of hours in continuous reading.
Got an award for the contributions on Tamil computing on 2016 from The Tamil Literary Garden, Canada. Received Rs. 50,000 as prize money. Working on creating an organization with this money to do more contributions with more volunteers. Will ask for donations soon.
Was living a simple, minimalistic lifestyle.
Quited TV, Facebook and smartphone two years ago. Still able to live without them. Prioritized the life in the order of Health, Family, Office, Personal growth, FOSS Contributions.
Thus, the year 2017 was so great. It gave me great lessons via bad experiences too. Those are opportunities for me to learn about people and world.
Hope you too had a great year. Hoping that 2018 will be even more exciting for all of us.
It uses offline data, to provide smooth user experience. OsmContributor, OSMAnd are other few android app, to contribute to OSM by adding POI. But we can feel the lag on navigating the map as they fetch real time data on every move over the map.
Maps.me is super fast. But because it uses only offline data. The maps.me provides updates for the data once in a month or twice.
We can see the POIs added by us. But, when we run a mapathon kind of events, many volunteers will be roaming around a city. Sometimes, they come to a street, where the POIs are added by someone else already.
When you are using maps.me for adding POI, we can not get the POIs added by other volunteers immediately. We have to wait for the updates provided by the maps.me team.
This leads to repeated adding of same POI many times. Even verifying for the same POI using other app is boring.
Checked for the maps.me team for providing options to update the data whenever required. But it is not on their roadmap.
Discussed this with “OpenStreetMap Asia” telegram group. replied with few ideas.
All we need now, is a mobile app to download the desired data from hotosm and place in the proper path of maps.me
We can develop this as a separate mobile app and distribute to all.
If this is done, anyone using maps.me can click another app, and update the osm data whenever required.
Reply here / Contact me if you are interested in doing this as a Free/OpenSource Project.
My mail id – firstname.lastname@example.org
Update – 1
Last week Saturday, met Mr. Mohamed Kasim, from IIT Bombay.
He demonstrated me a tiny, lightweight, cute laptop.
It is a 11.6″ laptop, based on an atom quad core processor. It has 4GB RAM, 32 GB NAND flash, an expansion slot through which one can attach an optional 2.5″ HDD/SDD, and most importantly, 10 hours of time, thanks to a 10,000 mAh battery. It has two USB ports, a mini HDMI port and a microSD card slot. It also has wifi and bluetooth.
This laptop comes with Debian Linux operating system and a lot of useful software, such as LibreOffice, LaTeX, Scilab, OpenModelica, Inkscape, GIMP, Blender, OpenFOAM, eSim, Synfig Studio, and various editors and browsers.
The cost is 10,000 INR.
I checked for its performance. Opened LibreOffice, Eclipse, Gimp, Blender and explored their basic operations. The performance was smooth. Cant see any lagging or heating.
Currently, the laptop is available Institutes and organizations for bulk orders. The service and support support centers are being setup across the country.
It is really lightweight. It is price is too low for its performance. You will love to buy immediately if you see it and experience it. The same happened to me.
Read here the announcement from Prof. Kannan, IITB
A brochure on this laptop is available here: https://goo.gl/hKfGfs.
If you are interested in buying it for your organization/college/school, contact Mr. Mohamed Kasim on mohamedkasim AT iitb DOT ac DOT in
I try to watch movies in theaters most of the times. But when some good movies come, they go out of theaters, when I decide to watch. In Chennai, most moves wind up in 2 weeks.
Last week, Myself and Nithya watched the movie “Mayavan“, which I got from a friend, which was downloaded from internet. The movie is really interesting and the concept, acting, direction, cinematography, direction all are really nice. Never expected such things from a tamil movie. The next day, when we had a discussion about the film “Anbe Sivam“. Immediately, watched the movie on YouTube. Yes, the full movie is available there.
We worried that we missed to watch the movie “Mayavan” on theater. We both are not active on social media and news sites. We get the info about good movies by word of mouth only. When friends praise a movie, we try to book in a nearby theater. Most of the times, it would gone away. We wait for few more weeks to get good HD pirated digital copy of the movie and watch it.
Piracy is evil. It is killing the cini industry. But it is non-stoppable.
Yes. I agree that. But, what we have to do when there is no screening of movies near us? Do we have to cross cities to watch a movie ?
When I was a kind, in my native kanchipuram, there was Indira Theater. it will screen good movies for second time. When a movie go out of first time screens, in a month, Indira theater will screen it again. There was Pandiyan Theater, which screens for third and fourth times of good movies. Nowadays, those two theaters were closed. Cant see such theater in kanchipuram or in chennai.
Discussed a system with Nithya to pay for the movies we see in pirated copies or youtube like websites. She told that she can pay 100 Rs for each movie she sees from home.
Here is a concept.
1. A mobile app or web site to get donations.
2. It will list all the movies
3. User can select a movie
4. Producers Bank details are displayed here. User can add in own bank and transfer money or donate via our app/site.
5. Users enters “Donate” Button
6. It asks for amount and redirects to payment gateway
7. Payment should be done via Credit card, debit card, net banking, PayTM like systems etc
8. Once the payment is done, the amount should be credited to Producer’s account.
If such system is available, most of will be donate money for the movies we like. We can start with tamil movies, then we can expand to other languages.
The alternate solution now available is “Amazon Prime Movies”. I am still not a prime member. Never experience it. Dont know it will be a hit in country like India where broadband is still a dream in most of the places.
Offline watching is still the usual thing here. Most of the people, who travel in train or bus, are watching a movie. For them, offline sharing is the easiest way to watch the movies. I think, amazon prime can not reach them, as traveling wont get good internet connection here.
I believe that such donation system will workout, if done properly and marketed by all the movie stars.
We need the following systems to build such donation system.
1. servers to run the application
2. mobile apps for android/iOS
3. Web site/ web application
4. Movies list
5. Producer’s account details
Need to discuss with the producer’s and get their account details.
We can expand the same idea to software industry, music industry and ebook industry too.
Comment here if you are agreeing/non-agreeing/interested to do this a project/etc
Promote this idea to the startups and investors. If someone can do, thats a happy news. if no one does, we can do it.
I went to my home town Kanchipuram, India for Christmas holidays. We had a good active Linux Users Group called KanchiLUG there few years ago. We still have few members there doing nice works there.
Decided to have a mini mapping party at kanchipuram on Dec 25, 2017. sent a mail to our mailing list and asked people to join the party.
We had one volunteer replied. T. Dhanasekar. We met on dec 25 10 am. Created an account for him in openstreetmaps.
We both dont use smartphone. I borrowed my wife Nithya’s phone. He did not get any. I dont have a motor bike there. I already took his bike. Hence, we decided to roam around the city together and add interesting places to OSM.
I explained maps.me app and how to add POI. We found many schools, temples, shops, clinics etc and added them. For few POI’s, we did not find suitable types in maps.me
Will ask the maps.me team to add more types.
In 2 hours, we added 75 places. There are still tons of places to add at Kanchipuram. We will add them in upcoming days.