Backend & UI Desktop Application Development (OCR In C++) Jobs in Mumbai - IIT Bombay

Backend UI Desktop Application Development OCR In C

IIT Bombay

0 Years

Not disclosed

Expired

Posted: 09 Sep 20

Job Description

About the work from home job/internship

Selected intern's day-to-day responsibilities include:

1. Work on algorithm optimization, interface, and application development
2. Write well designed, testable, & efficient code by using best software development practices
3. Work on the development of applications employing the backend of C++
4. Engage in UI development using QT creator
5. Stay plugged into emerging technologies/industry trends and applying them to operations and activities
6. Occasionally working on the pipeline of OCR text correction, to understand the ground scenario (converting scanned text to digital text, with manual correction of OCRed text)
7. Develop the next generation of algorithms to allow our users to enjoy a smooth experience whilst using the application
8. Debug and resolve issues using open communities like Stack Overflow and GitHub

Skill(s) required

Linux

Algorithms

User Interface (UI) Development

C++ Programming

English Proficiency (Spoken)

English Proficiency (Written)

Learn these skills on Internshala Trainings

Learn C++ Programming

Learn Business Communication

Who can apply

Only those candidates can apply who:

1. are available for the work from home job/internship

2. can start the work from home job/internship between 3rd Sep'20 and 8th Oct'20

3. are available for duration of 6 months

4. have relevant skills and interests

Other requirements

1. Expertise in key C++ terminologies including vectors, templates, etc.

2. Understanding of pointers and OOPs is a must

3. Proven background in coding competitions like HackerRank

4. Ability to formulate a development problem, design, experiment, and implement solutions in C++

5. Should be self-motivated to fix the issues while coding and look for the solution in open communities like Stackoverflow and Github

6. Good to have experience working with UI development with C++ using tools like QT Creator

7. Must carry their own laptops

Perks

Certificate

Letter of recommendation

Flexible work hours

5 days a week

Additional Information

Optical character recognition (OCR) is the process of converting the document images into an editable electronic format. This has many advantages like data compression, enabling search or edit options in the images/text, and creating the database for other applications like machine translation, speech recognition, and enhancing dictionaries and language models. OCR in Indian languages is quite challenging due to richness in inflections. Using open-source and commercial OCR systems, we have observed the word error rates (WER) of around 20-50% on printed documents in four different Indic languages. Moreover, developing a highly accurate OCR system with accuracy as high as 90% is not useful unless aided by the mechanism to identify errors. So, we started with the problem of developing 'OpenOCRCorrect', an end-to-end framework for error detection and corrections in Indic-OCR. Our models outperform state-of-the-art results in 'Error Detection in Indic-OCR' for six Indic languages with varied inflections and we have solved the out of vocabulary problem for “Error Correction in Indic-OCR” in our ICDAR-2017 conference paper. We further improve the results with the help of sub-word embeddings in our ICDAR-2019 conference paper. Currently, we are targeting Sanskrit. Although the OCR tools available online do a decent job on English texts, they are not optimized for Indic languages. Thus developing an OCR model for the same is our concern. The model should be able to detect text with maximum level accuracy and should be able to draw bounding boxes on each line of the text. Further, in the digitization process of such texts, the second step would be spelling correction and formatting of the text detected by the OCR models. 'ICDAR 2019 Post-OCR competition': Our team 'CLAM' secured 2nd position in the multilingual PostOCR competition at ICDAR'19. Our model achieved the highest corrections of 44% in Finnish, which is significantly higher than the overall topper (8% in Finnish).

Number of openings

Job Particulars

Role it software engineer

Education Diploma, B.Com, M.Com, BCA, BE/B.Tech, BSc, MCA, ME/M.Tech, MSc, PG Diploma, 12th Pass (HSE)

Who can apply Freshers

Hiring Process Face to Face Interview

Employment TypeInternship / Projects

Job Id994616

Job Category IT/Software , Diploma

Locality Address

State Maharashtra

Country India

About Company

About IIT Bombay

The Indian Institute of Technology, Bombay (IITB) is one of the fifteen higher institutes of technology in the country set up with the objective of making facilities available for higher education, research, and training in various fields of science and technology. Prof. Ganesh Ramakrishnan is attempting to facilitate the empowerment of people in rural areas in terms of livelihood, education, and skill generation through Information and Communication Technology (ICT). IIT Bombay has also honored Prof Ganesh's work on 'Adaptive framework for end-to-end corrections in Indic OCR'.

Jobs By Location

Mumbai

Noida

Bangalore

Hyderabad

Pune

Chennai

Delhi

Kolkata

Ahmedabad

Gurgaon

See All mumbai Jobs

See All noida Jobs

See All bangalore Jobs

See All hyderabad Jobs

See All pune Jobs

See All chennai Jobs

See All delhi Jobs

See All kolkata Jobs

See All ahmedabad Jobs

See All gurgaon Jobs

Others also searched for

IT/Software Jobs

Diploma Jobs

Jobs in Mumbai

ARE YOU A FRESHER? REGISTER NOW

Looking for your first Dream Job?

Update Resume

Upload Resume

Active Jobs By Type

View all

Active Jobs By Category

View all

Backend & UI Desktop Application Development (OCR In C++) Jobs in Mumbai - IIT Bombay