AM Research is a division of Aspiring Minds. Aspiring Minds aspires to build an assessment-driven job marketplace (a SAT/GRE for jobs) to drive accountability in higher education and meritocracy in labor markets. The products developed based on our research has impacted more than two million lives and the resulting data is a source of continuous new research.


A cocktail of assessment, HR, machine learning, data science, education, social impact with two teaspoons of common sense stirred in it.

AI powered coding platform ‘understands’ programs that do not compile!

If you ever took a coding job test on a machine, you will probably frown if you couldn’t make your code to compile. Your program might be almost right, but due to some silly bug, unidentified in a small time frame, you will get a ZERO.

 

syntax_error

 

Not any more! Aspiring Minds’ research team has created a technology which can detect how good the program’s algorithm is, even if it doesn’t compile.

How do we do it? First, we can fix some of the codes using artificial intelligence. By looking at patterns in good compilable codes, our algorithms minimally modify existing programs to make them compilable. By using this approach, we can compile 40% of uncompilable codes. Once compilable, our patented machine learning based algorithm can generate a grade which mimics human raters.

Fancy as it may seem, we had a harder problem to solve. What about the codes that do not compile? Using smart static analysis of codes, we are able to derive features, signatures of the logic of the program, from these codes automatically. With these features and a customized form of our machine learning algorithm, we can provide grades as accurately as you could think!

On a set of programs attempted for a job in a large e-commerce player in USA, we find that 46% codes were not compiling, but weren’t blank.

Our AI based algorithm found that 6% of these codes, for 596 students, had nearly correct logic. Another 29% candidates, with a little bit of guidance, would have reached the right logic. All these candidates deserved a shot with the company!

In another data set of a technology giant in China, we find that 27% candidates whose codes do not compile, have sound programming logic.

What is more? Our AI algorithm can provide feedback to all candidates whose code do not compile. To some, we can tell how to fix their programs and make them to compile. To all, we can give them feedback on their algorithmic approach, tips to reach the correct logic and provide feedback on the stylistic and maintainability issues in their code.

Disappointed with coding platforms which gives everyone a poor score and no feedback… We have corrected this for all times to come!

- Varun Aggarwal with Rohit Takhar
Learn more about Automata – our coding platform.

Plan what NOT to do in 2017!

A new year is on the horizon. For many people it is time to make resolutions about what to do in the coming year. This year, instead of focusing entirely on what you want to do, consider thinking more carefully about those things you want to avoid. Our recent research, also covered by WSJ print, found that the secret to success is knowing what NOT to do and then not doing it! For instance, there were many things during this past year that experts advised should not be done – such as NOT to do a Brexit, NOT to elect ultra-nationalist voices and NOT to demonetize one’s currency without a plan. Only time will tell whether these were actually bad decisions. We find that recognizing a bad decision and avoiding it is far more important for success than focusing on the best things to do.

NT01

Figure 1

Our evidence came from tracking job success. We found that the most successful salespersons, customer service agents and managers weren’t those who chose the “best” course of action in a given situation, but rather were those who knew what NOT to do in a situation and avoided those actions. For instance, in a situation when you are very late for a sales meeting, what one absolutely should not do is fail to apologize. On the other hand, there might be different ways one could apologize or show regret, some being better than others, as deemed by experts. However, our work showed choosing among these different ways of expressing regret was not predictive of one’s success in a sales job. What mattered was the ability to identify what should not be done (i.e., expressing no regret). The wrong response may seem obvious in this situation, but it isn’t obvious to everyone and is also not obvious for many other situations.

Our study was based on a methodology called situational judgment testing. We provided candidates with a series of specific situations and asked them to choose from among a number of possible ways to respond to each situation – a technique known as situational judgment testing or SJT (see Figure 2 for an example). We asked them to choose which of the options presented for each situation would be the best way to respond and which would be the worst. We then analyzed the data to see if their choices predicted actual job performance (such as sales targets achieved) for a few different roles.

Figure 2: Sample question from a SJT

We expected that the people who were most successful in the workplace would be those who were able to identify what experts in the field said were the best ways to respond to each scenario. It turns out that was not the case. Instead, what we found was that the people who were most successful on the job were those who were correctly able to identify the worst answer to a larger number of situations. They knew what course of action was important to avoid for more scenarios. Specifically, the correlation between the ability to correctly identify the worst responses and job performance ranged r = 0.28 to 0.33 and was statistically significant. By contrast, the correlation between the ability to correctly identify the best responses to the scenarios and performance ranged r = 0.14 to 0.16 and was not statistically significant.

This work has important ramifications, the first and most immediate of which is being able to filter and hire better performers simply by concentrating on whether they know how to avoid doing the wrong things, which are typically widely-agreed upon, rather than trying to find people who pick the best answer. This should influence interview methodologies, case based discussions and other ways of candidate evaluation.

Another significant contribution is to the field of situational judgment testing. Unlike IQ tests, situational judgment tests are traditionally hard to standardize. Different organizations, functions and cultures have different notions of the ‘best’ way to handle a situation. Thus, with the best answer philosophy, one needs to build different tests and scoring mechanisms for each. On the other hand, the contribution of our work is that the ‘worst’ answer is more universal and consistent across diverse environment. It suggests that the development of SJTs can be relatively standardized across fields of study in a way that has not previously been possible.

Above and beyond all of these, the results have implications for our daily lives. Specifically, the results suggest that maybe this year you ought to concentrate on what ‘not to do’ and train your mind to avoid those things! Our conjecture is that it will lead to better happiness to your lives.

Make a start and list out what things you will avoid doing in 2017. We have our top on the list…Not write boring blogs!

-Varun and Steve

World’s first automated motor skill test – exploiting the power of touch tablets

Results presented at Ubicomp 2016, Heidelberg, Germany

Knowledge and cognitive ability tests have been automated and are taken on computers for more than three decades now. Pretty much all of you would have taken a SAT, GMAT or a GRE. What about motor skills? They are needed for almost all vocational jobs, say a plumber’s manual dexterity in fixing a screw. The best tests for them still are these bulky boards, pegs and instruments.

pegpin
Fig 1. Manual motor skill testing equipment – Pegboard and Pinboard

No one till date really thought about exploiting the power of the touch interfaces to develop such tests. Touch screen based devices are now ubiquitous in form of mobile phones and tablets. We wanted to find whether we can test people’s skills, say in tailoring and machining, by making them do things on the tablet. We wrote creative apps to make them do various actions on a tablet — rotating their fingers, pinching them, moving their elbows and shoulders to trace… and so on.

touch1motor3

 

 

 

 

 

 

 

 

 

         Fig 2. Touch interface device (Tab)                                            Fig 3. Snapshots of our motor skills assessment apps

We reported in our Ubicomp paper, presented last week, that the scores from these tests actually do predict the speed and accuracy of industrial tasks done by machinists, tailors and machine operators. In fact, they are better predictors than the bulky manual tests! Our test scores can predict all parameters of task performance measured by us. The correlation ranges 0.19 to 0.37, similar to what a logical ability test would predict for a knowledge worker. In comparison, manual test scores correlate significantly only for 4 out of 7 task performance ratings and ranges 0.19-0.33.

This has great implications for the training and job matching of vocational workers. Using these apps, vocational job aspirants can test their motor skills at the comfort of their homes. They can get feedback and work on improving their skills. Also, if they perform well, they can generate credentials such as “Motor skills certified for a tailor” and highlight them to employers. The same assessments can be used by the industry to filter and recruit high performing employees.

We are happy to present the world’s first validated motor skill test. There is so much more opportunity for further research – figuring out which scores correlate to performance in which task, creating a job to score map, creating more innovative apps and so on… Let us do it with the power of the touch interface.

-Varun

The first interactive US Skill Demand Map- A big data approach

Jobseekers wish to know what skills are required by the industry in their region and also, what skills pay the most. So do institutions of higher and vocational education. Unfortunately, there is no information about this. It is considered hard to collate such information and the old school way of running surveys with corporations is time-consuming, expensive and mired by subjectivity.

We went after this problem the big data way – we scrapped some 4 million job openings from the web for the US, automatically matched them to our taxonomy of 1064 job roles and the 200+ skills required for these job roles. What did we get out of this? The US Skill Demand Map – For each state in the US, we know what percent of open jobs require a given skill and how much does a skill pay. For instance, see the Heat Map below — it shows how much does the software engineering skill pays in different US states.  All this is generated automatically and be updated in minutes every month based on the current open jobs in the market!

 Figure 1: Compensation for software engineering skill

Figure 1: Compensation for software engineering skill

This map is interactive. A jobseeker can enter his key skill to find which states demand it the most and which states pay for it the most. Additionally, s/he can scroll across the map to find the demand/compensation in each state for a given skill. On the other hand, the candidate can enter a state and find out top paying and high-demand skills in the state. Try it now!

Such analysis also helps us uncover policy trends (See our report). We found that agreeableness and finger dexterity are the most in demand skills after Information Gathering and Synthesis, which has the highest demand. One may see in the map below the states which have more percent of jobs requiring agreeableness and those where finger dexterity is required more often.

 

Figure 2: Skills in highest demand in each U.S. state (other than Information Gathering & Synthesis)

Figure 2: Skills in highest demand in each U.S. state (other than Information Gathering & Synthesis)

On the other hand, we can find the states which have the most demand and pay the most for say, analytical skills. New York pays the most for the skill, whereas the highest percent of jobs in Virginia need analytical skills. (See Figure 3)

Figure 3: Heat maps for demand and compensation for analytical skills

Figure 3: Heat maps for demand and compensation for analytical skills

The U.S. Skill Demand Map fills a major information gap in the labor market. To our knowledge, this is the first effort to objectively present the demand for skills across US states to aid better decision-making by job seekers. It is based on objective data, it is quick, accurate and user-friendly.

Trying to understand what skill to gain or how best to utilize your skills? Use our interactive map now!

-Varun

Scaling up machine learning to grade computer programs for 1000s of questions in multiple languages

Machine learning has helped solved many grading challenges – spoken english, essay grading, program grading and math problem grading to cite a few examples. However, there is a big impedance in using these methods in real world settings. This is because one needs to build an ML model for every question/prompt – for instance, in essay grading, a different model designed to grade an essay on ‘Socialism’ will be very different from one which can grade essays on ‘Theatre’. These models require a large number of expert rated samples and a fresh model building exercise each time. A real-world practical assessment works on 100s of questions which then translates to requiring 100s of graders and 100s of models. The approach doesn’t yield to be scalable, takes too much time and most of the times, is impractical.

In our KDD paper accepted today, we solve this challenge quite a bit for grading computer programs. In KDD 2014, we had presented the first machine learning approach to grade computer programs, but we had to build a model per problem. We have now invented a technique where we need no expert graded samples for a new problem and we don’t need to build any new models! As soon as we have around a few tens of ‘good’ codes for a problem (automatically identified using test case coverage and static analysis), our newly invented question-agnostic models automatically take charge. How will this help us? With this technology, our machine learning based models can scale, in an automated way, to grade 1000s of questions in multiple languages in a really short span of time. Within a couple of weeks of a new question being introduced into our question pool, the machine learning evaluation kicks in.

There were couple of innovations which led to this work, a semi-supervised approach to model building:

  • We can identify a subset of the ‘good’ set automatically. In the case of programs, the ‘good set’, codes which get a high grade, can be identified automatically using test cases. We exploit this to find other programs similar to these in a feature space that we define. To get a sense of this, think of a distance measure from programs identified as part of the ‘good set’. Such a ‘nearness’ feature would then correlate with grades across questions irrespective of whether it is a binary search problem or a tree traversal problem. Such features help us build generic models across questions.

  • We design a number of such features which are invariant to the question and correlate to the expert grade. These features are inspired by the grammar we proposed in our earlier work. For instance, one feature is how different is an unseen program from the set of keywords present in the ‘good set’; while another is the difference in the programs in the kind of computations they are doing. Using such features, we learn generic models for a set of problems using supervised learning. These generic models work super well for any new problem as soon as we get our set of good codes!

Check out this illustrative and easy-to-grasp video which demonstrates our latest innovation.

 

The table presents a snapshot of the results presented in the paper. As shown in the last two columns, the ‘question-independent’ machine learning model (ML Model) constantly outperforms the test suite based baseline (Baseline). The claim of ‘question-independence’ is corroborated by similar and encouraging results (depicted in last three rows) obtained on totally unseen questions, which were not used to train the model.

Metric
Question Set
#Questions
ML Model
Baseline
Correl
All questions
19
0.80
0.65
Bias
All questions
19
0.24
0.35
MAE
All questions
19
0.57
0.85
Correl
Unseen questions only
11
0.81
0.65
Bias
Unseen questions only
11
0.27
0.31
MAE
Unseen questions only
11
0.59
0.84

What does this all mean?

  • We can really scale ML based grading of computer programs. We can continue to add new problems and the models will automatically start working within a couple of weeks.
  • These set of innovations apply to a number of other problems where we can automatically identify a good set. For instance, in circuit solving problems, the ones with the correct final answer could be considered a good set; this can similarly be applied to mathematics problems or an automata design problem; problems where computer science techniques are mature to verify functional correctness of a solution. Machine learning can automatically then help grade other unseen responses using this information.

Hoping to see more and more ML applied to grading!

Varun

Work done with Gursimran Singh and Shashank Srikant

Tweets

Hear Pralay Mondal, Senior Group President at @YESBANK speak on The #future of #Businesses and #technology led tran… twitter.com/i/web/status/9…

We are back! We thank everyone for your extended support and eagerly look forward to meeting you. 13 December 2017… twitter.com/i/web/status/9…

With deep regret, we inform you that due to #MumbaiRains & cyclone warning issued by Govt, we have postponed the As… twitter.com/i/web/status/9…

The stage is set! We eagerly look forward to meeting you today at the @AspiringMindsAM product launch event.… twitter.com/i/web/status/9…

Start early! #placement preparation should start from 1st year of college and not in the #final year! @varaggarwaltwitter.com/i/web/status/9…

Pralay Mondal, Senior Group President @YESBANK to speak on The #Future of #Business and #Technology led transformat… twitter.com/i/web/status/9…

Meet our impressive line up of speakers at Aspiring Minds #DIGIHRSUMMIT. @Joydeeproy Pralay Mondal @YESBANKtwitter.com/i/web/status/9…

7 days to go! Register today to participate in the launch of the latest innovation from Aspiring Minds Research lab… twitter.com/i/web/status/9…

#ArtificialIntelligence is powering the #nextgen #recruitment process. Are you up for it? Register for Aspiring Min… twitter.com/i/web/status/9…

Join us for Aspiring Minds new product launch event. Date:05 December 2017 | Place - The Grand Hyatt, Mumbai. Block… twitter.com/i/web/status/9…

Team AM & #ESSCI exhibit vocational & motor #skills assessments at @MSDESkillIndia National Entrepreneurship Awards… twitter.com/i/web/status/9…

Demand of Spoken #English #skill across job roles. Philippines National Employability Report 2017 Read-… twitter.com/i/web/status/9…

"Migration musts" - Robust #skills #assessment AND International Skill Certificates says @himanshu0820 @skillbharat financialexpress.com/india-news/why…

#AMNERPhili, the first #employability study on grads of #Philippines. How job-ready are the youth of Phili? Read on… twitter.com/i/web/status/9…

"Tamil Nadu ranks 4th in job openings in India" reveals AM's #skillmapindia Read report by @timesofindia #skills bit.ly/2z1y3h9

Get privy to amazing facts on employability & #skills of graduates in Philippines Read full report here #AMNERPhili aspiringminds.com/philippines-na…

Bots and humans will need to work as team as their #skills complement each other~@varaggarwal #automationtwitter.com/i/web/status/9…

#AMNERPhili launches at Contact Islands 2017 @ccapph today! Watch this space for more updates. #ContactCenter #BPOtwitter.com/i/web/status/9…

#Software developers and sales ppl most-in-demand by Ind employers Skill Map - India, 2017 Read on… twitter.com/i/web/status/9…

65% Philippines grads lack right job skills- AM's Philippines National Employability on 12 Oct@#ccapph #AMNERPhili news.abs-cbn.com/business/10/05…

Catch @varaggarwal share a sneak preview of 1st ever '#Philippines National Employability Report' @ #ccapph Stay tu… twitter.com/i/web/status/9…

#skillmapindia 2017 is a #BigData view of the top #jobs and #skills preferred by employers across the country. pic.twitter.com/inRoR0TUbz

Twitter Media

AM's #SkillmapIndia reveals #SoftwareDeveloper #Jobs top dd chart in Corp India. Eng more desired than Logic&Maths… twitter.com/i/web/status/9…

Aspiring Minds launches #SkillMapIndia 1st ever interactive platform showing #jobs & #skills dd in India #SkillIndia business-standard.com/article/pti-st…

Hear @varaggarwal CTO & Co-Founder Aspiring Minds speak at #DigitalHaryanaSummit #DigitalTransformation Haryana twitter.com/nasscom/status…

AspiringMinds #NER shows glaring reality of India's #employability quotient Triggers @NITIAayog 3Y action plan #jobs bit.ly/2xNLyBq

Serious #training intervention needed in #skills which #Nursing assistants lack. #healthcare #jobs #careertwitter.com/i/web/status/9…

Team @AspiringMindsAM thanks our honored guests for gracing #AMNEC2017, Pune Chapter. @sanjyotkhare @sonyadctwitter.com/i/web/status/9…

Using #datascience to assess employees and find the #right organizational fit: Sushant Dwivedy @AspiringMindsAM at… twitter.com/i/web/status/9…

Rajat Mathur-VP Client Solutions, Aspiring Minds addressing #career office heads at #AMNEC2017 #Pune @myamcat #jobstwitter.com/i/web/status/9…

The stage is set! Aspiring Minds kick starts National Employability Conclave 2017 | Pune Chapter #AMNEC2017twitter.com/i/web/status/9…

Sushant Dwivedy, Head, Enterprise Client Solutions @AspiringMindsAM addressing #IBPAP HR Summit #AMIBPAP pic.twitter.com/QBfxOd7gWo

Twitter Media

Very interesting discussions on Day 1 #IBPAP HR Summit 2017. #AMIBPAP #HRTech #talent #disruption pic.twitter.com/msBuVjR5yj

Twitter Media

Team AM at #IBPAP #HR Summit 2017 #Philippines. The stage is set for some interesting conversation around humanizin… twitter.com/i/web/status/9…

Just 1 day to go! Hear Sushant Dwivedy, Head Enterprise Client Solutions, Aspiring Minds speak at #IBPAP #HR Summit… twitter.com/i/web/status/8…

45% #nursingschool graduates lack basic medical knowledge | Skills Nursing Assistant -2017 by Aspiring Minds #skills indiatoday.intoday.in/story/45percen…

Consolidation of #edtech in #India will see a few select players emerging as leaders - @varaggarwal @livemint bit.ly/2wvQyMW

Aspiring Minds at #IBPAP #HR Summit 2017. Watch this space for interesting conversations around #talent quality,… twitter.com/i/web/status/8…

80% employee turnover is due to bad hiring decisions @HarvardBiz Soak up sum really gr8 hiring & #InterviewTipstwitter.com/i/web/status/8…

Aspiring Minds' predictive #assessment algorithms improve recruit efficiency by over 35% Read on @livemint @myamcattwitter.com/i/web/status/8…