Singapore
3K followers 500+ connections

Join to view profile

About

Experienced (5+ years) data scientist with expertise in prototyping and delivering AI…

Activity

Join now to see all activity

Experience & Education

View Raja’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Licenses & Certifications

Publications

Projects

  • A bottom-up recommender system for tourism industry

    - Present

    Developed a content based recommender system for travel destinations by aggregating cosine similarities between user preferences and the underlying Points Of Interests (POIs)

    Created a dataset containing 230k+ POIs by web scraping

    Extracted POI features using the TF-iDF approach and Universal Sentence Encoder architecture

    Executed an end-to-end pipeline involving data collection, data exploration, data visualization, data cleaning, feature extraction and model…

    Developed a content based recommender system for travel destinations by aggregating cosine similarities between user preferences and the underlying Points Of Interests (POIs)

    Created a dataset containing 230k+ POIs by web scraping

    Extracted POI features using the TF-iDF approach and Universal Sentence Encoder architecture

    Executed an end-to-end pipeline involving data collection, data exploration, data visualization, data cleaning, feature extraction and model development

    See project
  • Exploration of Neural Machine Translation architectures

    -

    Built end-to-end encoder-decoder models to generate English translation of Spanish sentences

    Implemented advanced sequence to sequence machine translation algorithms from scratch (using Pytorch) such as the Transformer} and sub-word modelling

    Achieved high BLEU score of 24.55 (Stanford CS224 en-es dataset) by exploring different attention mechanisms, beam search strategies and hyperparameter tuning

  • Mitigating unintended bias in filtering offensive online conversations

    -

    Detected toxic (rude, obscene, insulting) comments in online conversations while minimizing unintended biases associated with mention of identities referring to race, gender and sexual orientations

    Developed and fine-tuned NLP models using state of the art algorithms such as BERT, GPT-2 and XLNET to achieve high model performance

    Designed a custom loss function accounting for the unintended biases to optimize the target AUC metric and secured a silver medal for finishing in…

    Detected toxic (rude, obscene, insulting) comments in online conversations while minimizing unintended biases associated with mention of identities referring to race, gender and sexual orientations

    Developed and fine-tuned NLP models using state of the art algorithms such as BERT, GPT-2 and XLNET to achieve high model performance

    Designed a custom loss function accounting for the unintended biases to optimize the target AUC metric and secured a silver medal for finishing in the top 2% of the Kaggle competition

  • Product Classifier for Shopee National Data Science Challenge 2019

    -

    Predicted category of e-commerce products based on their images and title descriptions

    Built a CNN image classifier by fine-tuning pre-trained VGG-16 and MobileNet model architectures

    Trained LSTM and GRU models with GloVe and fastText embeddings for text classification

    Secured 3rd rank among 360 teams by ensembling the predictions from machine learning models with XGBoost, adopting K-Fold cross-validation and incorporating creative feature engineering

    See project

Honors & Awards

  • 3rd place: Feedback Prize - Predicting Effective Arguments Kaggle Competition

    Kaggle

    https://www.kaggle.com/competitions/feedback-prize-effectiveness/overview

  • 3rd place: NBME - Score Clinical Patient Notes Kaggle Competition

    Kaggle

    https://www.kaggle.com/c/nbme-score-clinical-patient-notes

  • Silver Medal: Jigsaw Unintended Bias in Toxicity Classification Kaggle Competition

    Kaggle

    https://www.kaggle.com/competitions/jigsaw-unintended-bias-in-toxicity-classification

  • Student Competition Winner: EMI 2017 Conference

    Engineering Mechanics Institute Conference (EMI 2017)

    http://emi.ucsd.edu/
    Best student paper presentation

  • President's Graduate Fellowship (PGF) in NUS

    National University of Singapore (NUS)

    Exceptional promise and accomplishment in research

  • Academic Excellence Award IIT Kanpur

    -

    Excellent coursework performance

  • DAAD-WISE Scholarship for Summer Intern

    German academic exchange

Languages

  • English

    Native or bilingual proficiency

  • Hindi

    Native or bilingual proficiency

  • Bengali

    Native or bilingual proficiency

More activity by Raja

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Raja Biswas

View Raja’s full profile

  • See who you know in common
  • Get introduced
  • Contact Raja directly
Join to view full profile