TAHSIN MAYEESHA

https://tahsin-mayeesha.github.io/

Dhaka, Bangladesh
+8801872124350
tasmiah.tahsin@northsouth.edu

Work experience

Data Scientist

2020/122021/3

NSU HCI DIAL Lab

Project : Alor Akash Research on intersection of gender, technology usage and financial inclusion among Bangladeshi women. Facilitated by NSU ECE dept, funded by Bill and Melinda Gates Foundation. Worked on text visualization and analytics of interview transcript data of digital financial inclusion.

Google Summer of Code Participant

2019/42019/8

Tensorflow, Google

Worked with Tensorflow-Hub team on text embedding modules. Features prototype ULMFiT implementation, pretrained embedding exporter and a Bangla text classification notebook.

Google Summer Of Code Participant

2018/32018/8

Berkman Klein Center of Internet And Society, Harvard University

Worked on Mediaviz , a python network visualization library created for my project Automating Force Based Layout Scaling And Network Visualization For Media Source Networks From MediaCloud Topic Graphs during GSOC 2018.

Data Scientist

2017/62017/9

Cramstack

Worked on government project for cleaning 5 years worth of electricity data(time series) with pandas library and python programming language to build an interactive dashboard prototype.

Education

B.Sc in Software Engineering

20132014

University of Waterloo, Canada

B.Sc in Computer Science

20152020

North South University, Bangladesh

Relevant Coursework : Data Structures, Computer Architecture, Discrete Math, Digital Logic, Calculus, Linear Algebra, Probability and Stats, Machine Learning, Artificial Intelligence, Natural Language Processing.

Portfolio

Papers :

Deep learning for Question Answering in Bangla. Journal of Information and Telecommunication, Taylor and Francis, 2020.

First Bengali Question Answering System trained on synthetic translated dataset BanglaSQuAD using multilingual BERT models. Benchmark dataset released in this link.

Applying Text Mining To Protest Stories as Voice Against Media Censorship. ACM CSCW 2018 - Solidary Across Borders Workshop.

Text mining and network analysis applied on protest related personal story dataset.

Technical Reports :

DOBHASHI: Deep Learning Based Machine Translation System from English to Bangla.

Bangla machine translation System trained on SUPARA Benchmark Bangla-English parallel corpus with LSTM and transformer models.CSE 495, NLP course project. Dobhashi means translator in Bangla.

Trends in NLP for Low Resource Languages.

Survey paper on trends in NLP for low resource languages featuring transfer learning and translation attempts.

Using News to Predict Stock Market Movements

Historical data from Yahoo Finance was combined with hackernews article related dataset to analyze trends in stock market closing prices. Final models were XGBoost and LSTM based.

Projects :

Bangla News Article Classifier

Tensorflow hub NLP project focusing on classifiying BARD bangla dataset into 5 classes. Uses pretrained embedding exporter to export FastText bangla embeddings to TF-Hub module exporter. The exported embedding module is used to classify bangla articles. Achieves 94% accuracy and precision in a heavily imbalanced dataset.

Mediaviz

Mediaviz uses force atlas 2 layout as default and scales the layout automatically for graphs with 100-1000 nodes that has a power law linking structure. Features for network filtering, coloring, node resizing, prevention of label overlap and community visualization are also added. Python package deployed on pip. Project link & Github repository. Blog post here and here.

Credit Card Recommendation System
Credit card recommendation system built with scikit-learn and deployed with Django and Google Dialogflow. Recommends within 120 cards collected from 40+ banks in Bangladesh based on similarity measures compared with user preference input.
Transfer Learning on Multi-Class Fish Image Classification Contest
Transfer Learning with VGG16 neural network architecture implemented by Keras on multi-class fish classification problem with data from Nature Conservancy Fishery Monitoring Competition on Kaggle. Udacity capstone project using deep learning.
Network Visualization of Media coverage of violence against women in Bangladesh

This project explore the media coverage on the articles about harassment or violence against women. Using named entity extraction data is first filtered using keywords with python and networkx. Network visualization is done using gephi. Interactive link.

Take-Home Challenges :

Classifying Actionable Sentences from Emails
Extracted text from Enron dataset to get sentences to build a rule based model with keyword and textacy features. Also based on company given small positive-only labelled dataset merged with output from rule based models, build the actual actionable sentence classifier with NB,SVM,LSTM and Logistic regression models.

Scholarships And Certification

Secure and Private AI Scholarship Challenge, Udacity-Facebook
Fast. AI International Fellowship
International fellowship by non-profit AI education organization Fast.AI for completing their partnership courses with USF Institute of data science. Featured in : "Deep Learning, not just for Silicon Valley" article in company blog.
Udacity Machine Learning Nanodegree
Offered by education provider Udacity for completing their machine learning nanodegree . Coursework and capstone project in Github.
Presidents Scholarship of Distinction, University of Waterloo.

Media

Writer, Contributor and Publication Owner, Medium
Editor of "Learning Machine Learning" publication, Link to my profile. Guest Author in Udacity. Featured : "All about Udacity Machine Learning Nanodegree" in Udacity.
Talks : Udacity School Of AI Open House Webinar, Data For Democracy Hackathon Presentation
Hackathon : Won Banglalink SDG Hackathon with Team Quirkybits(as team leader) for building a legal assistant chatbot Nayona. Write up.

Created with