Scikit-learn Pandas Pipeline
2021-03-12
Scikit-learn pipeline helper for pandas dataframe. This package created to simplify data preprocessing using scikit-learn pipeline. Instead return numpy array this package will return pandas dataframe.
Scikit-learn
Pandas
Read More
Safe Driver Prediction - Kaggle
2021-03-09
Predict if a driver will file an insurance claim next year. This project use custom Scikit-learn pipeline and LightGBM for modeling. W&B was used for hyper parameters sweeps, experiments tracking, and artifacts logging.
Scikit-learn
LightGBM
W&B
Read More
Chicago Taxi Tips
2021-03-02
Predict if a taxi driver will get high tips. This project use custom Scikit-learn pipeline and LightGBM for modeling. W&B was used for experiments tracking, and artifacts logging.
Scikit-learn
LightGBM
W&B
Read More
People Analytics - BRI Datahackathon
2021-02-27
31st place solution for BRI Datahackathon - People Analytics. LightGBM was used with model bagging technique. Final AUC score is 56.3%.
Scikit-learn
LightGBM
Read More
House Price Multi Input
2021-02-15
Predict house price with structured data and images. We use Keras Functional API and preprocessing data in neural network graph using preprocessing layer. For deployment we use TFServing and Streamlit for app demo.
TensorFlow
TFServing
Streamlit
Read More
Furniture Detection TFOD
2021-01-29
Furniture detection using Tensorflow Object Detection API. Dataset was obtained from OpenImages and converted into TFRecord format. After training EfficientDet-D0 for 100k steps we got 17.1% mAP on test dataset. For deployment we use Docker and Streamlit for app demo.
TensorFlow
Object Detection
TFServing
Streamlit
Read More
Product Matching Siamese Network
2021-01-14
Multiple input siamese network using image and title pairs to predict if they are the same or different products.
TensorFlow
Read More
Cassava Disease TFLite
2021-01-11
Simple implementation of cassava disease identification for mobile device. We train efficientnet_lite3 model using tflite-model-maker and create simple android app.
TensorFlow
TFLite
Android
Read More
Indonesian Vaccination Tweets
2021-01-10
Scrapping tweets about vaccination program in Indonesia from 11 January 2021.
Tweepy
Read More
Replicating Product Matching - NDSC
2020-12-30
Replicating 1st place solution of NDSC - Product Matching. ResNet50 and Fastext were used to extract image and title pairs, and LightGBM were used to predict if they are the same or different products.
TensorFlow
Fastext
LightGBM
Read More
Cassava Disease Classification - Kaggle
2020-12-15
Implement useful technique in images classification task to speed up training process and increase accuracy like image augmentation, mixed precision, bagging, and test time augmentation.
TensorFlow
Read More
Payment Matching - NDSC
2020-10-10
Solution for NDSC - Payment Matching using recordlinkage library. This solution got 27th place in private leaderboard.
Competition
Read More
Bussines Clustering Yogyakarta
2020-05-29
Finding the best place to open a business in Yogyakarta. Popular venue near urban and college area in Yogyakarta obtained from Foursquare API. K-means algorithm was used to identify similarity each area.
Scikit-learn
Foursquare API
Read More
Simple KNN Movie Recommendation
2020-05-26
KNN algorithm from scratch, and its implementation for movie recommendation.
Numpy
Read More
Rock Paper Scissors
2020-05-25
Machine learning classification project for Dicoding. This project use keras ImageDataGenerator along with data augmentation.
TensorFlow
Read More