Skip to main content

Posts

Showing posts from March, 2018

Predicting user clicks using XGBoost!

Another post starts with you beautiful people! I hope you are enjoying our machine learning journey and now after familiar with many real world problems as we have seen earlier you come to know that with this skill you can make the world a better place to live! To continue our journey today we are going to analyze China's largest Big Data service platform problem and this platform is known as TalkingData About The Problem- TalkingData covers over 70% of active mobile devices nationwide . They handle 3 billion clicks per day, of which 90% are potentially fraudulent . Yes, your read it right! 90% of clicks are fraud and it causes them unnecessary server load. Our Challenge- As a data scientist our task is to  build an algorithm that predicts whether a user will download an app after clicking a mobile app ad. Data- To support our modeling, TalkingData has provided a generous dataset covering approximately 200 million clicks over 4 days which you can download/see from here