Analyze, process, evaluate and document large data sets
Identify and develop appropriate machine learning/deep learning/natural language understanding/natural language processing techniques to uncover the value of the data
Design data structure and data storage schemes for efficient data manipulation and information retrieval
Develop tools for data processing and information retrieval
Develop data driven models to quantify the value of a given data set
Apply, modify and invent algorithms to solve challenging business problems
Validate score performance and conduct ROI and benefit analysis
Document and present model process and model performance
Required Skills:
Advanced degree (PhD or Masters) in Machine Learning, Data Science, AI, Computer Science, Computer Engineering, Electrical Engineering, Physics, Statistics, Applied Math or other quantitative fields
0-3 years of working experience in data science, and/or predictive modeling
Demonstrated ability to lead and execute end-to-end projects
Ability to independently support existing products
Proven track record in modifying and applying advanced algorithms to address practical problems
Proficient in deep learning (CNN, RNN, LSTM, attention models, etc.), machine learning (SVM, GLM, boosting, random forest), graph models, and/or, reinforcement learning
Experience with open source tools for deep learning and machine learning technology such as Keras, tensorflow, pytorch, scikit-learn, and pandas etc.
Proven ability to work independently on development of complex models with extremely large and complex data structures
Proficient in more languages such as, Python, R, Java, C++, or C
Experience in large data analysis using Spark (pySpark preferred)
Robust knowledge and experience with statistical methods
Desired Skills:
Good knowledge of Python and related libraries
Experience with Hadoop and NoSQL related technologies such as Map Reduce, Spark, Hive, HBase, MongoDB, Cassandra, etc.
Experience with online, mobile marketing analytics and GPU programming
Strong understanding of Natural Language Processing, Natural Language Understanding, and the relevant open-source tools
Solid knowledge of Bayesian statistical inference and related machine learning methods
Experience with Agile methods for software development