Text copied to clipboard!

Title

Text copied to clipboard!

Data Scientist

Description

Text copied to clipboard!
We are looking for a Data Scientist who will support our product, sales, leadership and marketing teams with insights gained from analyzing company data. The ideal candidate is adept at using large data sets to find opportunities for product and process optimization and using models to test the effectiveness of different courses of action. They must have strong experience using a variety of data mining/data analysis methods, using a variety of data tools, building and implementing models, using/creating algorithms and creating/running simulations. They must have a proven ability to drive business results with their data-based insights. They must be comfortable working with a wide range of stakeholders and functional teams. The right candidate will have a passion for discovering solutions hidden in large data sets and working with stakeholders to improve business outcomes.

Responsibilities

Text copied to clipboard!
  • Work with stakeholders throughout the organization to identify opportunities for leveraging company data to drive business solutions.
  • Mine and analyze data from company databases to drive optimization and improvement of product development, marketing techniques and business strategies.
  • Assess the effectiveness and accuracy of new data sources and data gathering techniques.
  • Develop custom data models and algorithms to apply to data sets.
  • Use predictive modeling to increase and optimize customer experiences, revenue generation, ad targeting and other business outcomes.
  • Coordinate with different functional teams to implement models and monitor outcomes.
  • Develop processes and tools to monitor and analyze model performance and data accuracy.
  • Conduct research from which you'll develop prototypes and proof of concepts.
  • Look for opportunities to use insights/datasets/code/models across other functions in the organization.
  • Create automated anomaly detection systems and constant tracking of its performance.

Requirements

Text copied to clipboard!
  • Strong problem solving skills with an emphasis on product development.
  • Experience using statistical computer languages (R, Python, SLQ, etc.) to manipulate data and draw insights from large data sets.
  • Experience working with and creating data architectures.
  • Knowledge of a variety of machine learning techniques (clustering, decision tree learning, artificial neural networks, etc.) and their real-world advantages/drawbacks.
  • Knowledge of advanced statistical techniques and concepts (regression, properties of distributions, statistical tests and proper usage, etc.) and experience with applications.
  • Excellent written and verbal communication skills for coordinating across teams.
  • A drive to learn and master new technologies and techniques.
  • We’re looking for someone with 5-7 years of experience manipulating data sets and building statistical models, has a Master’s or PHD in Statistics, Mathematics, Computer Science or another quantitative field, and is familiar with the following software/tools: Coding knowledge and experience with several languages: C, C++, Java, JavaScript, etc.
  • Knowledge and experience in statistical and data mining techniques: GLM/Regression, Random Forest, Boosting, Trees, text mining, social network analysis, etc.
  • Experience querying databases and using statistical computer languages: R, Python, SLQ, etc.

Potential interview questions

Text copied to clipboard!
  • Can you give an example of a time when you used data to solve a business problem?
  • What is your experience with predictive modeling?
  • Can you explain a time when you used machine learning in your work?
  • What programming languages are you most comfortable with?
  • Can you explain a time when you had to explain complex data to non-technical stakeholders?
  • What is your experience with data visualization tools?
  • How do you ensure the accuracy of your data?
  • What is your experience with big data platforms?
  • How do you handle missing or corrupted data in a dataset?
  • Can you give an example of how you have used data to drive business strategy?