What you will be doing;
We have an exciting opportunity for a Data Scientist to join our newly formed Machine Learning within our Digital and Innovation department on a permanent basis (with the potential for further fixed term opportunities).
We're driving a lot of innovation related to Machine Learning and AI in the rail industry and the scope of possibilities is growing rapidly. The Machine Learning team is developing predictive solutions to support decision making across multiple teams in the company. We also work with teams across the company to increase the data literacy and help teams see where Machine Learning and AI can integrate into their solutions, ease and improve decision making thus making the work more enjoyable and creative.
Our Data Scientist role forms an essential part of LNER's Machine Learning team, primarily by building statistical models to enable the benefits of business-challenge led use cases to be sought after and then realised.
This role with also involve;
- Supporting the end-to-end business-challenge led process from feasibility testing an idea at conception, to testing at a proof of concept phase to realising the benefits at production and BAU maintenance. This will be done by taking a key role in the ‘scrum team' as the data science expert, collaborating with Machine Learning Product Managers and Machine Learning Engineers to collectively be responsible for successful delivery of capabilities.
- Being responsible for building machine learning models that meet the requirements of a business challenge or opportunity, including selecting which model is best for each unique project and seeing this through to model delivery and testing.
- Supporting the Machine Learning Technical Lead by implementing best practices for code development, training and testing models, deployment, ethics, and approach to productionising solutions.
- Being a subject matter expert in data science theory and practice, to support the Machine Learning team to demonstrate to the business how data science solutions could benefit their areas.
- Critically assessing and validating all modelling related tasks in machine learning production pipeline.
- Designing, building, and implementing the quality assurance safeguards for machine learning production pipeline.
- Ensuring the high-quality documentation of the machine learning pipelines.
- Hybrid working with Tuesdays and Thursdays based in either our York or Kings Cross office.
What you'll need:
We are looking for someone who has proven experience in a Data Scientist role working on a wide variety of projects, within an in house function. You'll also have;
- Experience of taking on novel, business-challenge led problems, developing algorithms iteratively and implementing productionised operational solutions to drive the benefit of machine learning.
- Knowledge of best practices for statistical data modelling, analysis and exploration, machine learning.
- Fluent in writing well-structured Python code for machine learning algorithms.
- An excellent knowledge of basic machine learning libraries, such as NumPy, SciPy, Pandas, Dask, PyTorch, Tensorflow, etc.
- A solid grasp of standard data science techniques, for example, supervised/unsupervised machine learning, model cross validation, Bayesian inference.
- Experience in working with large amounts of raw data; preparing, cleansing and processing.
- An understanding of coding best practices and experience with code and data versioning (using Git/CodeCommit), code quality and optimisation, error handling, logging, monitoring, validation and alerting.
- A ‘test and learn' approach mentality.
- Experience in working with AWS Machine Learning Stack i.e. Sagemaker Studio (desirable).
- Familiarity with Tableau and/or other visualisation tools (e.g. Power BI) (desirable).
- Strong stakeholder management, communication, and engagement skills tailored to the audience (exec team, senior stakeholders, data engineers) (desirable).
If this sound like you, what are you waiting for?
Apply now!