Lead Data scientist
Georghios Joseph
I have a background in applied data science, technical leading, analytics, information systems management, project management, and greenfield environment experience.
10+ years of experience in artificial intelligence and data space. Scroll down to find out more.
Technical Skills
I can identify problems and come up with innovative solutions. I can understand and commercialise cutting edge software technologies from peer-reviewed academic articles. If the right tools aren’t available, I make them.
My journey has led me to rapid prototyping, product/service development and design, risk & fraud detection, and board level advisory projects for market launches.
- Artificial Intelligence
- End-to-end data product/service development
- Cloud computing
- Coding
artificial intelligence
- Large language models
- Deep learning
- Machine learning
- Causal, Explainable, Fair
- Computer vision
- Data analytics
- Statistics
Product/Service Development
- LLMOps & MLOps
- Prototyping
- Minimum viable product
- End-to-end
Cloud computing
- Azure & OpenAI
- Google Cloud Platform
- Amazon Web Services
Coding
- Python
- SQL
- Version control
Soft and business Skills
I possess strong business acumen and I can extract requirements from stakeholders, turn them into projects, form teams around them and lead them to deployment. Proven expertise in digital-self serve projects that provide increased revenue, operational cost reduction, increased service capacity, risk avoidance of non-compliance, and a competitive/innovative edge.
I can speak and advise board (C-level) executives, technical staff, subject matter experts and everyone between. I work closely with progressive clients who are committed to exploring their data to reveal insights beneficial to their organisations’ performance and opportunities for growth.
- Data Communication & Outreach
- Team management
- Business Accumen
- Multi-field exposure
Data Comms & outreach
- Data analytics
- Write, format, and present technical prose
- Facilitate data-informed discussions
- Help non-technical executives act on data findings
Team Management
- Team leader
- Technical lead
- Hiring and managing
- Agile values and principles
Business
Accumen
- Ability to identify high impact and capitalizable projects
- Triaging and prioritizing
- Ability to bring together high performing and value adding teams
Multi-Field
Exposure
- Battery testing
- Fin-tech
- Supply-chain
- Chemical engineering
- Chemical testing
- Tissue Engineering
- Regenerative medicine
My Experience
Data Scientist | Large Language Models | Artificial Intelligence
Extensive experience of working with data. Extracting value from data, solution prototyping and development, builder and leader of value delivering teams. Multi-modal communicator with strong business acumen aiming to create intellectual property and software assets.
My background is in applied data science, analytics, information systems management and analysis, project management, business start-up, and computer networks. Here is a glimpseof my industry experience:
Lead data Scientist – 2022 October to present
Reporting to data directors and I provide data-driven services as capitalised assets. Managing a team, hands-on, focusing on innovation.
Senior Data Scientist – 2020 August to 2022 September
Reported to the head of product for analytics and data. Created data-driven products as capitalised assets. Managed a team, but also hands-on, focusing on product innovation.
Chief of Data Science – 2020 February to 2020 August
Built the company with C-level executives and technical staff. Devised business strategies. Acted as a communication bridge between technical and C-executives while managing expectations.
Chief of Data Science – 2020 January to 2020 August
Reported to the CEO and I helped built the company focusing on tech innovation. Board member for tech decisions made. Participated in meetings with commercial impact.
Senior Data Scientist – 2017 December to 2020 January
Streamlined and improved invoice analysis and enhanced staff training material from ML. Involved in devising the data strategy and culture, built, and led projects and teams conceived with C-executives.
Data Scientist – 2013 October to 2017 November
Project: data science informing chemical engineering for stem cell control in synthetic environments.
Data Analyst – 2012 October to 2013 September
Helped biomaterial/tissue engineering scientists develop therapies by understanding and designing surface chemistries of artificial environments to enhance cell performance.
Projects
I have spent the last 10+ years working on a plethora of projects allowing stakeholders to make better decisions. Here are the noteworth ones:
ELLM – 2024 to PREsENT
Developing processing pipelines, vector databases, and Large Language Models (LLMs) to unlock business value from unstructured data. Delivered prototypes for test/consultation service determination and lead generation applications. Streamlined customer relationship services from 2 weeks down to 2 hours.
Element Detectron – 2023 to present
Near real-time anomaly detection of battery discharging test results. 1+ billions records ingestion and pre-processing in minutes using Databricks. Seconds to detect anomalies using statistical tests combined with machine learning achieves 92+% detection. The deliverable is a PowerBI dashboard and auto-alerting with the ability to provide detection feedback using PowerAutomate.
Collection Success – 2021 to 2022
Automatically re-collect direct debits intelligently using machine learning on transactional data from all transactional products. When enabled, it reduces re-collection cost by 5x, re-collects 94+% more DDs, and operates in seconds. The deliverable is a premium feature on the PCL platform that overrides the default re-collection date.
PCL Insights – 2020 to 2022
PCL as a platform is a black box for the customers. As our first data initiative, we return their aggregated usage, monthly, in a bespoke report available through the platform. We compared their historic performance along with an industry benchmark for adjusted for their company size and transactional volume.
Analytics platform built on AWS – 2020
Built with a team an easy-to-use analytical platform on AWS, utilizing tools like Lambda functions (Python), S3 for storage, and SageMaker for machine lerning. This platform was designed to be accessible even for novices, streamlining the data science workflow and enhancing the company’s service offerings.
Customer journey optimization – 2020
Leveraged POS data to identify bottlenecks, optimize store layouts and drive-throughs, and enhance preparation efficiency. Utilizing Python in Google Colab, BigQuery for big data, and TensorFlow for machine learning, Our suggestions had the potential to increase customer throughput by 30%.
VAT AI – 2018 to 2020
Implemented gradient-boosted machine classification for NLP tasks to identify invoice elements eligible for VAT recovery. This solution drastically reduced the time needed to process annual data from two weeks for one client to 30 minutes for 150 clients, saving millions for the NHS.
Forensics AI – 2019 to 2020
Utilized machine learning and subject matter expertise to detect duplicated invoices with slight mutations in codes, values, and dates, significantly reducing investigation time.
AI self-service platform – 2020
Developed a platform to house the above solutions with smart task queuing, enhancing the company’s digital self-serve capabilities.
Get-Chem AI – 2013 to 2017
GetChem uses machine learning to streamline the discovery of optimized biomaterial designs. It estimates cell behaviour from input variables from the surface’s chemical characteristics. The aim is to unravel the relationship between cells and biomaterial surface.
Chem-ID AI – 2012 to 2013
Linking surface chemistry with cell reactions is challenging for designing of biomedical materials. The tool uses chemical data from the outermost surface of hundreds of synthetic materials. Key surface features affecting human stem cell adhesion can now be identified much faster compared to conventional methods.
Awards & Publications
Frontrunner award for element detectron – 2023
Automatically detects anomalous behaviour in battery testing
Academic article – 2021
Effects of Surface Chemistry Interaction on Primary Neural Stem Cell Neurosphere Responses
Doctoral thesis – 2018
Application of data science to inform surface engineering for in vitro neural stem cell control
Fully funded scholarship – 2012
For a PhD in Regenerative Medicine
Education
Certifications – 2023
MLOps practitioner, Advanced Designer, Developer
PhD in Artificial Intelligence for Regenerative Medicine – 2017
Fully funded scholarship by the EPSRC
Bsc Computer networks and cybersecurity – 2012
First class Honours
Hobbies
Skydiving – 2023 to present
Facing my fears
Weighted Callisthenics – 2019 to present
Waving that flag