Innowise Sp. z o.o logo

Innowise Sp. z o.o

Vlad A. logo

Vlad A.

Available now

Data Scientist / Machine Learning engineer @ Innowise Sp. z o.o

Senior
Seniority
GMT+01:00, Warsaw, Poland
Location & Timezone
$55 - $60/hr
Average Hourly Rate
English, Polish
Languages

Top skills

Python
PyTorch
TensorFlow
LangChain
Flask

About

Data Scientist / Machine Learning engineer with more than 6 years of experience. Experienced in classic machine learning, recommendation systems, data analysis, and computer vision. Quick learner, ready to develop and gain new knowledge.

Top skills

Verified by Pangea.ai due diligence

Top SkillsCurrent UsageSeniority
Python
20%10 years
PyTorch
20%10 years
TensorFlow
20%10 years
LangChain
20%10 years
Flask
20%10 years

All skills

Roles and tools, to bring ideas to life and create meaningful experiences

AI & ML Solutions
CRM
ERP
Web Apps
Enterprise Software
Integrations

Professional experience

Explore a curated selection of projects highlighting Vlad A.'s expertise and experience. Each project aims to showcase challenges, solutions, and the final outcome, along with the tools and technologies used.

Compliance verification with LLM project preview
NDA
Compliance verification with LLM — Data scientist / Machine Learning engineer
The project focuses on detecting document forgery in financial contracts, particularly during the initial stages. Platform meticulously checks for compliance with regulatory requirements and identifies inconsistencies in dates, asset types, and quantities involved in share management or procurement. The project ensures agreement integrity by employing advanced technologies for comprehensive form verification and robust protection against fraudulent activities while prioritizing anonymity.
Python
PostgreSQL
Docker

E-Commerce Recommendation System project preview
NDA
E-Commerce Recommendation System — Lead Data scientist
The system that suggests relevant products to online shoppers based on their past browsing and purchasing history. The system uses data mining techniques to analyze user behavior and generate personalized recommendations, making the shopping experience more enjoyable and efficient.
Python
PyTorch
Docker
MariaDB
Redis
+1

AI Music project preview
NDA
AI Music — Lead Machine Learning engineer
The goal of the project was to create a neural music generation system, giving high quality music data and a large token generation limit.
Python
PyTorch
Flask
Redis
DynamoDB
+1

Preferred tools

View the preferred tools and apps used by Vlad A. to assess compatibility and alignment.

Azure
Azure
Git
Git
Github
Github
Gitlab
Gitlab
Google Chat
Google Chat
Google Drive
Google Drive
Google Meet
Google Meet

Career highlights

Discover Vlad A.’s professional journey, including employment history, certifications, and educational background.

Data scientist / Machine Learning engineer
Innowise Group2023 - Present
Collaborated with multiple cross-disciplinary teams and provided technical leadership, guidance, and coaching to engineers through brainstorming sessions and design reviews; Machine Learning pipeline architecture design; Formulation and testing of hypotheses; Setting up the infrastructure for efficient LLM models finetuning and validation; Experimented with quantization approaches to decrease memory usage and speed up the model at the inference stage; Employed Parameter-Efficient Fine-Tuning strategies to efficiently finetune Large Language Models for a relatively small amount of custom domain data and optimize training computing resources usage; Experimented with different objectives for self-supervised LLM finetuning. Implemented Langchain system with dialogue-like q&a chains with OpenAI API for initial creation of auto labeled data; Implemented document search to take into account the contexts of large documents and regulations; Prepared basic document vector index building pipeline and set up data validation layer; Processing of the US legislative body to be able to obtain the golden standard document closest to the requested document in 5 seconds; Parsing of weak structure documents and creation of mechanisms for quality control of extracted document clauses; Extracting financial terms (tested several approaches for a Named Entity Recognition task) from documents for further compliance analysis; Utilize advanced anomaly detection techniques to identify unusual patterns or behaviors in the execution of financial contracts, signaling potential front running attempts; Implemented different approaches for document classification by financial domains to customize verification pipelines for each specific domain; Vector search and inference pipeline optimization; Integrate model serving platforms that support dynamic batching and parallelism, maximizing throughput and minimizing latency during LLM inference; Code Review.
Lead Data scientist
Innowise Group2022 - 2023
Managing the whole cycle process of the development from scratch to deliver solution to the client; Close communication with the customer and stakeholders; Providing guidance, coaching, and mentoring to team members; Translate business needs into technical requirements; Conducted in depth exploratory data analysis with an eye to derive initial hypothesis and core issues to tackle later on; Set up environment to be able to run and customize pipeline through its configuration; Created a statistical outlier detection layer that improved overall ML model performance and delivered various insights; Proposed and integrated a recommendation system that made relevant advice for the end businesses and optimized their buys; Deliver useful analytical reports to the end customer; Optimization of algorithms.
Lead Machine Learning engineer
Innowise Group2020 - 2022
Conducted research to identify the most effective approach to address the business task; Managed the collection and processing of musical data in mp3 midi format; Experimented with different DL model architectures; Developed and applied cutting-edge Transformer-based architecture; Utilized Azure for data storage and processing, as well as for model training and deployment; Implemented a complex and uncommon data approach, resulting in a significant improvement in model quality; Applied advanced pre-trained Transformer architecture for music generation tasks; Managed Machine Learning lifecycles utilizing MLFlow; Performed DevOps and established Continuous Integration/Continuous Deployment (CI/CD) pipelines; Achieved studio quality sound of generated music increasing token generation limit from 0.5-4 minutes of default models to 30-60 minutes.
Machine Learning engineer / Data Scientist
Innowise Group2019 - 2020
Managed the acquisition and preprocessing of vast datasets; Engineered predictive models with Machine Learning libraries such as XGBoost to forecast energy consumption patterns; Employed mathematical optimization to create a dynamic energy resource allocation system; Communicated data insights effectively to stakeholders using data visualization tools such as Matplotlib and Seaborn; Ensured robustness and scalability of the model on AWS cloud infrastructure, performing rigorous testing and modifications; Conducted A/B testing and incorporated model enhancements using performance metrics; Collaborated with business leaders to align the model with business needs and operational constraints; Achieved a 20% reduction in energy costs for factory facilities.
Machine Learning engineer / Data Scientist
Innowise Group2018 - 2019
Participating in designing the overall structure and schema of the ml inference systems; Cleaning and processing large streams of complexly structured transactional data; Leveraged SQL for data querying and extraction from relational databases, and PySpark for processing large datasets in a distributed computing environment; Worked on data drift detection layer and along with the team tested several handling techniques; Integrated SHAP and LIME for model explainability, allowing end-users to understand model predictions with transparency, fostering trust in ML decisions; Developed models using logistic regression and gradient boosting frameworks like XGBoost, ensuring high accuracy in predicting default risks; Design and implement credit scoring models based on customer behavior, income, and transaction history; Employed K-means clustering and decision trees using Python’s Pandas for data manipulation and Scikit-learn for model development; Utilized Python and Scikit-learn for prototyping and TensorFlow for deploying scalable models. Applied ensemble learning techniques and neural networks to enhance fraud detection capabilities. Integrated the models into the banking system using Apache Kafka for real-time data streaming and processing; Visualized segmentation results with Tableau, providing actionable insights for marketing strategies. Used Apache Spark for handling large-scale customer data efficiently; Utilize ensemble methods, anomaly detection techniques, and predictive modeling to enhance the accuracy of fraud detection systems; Apply Natural Language Processing (NLP) techniques to categorize and label transactions based on textual descriptions. Enhance transactional data understanding by extracting meaningful information from transaction narratives; Managed model lifecycles from development to deployment using AWS SageMaker for model training and deployment, and Git for version control of model code and configurations; Implemented MLflow for experiment tracking and model versioning, coupled with Jenkins for automating the CI/CD pipeline, streamlining the deployment of updated models; Created custom monitoring layer to provide maintenance and insights on overall flow; Increased the stability of the system, increased the speed of the system by 5 times due refactoring; Facilitated collaborative code quality improvement sessions using GitHub for code reviews, fostering a culture of continuous learning and improvement among the data science team.
Data scientist / Machine Learning engineer
Innowise Group2017 - 2018
Participated in all software development end-to-end product lifecycle phases; Initiated image annotation process; Cloud automation and deployment of data pre-processing, models and output; Data Management: collection, processing, and labeling data, ensuring a robust dataset for model training; Conducted full model development and training cycle; Created detailed reports in order to visualize the current achievements, issues, risks, etc. for the customer; Developed Web application with document upload and processing capabilities, as well as functionality for visualizing results; Build technical documentation for computer vision systems for end-users to understand how these systems work and how to use them; Communicating with external stakeholders presenting results.

Testimonials

Anonymous

Innowise Sp. z o.o

Verified Testimonial

Vlad, Data Scientist/ML Engineer on our Compliance Verification platform, built LLM-powered forgery detection for financial contracts. He ensured regulatory compliance by identifying inconsistencies in dates, assets, quantities, and share/procurement details, delivering robust form verification, fraud protection, and data anonymity with advanced ML accuracy. Transformed high-stakes finance integrity!

Similar talent

Slide 1 of 0