Zaishan Weng
  • Home
  • Blog
  • Videos & Articles
Categories
All (11)
AI (3)
ASR (1)
Anomaly Detection (1)
Automatic Speech Recognition (1)
Business Intelligence (2)
CSV (1)
Clustering (1)
Collaborative Filtering (1)
Data Analytics (1)
Data Cleaning (1)
Data Engineering (3)
Data Masking (1)
Data Science (3)
Design Thinking (1)
DocVQA (1)
Document Understanding (1)
Excel (1)
Finance Analytics (1)
GenAI (1)
Machine Learning (3)
Machine_Learning (1)
Procure_to_Pay (1)
Python (3)
Qlik (1)
RAG (1)
Recommender System (1)
Recsys (1)
Regex (2)
Requirements Gathering (1)
SAP (1)
Splitting_of_Purchase (1)
Stakeholder Engagement (1)
Transformer (3)
User Experience (1)
User Interface (1)

Data & AI Blog

Impressive yet easy to implement Document Understanding system with OCR-free Donut Transformers Model in Python

DocVQA
Data Science
AI
Machine Learning
Transformer
Document Understanding
Recentl…
Oct 6, 2022
Zaishan Weng

Simple implementation of a meeting transcription solution locally

ASR
Automatic Speech Recognition
AI
Machine Learning
Transformer
In a recent engagement with a team that provides secretariat service, there is an opportunity to optimize the existing process of creating meeting…
Apr 14, 2024
Zaishan Weng

Simplified understanding of Retrieval Augmented Generation (RAG)

GenAI
RAG
AI
Machine_Learning
Transformer
One of the interesting applications of Generative AI is searching for information within your own documents which is termed as Retrieval Augemented Generation or RAG in…
May 5, 2024
Zaishan Weng

Quick start guide to build a Collaborative Filtering Recommendation System with implicit library in 4 steps

Recommender System
Recsys
Collaborative Filtering
Data Science
While there are various guides, articles and lessons available on building a recommendation system, the implicit library package is discussed less often. In the course of my…
Feb 9, 2023
Zaishan Weng

Hierarchical Clustering can be more suitable compared to KMeans when grouping customers based on purchase behaviors and detecting outliers

Clustering
Machine Learning
Anomaly Detection
Data Science
Clustering can be a particularly useful starting point to embark on a Machine Learning journey especially when data labels are yet to be built up. One of the easy way to…
Feb 3, 2023
Zaishan Weng

Applications of Regex and Python in data transformation for masking of sensitive information and extraction of date details from free text

Regex
Data Engineering
Data Masking
There are many useful applications of Regex. In this article, I would like to cover two of them commonly used for my projects in Singapore. They are
Oct 18, 2022
Zaishan Weng

Quick Start Guide on incorporating design thinking artifacts for requirements gathering during Project Initiation Phase for Agile Data Analytics Projects

Design Thinking
Requirements Gathering
Data Analytics
Stakeholder Engagement
Business Intelligence
From my recent involvement in data analytics project engagements as the Data…
Jul 8, 2022
Zaishan Weng

Data cleaning on SAP data extracts in .txt format with Regex and Python

Data Engineering
SAP
Python
Regex
During one of our recent projects involving the procure to pay process, our team encountered SAP raw data extracted…
Jun 25, 2022
Zaishan Weng

Handle various input data source formats (csv, xlsx, xlsb, txt, parquet) with Python and Pandas

Python
Data Cleaning
Excel
CSV
Data Engineering
In the process of ingesting and cleaning raw data in…
Aug 12, 2022
Zaishan Weng

Procure-to-Pay Process Analytics - Split Purchase Order red flag detection using Python

Finance Analytics
Python
Procure_to_Pay
Splitting_of_Purchase
The procure to pay process is a commonly used term to describe the fulfillment of goods and/or services to a given requirement. It usually starts with having a purchase…
May 4, 2023
Zaishan Weng

Some ideas on enhancing User Experience on Qlik Sense

Qlik
User Experience
Business Intelligence
User Interface
After the…
Jul 22, 2022
Zaishan Weng
No matching items
  • © Copyright 2024 Zaishan Weng