John Lu COSCUP 2025

John Lu
.ical

John is a Senior AI Engineer, currently focused on developing NLP applications.

He is deeply motivated by challenges and tends to be excited by breaking conventional ways of thinking and doing. With prior experiences in Software Engineering, he works on combining the latest AI technology and engineering to transform challenges into practical solutions.

Sessions

年8月9日

11:20

30 分鐘

Let's build a Visual Language Model (VLM) from scratch with Python

John Lu

Ever wonder how Vision Language Models (VLMs) works? VLMs are built on a vision encoder and a language decoder. It accepts both images and text as inputs and can answer vision-language questions with detailed insights. Building a VLM from scratch allows us to customize the component for our application. The goal of this talk is to demonstrate how VLMs could be implemented in a Pythonic way. To do so, we're going to build the PaliGemma VLM completely from scratch all using Python.

PyCon Taiwan Community

Let's build a Transformer: JAX Source code explained from scratch

John Lu

Transformer architecture can be used for various NLP and CV tasks. They are pre-trained to generate text and images based on large datasets. Building a transformer from scratch allows us to customize the component for our application. The goal of this talk is to demonstrate how the transformer model could be implemented on JAX. To do so, we're going to build a general purpose transformer completely from scratch all with JAX.

主議程軌 - Main Session Track

RB105

John Lu .ical

Sessions

John Lu
.ical