Chuxin Liu
I am a quantitative modeling senior associate at JPMorgan and I hold a PhD in Economics.
Feel free to connect me on LinkedIn: https://www.linkedin.com/in/chuxin-liu/
Sessions
Colab Notebook Link: https://colab.research.google.com/drive/1faxDHE3LdAwH7MORdnJei87Q0WF1BhS0?usp=sharing
Make a copy to your local drive to start working on this notebook.
Ever wondered how groundbreaking language models like ChatGPT and Llama were built? The answer lies in transformer, a powerful neural network architecture. In this workshop, we'll dive deep into the inner workings of transformers, with specific focus on self-attention mechanism. We will guide you through the process of building one from scratch. Whether you're a beginner or an experienced practitioner, this workshop is designed to cater to all levels of expertise.