AI Engineering by Chip Huyen: Chapter 2 Notes and summary

Chapter 2: Understanding Foundation Models Overview Foundation model design choices (training data, architecture/size, post-training) are increasingly opaque. The training process splits into pre-training (makes model capable) and post-training (aligns model to human preferences). Sampling (how outputs are chosen from all possibilities) is a crucial, often-underestimated factor impacting model behavior and Read more

Notes on paper: Large Language Models as Zero-Shot Conversational Recommenders

Link to paper https://arxiv.org/abs/2308.10053 Notes CRS possesses the potential to: (1) understand not only users’ historical actions but also users’ (multi-turn) natural-language inputs; (2) Provide not only recommended items but also human-like responses for multiple purposes, such as preference refinement, knowledgeable discussion, or recommendation justification.Towards this a typical conversational recommender Read more

Notes on Paper: RecMind: Large Language Model Powered Agent For Recommendation

Link to paper https://arxiv.org/abs/2308.14296 Notes The paper propose a novel algorithm, Self-Inspiring, to improve the planning ability of the LLM agent. At each intermediate planning step, the LLM “self-inspires” to consider all previously explored states to plan for next step. Literature survey Architecture Tools they used 1) DB tool1) To Read more

Best Practices for Building Machine Learning Applications

Introduction to Building Machine Learning Applications Building machine learning applications requires a thorough understanding of the fundamentals of machine learning and software development. This section will provide an overview of the key considerations and best practices for building machine learning applications. Best Practices for Data Preprocessing Data preprocessing is a Read more

Distribution of error functions

We can plot error distributions like probability density functionand cumulative density function and make important deductionsbased on it. We can use plot Probability Density functions(PDF) and Cumulative density function (CDF) by using the error function as a random variable Using PDF of error distribution An ideal pdf for error distributions Read more