Machine learning

Notes on paper: Large Language Models as Zero-Shot Conversational Recommenders

Link to paper https://arxiv.org/abs/2308.10053 Notes CRS possesses the potential to: (1) understand not only users’ historical actions but also users’ (multi-turn) natural-language inputs; (2) Provide not only recommended items but also human-like responses for multiple purposes, such as preference refinement, knowledgeable discussion, or recommendation justification.Towards this a typical conversational recommender Read more…

By admin, 11 monthsJuly 15, 2024 ago

Machine learning

Notes on Paper: RecMind: Large Language Model Powered Agent For Recommendation

Link to paper https://arxiv.org/abs/2308.14296 Notes The paper propose a novel algorithm, Self-Inspiring, to improve the planning ability of the LLM agent. At each intermediate planning step, the LLM “self-inspires” to consider all previously explored states to plan for next step. Literature survey Architecture Tools they used 1) DB tool1) To Read more…

By Deepanshu Lulla, 11 monthsJuly 13, 2024 ago

Machine learning

Coursera Week 1: Intro to Neural Networks Notes

Executive Summary Week 1: Introduction Let’s say you have a data set with six houses, so you know the size of the houses in square feet or square meters and you know the price of the house and you want to fit a function to predict the price of a house as Read more…

By admin, 2 yearsOctober 14, 2023 ago

Distributed systems

Exploring Vector Databases

Amid the AI revolution, diverse AI models like large language and generative AI models have come into the limelight. These novel AI models require efficient data processing, achievable using vector embeddings. By providing semantic information to the AI models, they gain a better understanding and can perform complex tasks efficiently. Read more…

By admin, 2 yearsJuly 11, 2023 ago

Machine learning

Best Practices for Building Machine Learning Applications

Introduction to Building Machine Learning Applications Building machine learning applications requires a thorough understanding of the fundamentals of machine learning and software development. This section will provide an overview of the key considerations and best practices for building machine learning applications. Best Practices for Data Preprocessing Data preprocessing is a Read more…

By Deepanshu Lulla, 2 yearsApril 22, 2023 ago

Machine learning

Median absolute deviation (MAD) of Errors

Median Absolute deviation is one of the other techniques specifically used for analyzing the performance of regression models. Computing MAD of errors For a univariate data set X1, X2, …, Xn, the MAD is defined as the median of the absolute deviations from the data’s median, X_median = median(X) Median Read more…

By admin, 6 yearsNovember 10, 2019 ago

Machine learning

R-Squared/Coefficient of determination

R-squared is a statistical measure of how close the data are to the fitted regression line. It is also known as the coefficient of determination, or the coefficient of multiple determination for multiple regression. This metric is specifically designed for regression-based algorithms where the output is a real value. Computing Read more…

By admin, 6 yearsOctober 20, 2019 ago

Machine learning

Distribution of error functions

We can plot error distributions like probability density functionand cumulative density function and make important deductionsbased on it. We can use plot Probability Density functions(PDF) and Cumulative density function (CDF) by using the error function as a random variable Using PDF of error distribution An ideal pdf for error distributions Read more…

By admin, 6 yearsSeptember 22, 2019 ago

Machine learning

Logarithmic loss (or cross-entropy)

Logarithmic loss (or cross-entropy) measures the performance of a classification model where the prediction input is a probability value between 0 and 1. The goal of our machine learning models is to minimize this value. It is also heavily used in Kaggle competitions to estimate the score of submissions. A Read more…

By admin, 6 yearsAugust 25, 2019 ago

Machine learning

Receiver operating characteristic (ROC ) curve

For binary classification problems, a good way to measure the performance of a model is by finding out AUC (Area Under The Curve) of ROC (Receiver Operating Characteristics). What is a ROC curve? It is a plot of True Positive Rate(TPR) vs FPR at various thresholding levels. Let’s understand this Read more…

By Deepanshu Lulla, 6 yearsAugust 11, 2019 ago

Machine learning

Notes on paper: Large Language Models as Zero-Shot Conversational Recommenders

Like this:

Notes on Paper: RecMind: Large Language Model Powered Agent For Recommendation

Like this:

Coursera Week 1: Intro to Neural Networks Notes

Like this:

Exploring Vector Databases

Like this:

Best Practices for Building Machine Learning Applications

Like this:

Median absolute deviation (MAD) of Errors

Like this:

R-Squared/Coefficient of determination

Like this:

Distribution of error functions

Like this:

Logarithmic loss (or cross-entropy)

Like this:

Receiver operating characteristic (ROC ) curve

Like this: