2023 has been the year of Generative AI, with several Large Language Models and Large Multimodal Models becoming available. I have had participated in a few hackathons to gain a first-hand experience of these novel technologies and build a small solution or two that can help make our lives better. In addition, I have worked on a few more GenAI projects.
Some of these GenAI solutions are as follows:
How does one measure the impact of GenAI? CodePrompt is a small, hand-crafted dataset built to assess the efficiency of code generation using the PaLM 2 LLM. CodePrompt consists of 30 coding problems in Python. These range from code generation to completion and troubleshooting errors.
Source code(Banner generated using Bannerbear)
We often spend a lot of time to create presentation slide decks. With SlideDeck AI, users and generative AI co-create a presentation slide deck in a few steps using Mistral 7B Instruction tuned. Previously, it used Llama 2, and had won 3rd place in the Llama 2 Hackathon with Clarifai.
Source codeGemini Senpai is a small, experimental AI assistant prototype built using Gemini's function calling. Currently, Gemini Senpai allows users to generate Python code and small Python applications spanning multiple modules. Build software with an AI assistant, collaboratively.
Source codeAs scientists and engineers, we often draw a lot of diagrams depicting systems, for example, architecture, state machines, and flow diagrams. However, writing their descriptions can often be tedious, but without which system documentation remains incomplete. With Sys2Doc, one can generate system documentation based on a given diagram of any system. Sys2Doc is powered by Gemini Pro Vision.
Source codeBuilding an RAG app is easy. However, optimizing it is necessarily not. With RAG2Rich, one can identify the optimal parameters/configurations using answer "richness" score, which is evaluated based on the context relevance, answer relevance, and groundedness measures computed by TruLens. In other words, RAG2Rich offers a scientific approach toward optimizing Retrieval-Augmented Generation System. RAG2Rich is powered by Vertex AI and PaLM 2, among others.
Source codePoetry is food for the soul. On the other hand, an image is worth a thousand words. With Poem2Pic, one blends poetry with art. Poem2Pic enables the generation of an image based on a poem. In particular, Flan-T5, a large language model (LLM), is used to generate a very short summary of an input poem. The summary is then fed to Stable Diffusion in order to generate an image. The final image is displayed to the user.
Source code