Skip to main content

Introduction to GenAI - Part 1

This is part of a series of articles to introduce Generative AI and step-by-step process with code sample to build GenAI Application; Part-1 (GenAI Introduction), Part-2 (Building GenAI App using Spring Boot and Amazon Bedrock) and Part-3 (Building GenAI App with Proprietary data).

GenAI Evolution
GenAI (aka Generative AI) is the latest buzz word across Technology space. This buzz is something similar to a set of earlier Tech milestone revolutions, latest of such event was Smart Phone era (iPhone, Android). Smart Phones have revolutionized content consumption, access and interaction pattern (Billions of users experienced Internet only because of Smart Phone!). Similarly, GenAI is introducing next phase of Content creation, processing and interaction.
AI (Artificial Intelligence) is not something new, it's been there as classic Computer Science concept (aka theory) for ages. But it took huge amount of effort and time to make it feasible (Tech, Infrastructure and Commercial viability) and mainstream technology.

What is GenAI?
In simple words, Generative AI is a subset of AI / ML / Deep Learning which Learn and Process existing data to generate new content (Text, Audio, Video etc.) just like a Human Creator. Example, OpenAI's ChatGPT (GPT aka Generative Pre-trained Transformers) can generate content (blogs / stories) on any topic just like human content creators.
GenAI can be used for wide range of use cases like Content generation (blogs, stories), Language Translation, Sentiment Analysis, Question Answer (ChatBot), Content Summarisation and more.

How does GenAI work?
GenAI uses Transformer Architecture (neural networks) which takes text sequence as input (aka Prompt) and produces another text sequence as output (aka Result / Response). Neural Networks simulates human brain where multiple neural nodes orchestrate to solve any query. These Transformers are called Models (e.g. LLM - Large Language Model) in GenAI context. All these Transformers or Models are per-trained with vast amount of data to answer user query.
For example, a popular GenAI App (ChatGPT3) was trained with 45 terabytes of data from websites, books, wikipedia. That's why these Models know much more than what we human do!

GenAI Data Store
How does Models can handle such vast amount of data and search so FAST? Answer to this is Vector Database and Semantic or Similarity Search (an approach of searching using meaning of query text instead of keyword matching). Traditional databases store data in Table and Column. Vector DB store data in multi-dimensional numeric vector. For example, RGB Colors are in 3-dimensional structure where [0, 255, 0] represents Green Color. In a practical Model, 1,000+ Vector dimensions are used, higher dimensions increase accuracy.

Due to numeric representation of Vector Database, instead of keyword matching it uses distance calculation algorithm (Eculidian, Cosine similarity, Dot product) across Vectors to retrieve similar or proximity result. This helps Vector Databases to serve at micro second level latency, independent of database size.

GenAI Model Architecture
GenAI Model is a functional component which process queries and generate response. User query (aka Prompt) are in Natural Language, to process such query with there real meaning and relevance (i.e. Semantic Search), query text is further converted into Vectors (numeric representation). These query vectors are then matched against Vector Database to generate Result vectors, which are further converted to Word / Phrases (Final Response).

Converting query sentence to numeric set mapping (aka Vectors) involves processes like Tokenization (breaking down query sentence into smaller Tokens, set of 4-5 characters) and Vector Embedding (convert tokens to numeric mapping i.e. vector).

Concept of Token is very important, as AI Provider Platforms calculate usage (and thus charges) based on number of Input and Output Tokens i.e. size of request to be processed and amount of response content to be generated.

Model training is very expensive process (both time and cost). Real-life use cases would want Model to augment their Result based on proprietary records. Example, companies may want response of a Chat bot is based on their proprietary company details (sales, customer record etc.). This would provide more personalized and context driven experience.

RAG (Retrieval Augmented Generation) Architecture helps Models to leverage Knowledge Base (external proprietary records) for better contextual Response.


Importance of GPU
Models require heavy numeric calculation and parallel processing. GPU with their parallel processing capabilities, makes heavy computation like Image processing much faster. Thus GPU is more align to this requirement rather than CPU (sequential processing). This further allows to scale infrastructure at low cost (adding GPUs are much cost effective and scalable).

GenAI Provider Platforms
All leading cloud providers like AWS (Aamazon Bedrock), Google (Vertex AI), Azure / OpenAI etc. offer their fully-manged GenAI platform stack. All of them provides API based approach to interact with Applications. It depends on Application creator to choose their preferred platform based on their preferences (Cost, Existing cloud platform, Integration effort etc.).

Each of these platforms includes wide range of in-built Models (provided by different vendors- Titan by Amazon, Claude by Anthropic, Llma by Meta). For application development, use existing Models as per use cases and leverage Knowledge Base to infuse (augment) external proprietary information to generate more context driven Response.

Hope this gives a fare understanding of GenAI. If you are ready to explore deeper and build your first GenAI Application, here is another article GenAI App using Spring Boot and Amazon Bedrock.

Comments

Popular posts from this blog

Android Parcelable Example

Few days back I had a requirement to send a ArrayList of my Custom Class Objects through Android Intent , I guess most of you also find this requirement now and then. I never thought it can be that tricky. There are built in functions in Intent to send ArrayList of primitive objects e.g. String, Integer, but when it comes to Custom Data Handling Objects, BOOM … you need to take that extra pain! Android has defined a new light weight IPC ( Inter Process Communication ) data structure called Parcel , where you can flatten your objects in byte stream, same as J2SDK’s Serialization concept. So let’s come back to my original requirement, I had a Data Handling class, which groups together a set of information- public class ParcelData {       int id;       String name;       String desc;       String[] cities = {"suwon", "delhi"}; } I want an ArrayList<ParcelData> of Data Handling objec...

Call Control in Android

This tutorial is for those who want to control Phone Call in Android OS. Programmatic approach to Accept or Reject call without user intervention. Kindly note, this approach uses Java Reflection to call methods of an internal class of Android Telephony Framework and might not work with all versions of Android OS. The core concept has been explained in this Android open code . 1st thing 1st, Give the permission . You need to define 3 User Permissions to handle call related functionality- android.permission.READ_PHONE_STATE android.permission.MODIFY_PHONE_STATE (For answerRingingCall() method) android.permission.CALL_PHONE (For endCall() method) Define a Receiver... Create a Receiver which accepts broadcasts with intent action android.intent.action.PHONE_STATE, define following in the Manifest- [receiver android:name=".PhoneCall"]         [intent-filter]             [action android:name="android.in...

Android Looper and Toast from WorkerThread

Have you ever tried to launch Android Toast message from worker thread? Probably you are wondering why the heck it is giving this error- java.lang.RuntimeException: Can't create handler inside thread that has not called Looper.prepare() In this article we are going to explore reason behind the above exception and try to understand how Looper works in Android. At the end, I am going to explain one approach to run Toast from a worker thread, but before that we need to understand how Looper works. If you already know in/out of Looper, you can skip below section and directly move to the solution part. Looper is a very basic wrapper class which attach a MessageQueue to a Thread and manage this queue. MessageQueue is a structure to sequentialize simultaneous processing requests of a Thread.  In Android, message/request processing classes like Handler uses Looper to manage their respective MessageQueue. Looper = Thread + MessageQueue Android Looper Life Cycle: As you can see in the abo...

Overlay on Android Layout

This will help you to create custom Layout and add Overlay on a LinearLayout. The concept can be reused on other Layout classes i.e. RelativeLayout, FrameLayout etc. I have added a popup Selection Palette, containing "Map Pin" and "List" icons. You can minimize the popup by clicking on the section in Green on the left side bottom corner of the screen.   How can I do that- You need to follow 4 steps- 1. Override LinearLayout Create a Class MyLinearLayout.java which should overwrite LinearLayout 2. Drawing You need to overwrite dispatchDraw (Canvas canvas) method. It gives control to the whole screen. Make sure you set android:layout_height="fill_parent" for the associated layout definition in XML. You can draw anything and anywhere on the canvas. dispatchDraw (Canvas canvas) gets called only after underlying views are drawn, so whatever you draw comes in the foreground.   3. Event Handling You need to overwrite dispatchTouchEvent (MotionEvent e...

Android Custom TextView

Have you ever tried to add custom behavior to in-build Android Text View or create custom attributes? If yes, this article is going to help you. Here we'll create Single Custom TextView with support for custom attributes to display First and Last Name in different font and colors. During this process we'll learn following topics- 1. How to override default Views in Android 2. How to define custom Layout Attributes in Android So, Let's get started... Following sections explains necessary changes required in Java code and XML layout files. Create Custom Text View (MyTextView.java) 1. Override Android's default TextView   2. Implement Constructors. If you want custom attributes, override Constructor having Attributes in argument. 3. Override onMeasure(): Calculate required width and height, based on Text Size and selected Font. Once calculation is complete, set updated measure using setMeasuredDimension (reqWidth, reqHeight) Note: It’s really important to define the corr...

Google SpreadSheet Library for Android

You might have already tried using Google's GData Lib to access SpreadSheet from Android, and after few hours of try, you start Google for any alternate solution. I have also spent number of hours without any solution. So, I have developed SpreadSheet client Lib [ works on Android :-) ] and ideally work on any Java platform- http://code.google.com/p/google-spreadsheet-lib-android/ Latest version: 2.1 (Added support for List Feed. Please visit above link to get more info.) Supported Features: 1. Create/Delete Spreadsheet 2. List all stored Spreadsheets 3. Add/Delete WorkSheet into/from a given SpreadSheet 4. List all Worksheets of a given Spreadsheet 5. Add Records into WorkSheet/SpreadSheet (It supports Table based record handling) 6. Retrieve Record from WorkSheet/SpreadSheet ( Structured Query support) 7. Retrieve Record as List Feed from Worksheet 8. Update/Delete Records 9. Share ShreadSheet with user/group/domain. 10. Conditional data retrieval- ...

Android Fragment

Fragment is being hanging out since Andriod 3.0, but with the release of 4.0, it has become an obvious choice for Android Application development for both Tabs and Smart phones. Few people think, fragment is a " Superman " which can add any kind of UI layout/style/decoration. But that is not true, rather than being an UI layout or decoration enhancer, Fragment is a very important concept to manage segments of your UI component code base . Prior to Fragment, developers were able manage UI flow only at the Activity level. All UI components were Views (mentioned in XML layout and part of Activity) and there was no way to manage these components separately. As a result all view management code were in a single file i.e. Activity class. With fragment approach, we can now remove View management code from Activity and place them in their respective Java classes. So, a pretty neat approach for code management. Here I'll explain various concepts of Fragment with an example appli...

HashMap Internal

I always wanted to implement the logic behind the famous data structure of Java , HashMap and here it comes. Though it’s not the complete implementation considering all optimization scenarios, but it will highlight the basic algorithm behind the HashMap . So let’s start, this implementation will use my LinkedList implementation (Reason: for this implementation I thought to write everything from the scratch apart from primitive data types. Sounds weird? May be ;-) ). You may refer my earlier post on LinkedList , as I’m going to use it. HashMap is a key-value pair data structure, where you retrieve value by passing key. To optimize the search of key, a concept of Bucket (an Array of Java Objects) has been introduced. This is used, so that if search hits this Bucket , corresponding value element is instantly retrieved, otherwise it iterates sequentially through the Linked List associated with each Bucket element. So you can say if all HashMap elements get fit into the Bucket, retrieving...

Accessing Yahoo Finance API

Since last few days I was wondering the right set of Web Service to read Country wise Stock Exchange index information . I found a bunch of scattered information, but no straight forward answer. It seems there are not many "reliable" and "flexible" options and Yahoo Finance is one of the top of this class. Though Yahoo Finance is very powerful, some how its very less documented and it seems Yahoo doesn't care much about this wonderful web service and expect Developers to do some kind of "hacking". The only online resource that I (and most of you as well ) found is one 3rd party web site- http://www.gummy-stuff.org/Yahoo-data.htm and it seems they know much more than what Yahoo dose..;-) Anyway let me continue and share my experience and information to help budding developer who wants to use Yahoo Finance Web Service in their Mobile, Web o r Desktop s olution. There are 2 set of APIs to access Yahoo Finance details- YQL based Web Service : Th...

Eclipse EGIT, Download Code, Attach Framework code & Debug

This article explains procedure to download Android source (few important Apps and Framework base code) using Eclipse EGit plugin and then attach framework code to debug important framework classes (e.g. Activity etc.). Install EGit Download Source from GIT Repository Attach Framework code Debug Download EGit Plug-in EGit is a GIT plugin for Eclipse which helps to mange GIT clone, Check-ins, Sync etc. from your Eclipse workspace. Eclipse (Version: 3.7.x) -> Help -> Install New Software -> "Add" - " http://download.eclipse.org/egit/updates ". Once the plug-in installation is successful, you'll find a new Eclipse View perspective- "Git Repository Exploring"    Download Android source To download code from Android GIT repository, we need to create "local Git clone". Each local clone is associated with Remote Clone URL.   https://android.googlesource.com/ lists Git Repository URLs for different sections of An...