Accelerate ML operations using Generative AI

4 min read

Reading Time: 4 minutes

As Data Science and Machine Learning practitioners, we often face the challenge of finding solutions to complex problems. One powerful artificial intelligence platform that can help speed up the process is the use of Generative Pretrained Transformer 3 (GPT-3) language model.

What is GPT?

GPT stands for “Generative Pre-trained Transformer.” It is a type of Generative AI language model that uses deep learning techniques to generate human-like text. GPT models are trained on vast amounts of text data and can learn to generate natural-sounding language in a variety of contexts. GPT models have been used for a wide range of natural language processing tasks, including text generation, question answering, and language translation. They have also been used to create chatbots and other conversational artificial intelligence applications.

GPT Model Series.

OpenAI is an artificial intelligence research laboratory consisting of a team of engineers and researchers dedicated to advancing the field of artificial intelligence in a safe and responsible manner. They have developed several cutting-edge machine learning models, including the GPT series of language models and few of them are listed below.

GPT-3	The GPT-3 (Generative Pre-trained Transformer 3) language model is trained on 175 billion parameters. It can generate high-quality natural language text that is often indistinguishable from human-written text, and can be used for a wide range of tasks, from text completion and summarization to question-answering and language translation.
GPT-2	GPT-2 (Generative Pre-trained Transformer 2) is an earlier version of the GPT model, with 1.5 billion parameters. It is still a highly capable language model and can be used for many of the same tasks as GPT-3, albeit with somewhat less accuracy and fluency.
Codex	Codex is an artificial intelligence language model that is specifically designed for generating code, and is trained on a large corpus of open-source code repositories. It can be used to generate code snippets, complete functions, and even entire programs in a wide range of programming languages.
DALL-E	DALL-E is a language model that is trained to generate images from textual descriptions, using a combination of text-to-image synthesis and image generation techniques. It can be used to create custom visualizations and artwork based on textual input.

Here are some of the ways which can help speed up your DS/ML experiments and implementations using GPT solutions:

1) Generating text for documentation and reports: Writing documentation and reports can be a time-consuming task. With a generative AI platform like GPT-3, we can generate high-quality, natural language text that can be used as a starting point for our documentation and reports. For example, we can generate summaries of our data, descriptions of our models, or explanations of our results.

2) Generating code: GPT-3 can generate code snippets based on natural language descriptions of the task we are trying to accomplish. This can be especially useful for tasks such as data cleaning, feature engineering, and model tuning. We can use the generated code as a starting point and modify it as necessary to fit our specific use case.

3) Generating ideas: GPT-3 can also be used to generate new ideas for Data Science and Machine Learning projects. We can input a prompt describing the problem we are trying to solve or the data we are working with, and GPT-3 can generate suggestions for models, algorithms, or approaches we might want to consider.

GPT inside notebooks

To make it easier to use GPT-3 in Python, there are several libraries available that wrap the API and provide a simple interface for using the language model. One such library is the OpenAI library. With this library, we can use GPT-3 to quickly generate text, code, or other content to assist in our Data Science and workflows during ML operations.

In this blog, we’ll explore how to use Python magic functions to access GPT and enhance the user experience.

Magic functions

Python’s magic functions are a way to enhance the functionality of Python environments like Jupyter Notebook by allowing users to create shortcuts or special commands. These functions are called “magic” because they are not typical Python functions, but rather built-in functions that perform specific actions or set certain configurations. Magic functions can be used for a variety of tasks, such as timing code execution, debugging, profiling, and integrating with external libraries or APIs. They can also be used to create custom shortcuts or commands for repetitive tasks, making coding and data analysis more efficient and streamlined.

How to use GPT Inside notebook

Here are step-by-step instructions on how to define a custom Python magic function using OpenAI and integrate it into Jupyter Notebook:

Install the OpenAI API and dependencies

To use OpenAI in Python, you’ll need to install the OpenAI API and its dependencies. You can do this by running the following command in your terminal or command prompt:

pip install openai

Authenticate with the OpenAI API

Before you can use the OpenAI API in your Python code, you’ll need to authenticate with your API key. You can obtain your API key by creating an account on the OpenAI website and following the instructions provided. Once you have your API key, you can authenticate by adding the following code to your Python script:

import openai
openai.api_key = “YOUR_API_KEY”

Define the custom magic function

In this example, we’ll create a line magic function called GPT3 that generates text using the OpenAI GPT-3 language model. Here’s the code:

In this code, we’re using the openai.Completion method to generate text using the GPT-3 language model, with a maximum of 1024 tokens. The line argument passed to the function will be used as the prompt for generating the text.

Register the custom magic function with Jupyter Notebook

To use the gpt3 function as a magic function in Jupyter Notebook, we need to register it using the get_ipython() method. Here’s the code to do that:

get_ipython().register_magic_function(gpt_magic, magic_kind=’line’, magic_name=’gpt’)

This registers the gpt3 function as a line magic function, which means it can be called using the % symbol before the function name.

Test the custom magic function

Now that we’ve defined and registered our custom magic function, we can test it out in Jupyter Notebook. To use the gpt3 function, simply type %gpt3 followed by your prompt text. Here’s an example:

This generates a response from the GPT-3 model, based on the prompt text you provided.

Conclusion

Overall, creating a custom Python magic function using OpenAI is a powerful tool for improving the functionality and versatility of your Python environment, and can greatly enhance the user experience of data analysis and coding tasks. Using OpenAI GPT-3 allows us to generate high-quality text, code, and ideas quickly and easily, freeing up more time for us to focus on the critical thinking and analysis that are at the core of our work. As the technology continues to evolve, it is likely that we will see even more use cases for GPT-3 in the world of Data Science and Machine Learning.

Using GPT with Refract by Fosfor

Refract notebook comes with a pre-integrated GPT magic function %gpt, making the state-of-the-art language model developed by OpenAI easily accessible from the Jupyter-notebook which can help generate, debug, explain code, and much more with just a few simple prompts. Refract’s integration with GPT is a significant advancement in the field of ML operations providing users with a powerful and intuitive platform to build cutting-edge ML models.

Author

Manish Singh

FDC Industry Principal Lead, BFSI

Manish Singh has 12+ years of progressive experience in executing data-driven solutions. He is adept at handling complex data problems, implementing efficient data processing, and delivering value. He is proficient in machine learning and statistical modeling algorithms/techniques for identifying patterns and extracting valuable insights. He has a remarkable track record of managing complete software development lifecycles and accomplishing mission critical projects. He is highly competent in blending data science techniques with business understanding to transform data seamlessly into business value.

More on the topic

Read more thought leadership from our team of experts

ChatGPT - A revelation in Decision intelligence?

As a child, I loved the movie Terminator. I was in awe of how life-like, intelligent and cool the Artificial Intelligence (AI) cyborg assassin was. Today, as I read about and experience an AI chatbot that potentially can emulate a “terminator” from the future and give me career advice, I am absolutely blown away.

Large language models

Refract is a DSML platform that helps multiple personas like Data Scientist, ML Engineer, Model Quality Controller, MLOPs Engineer, and Model Governance Officer work seamlessly together on any AI use case.

Introduction to Decision Intelligence and its tools

The pandemic has completely transformed the way we live life and do business. Over the last two years, we have learned how important it is to be flexible and agile in our decision-making. According to a recent survey by Gartner, "65% of respondents said the decisions they make are more complex than just two years ago, and 53% said they face more pressure to explain or justify their decisions."

Privacy & Cookie policy

Privacy & Cookies policy

Cookie name	Active
sess_map

What is a cookie?

A cookie is a small piece of data that a website asks your browser to store on your computer or mobile device. The cookie allows the website to “remember” your actions or preferences over time. On future visits, this data is then returned to that website to help identify you and your site preferences. Our websites and mobile sites use cookies to give you the best online experience. Most Internet browsers support cookies; however, users can set their browsers to decline certain types of cookies or specific cookies. Further, users can delete cookies at any time.

Why do we use cookies?

We use cookies to learn how you interact with our content and to improve your experience when visiting our website(s). For example, some cookies remember your language or preferences so that you do not have to repeatedly make these choices when you visit one of our websites.

What kind of cookies do we use?

We use the following categories of cookie:

Category 1: Strictly Necessary Cookies

Strictly necessary cookies are those that are essential for our sites to work in the way you have requested. Although many of our sites are open, that is, they do not require registration; we may use strictly necessary cookies to control access to some of our community sites, whitepapers or online events such as webinars; as well as to maintain your session during a single visit. These cookies will need to reset on your browser each time you register or log in to a gated area. If you block these cookies entirely, you may not be able to access gated areas. We may also offer you the choice of a persistent cookie to recognize you as you return to one of our gated sites. If you choose not to use this “remember me” function, you will simply need to log in each time you return.

Cookie Name	Domain / Associated Domain / Third-Party Service	Description	Retention period
__cfduid	Cloudflare	Cookie associated with sites using CloudFlare, used to speed up page load times	1 Year
lidc	linkedin.com	his is a Microsoft MSN 1^st party cookie that ensures the proper functioning of this website.	1 Day
PHPSESSID	ltimindtree.com	Cookies named PHPSESSID only contain a reference to a session stored on the web server	When the browsing session ends
catAccCookies	ltimindtree.com	Cookie set by the UK cookie consent plugin to record that you accept the fact that the site uses cookies.	29 Days
AWSELB		Used to distribute traffic to the website on several servers in order to optimise response times.	2437 Days
JSESSIONID	linkedin.com	Preserves users states across page requests.	334,416 Days
checkForPermission	bidr.io	Determines whether the visitor has accepted the cookie consent box.	1 Day
VISITOR_INFO1_LIVE		Tries to estimate users bandwidth on the pages with integrated YouTube videos.	179 Days

Category 2: Performance Cookies

Performance cookies, often called analytics cookies, collect data from visitors to our sites on a unique, but anonymous basis. The results are reported to us as aggregate numbers and trends. LTI allows third-parties to set performance cookies. We rely on reports to understand our audiences, and improve how our websites work. We use Google Analytics, a web analytics service provided by Google, Inc. (“Google”), which in turn uses performance cookies. Information generated by the cookies about your use of our website will be transmitted to and stored by Google on servers Worldwide. The IP-address, which your browser conveys within the scope of Google Analytics, will not be associated with any other data held by Google. You may refuse the use of cookies by selecting the appropriate settings on your browser. However, you have to note that if you do this, you may not be able to use the full functionality of our website. You can also opt-out from being tracked by Google Analytics from any future instances, by downloading and installing Google Analytics Opt-out Browser Add-on for your current web browser: https://tools.google.com/dlpage/gaoptout & cookiechoices.org and privacy.google.com/businesses

Cookie Name	Domain / Associated Domain / Third-Party Service	Description	Retention period
_ga	ltimindtree.com	Used to identify unique users. Registers a unique ID that is used to generate statistical data on how the visitor uses the web site.	2 years
_gid	ltimindtree.com	This cookie name is asssociated with Google Universal Analytics. This appears to be a new cookie and as of Spring 2017 no information is available from Google. It appears to store and update a unique value for each page visited.	1 day
_gat	ltimindtree.com	Used by Google Analytics to throttle request rate	1 Day

Category 3: Functionality Cookies

We may use site performance cookies to remember your preferences for operational settings on our websites, so as to save you the trouble to reset the preferences every time you visit. For example, the cookie may recognize optimum video streaming speeds, or volume settings, or the order in which you look at comments to a posting on one of our forums. These cookies do not identify you as an individual and we don’t associate the resulting information with a cookie that does.

Cookie Name	Domain / Associated Domain / Third-Party Service	Description	Retention period
lang	ads.linkedin.com	Set by LinkedIn when a webpage contains an embedded “Follow us” panel. Preference cookies enable a website to remember information that changes the way the website behaves or looks, like your preferred language or the region that you are in.	When the browsing session ends
lang	linkedin.com	In most cases it will likely be used to store language preferences, potentially to serve up content in the stored language.	When the browsing session ends
YSC		Registers a unique ID to keep statistics of what videos from Youtube the user has seen.	2,488,902 Days

Category 4: Social Media Cookies

If you use social media or other third-party credentials to log in to our sites, then that other organization may set a cookie that allows that company to recognize you. The social media organization may use that cookie for its own purposes. The Social Media Organization may also show you ads and content from us when you visit its websites.

Ref links:

LinkedIn – https://www.linkedin.com/legal/privacy-policy Twitter – https://gdpr.twitter.com/en.html & https://twitter.com/en/privacy & https://help.twitter.com/en/rules-and-policies/twitter-cookies Facebook – https://www.facebook.com/business/gdpr Also, if you use a social media-sharing button or widget on one of our sites, the social network that created the button will record your action for its own purposes. Please read through each social media organization’s privacy and data protection policy to understand its use of its cookies and the tracking from our sites, and also how to control such cookies and buttons.

Category 5: Targeting/Advertising Cookies

We use tracking and targeting cookies, or ask other companies to do so on our behalf, to send you emails and show you online advertising, which meet your business and professional interests. If you have registered on our websites, we may send you emails, tailored to reflect the interests you have shown during your visits. We ask third-party advertising platforms and technology companies to show you our ads after you leave our sites (retargeting technology). This technology allows us to make our website services more interesting for you. Retargeting cookies are used to record anonymized movement patterns on a website. These patterns are used to tailor banner advertisements to your interests. The data used for retargeting is completely anonymous, and is only used for statistical analysis. No personal data is stored, and the use of the retargeting technology is subject to the applicable statutory data protection regulations. We also work with companies to reach people who have not visited our sites. These companies do not identify you as an individual, instead rely on a variety of other data to show you advertisements, for example, behavior across websites, information about individual devices, and, in some cases, IP addresses. Please refer below table to understand how these third-party websites collect and use information on our behalf and read more about their opt out options.

Cookie Name	Domain / Associated Domain / Third-Party Service	Description	Retention period
BizoID	ads.linkedin.com	These cookies are used to deliver adverts more relevant to you and your interests	183 days
iuuid	demandbase.com	Used to measure the performance and optimization of Demandbase data and reporting	2 years
IDE	doubleclick.net	This cookie carries out information about how the end user uses the website and any advertising that the end user may have seen before visiting the said website.	2,903,481 Days
UserMatchHistory	linkedin.com	This cookie is used to track visitors so that more relevant ads can be presented based on the visitor’s preferences.	60,345 Days
bcookie	linkedin.com	This is a Microsoft MSN 1st party cookie for sharing the content of the website via social media.	2 years
__asc	ltimindtree.com	This cookie is used to collect information on consumer behavior, which is sent to Alexa Analytics.	1 Day
__auc	ltimindtree.com	This cookie is used to collect information on consumer behavior, which is sent to Alexa Analytics.	1 Year
_gcl_au	ltimindtree.com	Used by Google AdSense for experimenting with advertisement efficiency across websites using their services.	3 Months
bscookie	linkedin.com	Used by the social networking service, LinkedIn, for tracking the use of embedded services.	2 years
tempToken	app.mirabelsmarketingmanager.com		When the browsing session ends
ELOQUA	eloqua.com	Registers a unique ID that identifies the user’s device upon return visits. Used for auto -populating forms and to validate if a certain contact is registered to an email group .	2 Years
ELQSTATUS	eloqua.com	Used to auto -populate forms and validate if a given contact has subscribed to an email group. The cookies only set if the user allows tracking .	2 Years
IDE	doubleclick.net	Used by Google Double Click to register and report the website user’s actions after viewing clicking one of the advertiser’s ads with the purpose of measuring the efficiency of an ad and to present targeted ads to the user.	1 Year
NID	google.com	Registers a unique ID that identifies a returning user’s device. The ID is used for targeted ads.	6 Months
PREF	youtube.com	Registers a unique ID that is used by Google to keep statistics of how the visitor uses YouTube videos across different web sites.	8 months
test_cookie	doubleclick.net	This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor’s browser supports cookies.	1,073,201 Days
UserMatchHistory	linkedin.com	Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor’s preferences.	29 days
VISITOR_INFO1_LIVE	youtube.com		179 days

Third party companies	Purpose	Applicable Privacy/Cookie Policy Link
Alexa	Show targeted, relevant advertisements	https://www.oracle.com/legal/privacy/marketing-cloud-data-cloud-privacy-policy.html To opt out: http://www.bluekai.com/consumers.php#optout
Eloqua	Personalized email based interactions	https://www.oracle.com/legal/privacy/marketing-cloud-data-cloud-privacy-policy.html To opt out: https://www.oracle.com/marketingcloud/opt-status.html
CrazyEgg	CrazyEgg provides visualization of visits to website.	https://help.crazyegg.com/article/165-crazy-eggs-gdpr-readiness Opt Out: DAA: https://www.crazyegg.com/opt-out
DemandBase	Show targeted, relevant advertisements	https://www.demandbase.com/privacy-policy/ Opt out: DAA: http://www.aboutads.info/choices/
LinkedIn	Show targeted, relevant advertisements and re-targeted advertisements to visitors of LTI websites	https://www.linkedin.com/legal/privacy-policy Opt-out: https://www.linkedin.com/help/linkedin/answer/62931/manage-advertising-preferences
Google	Show targeted, relevant advertisements and re-targeted advertisements to visitors of LTI websites	https://policies.google.com/privacy Opt Out: https://adssettings.google.com/ NAI: http://optout.networkadvertising.org/ DAA: http://optout.aboutads.info/
Facebook	Show targeted, relevant advertisements	https://www.facebook.com/privacy/explanation Opt Out: https://www.facebook.com/help/568137493302217
Youtube	Show targeted, relevant advertisements. Show embedded videos on LTI websites	https://policies.google.com/privacy Opt Out: https://adssettings.google.com/ NAI: http://optout.networkadvertising.org/ DAA: http://optout.aboutads.info/
Twitter	Show targeted, relevant advertisements and re-targeted advertisements to visitors of LTI websites	https://twitter.com/en/privacy Opt out: https://twitter.com/personalization DAA: http://optout.aboutads.info/

Save settings

Overview

Partners

What’s hot

Industries

Roles

Knowledge hub

About Fosfor

The Fosfor Decision Cloud