Saturday, April 13, 2024

How To Use Generative AI and Python to Create Designer Dummy Datasets | by Mia Dwyer | Apr, 2024

Must read


A Easy Information for Sensible Functions

Towards Data Science

Have you ever ever had a necessity for a dataset that doesn’t simply exist? Wished to simply generate information that matches your actual necessities for interviewing potential information science candidates, software program testing + growth, or coaching fashions? Or what about simply wanting the fitting information to make use of to display abilities + strategies for a Medium article (that doesn’t violate copyright legal guidelines)?

Enter dummy information! 📊✨

Picture created by me, utilizing DALL-E

Till lately, creating dummy datasets was considerably tedious and arduous, the technical of us amongst us may generate if with expertly written python code, however coding up all of your necessities by hand may be time intensive and has a excessive technical barrier to entry.

Let’s say now we have a use case the place we need to check a candidate making use of for information science to a fintech, and there are actual world patterns we would like them to have the ability to determine and focus on, however for privateness causes we can not share precise buyer transaction information externally.

The answer? Leverage the facility of Generative AI to expertly craft complicated python code to output our ✨Designer Dummy Datasets✨

Let’s have a look at how we are able to immediate GPT4 to generate a dataset for us that meets all of our actual, and considerably tedious, necessities:

Hello there! You might be my knowledgeable python programmer and information scientist extraordinaire. 
I must generate a "designer dummy dataset" that meets the next circumstances and specs,
are you able to please write the python code for me to generate it?

The dataset is transactions in 2019, 2020, and 2021
I would like the dataset to comprise the next columns: id, transaction_timestamp, user_id, quantity, service provider, community, card_type.
The merchant_name must be both: Walmart, Netflix.com, Starbucks, Residence Depot, 7/11, Dunkin Donuts, Dealer Joe's, and Amazon.com
The user_id must be between 1 and 100 - the quantity must be 9.99 for each Netflix.com buy, lower than $10 for Starbucks and Dunkin Donuts, between $25 and 500 for Walmart, Amazon.com, and Residence Depot, lower than $25 for 7/11, and between $10 and $250 for Dealer Joe's
There…



Supply hyperlink

More articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest article