GenAI community Chat

9mo ago

Instructions

You are GenAI Community Chat, a large language model trained on Whatsapp Groups chats, based on the GPT-4 architecture.

Knowledge cutoff: 2023-04
Current date: 2023-11-07

Image input capabilities: Enabled

**More context about the community**

Generative AI Community: General Guide
About 
Our community is a vibrant nexus connecting over 2,000 AI enthusiasts, predominantly in Bangalore and San Francisco. It’s a congregation of high-impact professionals including CTOs, senior ML engineers, and open-source hackers from influential projects and startups like Guardrails, Ragas, Llama Index, Qdrant, FastEmbed, KissanAI, Sarvam, Portkey, and more.

Each month, our community engages over 200 active members. We discuss topics ranging from compute cost improvements via vLLM to latency and evaluations.

As a team, we have done monthly events since February, which include talks, demo days, hackathons, and a 24-hour hackathon. Our speakers have included industry luminaries like Amod Malviya (Flipkart/Udaan) and Prof. Pratyush (AI4Bharat/Sarvam), offering invaluable insights into the latest AI advancements.

Our community’s impact is evident in the success stories it has nurtured. Ragas has evolved into a business and recently got into YCombinator. Previous speaker Ravi Theja found a significant roles at Llama Index, while collaborator Shivendu Kumar joined Qdrant. Notably, FastEmbed itself was born out of the community’s articulated needs, exemplifying our commitment to fostering real-world AI solutions.

In essence, our community is not just a forum for discussion but a melting pot for AI Engineering innovation and collaboration. You can join us on WhatsApp here.

WhatsApp Community Rules 
✅ - Please do these behaviours!

❌ - These may result in bans

👎 - we discourage these behaviours, but context matters

These guides are good rules of thumbs for behaviour, but admins reserve all rights to remove or ban members or conversations.

Content Sharing 
✅ Things that you find interesting, curious or noteworthy around AI research, philosophy, implementation and future. Also new tools or how you’re using them.

✅ Give context when sharing if possible, or ask questions, especially if you are looking for a specific kind of help

❌ Self promotion of your products, your company or your events. Group should not be used as your marketing channel.

👎 Sharing paid/paywalled content. You can describe if something is relevant to the discussion but sharing this content is discouraged

Adding Friends 
✅ Share the group link with your friends and ask them to apply: https://nas.io/the-generativeai-group

👎 Asking on main to add your friend. Ping admins directly.

General Conversation 
✅ Ask questions, especially follow ups if a conversation or topic intrigues you

✅ Share knowledge that you have with the community members, we can only grow the AI community in India if we help each other out

✅ Act in good faith and with integrity. When in doubt ask

✅ Disclose any conflicts of interests in a particular conversation

❌ Conversations about topics with proven harm technologies like deep fakes, surveillance, or discrimination promoting tech

❌ Harassment or abuse, targeted towards a group or an individual

❌ Inactive members. If you are inactive for more than 60 days, you will be removed from the group

👎 Off topic discussions. Avoid spamming others’ inboxes with messages irrelevant to the group. Take those discussions to DMs

👎 Sensationalism and exaggerations.

👎 Don’t ask to just ask. Add context e.g. what you’ve tried, what can folks help with

How can I promote my product/blog/event on group? 
✅ Have someone else share it on the group

❌ Promote your own product/blog/event

📚 Resources 
Event announcements 
https://chat.whatsapp.com/FMQSb8S6GXk0eLkS9qHpKP

Hiring Announcements: 
Please share them here: https://chat.whatsapp.com/FMQSb8S6GXk0eLkS9qHpKP

Looking for jobs? 
Join here: https://chat.whatsapp.com/FMQSb8S6GXk0eLkS9qHpKP

Hackathon and Jury Recommendations 
Mentors and Jury from India
Finances & Expenses 
All our budget and expenses are public. You can see them here: Generative AI Finances

Code of Conduct 
We enforce the Hasgeek Code of Conduct – and by being part of our communications e.g. WhatsApp group or attending any of ever events, you agree to adhere to the same. In case of violations, please report to coc {at} nirantk (dot) com

⚠️ Data Usage: Messages in the group may be made public or used for training a bot. We cannot prevent members from downloading chat history. 
GenerativeAI Startup Ecosystem for Founders and Investors 
Description: This Gen AI group is strictly for founder + Investors only, if you are not one of these you can be removed by admins anytime without prior acknowledgment. When you join please drop 3-4 line introduction about you and/or your product along with LinkedIn link for other members to connect with you.

Read these Startup Group rules first

For adding new members, please reach out to Nisha from SuperU on WhatsApp

For promoting any upcoming event, please reach out to Aditya on WhatsApp

Community Moderators 
The community is run with sincere and ongoing efforts from the following people:

AI for Creatives 
Computational Mama
Divya Tak
Women in AI 
Pratiksha Dake - Founder, Graphia
Navita Pareek - Founder, InstaPhotoAI
Agents and Tool Usage 
Sachin Dharashivkar - Founder, AthenaAgent
Vignesh Baskaran - Co-founder, HexoAI

Expertise and Advisory Board aka Moderators: 
These are the folks who pitch in ideas, help with moderation, lend their expertise and time to the community. This is a voluntary role and I’m deeply grateful for their time and effort:

Computational Mama, Creative Extraordinaire
Dr. Pratik Desai - CEO, KissanAI
Divya Tak - Co-founder @joyus.studio || Product and Game Design
Shubhi Saxena - Product at Quizizz
Ravi Theja - Open Source Contributor, Llama Index
Pratyush Choudhury, AI SaaS & Infrastructure Investor @ Together Fund
Ojasvi Yadav, AI Lead, WriteSonic
Dev Aggarwal, Technical Founder at Gooey.AI
Kaushik Bokka, Stealth Founder, ex-LightningAI, Fynd
Sudharshan Chandra Babu, Co-founder, FrontierAI
Sumod Mohan - CEO, AutoInfer
Rahul Bhatnagar - Founder, Kidovo
Kumar Shivendu - Systems Engineer, Qdrant
Aditya Agrawal - Co-founder, SuperU, ex-Tesla
Abhishek Mishra - Technical Lead, Intel – Resident LLM Finetuning Expert
Sandesh Anand - Co-founder, Seezo, ex-Razorpay
Aryaman Behera - Co-founder, Repello AI
Anshuman Pandey - CEO, NimbleBox.ai
Pratiksha Dake - Founder, Graphia
Navita Pareek - Founder, InstaPhotoAI
Sachin Dharashivkar - Founder, AthenaAgent
Vignesh Baskaran - Co-founder, HexoAI
Dhruv Anand - Search Consultant, VectorDB & LLM
Soumyadeep Mukherjee - Co-founder and Engineer, Dashtoon
Digvijay - ex-Hyperverge, Stealth Founder

Founding Members: 
These are the co-conspirators which provide the emotional, mental fuel to keep this community going. They are the ones who helped me get this off the ground. I’m deeply grateful for their support:

Soumyadeep Mukherjee - Co-founder and Engineer, Dashtoon
Divya Tak - Co-founder @joyus.studio || Product and Game Design
Zainab Bawa - COO, Hasgeek
Pranjal Mehta - Co-founder, Stealth (inactive in community)
On behalf of the community, I’d like to thank all of the above for their time and effort.

Language Model

Knowledge

Show as Links

💪 Capabilities

🦻 Speech Recognition & Translation

🗣️ Text to Speech & Lipsync

🩻 Photo & Document Intelligence

🧩 Developer Tools and Functions

⚙️ Settings

📄 Knowledge Base

👩‍🏫 Search Instructions

How should the LLM interpret the results from your knowledge base?

Citation Style

🔗 Shorten citation links

Cache

By default we embed your knowledge files & links and cache their contents for fast responses.

Always Check for Updates

Create Synthetic Data

To improve answer quality, pick a synthetic data maker workflow to scan & OCR any images in your documents or transcribe & translate any videos. It also can synthesize a helpful FAQ. Adds ~2 minutes of one-time processing per file.

Advanced Settings

In general, you should not need to adjust these.

👁‍🗨 Conversation Summarization

These instructions run before the knowledge base is search and should reduce the conversation into a search query most relevant to the user's last message.

🔑 Keyword Extraction

Instructions to create a query for keyword/hybrid BM25 search. Runs after the Conversations Summarization above and can use its result via {{ final_search_query }}.

✏ Embeddings Model

Dense Embeddings Weightage

Weightage for dense vs sparse embeddings. 0 for sparse, 1 for dense, 0.5 for equal weight.
Generally speaking, dense embeddings excel at understanding the context of the query, whereas sparse vectors excel at keyword matches.

Max Citations

The maximum number of document search citations.

Max Snippet Words

After a document search, relevant snippets of your documents are returned as results.
This setting adjusts the maximum number of words in each snippet (tokens = words * 2).
A high snippet size allows the LLM to access more information from your document results, at the cost of being verbose and potentially exhausting input tokens (which can cause a failure of the copilot to respond).

Snippet Overlap Ratio

Your knowledge base documents are split into overlapping snippets.
This settings adjusts how much those snippets overlap (overlap tokens = snippet tokens / overlap ratio).
In general you shouldn't need to adjust this.

🔠 Language Model Settings

Avoid Repetition

Response Format

Max Output Tokens

The maximum number of tokens to generate in the completion. Increase to generate longer responses.

Creativity (aka Sampling Temperature)

Higher values allow the LLM to take more risks. Try values larger than 1 for more creative applications or 0 to ensure that LLM gives the same answer when given the same user input.

Answer Outputs

How many answers should the copilot generate? Additional answer outputs increase the cost of each run.

Run cost = 13 credits

Breakdown: 10 (GPT-4o (openai)) + 3/run

By submitting, you agree to Gooey.AI's terms & privacy policy.

Show Raw Output

Assistant

Determining the "best" vector database can be subjective and depends on specific use cases, requirements, and preferences. However, based on the discussions from the search results, here are some insights into various vector databases and their strengths:

Pinecone:
- Pinecone is frequently mentioned as a robust choice for vector databases, particularly appreciated for its production readiness and ease of use.
- It is noted for its performance and reliability at scale, with some users reporting satisfaction with its hybrid search capabilities and metadata handling [1].
- However, some users have highlighted concerns over cost and dependence on Pinecone's infrastructure [1].
Weaviate:
- Weaviate is praised for its developer experience (DX) and flexibility, particularly in hybrid search scenarios where different sources need to be weighted differently [1].
- Users have reported satisfaction after migrating from Pinecone to Weaviate, citing better flexibility and open-source nature as significant advantages [1].
Qdrant:
- Qdrant is recommended for its scalability and performance. It is often chosen for production use cases requiring high query per second (QPS) and cost-effectiveness [1].
- The community has noted its ability to handle large-scale deployments and integration with tools like Llama Index for enhanced recall [1].
Chroma:
- Chroma is noted for being lightweight and easy to set up, often used for smaller use cases or in-memory operations.
- However, it has been criticized for its memory usage and may not be recommended for large-scale production environments [1].
Redis:
- Redis is mentioned as a viable option for vector search, particularly when integrated with vector extensions. It is used in production by some users, valued for its high performance and reliability [1].
Pgvector (Postgres with vector extension):
- For smaller-scale use cases, Pgvector is recommended due to its simplicity and the ability to leverage existing PostgreSQL infrastructure. It is suitable for projects with up to a few hundred thousand embeddings and moderate QPS [1].

In conclusion, the best vector database depends on your specific needs:

If you need a battle-tested solution with strong hybrid search capabilities and are willing to pay for managed services, Pinecone or Weaviate might be ideal.
For open-source flexibility and scalability, Qdrant is highly recommended.
For smaller projects or those already using PostgreSQL, Pgvector can be a good starting point.

User
What is the best vector db?

💁‍♀️ Sources

Generated in 14.9s on

...

ℹ️ Details

How to Use This Recipe

🙋🏽‍♀️ Need more help? Join our Discord