Magnifico’s Post

Magnifico reposted this

The Knowledge Graph Guy

10mo

🔵 How Graphs Could Shape the Future of Vector Search 🔵 With ongoing advancements in Large Language Models (LLMs) such as ChatGPT, vector-based search mechanisms are rapidly transitioning from being auxiliary features to core functionalities in many platforms. Vector search is now found not only in specialised stores like Pinecone and Weaviate but also in search platforms such as Elasticsearch and databases like MongoDB. Notably, both these platforms have employed an algorithm called Hierarchical Navigable Small Worlds (HNSW) to deliver efficient vector search. HNSW is a graph-based algorithm, its power lies in the ability to transform continuous embedding vectors into a discrete, layered graph. 🔵 Discrete and Continuous Semantics Traditionally, fuzzy matching strategies are often implemented in conjunction with discrete filters. In search, this is referred to as 'faceting' (think of searching for 'shiny black shoes' on eBay and then selecting a specific brand from a dropdown menu). This hybrid approach has proven effective and is being widely adopted for vector search as well. For example, one might restrict documents based on geographical origin or timeframe and then use vector-based search to gauge sentiment only within that subset. 🔵 A Graph-Based Revolution Traditional filtering is typically based on tabular (rows in a database) or tree-like (JSON documents) data formats. The landscape changes significantly when the data itself is structured as a graph. When employing HNSW in a graph-based setup, both continuous vectors and discrete facets become vertices in the same graph. This allows for more nuanced relationships and more efficient alignment. Furthermore, the upper layers within HNSW represent a form of compression. With your data in a graph, you can move beyond the classic HNSW node-degree compression algorithms to consider more semantic forms of compression, which take domain-specific ontologies into account. This could prove to be very powerful. 🔵 Key Takeaways for Organisations I posit that transitioning to graph-based data structures is the next logical step in the evolution of search and knowledge representation. Therefore, my advice to organisations looking to stay ahead in the data management and analytics game is to transition as much of their core data into a graph structure as quickly as possible. ⭕ HNSW: https://lnkd.in/eH7JqEyZ ⭕ Continuous and Discrete: https://lnkd.in/ex8HA_Nj ⭕ Embrace Complexity: https://lnkd.in/ejZikEGp ⭕ Semantic Router: https://lnkd.in/eucZUjrV

58 Comments

Mercia E. Arnold

Quick-study, multifaceted, friendly, life- long learner, strategic practical orthogonal critical thinker and business model surfer. Economist & Attorney. State Legislative and Fiscal Counsel.

10mo

How does one protect against #bias in #vector based search #algorithms where people may choose to ignore non-zero #crosspartialderivatives in their #LLMs? Should #disclosure of #vector #assumptions be requited as #validation of the #LLM as #science rather than #faith? Biases that are #faithbased, in the United States, are permissible, but may be a basis for #impermissable exclusions of the #possible. IMHO, #technology should #facilitatenotfrustrate human creativity as a transport TO information, & NOT an arbiter OF information.

1 Reaction

John O'Gorman

Disambiguation Specialist

10mo

Tony - I'm not that comfortable with the phrase 'Discrete and Continuous Semantics' but I think I know where you're headed here. I think what may be more precise is the word 'Discrete' refers to a single token with a semantically equivalent definition or general description (also a 'token' in my view). So a semantically discrete token has meaning, LLMs also (obviously) use uniquely identified (discrete) tokens, but what you describe as 'Continuous' refers to a list of probabilities of 'next tokens' which seems to indicate syntax. So, a token is discrete, while a list of embeddings (let's call it a vector) tells me which discrete, semantically stable tokens are likely to be associated with it. The combination of semantically enriched faceted knowledge graphs and syntactically robust LLMs is indeed a powerful combination. McCarthy Tétrault

5 Reactions

Victor Grazi

Oracle Java Champion, Pluralsight author Twitter @vgrazi

10mo

I find it easier to express this relationally select customer, sum(sales) where product=tv group by customer order by sum(sales) desc Is that just my experience bias?

1 Reaction

Pranab Ghosh

AI Consultant || MIT Alumni || Entrepreneur || Open Source Project Owner || Blogger

10mo

With RAG , the LLM is hardly doing any search because you are providing all the content to search for as context using vector indexing, text indexing, KG and whatever. In this case LLM is working more as an NLP engine gleaning the answer from the provided context

4 Reactions

David Bergling

10mo

Tony Seale, can Graph Structure and Blockchain be aligned in a good way would you say?

1 Reaction

Harry Powell

Data science leader with track record of innovation and value creation

10mo

HNSW is a great use case for graph. What architecture are you currently using?

3 Reactions

Kingsley Uyi Idehen

Founder & CEO at OpenLink Software | Advancing Data Connectivity, Multi-Model Data Management, and AI Smart Agents | Unifying Disparate Data Silos via Open Standards (SQL, SPARQL, RDF, ODBC, JDBC, HTTP, GraphQL)

10mo

Yep! BTW -- here's an live variant of what you've depicted based on actual data from an @Apple product page. https://www.openlinksw.com/data/turtle/general/apple-knowledge-graph-manifestation-3.html Fundamentally, the notion of a Semantic Web has stealthily put so much in place to be exploited by this era of LLM-based language processors and code generators 😀 #KnowledgeGraph #SemanticWeb #CDO #CIO #CDAO #CTO #CMO #LinkedData #DataConnectivity

16 Reactions

Jon Cooke

(S&L)LM/GenAI & Analytics Data Products | Composable Enterprises using Data Product Pyramid and GenAI | Data Product Workshop podcast co-host

10mo

Love this Tony. Looks very like my diagram. https://www.linkedin.com/feed/update/urn:li:activity:6975421577098059776/

1 Reaction

Sebastian Wohlrapp ⭐️

10mo

Tony Seale Doesn’t a data product contain meaning or at least context to be properly consumable? If so, shouldn’t the Ontology be part of that, so on a lower level in your layer map?

3 Reactions

Mark Spivey

Helping us all "Figure It Out" (Explore, Describe, Explain), many Differentiations + Integrations at any time .

10mo

what is “discrete” or “continuous” here is not the “semantics” …

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Magnifico

207 followers
9mo
Report this post
Excited to learn more about NeMo and possibly implement it for customers!

NVIDIA AI

1,021,670 followers
9mo

ICYMI: We announced NeMo Retriever, a #generativeAI microservice that connects custom LLMs to enterprise data. It delivers highly accurate responses for AI applications. #AWSreInvent Image created using OpenAI's DALL-E, prompted by Jordan Ranous of StorageReview.com. Read more: https://nvda.ws/46HWQGl

NVIDIA NeMo Retriever for Generative AI in Enterprises Announed

https://www.storagereview.com
Like Comment
To view or add a comment, sign in
Magnifico

207 followers
9mo
Report this post
This is amazing!

Anthony Robbins

Watch the keynote from our most recent GTC to see NVIDIA CEO Jensen Huang share the AI technologies affecting every industry—and our everyday lives.
9mo

BREAKING NEWS from #awsreinvent2023 Amazon Web Services (AWS) and NVIDIA Announce Strategic Collaboration to Offer New Supercomputing Infrastructure, Software and Services for Generative AI November 28, 2023 -> AWS to offer first cloud AI supercomputer with NVIDIA Grace Hopper Superchip and AWS UltraCluster scalability -> NVIDIA DGX Cloud—first to feature NVIDIA GH200 NVL32—coming to AWS -> Companies partner on Project Ceiba—the world’s fastest GPU-powered AI supercomputer and newest NVIDIA DGX Cloud supercomputer for NVIDIA AI R&D and custom model development -> New Amazon EC2 instances powered by NVIDIA GH200, H200, L40S and L4 GPUs supercharge generative AI, HPC, design and simulation workloads -> NVIDIA software on AWS—NeMo LLM framework, NeMo Retriever and BioNeMo—to boost generative AI development for custom models, semantic retrieval and drug discovery “Amazon Web Services (AWS) and NVIDIA have collaborated for more than 13 years, beginning with the world’s first #GPU cloud instance. Today, we offer the widest range of NVIDIA #GPU solutions for workloads including #graphics, #gaming, #highperformancecomputing, #machinelearning, and now, generative ai,” said Adam Selipsky, #CEO at Amazon Web Services (AWS). “We continue to innovate with NVIDIA to make AWS the best place to run GPUs, combining next-gen NVIDIA Grace Hopper Superchips with AWS’s EFA powerful networking, EC2 UltraClusters’ hyper-scale clustering, and Nitro’s advanced virtualization capabilities.” “#generativeai is transforming #cloud workloads and putting #acceleratedcomputing at the foundation of diverse content generation,” said Jensen Huang, #founder and #CEO of NVIDIA. “Driven by a common mission to deliver cost-effective state-of-the-art #generativeai to every customer, NVIDIA and AWS are collaborating across the entire computing stack, spanning AI infrastructure, acceleration libraries, foundation models, to generative AI services.” Bill Vass | Rich Geraffo | David Appel | Kim Majerus | Ray Falcione | Jim Young | Rebecca Wetherly | Rima Olinger | Heidi Buck | Mary Alexander | Ash Thankey | Amy Belcher | Kyle Johnson | Phil Goldstein | Iram A. Ali | Matthew Briggs | David Rubal, CISSP, NREMT | Renzo Rodriguez | Christian Hoff | Debra Goldfarb | Robin Goad | Dominic Delmolino | Brian Pickering United States Department of Defense | Defense Information Systems Agency | Defense Advanced Research Projects Agency (DARPA) | Lockheed Martin | Raytheon | Northrop Grumman | Huntington Ingalls Industries, Inc. | MITRE

AWS and NVIDIA Announce Strategic Collaboration to Offer New Supercomputing Infrastructure, Software and Services for Generative AI

nvidianews.nvidia.com
Like Comment
To view or add a comment, sign in
Magnifico

207 followers
10mo
Report this post
👏
Gabriele Venturi

Building PandasAI, the library to extract value from your data
10mo

OpenAI is not the death of startups, it's a Wakeup Call! In the past days, many fellow entrepreneurs have reached out to me asking if I'm concerned about OpenAI's new offerings. There's a narrative spreading that thousands of startups will be killed by capabilities like custom chatbots and text generation revealed at DevDay. Like them, I've been following the OpenAI announcements closely. The launches of tools like custom GPTs have no doubt led some to predict the impending downfall of companies built on conversational AI. However, I believe this view misses the mark. The startups destined for disruption by OpenAI's offerings were likely already on borrowed time. Building a thin wrapper on top of an owned technology like GPT-4 was never going to be a sustainable business in the long-run. The value lies in building differentiated products with unique data and capabilities. There are a few key points we should keep in mind: ✅ Generative AI is more than just conversational interfaces. For the first time, we have technology mimicking human reasoning and creativity. The possibilities extend far beyond chatbots. ✅ Conversational interfaces are not necessarily the future. OpenAI's own usage data for ChatGPT shows declines after initial hype. A clickable UI can often be more efficient than typing sentences. ✅ There's a massive difference between a basic integration of LLMs and building an actual product. True startups solve real problems and meet needs. An intelligent interface is just one piece. For many companies, this moment is an opportunity, not a death sentence. The time has come to stop relying on third-party technology and double down on unique data, industry expertise, and product-market fit. The fundamentals haven't changed. Building a startup today still means making something people want. OpenAI expands what's possible, but ultimately we still need to identify real problems and develop complete solutions. Rather than the end, I see this as a wakeup call. A nudge to build differentiated products on owned technology, not thin layers on leased foundations. The startups that survive will be those that embrace OpenAI as an enabler, not a crutch. An amazing new technology to create value, not a shortcut. This is an exciting time full of possibility. OpenAI has raised the bar, but also opened up many new avenues. For startups willing to learn and adapt, the opportunities are endless. The only true death will come to those who fail to evolve. #OpenAI #startup #GenAI
Like Comment
To view or add a comment, sign in
Magnifico

207 followers
10mo
Report this post
There's a lot to be optimistic about with AI.
Shahid Azim

CEO I Managing Partner I Co-founder I Serial Health Tech Entrepreneur
10mo Edited

Back from Ted AI, here some interesting snippets from an exciting couple of days. Broadly, though we are seeing hype cycles in some segments, there is a massive societal scale shift underway which touches every profession and every sector. #c10labs C10 Labs also hosted its first west coast AI Salon which was attended by some amazing minds! #ai4impact With God like powers , comes a need for god like wisdom. Don’t hate the tech players, but change the rules of the game! ai is not just a tool but a ladder for us. English is the most common programming language now! We are going to have an agent for everything Personals agent for everyone. “Having your bit flipped !!! ( seeing AI perform !) “, Ried Hoffman’s quote on seeing chatGPT perform for the first time in a private setting with Gates. Do not panic. Line of sight medical assistants for every one, Tutors for everyone. Potential emergence of world of possibilities and abundance. Navigate better outcomes for humanity with AI. Human ingenuity like never before with AI. Ai allows for Gift of time for the patient doctor relationship. Ai is a technology of abundance and we should not approach it with a scarcity mindset. Ramesh Raskar Patricia Geli Muntazir Mehdi Ahmer Inam George K. Beth Porter #TEDAI
Like Comment
To view or add a comment, sign in

207 followers

View Profile Follow

Magnifico’s Post

More Relevant Posts

Explore topics