data.world

data.world

Software Development

Austin, Texas 18,792 followers

The Enterprise Data Catalog Platform

About us

data.world is the data catalog platform built for your AI future. Its cloud-native SaaS platform combines a consumer-grade user experience with a powerful knowledge graph to deliver enhanced data discovery, agile data governance, and actionable insights. Get fast, trusted access to your company’s knowledge when you create AI-powered data experiences with the AI Context Engine™. data.world is a Certified B Corporation and public benefit corporation and home to the world’s largest collaborative open data community with more than two million members, including ninety percent of the Fortune 500. Our company has 76 patents and has been named one of Austin’s Best Places to Work eight years in a row.

Website
https://data.world
Industry
Software Development
Company size
51-200 employees
Headquarters
Austin, Texas
Type
Privately Held
Founded
2016
Specialties
data collaboration, data science, analytics, enterprise, teamwork, open data, collaboration, data catalog, data governance, data ops, data stewards, ai, saas, and data cloud

Locations

Employees at data.world

Updates

  • data.world reposted this

    View profile for Juan Sequeda, graphic

    Principal Scientist & Head of AI Lab at data.world; co-host of Catalog & Cocktails, the honest, no-bs, non-salesy data podcast. Scientist. Interests: Knowledge Graphs, AI, LLMs, Data Integration & Data Catalogs

    A fantastic #HonestNoBS Dinner in Boston during CDOIQ. The takeaways: 👍 What’s working  - Easy to get ideas of new ways to drive value with data  - Knowing the business of the business  - AI hype is helping because it can be seen as a catalyst to focus on data and focus on the stuff we care about. But also a distraction  - In addition to Business Reviews, let’s have Business Forwards. Need to talk about what’s going forward - OODA loop: Observe Orient Decide Act  - Make the execs look better. But how do we find that out faster?  - Executives being specific about what they care about. Sometimes it happens if they get into an embarrassing situation because they lacked specific information - Asking: What do you know and don’t know about X? What can you know?  - Never waste a crisis. Something goes wrong. Take advantage - Investing in metadata, protocols, standards within an industry. Describe how to exchange the data that is generated automatically.  Why? Cost reduction. Also if you share bad data, then you look bad - Have the data workshop during the executive planning. Not just OKRs but also art of the possible and the hypothesis we want to test  - Focus on data processing, shift left. Analytics is post processing. However, native tech companies' learnings are not always applicable to everyone - Let business build their own thing, and have them follow a process - Data lineage 👎 What’s not working.  - Activating and executing the ideas to drive value - Change is hard. Are people afraid of change because of accountability? No one wants to be responsible if it goes bad  - Culture is hard. Divide between tech and biz - Buy in is hard, even if the ideas are good and show apparent value  - People are scared. The data they have today is the constant. People who can’t reproduce the number twice, are accountable - There is no clear vision/north star. Where do we want to go? “Our direction was directionless” - Data strategy lacking visibility of business strategy “We have a new strategy. It includes AI. But not data”  - A culture of “print the dashboard”/“report as pdf”. We should understand why that ask? Lack of trust. Want it printed because they want the evidence in case the numbers change. Need to meet people where they are.  - Clear high level mission of the organizations but it may not connect with the “rank and file” - People don’t know what they are looking for - Not being open minded. On metrics: I don’t care if it’s wrong. I want a discussion.  - Defining what X means - One mistake now the perception is that all data is bad - Hide behind industry norms "No one does this well" so why do you want to be in second place? - Implement transaction systems and the biz doesn’t even know what’s in the system - Putting tech in before identifying the problem - Blueprints aren't valuable until the bridges fall down, and you don’t want to be on the bridge when it falls down. Favorite quote: “Sometimes buying in requires letting go”

    • No alternative text description for this image
  • View organization page for data.world, graphic

    18,792 followers

    What a great start to the day at #CDOIQ2024! 🎉 Our session, "From Data to Governed Decisions with AI," was a success thanks to our amazing speakers Patrick McGarry (data.world), Justin Magruder (SAIC), Ren Essene (Consumer Financial Protection Bureau), and Dessa Glasser (Global Legal Entity Identifier Foundation (GLEIF)). Don’t miss this afternoon's session with Juan Sequeda (data.world) on "Knowledge Graphs as a Requirement for AI Ready Data and LLM Accuracy." It's a must attend—make sure to add it to your calendar! 📅 View session details and add it to your schedule: https://hubs.li/Q02GYw-R0

    • No alternative text description for this image
  • data.world reposted this

    View profile for Juan Sequeda, graphic

    Principal Scientist & Head of AI Lab at data.world; co-host of Catalog & Cocktails, the honest, no-bs, non-salesy data podcast. Scientist. Interests: Knowledge Graphs, AI, LLMs, Data Integration & Data Catalogs

    Excited to be back CDOIQ again. I'm giving a talk on Wednesday and the takeaway is: Invest and treat knowledge as a first class citizen (the WHAT) But Why? 1. Because we are doing the same thing over and over again (more data, more tech) and expecting different results. That's Einstein's definition of insanity.  2. We need to find a balance between efficiency and resilience in order to address the known use cases of today (efficiency) and the unknown uses cases of tomorrow (resilience)  3. Operational Efficiency: by being efficient and resilient, we can reduce cost, improve productivity, be agile and flexible to deal with those unknown use cases, optimize resources But haven't we been already been investing in knowledge? 1. Yes, but by chance: we find issues when folks complain 2. Yes, but in an ad-hoc manner: it's not someones dedicated task  3. Yes, but inconsistent investment: ups and downs to deal with data quality But why NOW?  Simple: (Gen)AI! This is an opportunity to ride the hype, in order to address the pains we've been constantly been complaining about. If it's not now, then when??? How do you address the trust issues for AI: Accuracy, Explainability and Governance? By investing in Knowledge! How to treat knowledge as a first class citizen? People, Process and Technology, of course :) - Tech: Knowledge Graphs! Your first knowledge graph is your data catalog which brings together the technical and business metadata, building the brain of your organization. Our lab has been pioneering the research on studying how to use Knowledge Graphs to increase LLM accuracy, specifically for structured data. So if the "why now" is (Gen)AI, then this is the motivator to invest in knowledge graphs which provides the context of your organization.   - Process: Follow a methodology. Success is defined by business questions, not technical questions. I'll be talking about our Pay as you go methodology and Iron thread approach.  - People: Have the right people in the room: Business, Data and .. Knowledge Engineers! I'm also excited to host another #HonestNoBS dinner Wednesday night. It's going to be 6 months since we did the last one so it will be great to see how the data world has changed in the last half year. If you are not aware of these dinners, check out the report I wrote about the dinners I hosted in 2023. We can possibly squeeze a few more folks in, so if you are a data leader/executive and want a dinner under chatham house rules, comment below or ping me directly. Links in comments

    • No alternative text description for this image
  • View organization page for data.world, graphic

    18,792 followers

    Join us at #CDOIQ2024 on Wednesday for a deep dive into agile data governance and GenAI! 🚀 In Session 9-A: "From Data to Governed Decisions with AI," you'll learn from industry experts Patrick McGarry (data.world), Ren Essene (Consumer Financial Protection Bureau), Dessa Glasser (Global Legal Entity Identifier Foundation (GLEIF) and Justin Magruder (SAIC) about: 👉 Maintaining data quality 👉 Meeting regulatory standards 👉 Reducing bias in AI 👉 Ensuring security in AI outcomes 🗓️ Wednesday, July 17 | 8:30 am – 9:15 am Don’t miss this essential discussion. Add the session to your calendar here: https://hubs.li/Q02GzN7M0

    CDOIQ '24 with data.world

    CDOIQ '24 with data.world

    page.data.world

  • View organization page for data.world, graphic

    18,792 followers

    Exciting to see that folks are speaking our language! Thanks for the mention Neo4j 🚀 Check out our recent benchmark study cited in the blog post: https://lnkd.in/epJMkBQa

    View profile for Emil Eifrem, graphic

    Founder and CEO of Neo4j

    My friend Philip Rathle has written an outstanding blog post that summarizes the recent buzz around GraphRAG, what we've learned from a year of helping users build systems with Knowledge Graphs + LLMs and where we believe the space is going. There's been an explosion of research articles in the last few months discussing how to use Knowledge Graphs in RAG systems. For good reasons! It turns out that building a knowledge graph of your data and using it in RAG gives you several powerful advantages. 1️⃣ It gives you better answers to most if not ALL questions you might ask an LLM using normal vector-only RAG. That alone will be a huge driver of GraphRAG adoption. 2️⃣ In addition to that, once you've created your Knowledge Graph you get easier development thanks to data being visible when building your app. Easier to build LLM-backed applications is a BIG DEAL and sorely needed in these non-deterministic systems. 3️⃣ A third major advantage is that graphs can be readily understood and reasoned upon by humans as well as machines. Building with GraphRAG is therefore easier, gives you better results, and -- this is a killer in many industries -- is explainable and auditable! That's a powerful combo! I believer that over time, GraphRAG will overtake vector-only RAG as the default architecture in LLM-backed applications. It makes total sense that the R in RAG becomes graph centric. As an industry, we already converged on the best way to do Retrieval for the web. The key to a good R on the web was graph algorithms (specifically PageRank). That innovation created a trillion dollar company. a) Retrieve the relevant documents through keyword / vector search. b) Rank them in the graph to get the "top ten blue links." Vector-only RAG is Altavista. 🔎 GraphRAG is Google. 🚀 We're entering the "Ten Blue Links" era of RAG. https://lnkd.in/des7MJdK

    The GraphRAG Manifesto: Adding Knowledge to GenAI - Graph Database & Analytics

    The GraphRAG Manifesto: Adding Knowledge to GenAI - Graph Database & Analytics

    neo4j.com

  • View organization page for data.world, graphic

    18,792 followers

    In the dynamic realm of content creation, where AI meets human ingenuity, it's still the human collaboration that helps spin raw thoughts into gold and bring words to vivid life. The process of creation involves blending human intuition with AI innovation to create something truly magical. Read on: https://hubs.li/Q02GnSBR0 Author: Jason Guarracino

    The symphony of human and AI creativity: A heartfelt ode to collaboration

    The symphony of human and AI creativity: A heartfelt ode to collaboration

    data.world

  • View organization page for data.world, graphic

    18,792 followers

    🚀 Transform your public sector AI! Join data.world and Hakkōda on July 31st for an exclusive webinar. Hear from Patrick McGarry, GM Federal at data.world, and Eric Barton, Director of Data Enablement at Hakkōda, as they cover: 🔹 Improving data governance strategies 🔹 Optimizing your data infrastructure 🔹 Enhancing data quality and enrichment 🔹 Advancing your AI initiatives, even if you haven't started Don't miss this chance to learn from the best in the industry. Register now! 🗓️ July 31st 📍 Register here: https://hubs.li/Q02GdB6w0

    Data Modernization, Governance, and AI Strategies: From NOW to the Future of Gov AI - View

    Data Modernization, Governance, and AI Strategies: From NOW to the Future of Gov AI - View

Affiliated pages

Similar pages

Browse jobs

Funding

data.world 7 total rounds

Last Round

Series C

US$ 50.0M

See more info on crunchbase