#dataproduct

Larry SwansonLarrySwanson
2025-03-13

The arrival of creates urgency around the need to guide and govern them.

After 15 years of building reliable AI solutions for banks and other enterprises, Jacobus Geluk sees a standards-based marketplace as key to success.

His proposed new specification articulates the business needs that data products address, complementing his earlier work on the data-product description standard.

knowledgegraphinsights.com/jac

podcast cover art for interview with Jacobu Geluk, expert on the emerging data product economy
2025-01-08

The Treasury Board's policy suite is conceptually a giant graph structure, but is frustratingly resistant to automated analysis.

Some annoyances:

The policy suite straddles tbs-sct.gc.ca and canada.ca and policies often draw their authorities from material on laws-lois.justice.gc.ca

There is frustratingly little common structure you can rely on. If you think you found a structure, you just need to see a few more policies

Links between policies or to laws rarely link to relevant sections

Only a few policies have an XML data representation, most are available only as HTML, making web scraping the most reliable approach

Markers indicating sections, clauses etc. are not consistent across HTML documents making web scraping extremely annoying

Multiple requirements often occur in a single ("and")

Enabling programmatic analysis of policy would be broadly valuable both inside and outside government.

This should be an #opendata #dataproduct but it seems like these documents are largely treated like marketing material: if it looks OK in the browser it's done.

#gcdigital

Olavur Ellefsenolavur
2023-12-05

What do you think of the following description of Flowcore (the data management platform)?

The platform provides you with event streaming and event sourcing in a single, easy-to-use service. Data flow and replayable storage. Designed for developers at data-driven startups and enterprises that aim to stay at the forefront of innovation and growth.

flowcore.com/

Does anyone here know of some literature on how to share entity relations between domains? Let's say Lionel Messi, with id 123 in some sport system start an acting career and gets id 456 in #imdb. How would the sports department communicate this to some other, third domain so they can join and aggregate? How do you deal with deletion of an entity? Just tombstones? I'd love some research on this topic, as it seems to reinvent itself time and time again
#datamesh #dataproduct #kafka #protobuf

2022-12-11

I've been too long without posting. There's been a lot going on in data engineering, the modern data stack, the business(es) of data, … not to mention FTX and ChatGPT (but, of course, you already know about all of that.)
medium.com/@rhm2k/resonance-ca

#DataEngineering #DataProduct #DataEconomics

2022-11-23

"A Data Product is set of prepared data or information (and hence specifically not raw data) that is ready to be consumed by a wide set of consumers."

Willem Koenders provides insights into one of the fundamental concepts of contemporary data engineering.

#DataEngineering
#DataProduct

medium.com/@koendit/whats-the-

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst