Disruptive Analytics by Thomas W. Dinsmore
Author:Thomas W. Dinsmore
Language: eng
Format: epub
Publisher: Apress, Berkeley, CA
Streaming Operations
A conventional relational database processes a query until it reaches end-of-table indicators for all tables referenced in the query. In a streaming database, there is no equivalent end-of-table concept, since the database updates continually as new records arrive.
Many of the operations analysts seek to perform on streams are the same as those they perform on static tables. Some, however, are unique to streaming data: they include joining streams, aggregations, filtering, windows, and alerts.
Joining Streams. For insight, analysts may need to join multiple streams to one another. For example, a vehicle fleet operator may have streams of data arriving for each vehicle in the fleet; to monitor fleet-level statistics, all of the individual streams must be joined into a single stream.
For context, analysts may also need to join streams to static tables. For example, suppose that we are working with a stream of transactions posted by hundreds of retail stores, which we want to group by region. The transaction records in the incoming stream have a store code but not a region code; for that, we must join the stream to a static store master table to capture the region code.
Aggregations. A key capability of streaming SQL is to the ability to compute and retain aggregates on the incoming stream. For example, we may want to compute a cumulative count and sum of transactions as they arrive. Aggregations are useful when combined with windowing, so the computed measures correspond to statistics for discrete and finite time intervals.
Filtering. Filtering a stream of data is conceptually similar to filtering a static data set. We filter for two reasons: To remove noise and irrelevant data.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Storytelling for dummies by Andrea Fontana(1470)
Effortless by Greg McKeown(1391)
The Practice by Seth Godin(1370)
Mastering Blockchain by Lorne Lantz(1367)
Blockchain Quick Reference by Paul Valencourt & Samanyu Chopra & Brenn Hill(1120)
Mastering Blockchain by Lorne Lantz & Daniel Cawrey(890)
How to Lead by David M. Rubenstein(790)
The wind in the willows by Kenneth Grahame(773)
The Ape in the Corner Office by Richard Conniff(762)
Handbook of Big Data Analytics by Unknown(690)
Social Media Engagement For Dummies by Aliza Sherman(686)
FunRetrospectives: activities and ideas for making agile retrospectives more engaging by Paulo Caroli & Tainã Caetano Coimbra(668)
Getting Started with Data: The first book you should read to successfully get along with data. by Menegatti Gabriel & Team Simbiose Ventures(668)
Taking Care of Yourself (HBR Working Parents Series) by Harvard Business Review(668)
Business Storytelling For Dummies by Unknown(647)
Evernote for Self Publishing: How to Write Your Book in Evernote from Start to Finish by Jose John(639)
Help! My Facebook Ads Suck-- by M. D. Cooper & Jill Cooper(633)
Genius by Choice: Your unconventional A–Z handbook to enhance your learning process by Remondino Giulia S(613)
A Leader Listens by Ajay Banga(612)
