The flashcards below were created by user
on FreezingBlue Flashcards.
understand the problem
identify processes involved
identify info required to solve the prob
gather & process data & analyze the results
steps to identifying the right sources of data
tend to be degisned as high level, end-to-end structures useful for decision making and normalizing how things get done in an org - are comprised of one or more workflows relevant to the overall objective of the process
are task-oriented and often require more specific data than processes
validity - correct/quality
veracity - meaningful
volatility - how long does it need to 'live'
six V's of big data
true understanding of context that comes from triangulation of data
to examine the context in different ways & from different perspectives
refers to data's useful lifespan - only keep what you need - is it relevant, or will it be in the future
the idea that data must lead to an "ah ha" moment & actionable results
Enterprise Data Management (EDM)
a comprehensive approach to defining, governing, securing, and maintaining the quality of all data involved in the business process of an org - establishes policy about and ownership of key data types - sustains a single version of the truth
pillars of Enterprise Data Management (EDM)
cost of initial data acquisition
cost of data maintenance - security, privacy, etc.
two dimensions of cost of data
speed & amt of data are critical
use well-established framework
budget & HR allocation
how much risk are you willing to accept?
steps of implementation roadmap
1-identify business owners, review strategy, establish goals, build team, integrate into EDM, define costs
2-identify BD source, identify affected business processes, create technology and IT operating requirements, define desired business outcomes, leverage existing infrastructure, iterate with key business stakeholders
3-deploy business application, deploy new/modified IT operations practices, refine big data requirements for veracity and volatility, tune applications for best performance, perform after-action assessment
major stages of big data implementation