Professional Documents
Culture Documents
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 2
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 3
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 4
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 5
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 6
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 7
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 8
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 9
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 10
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 11
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 12
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 13
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 14
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 15
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 16
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 17
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 18
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 19
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 20
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 21
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 22
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 23
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 24
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 25
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 26
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 27
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 28
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 29
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 30
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 31
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 32
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 33
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 34
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 35
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 36
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 37
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 38
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 39
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 40
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 41
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 42
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
____________________________________
Big Data Architecture Lab (Copyright Arcitura Education Inc. www.arcitura.com) v2.1 43
Table of Contents
SFI is a large internet service provider (ISP) and a website hosting company. It provides
internet services, including broadband and TV, to around 7.5 million customers, 5 million
of which are residential customers and 2.5 million of which are business customers. SFI
hosts a large number of websites and provides 24/7 support to its customers via
telephone, email and online chat.
A team of consultants from the Big Data consultation company holds a meeting with
SFIs management and IT staff in order to prioritize the goals that need to be addressed.
After much deliberation, SLA fulfillment is given the top priority, for the management
believes that achieving SLA compliance will serve as a means of regaining customer
confidence and will ultimately help towards customer retention.
The consultants start looking into the reasons for non-conformance with the published
SLA. Compliance with SLA starts to slip when any of the offered services
(broadband/TV/website/email) becomes unavailable for more than the agreed downtime
or when, although the service is available, the data transfer speed becomes too slow,
resulting in severely degraded service. Normally, the main reason of total or partial
unavailability is a hardware failure, such as a failed router or a web server. The current
procedure of rectifying service-related issues follows a reactive approach, where an
issue is only fixed once it becomes known either when a customer reports it as an
incident or through the operational dashboards. Once it is known that there is a service
disruption, the next step is to identify the culprit hardware through manual inspection of
various log files. At times, the identification of the related log file itself takes a long time.
All this time taken to find the actual cause of the issue makes SLA compliance harder to
achieve.
The consultants propose a proactive strategy for rectifying total or partial service
unavailability issues by developing a Big Data analytics solution that can continuously
analyze log files to find error conditions. They are planning to develop a Big Data
pipeline that enables SFI to automatically collect log files from a variety of data sources,
processes these log files within a short time period and generate insights. The pipeline
would achieve this via a simple computation of statistics or through the application of
machine learning algorithms, and it would help the IT team quickly find the cause of an
issue.
Each of the following three exercises requires you to identify one or more design
patterns that help the development of a Big Data pipeline.
The Streaming Access Storage compound pattern, provided for reference purposes.
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application. (Note that any pattern referenced must be a core member
pattern of the Big Data Pipeline compound pattern.)
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
The Big Data Processing Environment compound pattern, provided for reference purposes.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
B. Illustrate the Big Data analytics logical architecture resulting from the application of
the previously identified pattern(s) by identifying the mechanisms required by the
pattern(s), as well as any other mechanism(s) not directly covered by the pattern(s), and
explain their relevance.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
The next problem that the management wants the consultants to tackle is the increasing
customer dissatisfaction due to longer issue-resolution time. The objective is to decrease
the time it takes to resolve customer-reported service issues, which will alleviate
customer dissatisfaction and increase SFIs rating when compared with other ISPs.
A customer can report an incident by calling the customer care team, sending an email,
filling an online form on SFIs website or through online chat with the customer care
team agent. Once an incident is registered, first-line technical support presents the
customer with a set of standard troubleshooting solutions that may or may not be
relevant to the specific nature of the issue that the customer is currently facing. If
unresolved, the incident is forwarded to the second-line technical support, where the
team uses a combination of previous experience and going through old support incidents
to find a similar incident in the past. If the incident still remains unresolved, in the case of
a broadband/TV service issue, an engineer is sent to the customers location. This adds
to SFIs operational costs, whereas in the case of website/email issue, the incident is
forwarded to third-line support.
The consulting team proposes an analytics-driven solution to reduce the time it takes to
successfully resolve customer service issues. The team plans to empower first-line
support by providing first-line support team members with incident-specific
troubleshooting information. The idea is that by providing case-specific troubleshooting
information, the time it takes to find the right solution can be greatly reduced. This will
further reduce support-related costs by saving money on unnecessary callouts to
customers premises.
The incident management system keeps a record of all issues raised by customers. This
system uses a relational database for storing incident related data. Although the current
system has been in use for the past 5 years, due to the large number of incidents that
get generated and the limited storage space of the relational database, only incidents
going back as far as 2 years are available. Older incidents are periodically archived by
exporting the data as XML files that currently amount to around 1.5 petabytes in size.
The proposed solution will employ text analytics and semantic search techniques to find
similar incidents reported within the last 5 years. The matched incidents resolutions can
then be recommended to the first-line support team members in order to achieve a
targeted and timely resolution of the current incident.
Furthermore, it is also planned to find the total number of similar incidents reported by
customers in the past 24 hours within the same area. This will help support team
members determine if it is an issue that is local to a particular customer or a more
general issue. The solution will be based on the frequent querying of current incidents to
find out the total number of incidents that share the same incident type.
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
B. Illustrate the Big Data analytics logical architecture resulting from the application of
the previously identified pattern(s) by identifying the required mechanisms, and explain
how the mechanisms enable the application of the identified pattern(s).
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
B. Illustrate the Big Data analytics logical architecture resulting from the application of
the previously identified pattern(s) by identifying the required mechanisms and explain
how the mechanisms enable the application of each of the identified patterns.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
SFI currently hosts a large number of a variety of websites. Some of these websites run
ecommerce sites, some act as a frontend for a variety of browser-based applications,
some host blogs and only a handful display static informational content that gets
updated infrequently. The websites that only display static content use a file system as
backend storage for the website content. However, all the other hosted websites use
relational databases for storing a variety of data. Some of these websites require
relational storage with ACID support for enabling transactional operations, such as order
processing and payment processing operations. However, not all operations require
relational storage, such as the storage of non-mutable data or update of data without
strict consistency requirements (data can remain stale for some period of time). Also,
most of the websites store structured data and unstructured data, such as images and
videos. Semi-structured data, such as blog entries and XML data, is also stored within
the relational databases.
In the recent past, ecommerce and social media-driven websites have been generating
very large amounts of data. To manage the increase in demand for data storage, SFI
has had to add additional database servers and buy licenses, resulting in a steep
increase in its IT spending. While SFI charges its customers for the amount of data
stored, the charge is heavily subsidized by SFI in order to remain competitive. Although
SFI can cope with the current data storage demand, the IT team envisages that the
added capacity will soon hit its limit, requiring a further increase in capacity. On the other
hand, some customers with technical understanding have also started demanding
alternative data storage solutions that are more scalable and provide better
performance.
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
B. Illustrate the Big Data analytics logical architecture resulting from the application of
the previously identified pattern(s) by identifying the required mechanisms, and explain
how the mechanisms enable the application of the identified pattern(s).
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
LOC is a large oil company that deals with the exploration, extraction, storage and
refining of oil. LOC has been in operation for nearly 4 decades and consists of over
5,000 wells, both onshore and offshore, that jointly produce one-fourth of the entire
countrys daily oil production.
Oil is extracted from reservoirs by drilling wells. There can be multiple oil wells in a
single oil field. The extracted oil is then transferred to different refineries using a network
of pipelines, trucks and trains. The refined petroleum (gasoline and diesel) is then
delivered to various gas stations across the country.
The first issue that you have been asked to look into is how to enhance oil exploration so
that only sites that can provide the best ROI are chosen. In order to design the required
Big Data solution environment, you perform some preliminary analysis in terms of how
the process works and the type of data involved.
Oil exploration involves analyzing large amounts of rock formation data, seismic data
and geospatial data. Historical reservoir data and well production data within the same
area or between similar areas is further analyzed to determine the quality and quantity of
the potential oil reserves. Once an oil reservoir is found, the required land is leased via
bidding. The amount of the bid and the duration of the lease depend upon the predicted
amount and the grade of the oil reserves and how much oil can be extracted each day.
Determination of these factors takes a considerable amount of time because the
engineers have to analyze, correlate and develop models from terabytes of data from
different information systems, for each system specializes in handling only a specific
type of dataset.
The engineers believe that the process of finding the right oil reserve can be greatly
expedited if all required data that needs to be analyzed is available at one place. They
further believe that access to increased amount of data will help them improve the
accuracy of their predictions. However, the current IT infrastructure does not provide a
means for storing and analyzing large volumes of non-relational data.
Based on your findings, you plan to design a repository of semi-structured and
unstructured data through the implementation of an unstructured data store.
Each of the next two exercises requires you to identify one or more design patterns that
help towards the development of a Big Data unstructured data store.
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application. (Note that any pattern referenced can be a core or an
optional member pattern of the Big Data Unstructured Data Store compound pattern.)
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application. (Note that additional pattern(s) not part of the Big Data
Unstructured Data Store compound pattern may also be required.)
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
Next, you are asked to help LOC in optimizing its well operations in order to obtain the
maximum possible yield from each well. To design a solution, you investigate how oil
wells are currently monitored.
Subsurface sensors and sensors installed on the well-head take continuous
measurements in the form of well logs. Gigabytes of data are continuously generated by
these sensors each day. However, in the absence of a storage infrastructure that can
store gigabytes of data generated by each sensor each day, readings are currently taken
manually by the engineers once a day. These readings are entered into a spreadsheet.
The spreadsheets are sent to the head office via FTP on a weekly basis. One of the IT
team members then imports all 2500 spreadsheets received for each oil well into the
ERP system via a script. Following this, queries are run against the imported data to
generate various statistics, which are then made available to the engineers and business
managers via different dashboards. The aforementioned process, from receiving the
spreadsheets to generating the statistics, takes around 4 to 5 days.
At present, this weekly import of well data coupled with the time it takes to import them
into the ERP means that the engineers and the managers do not have access to the
latest well production data. Any decisions taken to adjust production parameters, related
to well operations, are based on stale data. Furthermore, due to storage space
limitations, the ERP dashboard can show production statistics going back to 6 months
only. The type of the statistics displayed in the dashboards is predetermined. If the
engineers and the managers need a new set of statistics, although they understand
SQL, they need to ask the IT team, for they do not have direct access to the well log
data. The IT team can take up to 15 days to implement the requested changes.
Having completed your investigation, you believe that the lack of up-to-date information
about the operation of wells is inhibiting LOC from making the right decisions at the right
time for optimizing well production. To resolve this issue, you plan to develop a Big Data
solution that is capable of ingesting well logs on a daily basis from across all wells and
that can process them to generate the required statistics overnight so that the latest
statistics are available to the engineers and managers for analysis the very next day.
Furthermore, you intend to make the raw well log data available to the engineers and the
managers so that they can query the data and generate new statistics as needed. Apart
from this, to enhance tactical decision-making, you aim to provide access to the previous
5 years of well logs. By looking at long-term data, more confidence can be instilled in the
decision-making process.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
LOCs management is really satisfied with the progress you have been making so far.
They are already reaping the benefits of Big Data adoption in the form of increased
profits via timely analysis of a variety of voluminous data, which LOC was unable to
perform in the past. Building on your success, you start looking into LOCs final set of
business objects: reducing the cost of maintaining equipment and ensuring full
compliance with the newly introduced industry regulations.
Equipment is currently serviced/replaced based on predetermined intervals or when the
engineers perform a visual inspection, the timing of which can vary between engineers
and is normally dependent upon the experience of the engineer. The service, repair and
inspection records are stored in the asset management system. An inventory of parts is
kept in multiple warehouses across the country. Parts are ordered from different
suppliers spread across the globe and can take up to 7 days to arrive. However, parts
often fail unexpectedly, and when that happens, drilling, oil production from wells or
refinery operations grind to a halt, requiring emergency part replacement. This can
further create logistical problems, especially if the breakdown occurs at a remote site.
With regards to the activities undertaken for assuring regulatory compliance, all types of
operations, especially well drilling, need to demonstrate adherence to strict safety
guidelines at all times. This is a real concern for LOC because operational safety is only
maintained via infrequent physical inspections. One of the reasons behind the infrequent
inspections is the remote nature of the sites. A simple incident left unchecked can result
in a catastrophic accident, such as a blowout, posing grave danger to human lives as
well as the surrounding environment.
After a detailed consultation with the engineers and managers, you come to the
conclusion that the best way to reduce LOCs maintenance and repair costs is to
develop an intelligent asset management solution based on predictive analytics that can
forecast if a part is about to fail. Advance knowledge of service requirements will help
the engineers schedule a service in good time before the part fails, thus reducing well or
refinery downtime. A proactively planned service will further help ensure that a healthy
inventory level of the required parts exists.
On the other hand, for achieving full regulatory compliance, you propose the continuous
monitoring of all oil wells and pipelines. Such a monitoring system will provide advance
warning of any imminent issues. Furthermore, detailed data regarding all areas of
operations will be kept that will become the basis for fulfilling the newly imposed
regulatory requirement of detailed operational reporting.
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
TXC is the local government for a large metropolis collecting taxes and providing a
range of services to a population of over 15 million. Services include fire, ambulance,
police, libraries, waste collection, recycling, social care, streets and parks maintenance
and schools. Apart from getting subsidy from the federal government, TXC finances its
services through the collection of taxes, rates and fines. It is further responsible for
enforcing building regulations, urban development and maintaining electoral register.
TXCs priority is to maximize its revenue collection, as past 5 years statistics reveal that,
on average, it has only been able to collect 83% of the targeted tax and rates. Similarly,
the recovery of fines, such as the collection of parking fines, has not been 100%. These
discrepancies in revenue collection mean a smaller budget for providing services.
Statistics further reveal that fraud within childcare and adult social care is responsible for
million-dollar losses. One other major area of improvement that TXC envisages for cost
savings is the mitigation of waste that not only occurs within service delivery but also
within the current business practices of TXC. For example, different departments
procure the same supplies from different suppliers, which, if consolidated, can result in
massive savings. Last but not the least, a study conducted by the auditors has revealed
that in some cases, suppliers were paid more than once, further devouring TXCs
already shrinking budget.
Preliminary analysis shows that the main reason behind the aforementioned issues is
the lack of cross-functional understanding of TXCs operations and timely reporting. You
believe that fraud identification and waste elimination can be achieved through a data-
driven strategy that collates data from siloed applications in order to provide full and
timely visibility across multiple business functions.
How much tax would be charged on a building, domestic and non-domestic, depends on
the information provided by the occupant. To fall into a lower band of tax, the payee
provides false information, such as false information about property or annual revenue
generated by a business, which is lower than the payees actual payment. However,
TXC can only perform a limited number of physical inspections to verify the facts. The
same principle applies to the payment of benefits for social care.
You propose a Big Data solution that will correlate TXCs tax and benefit records both
against internal and external datasets in order to detect fraud. Furthermore, the solution
will assemble data from different departments to get a unified view of TXCs operations
in order to identify opportunities for reducing waste.
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
Your next task is to help TXC in the strategic deployment of its limited resources and to
enable public access to a variety of datasets.
A meeting is held to decide on the best course of action for deploying resources. It is
suggested that, as the services are provided to the general public, it would be ideal to
incorporate the publics opinion on which services should be given priority. Some
mangers suggest conducting a survey based on a sample of individuals. However,
others are of the opinion that doing so would not only take a long time but may also be
biased, as it will be based on the opinion of a handful of people. You step in and propose
that social media data can be analyzed to find out what the public actually values more.
Based on public opinion, budget and other resources can be allocated accordingly.
The implementation of the open data access policy requires TXC to collate data from
across different departments and make it available to the general public. However,
before making the data public, certain information, such as personally identifiable data,
will either need to be anonymized or completely removed. Additionally, members of the
public may also request data that requires gathering specific data elements from multiple
datasets based on individual request criteria.
A. Identify the design pattern(s) that need(s) to be applied to fulfill these requirements
and describe the application.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
(See the Module 10 and 11: Big Data Design Patterns supplements to find out how
these mechanisms enable the application of the previously identified patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns and the
fulfillment of any requirement not directly covered by these patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns and the
fulfillment of any requirement not directly covered by these patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements for pattern
descriptions.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns.)
(See the Module 10 and 11 Big Data Design Patterns supplements to find out how these
mechanisms enable the application of the previously identified patterns.)