<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JSIP</journal-id><journal-title-group><journal-title>Journal of Signal and Information Processing</journal-title></journal-title-group><issn pub-type="epub">2159-4465</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jsip.2015.62007</article-id><article-id pub-id-type="publisher-id">JSIP-55298</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  Non-Intrusive Context Aware Transactional Framework to Derive Business Insights on Big Data
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>iva</surname><given-names>Chidambaram</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>P.</surname><given-names>E. Rubini</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>V.</surname><given-names>Sellam</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff2"><addr-line>Department of Computer Science Engineering, SRM University, Chennai, India</addr-line></aff><aff id="aff1"><addr-line>Department of Computer Science Engineering, Sri Muthukumaran Institute of Technology, Chennai, India</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>sivachidambaram87@gmail.com(IC)</email>;<email>iniya29@gmail.com(PER)</email>;<email>sellamveera@gmail.com(VS)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>27</day><month>03</month><year>2015</year></pub-date><volume>06</volume><issue>02</issue><fpage>73</fpage><lpage>78</lpage><history><date date-type="received"><day>22</day>	<month>February</month>	<year>2015</year></date><date date-type="rev-recd"><day>accepted</day>	<month>1</month>	<year>April</year>	</date><date date-type="accepted"><day>2</day>	<month>April</month>	<year>2015</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  To convert invisible, unstructured and time-sensitive machine data into information for decision making is a challenge. Tools available today handle only structured data. All the transaction data are getting captured without understanding its future relevance and usage. It leads to other big data analytics related issue in storing, archiving, processing, not bringing in relevant business insights to the business user. In this paper, we are proposing a context aware pattern methodology to filter relevant transaction data based on the preference of business.
 
</p></abstract><kwd-group><kwd>Context Aware</kwd><kwd> Pattern Recognizer</kwd><kwd> Big Data</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>There are varieties of diversified portfolio of applications getting deployed in the Enterprise Infrastructure space, and each application has a different trend of arrival patterns which generates machine data that need to be captured and processed to gain business insights. The problem starts from collection and filtering and processing of the data becomes difficult due to the rate in which the data getting generated is huge [<xref ref-type="bibr" rid="scirp.55298-ref1">1</xref>] . This requires an efficient way to interpret and bring relevance to the particular context the business deals with. The preferred way to look at this issue is to bring in relevance when the data are getting generated real time so that only the relevance and needed machine data are getting captured by leaving the unwanted machine data.</p><p>The purpose of this Context Aware Transactional framework is to categorize the patterns and filter the machine data based on the relevance of each transaction getting performed. Based on the filtered data, the enterprise can concentrate on how to effectively bring out the business insights within the application by not spending too much on the cost aspects with respect to the data storage and processing [<xref ref-type="bibr" rid="scirp.55298-ref2">2</xref>] .</p><p>The objective of this paper is to propose a methodological approach to implement a non-intrusive component which can be plugged into the existing enterprise infrastructure layer to bring out all the insights business wants by capturing only the relevant business oriented machine data [<xref ref-type="bibr" rid="scirp.55298-ref3">3</xref>] . This paper is organized as follows. Section 1 gives an overview of context aware filtering. Section 2 gives the proposed architecture and the multiple phases involved. In Section 3 approaches and the API for context aware filtering are described [<xref ref-type="bibr" rid="scirp.55298-ref4">4</xref>] . Section 4 and Section 5 consist of experimental analysis and the amount of data saved during the data collection gain with respect to storing, archiving, and processing in the context of the component proposed [<xref ref-type="bibr" rid="scirp.55298-ref5">5</xref>] .</p></sec><sec id="s2"><title>2. Non-Intrusive Context Aware Transactional Framework</title><p>Context aware filtering is the process of recognizing the machine data based on the pattern. The patterns are application specific based on specifics like business rules, database access related, and external interfaces. The methodology is built in such a way that this component can be deployed as non-intrusive into any of the enterprise layer to generate data insights. The high level logical steps involved in context aware filtering are shown in <xref ref-type="fig" rid="fig1">Figure 1</xref>.</p><sec id="s2_1"><title>2.1. Pattern Builder</title><p>In this phase preserving of existing business and technical knowledge are captured will be utilized and key characteristics of the existing application are captured and stored in the master Meta data in the repository.</p></sec><sec id="s2_2"><title>2.2. Pattern Recognizer</title><p>In this phase pattern matching will be applied on the machine data, and the recognized data are retrieved and stored in the desired repositories. The generation of machine data is in multiple phases, so the pattern recognizer will be a logical independent component which can be made as non-intrusive deployment whenever any transaction happens.</p><fig id="fig1"  position="float"><label><xref ref-type="fig" rid="fig1">Figure 1</xref></label><caption><title> High level logical steps involved in context aware transaction framework</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/3-3400377x5.png"/></fig></sec><sec id="s2_3"><title>2.3. Pattern Filter</title><p>In this phase the pattern filter gets applied on to the pattern recognized machine data which properly filters the relevant and store it in the database for further processing.</p></sec><sec id="s2_4"><title>2.4. Pattern Extractor &amp; Visualizer</title><p>In this phase the pattern extractor and visualizer helps the enterprise to devise the strategy based on the business rule to extract data.</p></sec></sec><sec id="s3"><title>3. Proposed Architecture Frameworks</title><p>The high level proposed architecture is explained in <xref ref-type="fig" rid="fig2">Figure 2</xref> and the components involved in creating the Context aware filtering are explained in the following sections.</p><sec id="s3_1"><title>3.1. Channel Listener</title><p>This component will act as a listener component to the channels. The channel sends the request based on the request the listener component intelligently forms the triggering point for the Pattern builder to trigger its operations. It acts as a signal sender for the next component to act upon.</p></sec><sec id="s3_2"><title>3.2. Pattern Composer</title><p>This component will also act as intelligent interpreter and filters out the rules present across applications.</p></sec><sec id="s3_3"><title>3.3. Pattern Builder</title><p>This component retrieves the composed pattern from pattern composer and builds a searchable pattern format</p><fig id="fig2"  position="float"><label><xref ref-type="fig" rid="fig2">Figure 2</xref></label><caption><title> Context aware filtering high level architecture</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/3-3400377x6.png"/></fig><p>which can be directly applied on to the enterprise contextual data getting captured by the Channel Listener component. This component also deals with the intelligent interpretation of the contextual data from the Enterprise with multiple dimensions and variety. The smartness is built into the component itself and different scoring algorithms based implementation is leveraged to achieve the same.</p></sec><sec id="s3_4"><title>3.4. Pattern Filter</title><p>This component applies the filter rules and it has to interact much with the infrastructure component. The filtered data after the appliance of rules will be streamed to Pattern processor component</p></sec><sec id="s3_5"><title>3.5. Pattern Processor</title><p>This component retrieves the filtered contextual data and parses efficiently to aggregate and assemble the data as per the requirements</p></sec><sec id="s3_6"><title>3.6. Infrastructure Component</title><p>This component provides Authorization, Authentication, Logging, Security etc. and it’s visible to all the other components in this framework. This leverages most of the open source libraries for its operation.</p></sec><sec id="s3_7"><title>3.7. Transaction Log Parser</title><p>A portion of transaction log file used for the experimentation has been shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>.</p></sec><sec id="s3_8"><title>3.8. API Details</title><p>The API and the corresponding functions are explained in <xref ref-type="table" rid="table1">Table 1</xref>. These APIs are used in the process of data filtering.</p></sec><sec id="s3_9"><title>3.9. High Level Logical Details</title><p>Properties details and context aware details to filter the relevant data to connect to twitter and also the extraction based on filtering option given in context Data Pattern parameter is defined in <xref ref-type="fig" rid="fig4">Figure 4</xref>.</p></sec></sec><sec id="s4"><title>4. Report Analysis</title><p>The report analysis for visualization of contextual relevant data and also the percentage of savings before and after context aware filtering is shown in <xref ref-type="fig" rid="fig5">Figure 5</xref> and <xref ref-type="fig" rid="fig6">Figure 6</xref> respectively.</p><fig id="fig3"  position="float"><label><xref ref-type="fig" rid="fig3">Figure 3</xref></label><caption><title> Sample transaction log file</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/3-3400377x7.png"/></fig><fig id="fig4"  position="float"><label><xref ref-type="fig" rid="fig4">Figure 4</xref></label><caption><title> Context data pattern parameter</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/3-3400377x8.png"/></fig><fig id="fig5"  position="float"><label><xref ref-type="fig" rid="fig5">Figure 5</xref></label><caption><title> Visualization of contextual relevant data</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/3-3400377x9.png"/></fig><fig id="fig6"  position="float"><label><xref ref-type="fig" rid="fig6">Figure 6</xref></label><caption><title> Percentage of savings</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/3-3400377x10.png"/></fig><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> API descriptions</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Input: Transaction log file</th></tr></thead><tr><td align="center" valign="middle" >a) Listen to TCP/IP channels; b) New pattern repository builder pb; c) pb. build pattern (); d) pb. persist patterns (); e) Data: machine data; f) Getconnection (); g) List patterns list: = readall patterns (); h) New pattern composer pc; i) List redundant patterns = pc. get redundent patterns (patterns list); j) New pattern filter pf; k) List filtered pattern list = pf. filter patterns (patterns list, redundant patterns); l) Data = pf. apply filter (data, filtered pattern list); m) Data=aggregate &amp; assemble (data); n) Persist data (data); o) Release connection (); p) New pattern extractor pe; q) pe. show log report ();</td></tr><tr><td align="center" valign="middle" >Output: Contextual relevant data consolidated for gaining business insights</td></tr></tbody></table></table-wrap></sec><sec id="s5"><title>5. Conclusion</title><p>Most of the analytics application captures all the data for future analytics, but the key aspect is to bring in the context aware filtering on the data getting generated from multiple sources of applications. This eases out the analytics complexity on the enterprise and brings in better prospects towards visualization of data. The complexity and cost factor involved in data management like storing, archiving, backup, recovery, etc. can be reduced by this framework.</p></sec></body><back><ref-list><title>References</title><ref id="scirp.55298-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Duggan, J. and Stonebraker, M. (2014) Incremental Elasticity for Array Databases. ACM SIGMOD-/PODS (SIGMOD 2014).</mixed-citation></ref><ref id="scirp.55298-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Kalinin, A., Cetintemel, U. and Zdonik, S. (2014) Interactive Data Exploration Using Semantic Windows ACM. SIGMOD/PODS (SIGMOD 2014).</mixed-citation></ref><ref id="scirp.55298-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Vojnovic, M., Xu, F. and Zhou, J.R. (2012) Sampling Based Range Partition Methods for Big Data Analytics. No. MSR-TR-2012-18.</mixed-citation></ref><ref id="scirp.55298-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Sundaram, N., Turmukhametova, A., Satish, N., Mostak, T., Indyk, P., Madden, S. and Dubey, P. (2014) Streaming Similarity Search over One Billion Tweets Using Parallel Locality Sensitive Hashing. Annual Conference on Very Large Data Bases 2014 (VLDB 2014).</mixed-citation></ref><ref id="scirp.55298-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Jun, S.-W., Liu, M. and Kermin Fleming, A. (2014) Scalable Multi-Access Flash Store for Big Data Analytics. 22nd ACM/SIGDA International Symposium on Field-Programmable Gate Arrays.</mixed-citation></ref></ref-list></back></article>