Download PDF
Streamlining Transaction Matching in Large Datasets
Technology Category
- Analytics & Modeling - Big Data Analytics
- Analytics & Modeling - Data-as-a-Service
Applicable Industries
- Equipment & Machinery
- Finance & Insurance
Applicable Functions
- Sales & Marketing
Use Cases
- Machine to Machine Payments
The Challenge
The challenge was to match credit and debit transactions in a single dataset made up of 24 million records. The dataset did not include any identifier that might match debit transactions (sales), and credit transactions (returns). Each debit transaction with a corresponding credit transaction had to be removed from the data, and isolated into a separate stream of records without that single identifier already mentioned missing. The legacy process was time-consuming and prone to errors, involving filtering for high count values, and finding their exact credit value. This process was performed manually, copying and pasting the debit row with sales and the credit row into a separate excel spreadsheet, and removing the “matched” rows from the larger dataset.
About The Customer
The customer in this case study is an accounting and financial industry professional who deals with large transactional data on a regular basis. The customer's role involves managing and analyzing transactional data, which includes information recorded from transactions such as purchases, returns, invoices, payments, credits, debits, trades, dividends, asset sales, contracts, interest, payroll, lending, reservations, signups, subscriptions, donations, and more. The customer was tasked with the challenge of matching credit and debit transactions in a single dataset made up of 24 million records, a task that was time-consuming and prone to errors due to the lack of a single identifier in the dataset.
The Solution
The solution involved unioning all of the 30 excel workbooks with transactional data into a single .yxdb file (Alteryx Database File). A RecordID field was added to number each record. The data was then split between sales (debits) and returns (credits) using a Filter tool isolating records with a sales value less than $0. The absolute value of the negative sales value of the credit transactions was taken and a Join tool was used to join the debit transactions with the absolute value of the credit transactions. To avoid duplicating data in instances where there were two debit transactions with the same sales value, but only one corresponding credit transaction, a Multi-Row Formula tool was used to add a RecordID field grouped by the sales value in both the debit transaction stream, and credit transaction stream. The Join tool was then used again, with the configuration changed to join on the sales value, but also on the amount of times that the specific sales value actually exists in the data.
Operational Impact
Quantitative Benefit
Related Case Studies.
Case Study
Smart Water Filtration Systems
Before working with Ayla Networks, Ozner was already using cloud connectivity to identify and solve water-filtration system malfunctions as well as to monitor filter cartridges for replacements.But, in June 2015, Ozner executives talked with Ayla about how the company might further improve its water systems with IoT technology. They liked what they heard from Ayla, but the executives needed to be sure that Ayla’s Agile IoT Platform provided the security and reliability Ozner required.
Case Study
IoT enabled Fleet Management with MindSphere
In view of growing competition, Gämmerler had a strong need to remain competitive via process optimization, reliability and gentle handling of printed products, even at highest press speeds. In addition, a digitalization initiative also included developing a key differentiation via data-driven services offers.
Case Study
Predictive Maintenance for Industrial Chillers
For global leaders in the industrial chiller manufacturing, reliability of the entire production process is of the utmost importance. Chillers are refrigeration systems that produce ice water to provide cooling for a process or industrial application. One of those leaders sought a way to respond to asset performance issues, even before they occur. The intelligence to guarantee maximum reliability of cooling devices is embedded (pre-alarming). A pre-alarming phase means that the cooling device still works, but symptoms may appear, telling manufacturers that a failure is likely to occur in the near future. Chillers who are not internet connected at that moment, provide little insight in this pre-alarming phase.
Case Study
Premium Appliance Producer Innovates with Internet of Everything
Sub-Zero faced the largest product launch in the company’s history:It wanted to launch 60 new products as scheduled while simultaneously opening a new “greenfield” production facility, yet still adhering to stringent quality requirements and manage issues from new supply-chain partners. A the same time, it wanted to increase staff productivity time and collaboration while reducing travel and costs.
Case Study
Integration of PLC with IoT for Bosch Rexroth
The application arises from the need to monitor and anticipate the problems of one or more machines managed by a PLC. These problems, often resulting from the accumulation over time of small discrepancies, require, when they occur, ex post technical operations maintenance.
Case Study
Robot Saves Money and Time for US Custom Molding Company
Injection Technology (Itech) is a custom molder for a variety of clients that require precision plastic parts for such products as electric meter covers, dental appliance cases and spools. With 95 employees operating 23 molding machines in a 30,000 square foot plant, Itech wanted to reduce man hours and increase efficiency.