Yes, AI/ML are buzzwords these days—every startup seems to have an .ai domain. But without solving real, measurable problems, AI is just another trend.
We believe AI is only interesting when it delivers tangible results: cost savings, added value, and sticky user engagement. See how we’ve been driving these outcomes since 2019.
Document AI
Deep Learning
Multimodal
LauraMac is a leader in mortgage technologies. They provide SaaS solutions that facilitate the origination, acquisition, and review of loans, offering valuable insights and control over these processes.
Their platform learns, automates, and streamlines loan diligence processes. This enables loan originators, buyers, and sellers to extract, validate, review, and share data for more reliable decisions.
Founded in 2019 and based in Washington, LauraMac has quickly established itself as a leader in mortgage technology. In its first nine months, the platform was adopted by over a dozen clients, facilitating due diligence for over 35,000 loans, with more than 1,000 active users across 100+ sellers, investors, and buyers.
Their commitment to innovation and efficiency has earned them industry recognition, including being named a Tech100 winner by HousingWire in 2025.
By addressing challenges such as outdated technology, lack of transparency, and inefficient data normalization, LauraMac’s solutions provide enhanced transparency, seamless data normalization, secure information handling, and robust partner integrations, transforming the mortgage due diligence process.
LauraMac had a vast collection of loan tapes for due diligence. These loan tapes, often sourced from archives, contained scanned documents dating as far back as the early 2000s. Since U.S. loans typically have 30-year terms, a single loan file could accumulate a significant number of notes, forms, and supporting documents over time.
These loan tapes included a wide variety of document types, such as 1003 and 1008 forms, appraisal reports, income verification documents, and more. Each file could range from 400 to 3,000 pages, making manual processing a significant challenge. Traditionally, a dedicated person or team would handle these PDFs manually—splitting the loan tapes into separate document types, labeling them, and extracting key information for the due diligence and review process.
This manual process was time-consuming, taking anywhere from 30 minutes to an hour per loan. The inefficiencies became even more problematic during periods of low interest rates when loan volume surged, requiring faster and more accurate processing to keep up with demand. LauraMac needed an AI-driven solution to automate this tedious workflow, reduce processing time, and improve accuracy.
LauraMac needed an automated solution to streamline the labor-intensive loan tape processing workflow. Softmax Data designed and implemented a multi-stage AI-powered system to handle the process efficiently, accurately, and in compliance with financial regulations.
The first step was designing a secure data pipeline that seamlessly moved these large document files to a compliant cloud environment while adhering to strict financial security standards.
Next, we developed a computer vision-based solution to enhance document quality. This system automatically denoised, rotated, flipped, cropped, and adjusted the scanned images, ensuring that low-quality archived documents became clean and readable.
Once preprocessed, the pipeline converted PDFs into images at the optimal resolution for AI analysis. A multimodal deep learning model was then used to process both the text and image content, extracting key information from a wide range of complex mortgage documents.
To achieve the best accuracy, Softmax Data developed a model training pipeline that trained and evaluated over 30 models before finalizing the most effective one for automation. This model was designed to classify, segment, and extract critical data from various document types.
Additionally, the solution was built to be self-improving after deployment. By simply placing annotated documents in a designated folder, the system automatically retrained and fine-tuned itself, generating and deploying new AI models to the cloud on a nightly basis.
This end-to-end AI solution enabled LauraMac to dramatically reduce manual effort, improve processing speed, and enhance accuracy—allowing them to scale their due diligence process efficiently.
The impact of Softmax Data’s solution was transformational. The AI model achieved 95% precision in segmenting loan tapes and 88% accuracy in classifying individual documents. It also reached 92% accuracy in extracting key-value paired information, including text, numeric values, checkboxes, and radio buttons from various forms such as 1003 and 1008, across different issuers, formats, and versions.
This automation drastically reduced manual processing time. What previously took an average of 40 minutes per loan was now completed in less than 38 seconds—enabling LauraMac to process loans faster, improve accuracy, and scale operations efficiently.
Document AI
Financial Statement Analysis
RAG + Deep Learning
Rocket Mortgage Canada (RMC) is a digital-first mortgage broker helping Canadians access competitive home financing options. Unlike its U.S. parent company, which is a lender, RMC operates as a brokerage—serving as a bridge between borrowers and lenders. RMC is responsible for gathering, reviewing, and preparing the required documentation for lender-side loan officers to process applications efficiently and in full compliance with Canadian financial regulations.
To deliver a faster and more streamlined borrower experience, RMC sought to automate its internal document workflows—particularly around the manual and time-consuming process of reviewing and assembling complex applicant financial records.
To meet regulatory requirements and assist loan officers, RMC must analyze and package loan applications with extreme precision. A large part of this process involves reviewing applicants' financial statements—checking, savings, RRSPs, investments, and credit cards—from a wide range of issuers including major banks, credit unions, and fintechs like Wealthsimple.
Applicants frequently submit inconsistent documentation, including screenshots from phones, mobile apps, and scanned printouts. This inconsistency—along with differing layouts, formats, and poor scan quality—makes it extremely difficult to extract and analyze financial data accurately.
RMC's team had to manually trace income sources, verify asset levels, flag suspicious transactions, and generate summary and index pages linking each transaction to the correct document and page. This work typically took more than a day per application and slowed down the loan approval process significantly.
Softmax Data partnered with RMC to build a custom end-to-end AI pipeline that automates financial document analysis, compliance checks, and summary generation. The system begins by ingesting PDFs and classifying them by type, issuer, and account.
We then built custom models to extract transaction tables, standardize all formats into a unified structure, and normalize columns across a wide range of document layouts. This ensured every entry was tagged with consistent fields: Date, Description, Deposit, Withdrawal, and Balance.
A second layer of the system aggregates all transactions into a single financial timeline, allowing the model to map fund transfers between accounts, detect income consistency, and calculate asset summaries as of any given date.
To enable intelligent inquiry, we built a Retrieval-Augmented Generation (RAG) agent capable of answering natural language questions like “What is the total asset as of March 31st, 2025?” or “Has income from employer X been consistent over the last 6 months?”
Our solution also auto-generates a complete PDF package, including an index page linking transactions to their source documents, a summary sheet highlighting flagged items, and a neatly stacked bundle for the loan officer to process quickly and with confidence.
Finally, the system includes an ML deployment pipeline that automatically retrains the model on newly annotated data, ensuring it continually adapts to new formats and improves accuracy over time.
The impact was dramatic. What once took more than a day of manual work per application now takes under two minutes, allowing the team at Rocket Mortgage Canada to scale operations without scaling headcount.
The RAG assistant empowered loan preparers and compliance staff to get precise answers instantly without combing through dozens of pages, while the document packaging pipeline delivered clean, lender-ready summaries and indexes with every file.
By deploying this end-to-end Document AI solution, RMC has increased throughput, reduced human error, improved compliance, and brought faster approvals to their borrowers—all powered by AI.
Data Cleaning
Entity Linkage
Contact Verification
Clio is a fast-growing company that creates legal software for attorneys and law firms. Their software manages every aspect of a legal practice—from billing and case management, to client intake and time tracking. Their software is so effective Clio has seen an 80% growth rate over two years and they raised $250 million in investment capital.
While they had the capital to fuel growth, Clio’s challenge was finding new prospects. Their sales team needed a bigger, more accurate pool of leads. But here is where they hit a wall.
The Clio sales team wanted to improve its Salesforce CRM. Their system contained lots of duplicates and outdated information—attorneys moved firms or were listed in incorrect positions. Often, the contact information was missing the key information to help them close the sale. They knew from experience that the more they knew about their prospects— their education, their practice areas, their role in the firm—the higher their rate of conversion. Every time they called the wrong person or had to track down updated information they wasted valuable time that should have been spent selling.
Buying sales leads wasn’t a simple endeavor. They were limited to channels maintaining compliance with strict privacy laws. And when they purchased lists of law firms they had trouble.
This is where we came in. Softmax Data worked closely with Clio’s revenue operations team to design, develop and implement powerful, scalable machine learning solutions. We did this in three phases:
First, we created a solution to generate new leads for Clio automatically. Powered by machine learning, the solution intelligently extracted lists of lawyers from public websites. These lists contained titles and corresponding contact information including emails, phone numbers, and addresses. Then the acquired and cleaned leads were directly injected into the Salesforce CRM and assigned to sales reps.
Second, we made a solution to accurately detect and merge all duplicates. For example, if someone was listed twice because they spelled their name differently (John Smith vs. J. Smith) the computer could detect and remove the duplicate records. But it did this without losing any data—the best information in each duplicates were kept so that the remaining contact had all the important information.
Third, to increase connect-through rate, our machine learning solution validated the individual contact information to make sure each one was up to date.
Softmax Data’s machine learning solutions handled millions of contacts. It cleaned Clio’s list, generated 4 million new leads, then cleaned and merged it all into a refined and verified list of 1.8 million.
Imagine doing this work manually—even if it just took 5 minutes to search a website, check and verify each lead—the millions of lines would take humans decades.
But in just two months Softmax Data gave Clio a massive list of leads that was verified, clean and free from duplicates.
And Softmax did all this without any downtime for the Clio sales team. We used multi-stage deployment and secure backups, the sales team at Clio received rejuvenated CRM without any pause. They kept working even as our machine learning solutions cleaned, merged and verified their CRM.
The result of Softmax Data’s custom solutions was profound. Clio witnessed an immediate impact on their CRM integrity, their sales leads, funnel velocity, and sales productivity. Within one week, they saw a 98% increase in the connect-through rate.
Within three months, Softmax Data’s solutions collected 417% more sales leads, seamlessly added them to Clio’s Salesforce, and assigned contacts to corresponding contact owners. Even more exciting, the team could now run the leads generation daily and continually stay ahead of the competition by capturing sales leads.
The clean-up not only introduced higher sales productivity and eliminated the constant frustration, but also gave Clio’s financial executives a highly accurate view of their sales funnel.
In only four months Clio saw a $6 million increase in revenue directly attributed to Softmax Data’s work.
Purchase Propensity Prediction
Recommendation Engine
In 2019 Absolute Results (AR) helped car dealers sell over 93,000 cars. They did this by shifting dealerships from a reactive culture to a proactive “appointment culture.” Instead of salespeople waiting for customers to walk into the showroom, AR gives dealerships training and technology to actively schedule appointments with clients.
This growing category of sales is called Appointment Driven Private Sale Events—and AR is the global leader. It’s all driven by data. AR uses creative messaging and targeted data to sends millions of personalized event invitations to potential car buyers.
While AR was the leader in Appointment Driven Sales the landscape of their industry was rapidly changing. To maintain their position they had to keep improving the accuracy of their personalized messages.
They specifically wanted to improve which vehicles they would recommend as upgrade options. They needed data to help them predict which car John Smith would want when it was time to upgrade. That kind of accurate recommendation would drastically increase the conversion rate.
But how could they accurately predict that information for thousands of potential clients? People don’t buy new cars everyday. That limited how much AR knew about the buyers and their preferences. Even the number of transactions for most of the vehicle models was quite limited. Traditional methods rely on having huge amount of data for each car model and customer to make accurate predictions for the customer’s next purchase; this made it impossible to use traditional recommendation algorithms. To make things even more complex, any solution would have to accommodate a wide range of data qualities, siloed data sources, and CRM systems that weren’t used to talking to each other.
The CTO of AR, Josh Heppner heard about the powerful potential of Softmax Data’s machine learning and data science solutions. Could we teach computers to amass thousands of data points and interpret it so AR could sell more cars?
Softmax worked closely with the IT team at AR and executive team to design, develop, and implement a highly scalable custom-built recommendation engine. The engine leveraged both deep learning models and statistical models to predict individual consumer’s buying habits and the exact upgrade vehicles they would want.
It looked like magic but the powerfully accurate predictions were all based on our powerful data processing solutions. We looked at consumers’ past purchase history, demographics, and geographic information as well as external market trend data.
If John Smith had owned a Toyota Camry, then a Honda Accord, had a college degree, made over $75,000 a year and lived in Toronto, we could predict with scary accuracy the car he would want to upgrade to next (it was a Lexus, by the way).
The algorithms were designed to be flexible. They allowed for dealerships to input their first-hand relationship with their client base. This customization around their sales process and expertise made the algorithms even more accurate.
The impact was immediate. Softmax Data’s machine learning solution increased revenue, conversation rate, sales funnel velocity and sales team performance. Within one month of the start of the project, AR’s recommendation conversion jumped by an eye-popping 320%.
And it just kept getting better. As SoftMax’s machine learning solution received more and better data, the predictions kept improving.
People often say that “data is the new gold.” And it’s true. But for a business to benefit from the “gold” of data you have to know how to mine it and refine it. Softmax Data did just that for AR. We used 100% of AR customer data to make the predictions. The result was that each dealership sent messages to clients that were more personal, more accurate, and more effective. And this meant that more cars sold. AR’s revenue increased by an estimated 27% as a direct result of Softmax Data’s work.