Boss Software Solutions // data science project briefs · 2026
LIVE PRODUCTION DATA · NOT A STATIC DATASET

A live business,
not a frozen dataset.

A real business system serving retail operations in real time — generating continuous, connected events across sales, customers, inventory, workforce and communications. You'll research real-world decision-making as it happens, not a benchmark assembled for a paper.

// what you'll get access to
real-time
live POS transaction streams
multi-company
500+ retail companies
multi-site
600+ active branches
connected
3,000+ terminals, APIs & more

Boss Software Solutions runs the ERP, CRM and POS systems behind retail chains across Israel — the live systems generating all of the data above.

// why this is different

Not a dataset — a living data environment

The character of the data is the real value: live, connected, sequential, and relational.

[ real-time ]

A live, real-time system

Hundreds of businesses operating right now. Models can be posed to run live — while a customer is shopping, mid-transaction — not only on yesterday's export.

[ connected ]

Connected across every domain

Sales, CRM, inventory, workforce, tasks and communications joined in one environment. You can model a whole business, not a single isolated table.

[ sequences ]

Continuous event streams

The platform captures ongoing events, not isolated snapshots: purchases, inventory movements, employee actions, customer interactions and operational workflows.

[ graph ]

It's a graph, not just rows

Customers, products, categories, branches, employees, promotions and suppliers are richly interlinked — a real entity graph waiting to be explored.

Unlike most academic projects, you'll have access to a live production environment generating continuous operational events across sales, customers, inventory, workforce and business processes — enabling research on real-world decision-making, not static benchmark datasets.

// applied directions

Applied project directions

Five solid starting points — with plenty of flexibility to define the scope together.

01
Predictive Analytics & Customer Behavior

Purchase & churn prediction

Predict purchase potential for products or categories, predict churn, and identify behavioral patterns over time — from long transaction history, profiles, loyalty clubs and promotions.

Churn PredictionCustomer ProfilingBehavioral Patterns
02
Supply Chain & Demand Forecasting

Demand & supply-chain forecasting

Hierarchical demand forecasting at the product × branch level from historical sales sequences. Applicable to inventory, ordering, supply and branch operations.

Time SeriesDeep ForecastingInventory Opt.
03
Anomaly Detection

Multi-dimensional anomaly detection

Detect data-entry errors, operational faults, abnormal usage and suspected fraud — both in the real-time transaction stream and in customer, employee or branch behavior over time.

Real-time StreamsFraud DetectionOutlier Analysis
04
Sequential Recommendation / Next Basket

Next-basket prediction

Predict the next item or basket from prior purchase sequences. Meaningful value in recommendations, loyalty clubs and targeted campaigns.

Sequence ModelsRecommendersPersonalization
05
Price Elasticity & Causal Effects

Price & promotion elasticity

Estimate how demand responds to price and promotion changes across products and categories — from real pricing, discount and sales history. A step toward causal, decision-oriented modeling.

ElasticityCausal InferencePricing
+

The breadth also enables combined projects across sources — demand forecasting fusing sales, inventory and promotions; anomaly detection fusing transactions, employees and logs; customer-behavior models fusing purchases, campaigns, service and loyalty. Deep Learning over a whole business system, not a single source.

?

Because the system is live, several of these can be posed as real-time inference — "can the model run while a customer is shopping?" Think next-basket prediction, fraud detection, dynamic recommendations and live anomaly alerts.

// advanced research directions

Want something that stands out?

Less conventional directions, made possible specifically by the connected, relational nature of this data.

Graph Neural Networks

The data is a graph

Customers, products, categories, branches, employees, promotions and suppliers form a rich relational network. Explore link prediction, fraud rings, substitution & affinity structures, and GNN-based recommendation.

GNNLink PredictionNode Embeddings
Multimodal Business Intelligence

Fuse the signals

Combine transactions, call metadata, support tickets, tasks and operational events to predict customer outcomes or surface operational issues that no single source reveals on its own.

MultimodalRepresentation LearningFusion
Agentic AI for Operations

Reasoning over the business

LLM-based reasoning over live operational data: natural-language access to ERP/CRM information and autonomous recommendations for business workflows. Ambitious and exploratory.

LLM AgentsNL-to-DataReasoning
// available data

Available data types

A broad operational environment from real retail businesses — not just POS data.

POS

Sales & transactions

Item-level transactions within a basket: item, quantity, price, discount, time, branch, register, cashier, promotions.

CATALOG

Products & pricing

Product hierarchies, categories, brands, prices and promotions — and how they change over time.

SUPPLY

Inventory & supply chain

Stock, supplier orders, receipts, inter-branch transfers, shortages and inventory movements.

CRM

Loyalty & CRM

Purchase history, customer segments, loyalty traits, campaigns, benefits and return patterns.

OPS

Internal operations

Tasks, workflows, statuses, assignment to employees and branches, handling and completion times.

HR

Attendance & shifts

Working hours, attendance, branch assignment and operational performance over time.

VOIP

Comms & service

Call data: times, durations, statuses, and association to branch, customer or process.

LOGS

System events

User actions, exceptional events, system processes and module usage data.

// what we provide

Data, compute & guidance

Data, shaped to your scope

Raw, curated or ML-ready datasets — whichever fits the project. We've handled customer consent and data access for analytics before, so real data won't be a blocker.

Compute & GPU time

Access to servers and dedicated GPU time, so you can train and run experiments at real scale — not only on a laptop.

{ }

Technical guidance

Relevant API access, documentation, and hands-on guidance from me and the dev team — to help you understand the data, frame the problem, and unblock issues together.

Want to pick a direction? Let's talk.

We can hold a short intro session covering the system, the available data and the options — to help you choose a research direction.

Lior Hermesh
Founder & CEO · Boss Software Solutions