Domino Data Lab provides a collection of open-source solutions called Domino Reference Projects. These projects are freely available and were built with the following goals:
To educate you about a specific data science topic.
To accomplish a specific analytical method or task in Domino, including relevant best practices.
To provide an easy way to share pre-built assets such as Launchers, Scheduled Jobs, Apps, and Endpoints.
To provide end-to-end implementations that new team members can use to get experience with the platform while onboarding.
All the projects follow a common pattern, where a use case was developed with Python or R. The data sets that the projects use are based on freely available collections of data that are encapsulated with the reference project or are available externally to be downloaded.
Typically, the projects contain a Jupyter Notebook, which provides background and context for the use case. Most of the projects also include the relevant scripts for operationalization (such as model retraining job scripts, Model API scripts, and web applications). The projects and all accompanying assets are available on GitHub.
The following table lists the reference projects that are currently available.
OpenAI custom Q&A
Use OpenAI’s LLM to do Q&A over information that OpenAI’s models have not been trained on.
Fine-tune a pre-trained LLM
Fine-tune a pre-trained HuggingFace LLM model to perform finance-related sentiment analysis.
OSS LLM inference
Create an LLM-driven chatbot that summarizes text using a Domino Model API.
Credit Card Fraud Detection
Uses XGBoost to detect credit card transaction fraud.
Named Entity Recognition
Locates and classifies named entities with a BiLSTM-CRF model.
The GitHub repositories include instructions about how to use the project assets and how to create a dedicated compute environment, if needed.