Ann Arbor Algorithms

AI Bootcamp 2025

2024 Materials

Ann Arbor Algorithms, the leading US firm in algorithm consulting, is proud to present the 2025 AI Bootcamp courses. AAA, renowned for its research and development consulting services, counts industry giants such as Eli Lilly, Merck, Genentech, Cedars-Sinai, KLA, and General Motors among its esteemed clients. The 2025 AI Bootcamp offers two intensive training courses designed to empower professionals with cutting-edge AI knowledge and hands-on skills.

Each course consists three half-day sessions of 2-hour lecture and 1-hour lab. The course format ensures an immersive learning experience, allowing participants to understand complex concepts and immediately apply them through hands-on labs.

Course A: Large Language Models ($1200)

Dive into the fascinating world of large language models (LLMs). This session covers the fundamentals of LLMs, their architecture, and applications. Participants will gain hands-on experience with the latest advancements in natural language processing, exploring how LLMs are transforming industries such as healthcare, automotive, and technology. Learn to develop, fine-tune, and deploy LLMs to solve complex real-world problems.

Day 1. BPE, Attention and Transformers; Beginning generation with Gemma.

Day 2. Llama3 Code Analysis; Logit Magics and Grammars-Guided Generation.

Day 3. Guidance and ToC; Langchain and RAG; Finetune with Peft.

Course B: Deep Learning ($1200)

Explore the depths of deep learning in this comprehensive session. From neural network basics to advanced deep learning techniques, participants will acquire the skills needed to build and optimize deep learning models. This session focuses on practical applications, including computer vision, signal processing, and language modeling. Enhance your understanding of deep learning frameworks and tools to stay ahead in the AI revolution.

Day 1: Tensors, Gradients and Loss Functions; Dive into Pytorch.

Day 2: CNN for Image and Signal Processing; Stable Diffusion and Generative AI.

Day 3: Graph Networks and Neural-ODE; Intro to Language Modeling.

Join us at the 2024 AI Bootcamp to elevate your AI expertise and drive innovation in your field. Register now to secure your spot in these transformative training sessions.

Services

Engineering

We are specialized in design, implementation and integration of algorithms and software systems in computer vision, machine learning, signal processing and large-scale data processing. We are also experienced in system optimization by identifying and removing bottlenecks in space, time and accuracy. We help our customers to rapidly evaluate and adopt latest development in these fields by customizing open-source software, or by reimplementing algorithms from scratch.

We speak English, Python and C++, and deliver our software in modularized code packages, portable executables, docker containers and AWS services.

Training

When we introduce a new software technology to our customers, we also help train their existing engineer or new recruit and work with them closely, so when our job is done there are people to carry on development and maintanance. We provide CPT/OPT training opportunities to students who wants to pursue a career in software engineer or data science, or to apply latest deep-learning technologies to their field of study.

We are now offering a one-day hands-on beginner Tensorflow training program that covers image annotation and basic model training (see codebase).

Consulting

We have advised multiple startup companies on design of technology and product roadmaps, design of system architectures, selection of platforms and toolchains, etc. We identify and interview candidates for our customers and help them to build their engineering teams.

Research

We maintain close relationship with academia and are involved in leading research in machine learning and its applications. Our current academic clients/collaborators include universities,particularly historically black colleges and universities.

Case Studies

Training Deep Convolutional Models

As both GPUs per system and TFLOPs per GPU grow rapidly, how to efficiently preprocess and stream training data to keep the GPUs busy is becoming an increasingly challenging problem. We developed PicPac, a C++ library to efficiently manage and stream massive amount of training data. PicPac fully utilizes the high IOPS of SSD/NVME to support out-of-core random shuffling and stratified sampling, and implements a plug-in framework of data transformation and augmentation to support various training tasks. PicPac's python API is easy to use and is compatible with Tensorflow, PyTorch, MxNet and Caffe.

Medical Imaging and Lesion Detection

We are experienced in deep-learning with DICOM medical images, both 2D and 3D. We have developed deep-learning models to detect and segment lung cancer, breast cancer, multiple-myeloma and other lesions. Our solutions based on PicPac and have ranked high in multiple competitions. See our demo of carotid artery plaque segmentation and 3D reconstruction.

Example of lung nodule detection.

Content-Based Image Search Engine

We developed KGraph, one of today's fastest libraries for approximate nearest neighbor search (benchmark), and Donkey, a NoSQL feature vector database and toolkit for developing nearest neighbor search engines. Donkey supports KGraph and Locality Sensitive Hashing for indexing and supports HTTP/Restful API.
Leveraging KGraph, Donkey and latest deep-learning models for feature extraction, we have helped our client in UK implement a content-based image search engine that indexes tens of millions of images with a single server.

Next Generation Sequencing

A2Genomics is our cloud platform for high-throughput sequencing data analysis and pattern discovery. Our pipeline efficiently processes massivie NGS datasets, run multiple algorithms including PCA, SVD, DESeq, k-means, SOM and WGCNA, and generates publication quality visualizations.

Collaborative Filtering

We have helped a leading Chinese internet radio app with 70+ million users design and implement a recommendation system that minds user behavior and making online personalized recommendations.

Radio Commercial Search and Discovery

We have helped our client in China develop audio fingerprinting algorithms and implement a system that indexes millions of hours of radio broadcast audio covering 100+ cities. The system provides online search-by-example service and automatically discovers repetitive audio clips for new advertisements monitoring.

Ann Arbor Algorithms

AI Bootcamp 2025

Course A: Large Language Models ($1200)

Course B: Deep Learning ($1200)

Services

Engineering

Training

Consulting

Research

Case Studies

Training Deep Convolutional Models

Medical Imaging and Lesion Detection

Content-Based Image Search Engine

Next Generation Sequencing

Collaborative Filtering

Radio Commercial Search and Discovery

Competitions

CMS names 25 innovators advancing in AI Health Outcomes Challenge

Gold Medal in Data Science Bowl 2018

Silver Medal in Data Science Bowl 2017

Gold Medal in Data Science Bowl 2016

About Us

Contact