Skip to content

mmcclean-aws/ml-specialized-hardware

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Applied AI/ML Specialized Hardware

Specialized Hardware is a ML (Machine Learning) model accelerator for Inference or Training, like AWS Inferentia, AWS Trainium, SIMD accel in CPUs and GPUs. In this repo you'll find reference implementations of different use cases (applications) for Computer Vision, Natural Language Processing, etc. that make use of hardware acceleration to reduce model execution latency and increase throughput.

Use cases are represented by questions which can be answered by the reference implementation linked to it.

If you're looking for technical samples that show how to run specific models on Trainium (trn1) and Inferentia (inf1 & inf2), go to AWS Neuron Samples

Tutorials/Reference implementations

Use Case Description
How to track people in video files? CV/ML Pipeline to process video files in batch with SageMaker+Inferentia, GStreamer and Yolov7+ByteTrack
How to measure the similarity between two sentences? Compute the semantic similarity of two or more sentences by extracting their embeddings with SageMaker+Inferentia and HF Bert Case
How to create a mechanism to answer questions from a FAQ? Fine tune a T5-ssm model (on SageMaker & Trainium) to build a Q&A mechanism, more powerful than a classic chatbot, to answer questions from a FAQ, sent by your customers

Contributing

If you have a question related to a business challenge that must be answered by an accelerated AI/ML solution, like the content in this repo, then you can contribute. You can just open an issue with your question or if you have the skills, implement a solution (tutorial, workshop, etc.) using Jupyter notebooks (for SageMaker Studio or Notebook Instances) and create a pull request. We appreciate your help.

Please refer to the CONTRIBUTING document for further details on contributing to this repository.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 55.0%
  • Python 38.2%
  • Dockerfile 6.8%