Here’s an incomplete list of technologies and tools I worked with. Focus is more on machine learning, other groups contain representative examples.
Group | Technology |
---|---|
Machine Learning Machine Learning Operations Data Engineering |
Apache Spark AWS SageMaker fairseq GPy GPyOpt HF Accelerate HF AutoTrain HF Bitsandbytes HF Datasets HF PEFT HF Sentence Transformers HF Text Generation Inference HF Tokenizers HF Transformers HF TRL JAX Keras LangChain llama.cpp NumPy OpenCV pandas PyMC PyTorch PyTorch Lightning scikit-learn scikit-optimize SciPy spaCy Tensorflow Tensorflow Probability Weights & Biases XGBoost |
Distributed Systems Stream Processing Messaging |
Akka Akka Streams Apache ActiveMQ Apache Kafka Apache Kafka Streams Apache ZooKeeper Eventuate FS2 Streamz |
Event Sourcing | Akka Persistence Eventsourced Eventuate |
System Integration | Apache Camel Open eHealth Integration Platform Streamz |
Databases Vector Databases |
Amazon DynamoDB Apache Cassandra Apache HBase ChromaDB ElasticSearch Faiss LevelDB Milvus MongoDB Oracle Database PostgreSQL Qdrant SQL Server |
Cloud Computing Container Orchestration |
AWS AWS SageMaker Docker GCP Kubernetes Terraform |
Programming Languages | Python Scala Java C++ |