Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system supports both BM25 retrieval algorithm and vector retrieval algorithm.
倉庫
allwefantasy 的倉庫
a java web program scheduled by mammuthus-dynamic-deploy based on yarn
挖坑与填坑
A Toolkit for Industrial Topic Modeling
InfiniSynapse运营增长知识库
Unify Efficient Fine-Tuning of 100+ LLMs
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Extract data from db like SQLServer,MySQL to Hbase,MongoDB,File or standard output supporting thrift ,RESTFul API with WebUI
The project can fetch datas from RESTful API
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Java MVC framework, agile, fast, rich domain model, made especially for server side of mobile application (一个敏捷,快速,富领域模型的Java MVC 框架,专为 移动应用后端量身定做)
Serviceframework一个简单但灵活的模块引擎
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
TensorFlowOnSpark brings TensorFlow programs onto Apache Spark clusters
A ORM implementation to active hibernate
一个使用了 AI 技术的智能生词本制作工具。
Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) models
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典