-
Notifications
You must be signed in to change notification settings - Fork 157
Description
Proposal: Release Apache GeaFlow 0.8.0(Incubating)
Since the last official release (v0.7.0), the Apache GeaFlow(Incubating) community has made substantial progress in extending graph AI capabilities, enhancing ISO-GQL standard compliance, and improving runtime stability. This release candidate brings significant advancements in vector storage, graph algorithms, and enterprise-grade connectors.
This issue serves as a discussion thread for:
- Finalizing the set of features and fixes to include
- Confirming readiness of pending PRs for inclusion
- Drafting the release notes
- Agreeing on the release timeline
✅ Merged PRs Summary (Since v0.7.0)
Below is a categorized list of all PRs that have been merged and are eligible for inclusion in v0.8.0. Contributors are credited accordingly.
🔧 Build & Infrastructure
- infra: add GHA condition #649 - Add GHA condition for CI optimization – @yuluo-yx
- dev: standardize editor configs for cross-platform development #648 - Standardize editor configs for cross-platform development – @yuluo-yx
- Upgrade lz4 version to 1.9.0 #714 - Upgrade LZ4 version to 1.9.0 – @qingwen220
- chore(deps): bump org.apache.logging.log4j:log4j-core from 2.17.1 to 2.25.3 in /geaflow #718 - Bump LZ4 Java from 1.9.0 to 1.10.1 – @dependabot[bot]
- [ISSUE701]Perform simple optimization on ErrorApiResponse using an exception classifier. #720 - Use classifier for artifact management – @yazong
📚 Documentation & Governance
- docs: Update governance and code of conduct for ASF compliance #641 - Update governance and code of conduct for ASF compliance – @mingcheng
- docs: Add security policy #643 - Add security policy – @mingcheng
- docs: update README with contact and star sections #684 - Update README with contact and star sections – @mingcheng
- docs: update source download url #721 - Update source download URL – @accevolve
- docs: update repository references from tugraph-family to apache #746 - Update repository references from tugraph-family to apache – @SYaoJun
🛠️ Core Runtime & Engine
- Add pr submission info #630 - Extract vertex/edge projector rules for optimization – @hey-money
- Fix data loss after failover #633 - Fix data loss after failover – @chzhoo
- Minor: Add try-with-resources block around ProcessLoggerManager #687 - Add try-with-resources for ProcessLoggerManager – @vamossagar12
- Fix memory management in mmap_ipc.cpp #725 - Fix memory management in
mmap_ipc.cpp– @wr-web - Refactor varint constants for improved readability #744 - Refactor encoder: extract varint constants – @SYaoJun
- fix: improve error handling in searchVectorIndex method #750 - Improve error handling in
searchVectorIndex– @SYaoJun - fix: add runtime checks for graph accessors in multiple classes #774 - Add runtime checks for graph accessors – @SYaoJun
🔌 Connectors & Storage
- feat: Paimon stream source. #662 - Add Paimon stream source support – @ChaomingZhangCN
- feat: Write data into ODPS with dynamic partiiton #666 - Write data into ODPS with dynamic partition – @ChaomingZhangCN
- feat: add neo4j and es connector #653 - Add Neo4j and Elasticsearch connectors – @yuluo-yx
- feat: support vector store. #637 - Support vector store infrastructure – @tanghaodong25
- [Proposal] PaddleSpatial SA-GNN Integration for GeaFlow-Infer #776 - Exclude lucene-core from Elasticsearch client dependency – @SYaoJun
🧠 AI & Graph Analytics
- feat: add jaccard similarity #650 - Add Jaccard similarity algorithm – @kitalkuyo-gita, @undertaker86001
- feat: support cluster coefficient #640 - Support clustering coefficient algorithm – @kitalkuyo-gita
- feat: Implements lpa and cc algorithm #670 - Implement LPA (Label Propagation) and CC (Connected Components) algorithms – @ChaomingZhangCN
- feat: support louvain algorithm #689 - Support Louvain community detection algorithm – @kitalkuyo-gita
- feat(ai): support graph consolidate algorithm #728 - Add Lucene & Embedding-Based Search Operators – @Leomrlin
- feat(ai): support graph consolidate algorithm #729 - Support graph consolidate algorithm for AI – @Leomrlin
- feat(ai): add CASTS for GeaFlow reasoning ability #737 - Add CASTS for GeaFlow reasoning ability – @Appointat
📈 SQL / GQL / DSL Enhancements
- feat(dsl): 实现ISO-GQL源/目标谓词函数 #675 - Implement ISO-GQL source/target predicate functions – @SeasonPilot
- feat: add ISO-GQL PROPERTY_EXISTS predicate (#359) #702 - Add ISO-GQL
PROPERTY_EXISTSpredicate – @SeasonPilot - chore(deps): bump lodash from 4.17.21 to 4.17.23 in /geaflow-console/geaflow-web #735 - Implement ISO-GQL
SAMEpredicate for element identity comparison – @SeasonPilot
🔨 Code Quality & Refactoring
- chore: deleting useless methods & translated into Chinese #667 - Remove useless methods and translate comments to Chinese – @yuluo-yx
- chore: delete unnecessary files #647 - Delete unnecessary files – @yuluo-yx
- chore(deps): bump org.apache.logging.log4j:log4j-core from 2.17.1 to 2.25.3 in /geaflow-mcp #719 - Optimize hard-coded cluster constants – @Mukvin
- feat(ai): add CASTS and Light Memory components #748 - Fix typos in log messages and comments – @SYaoJun
- fix: correct typos in log messages and comments across multiple files #752 - Update RocksdbClient to use
getDeclaredConstructor– @SYaoJun - chore(deps): bump com.fasterxml.jackson.core:jackson-core from 2.10.0 to 2.18.6 in /geaflow/geaflow-dsl/geaflow-dsl-connector/geaflow-dsl-connector-hudi #754 - Fix raw type usage: replace
Collections.EMPTY_LIST– @SYaoJun - fix(math): handle null and empty inputs in average methods #760 - Handle null and empty inputs in average methods – @SYaoJun
- feat(CI): add Codecov integration for coverage reporting #772 - Define
BYTES_PER_KBconstant for consistent conversion – @SYaoJun
📝 Draft Release Notes v0.8.0(Incubating)
Apache GeaFlow v0.8.0(Incubating) represents a major leap forward in graph intelligence capabilities, bringing enterprise-grade AI features, expanded ISO-GQL compliance, and enhanced storage connectivity.
Highlights
- 🤖 AI-Native Graph Analytics: Introduced vector store support, Lucene-based embedding search, and reasoning capabilities via CASTS operators. Added graph consolidate algorithms for AI workflows.
- 🧮 Advanced Graph Algorithms: Expanded algorithm library with Jaccard similarity, clustering coefficient, Louvain community detection, Label Propagation (LPA), and Connected Components (CC).
- 🌐 ISO-GQL Compliance: Full implementation of ISO-GQL standard predicates including
SOURCE/TARGET,PROPERTY_EXISTS, andSAMEfor element identity comparison. - 🔗 Enterprise Connectors: Added Neo4j and Elasticsearch connectors, ODPS dynamic partition writing, and Paimon streaming source support.
- 🛡️ Stability & Performance: Fixed critical data loss issues after failover, optimized memory management in native components, and improved error handling across vector and graph accessors.
- 🏗️ Developer Experience: Standardized cross-platform development configs, upgraded LZ4 compression, and improved code quality with comprehensive refactoring.
Contributors
Thanks to everyone who contributed to this release:
@hey-money, @mingcheng, @yuluo-yx, @chzhoo, @kitalkuyo-gita, @undertaker86001, @ChaomingZhangCN, @vamossagar12, @accevolve, @Mukvin, @yazong, @tanghaodong25, @SeasonPilot, @wr-web, @Leomrlin, @Appointat, @SYaoJun, @qingwen220
Let's aim for a high-quality release that demonstrates Apache GeaFlow(Incubating)'s commitment to innovation in graph computing and AI integration!
Next Steps
- Please review the above summary and comment with:
- Any missing PRs or corrections
- Interest in championing this release
- Concerns about pending PRs or critical blockers
- Once consensus is reached, we can proceed with:
- Creating the release branch
- Voting on the release candidate
- Publishing artifacts to Maven Central and Apache Dist area
Thanks all! Looking forward to shipping v0.8.0(Incubating) 🚀
Note: This proposal covers development from the v0.7.0 tag through March 2026.