Data Sync Source
Selecting the appropriate data source is fundamental to successful data synchronization. The right choice ensures data integrity, optimizes sync performance, reduces integration complexity, and maintains high data quality standards.
Tacnode DataSync supports a comprehensive range of data sources, covering mainstream relational databases, big data messaging queues, NoSQL databases, and specialized cloud services.
Importance of Data Source Selection
Choosing the right data source enables:
- Data Integrity Assurance: Select sources that support required data formats and protocols
- Performance Optimization: Choose appropriate sync strategies based on source characteristics
- Reduced Integration Complexity: Use supported data sources to minimize development overhead
- Quality Guarantee: Select stable and reliable sources to ensure sync quality
Supported Data Sources
Input Data Sources
Tacnode DataSync can acquire and transmit data from the following sources:
Data Source | Sync Capabilities | Primary Use Cases |
---|---|---|
Tacnode | Efficient full and incremental data sync | Tacnode instance-to-instance migration |
PostgreSQL | Efficient full and incremental data sync | PostgreSQL database synchronization |
MySQL | Efficient full and incremental data sync | MySQL database synchronization |
Oracle | Efficient full and incremental data sync | Oracle database synchronization |
Kafka | Real-time event sync, supports KVS JSON, DOUBLE SERIALIZED KVS JSON, CANAL JSON, CANAL PROTOBUF formats | Real-time data stream processing |
MongoDB | Efficient full and incremental data sync | MongoDB database synchronization |
Alibaba Cloud ADB (AnalyticDB) | Efficient full sync | Alibaba Cloud analytical database |
Alibaba Cloud Data Hub | Real-time event sync | Alibaba Cloud real-time data processing |
Alibaba Cloud SLS (Simple Log Service) | Real-time event sync | Alibaba Cloud log service |
Other Relational Databases | Support for protocol-compatible databases | Protocol-compatible database systems |
Output Data Sources
Currently, Tacnode DataSync supports exporting data to:
Data Source | Export Capabilities | Configuration Requirements |
---|---|---|
Kafka | Export Tacnode database change events to Kafka topics, supports Maxwell and KVS formats | Kafka cluster connection info and topic configuration |
Permission Configuration
Different data sources require specific permissions to ensure DataSync can properly access and synchronize data.
MySQL Permissions
For MySQL data sources, different sync types require different permissions:
Full Sync Permissions
Incremental Sync Permissions
Binlog Configuration Requirements
Incremental sync requires proper MySQL binlog configuration:
Required Settings:
server_id
: Non-emptylog_bin
: 1 (enabled)binlog_format
: ROWbinlog_row_image
: FULL
Verification Query:
PostgreSQL Permissions
For PostgreSQL data sources, permission requirements vary by sync type:
Full Sync Permissions
Incremental Sync Permissions
Logical Replication Configuration Requirements
Incremental sync requires proper PostgreSQL logical replication configuration:
Required Settings:
wal_level = logical
Verification Query:
Oracle Permissions
For Oracle data sources, grant the following permissions to ensure DataSync can properly access and sync data:
Kafka Permissions
For Kafka data sources, ensure DataSync has the following permissions for data import or export:
Required Access:
- Read permissions on specified topics (input source)
- Write permissions on specified topics (output source)
- Access permissions to Kafka cluster metadata
Configuration Example:
MongoDB Permissions
For MongoDB data sources, grant the following permissions to ensure DataSync can properly access and sync data:
Configuration Guide
Configuring DataSync to connect to data sources typically involves the following steps:
Obtaining Access Credentials
Prepare the target data source's address, port, username, password, database name (or Topic name, Project name, etc.), and necessary security credentials (such as Access Key/Secret Key, SSL certificates, etc.).
Security Recommendations:
- Use dedicated sync accounts, avoid using administrator accounts
- Regularly rotate access credentials
- Enable SSL/TLS encrypted transmission
Choosing Connection Method
Direct Connection
Using direct network connection requires ensuring the firewall has added DataSync service IP addresses to the whitelist in advance.
Configuration Points:
- Ensure network reachability
- Configure appropriate security group rules
- Test network connectivity
Tunnel Connection
Through pre-established PrivateLink connections for more secure data transmission.
Configuration Points:
- Configure VPC endpoint
- Set up private network connection
- Verify connection security
Connection Testing
After saving configuration, first perform connection testing to ensure DataSync can properly access the data source.
Testing Steps:
- Click "Test Connection" button
- Wait for test results
- Adjust configuration based on test results
- Retest until connection succeeds
Common Connection Issues:
Best Practices
Permission Configuration
- Principle of Least Privilege: Grant only the minimum permissions required for sync
- Dedicated Accounts: Create dedicated accounts for data synchronization
- Permission Auditing: Regularly review account permission configurations
Security Configuration
- Network Isolation: Use private network connections to avoid public network transmission
- Encrypted Transmission: Enable SSL/TLS to encrypt data transmission
- Access Control: Configure IP whitelists to restrict access
Performance Optimization
- Connection Pooling: Properly configure connection pool size
- Batch Processing: Enable batch data processing to improve efficiency
- Concurrency Control: Adjust concurrency based on source system performance
Monitoring and Alerting
- Connection Status: Monitor data source connection status
- Performance Metrics: Monitor sync performance indicators
- Exception Alerting: Set up connection exception alerts
Monitoring Dashboard Example:
- Connection health status
- Sync throughput (records/second)
- Error rates and types
- Resource utilization metrics
Through proper selection and configuration of data sources, you can ensure the stability, security, and efficiency of data synchronization, providing reliable data support for business operations.
Advanced Configuration Examples
High Availability Setup
Multi-Region Data Sources
This comprehensive approach to data source selection and configuration ensures optimal performance and reliability for your Tacnode DataSync operations.