A logistics company uses a fleet of Amazon EC2 instances to ingest high-volume JSON data from on-premises systems at a peak rate of 1 MB per second. The company has discovered that in-flight data is lost during EC2 instance reboots. The data science team requires near-real-time access to the ingested data for analysis. What is the most scalable and resilient solution to support low-latency queries with minimal data loss?