Paper page - MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction
…Driven by its efficient architecture design and inference optimization, the model can perform real-time full-duplex omni-modal interaction on edge devices with less than 12GB RAM cost. View arXiv page…