The development of large language models (LLMs) has significantly advanced artificial intelligence (AI) across various fields. Among these advancements, mobile GUI agents—designed to perform tasks autonomously on smartphones—show considerable potential. However, evaluating these agents poses notable challenges. Current datasets and benchmarks often rely on static frame evaluations, which provide snapshots of app interfaces for agents… →
Evaluating the real-world applicability of large language models (LLMs) is essential to guide their integration into practical use cases. One key challenge in assessing LLMs is their tendency to exploit fixed datasets during testing, leading to inflated performance metrics. Static evaluation frameworks often fail to determine a model’s ability to adapt to feedback or provide… →
CONCLUSION: Current recommendation for protein intake (0.8-1 g/kg/day) is certainly not enough to ameliorate the muscle mass loss in middle age and older adults’ individuals with T2DM. In contrast, protein intake of 1.2-1.5 g/kg/day seems to be a more appropriate recommendation to combat upcoming sarcopenia, nonetheless the progression of T2DM was not interrupted. →
CONCLUSIONS: Compared to patients with other metastatic sites, R/M NPC patients with liver metastasis have poor survival patterns when receiving anti-PD-L1 therapy. Our study provides rational evidence for the urgent need to explore more efficacy treatment modalities for NPC patients with liver metastasis. →
:BACKGROUND: This randomized, open-label study examined the therapeutic effects of computerized cognitive training (CCT) combined with selective serotonin reuptake inhibitors (SSRIs) on cognitive impairment among patients with late-life depression (LLD). : METHOD: Study data were collected from May 5, 2021, to April 21, 2023. Outpatients who met diagnostic criteria for major depressive disorder according to… →
CONCLUSION: Most participants with HF perceived themselves to be moderately effective in performing cognitive activities. Given the statistically significant but small to moderate correlations between subjective and objective measures of cognitive dysfunction, administering both types of measures may aid in early detection of persons at risk for developing cognitive impairment. →
Proteins, the essential molecular machinery of life, play a central role in numerous biological processes. Decoding their intricate sequence, structure, and function (SSF) is a fundamental pursuit in biochemistry, molecular biology, and drug development. Understanding the interplay between these three aspects is crucial for uncovering the principles of life at a molecular level. Computational tools… →
Large language models (LLMs) have brought significant progress to AI applications, including code generation. However, evaluating their true capabilities is not straightforward. Existing benchmarks, such as LiveCodeBench and USACO, have limitations. They lack robust private test cases, do not support specialized judgment systems, and often work with inconsistent execution environments. These gaps make it challenging… →
CONCLUSION: The immunogenicity profile of biosim-NTZ was confirmed to match that of ref-NTZ in healthy subjects and patients with RRMS by applying highly sensitive methods. →
CONCLUSIONS: Zanubrutinib in combination with R-CHOP is an effective option for DEL patients, and the toxicity of zanubrutinib is entirely acceptable for patients. →