Abstract: Scene Graph Generation (SGG) generates graphs from visual scenes in the form of {subject-predicate-object} triplets. Traditional SGG methods rely on fully supervised training, requiring ...
Abstract: In recent years, Large Language Models (LLMs) have achieved remarkable performance across various natural language processing tasks. However, they still face significant challenges in ...