Skip to content
@VisionXLab

VisionXLab

VisionXLab at Shanghai Jiao Tong University, led by Prof. Xue Yang.

Pinned Loading

  1. h2rbox-mmrotate h2rbox-mmrotate Public

    [ICLR'23] PyTorch Implementation for H2RBox

    Python 106 11

  2. mllm-mmrotate mllm-mmrotate Public

    [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

    Jupyter Notebook 92 6

  3. point2rbox-v2 point2rbox-v2 Public

    [CVPR'25] Official repo of "Point2RBox-v2:Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances"

    Python 40 4

  4. whollywood whollywood Public

    [TPAMI] Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection

    Jupyter Notebook 11

  5. LRS-VQA LRS-VQA Public

    [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

    Python 48 1

  6. CrossEarth CrossEarth Public

    [TPAMI 2025] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation

    Python 180 9

Repositories

Showing 10 of 36 repositories
  • CitationClaw Public

    让每一次引用都成为可解释的影响力 Turning Every Citation into Explainable Impact

    VisionXLab/CitationClaw’s past year of commit activity
    Python 78 1 0 0 Updated Mar 14, 2026
  • GRADE Public
    VisionXLab/GRADE’s past year of commit activity
    Python 23 0 0 0 Updated Mar 14, 2026
  • EvoTok Public

    Code repo for "EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation"

    VisionXLab/EvoTok’s past year of commit activity
    9 0 0 0 Updated Mar 13, 2026
  • FIRM-Reward Public

    Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

    VisionXLab/FIRM-Reward’s past year of commit activity
    Python 23 0 1 0 Updated Mar 13, 2026
  • CrossEarth-SAR Public

    The official repo of CrossEarth-SAR, a sar-centric and billion-scale geospatial foundation model for cross-domain semantic segmentation

    VisionXLab/CrossEarth-SAR’s past year of commit activity
    Python 24 0 0 0 Updated Mar 12, 2026
  • Rise-Video Public

    RISE-Video: Can Video Generators Decode Implicit World Rules?

    VisionXLab/Rise-Video’s past year of commit activity
    Python 22 0 2 0 Updated Mar 11, 2026
  • PWOOD Public

    [CVPR'26] Partial Weakly-Supervised Oriented Object Detection

    VisionXLab/PWOOD’s past year of commit activity
    Python 8 0 0 0 Updated Mar 4, 2026
  • Point2RBox-v3 Public

    [ICLR'26] Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization

    VisionXLab/Point2RBox-v3’s past year of commit activity
    Python 12 1 0 0 Updated Feb 28, 2026
  • Awesome-RS-VL-Data Public

    Awesome Remote Sensing Vision-Language Datasets

    VisionXLab/Awesome-RS-VL-Data’s past year of commit activity
    61 MIT 2 128 0 Updated Feb 24, 2026
  • LRS-VQA Public

    [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

    VisionXLab/LRS-VQA’s past year of commit activity
    Python 48 1 1 0 Updated Feb 16, 2026