OSInsert-Image-Composition

OSInsert is a two-stage object insertion pipeline. This repository packages the minimal inference code for ObjectStitch, SAM, and InsertAnything into the libcom/os_insert module.

Stage 1 (ObjectStitch): generate a coarse composite on the target background image.
Stage 2 (SAM + InsertAnything): apply SAM to obtain a foreground insertion mask, then combine the "original background + ObjectStitch output + SAM mask" into a source image and mask, and feed them into InsertAnything to obtain a high-quality final insertion result.

0. Example Results

The table below shows several samples at different stages (from left to right: background, foreground, aggressive mode (ObjectStitch + SAM + InsertAnything), and conservative mode (InsertAnything only)).

Sample	Background	Foreground	aggressive (OSInsert, full pipeline)	conservative (InsertAnything)
bottle
box
bus
cake
keyboard
frame

1. Environment

Example environment configuration:

OS: Linux
Python 3.10
PyTorch ≥ 2.6.0

Dependency installation example:

conda create -n osinsert python=3.10
conda activate osinsert
pip install -r requirements.txt

Note: This repository does not include any pretrained weights. Checkpoints must be downloaded via the links below and configured via the local directory structure or environment variables.

2. Models and Directory Layout

This repository is self-contained and no longer depends on external ObjectStitch / InsertAnything source repositories. All inference-related code resides under libcom/os_insert. All checkpoints are organized under the model_dir/ directory:

model_dir/
  flux/
    FLUX.1-Fill-dev/
    FLUX.1-Redux-dev/
  insert_anything/
    20250321_steps5000_pytorch_lora_weights.safetensors
  objectstitch/
    v1/
      model.ckpt                      # -> ObjectStitch.pth
      configs/
        v1.yaml
      openai-clip-vit-large-patch14/  # CLIP weights directory
  sam/
    sam_vit_h_4b8939.pth

2.1 Checkpoints

ObjectStitch checkpoint:
- openai-clip-vit-large-patch14
  - HuggingFace: https://huggingface.co/BCMIZB/Libcom_pretrained_models/blob/main/openai-clip-vit-large-patch14.zip
  - ModelScope: https://www.modelscope.cn/models/bcmizb/Libcom_pretrained_models/file/view/master/openai-clip-vit-large-patch14.zip
- ObjectStitch.pth
  - HuggingFace: https://huggingface.co/BCMIZB/Libcom_pretrained_models/blob/main/ObjectStitch.pth
  - ModelScope: https://www.modelscope.cn/models/bcmizb/Libcom_pretrained_models/file/view/master/ObjectStitch.pth
SAM ViT-H: sam_vit_h_4b8939.pth
- Official download: https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth
InsertAnything LoRA (recommended):
- Direct download: https://huggingface.co/WensongSong/Insert-Anything/resolve/main/20250321_steps5000_pytorch_lora_weights.safetensors
FLUX.1-Fill-dev / FLUX.1-Redux-dev:
- FLUX.1-Fill-dev: https://huggingface.co/black-forest-labs/FLUX.1-Fill-dev/resolve/main/flux1-fill-dev.safetensors
- FLUX.1-Redux-dev: https://huggingface.co/black-forest-labs/FLUX.1-Redux-dev/resolve/main/flux1-redux-dev.safetensors

After downloading, organize all files according to the directory structure above. The following environment variables can override default paths:

FLUX_FILL_PATH
FLUX_REDUX_PATH
IA_LORA_PATH

If these variables are not set, the defaults under model_dir/... are used.

3. Data Format

The data format of OSInsert follows the original ObjectStitch convention:

background/{uniq}.png
foreground/{uniq}.png
foreground_mask/{uniq}.png
bbox/{uniq}.txt (content: x1 y1 x2 y2)

The TSV list file contains the following columns:

uniq_id \t bg_path \t fg_path \t fg_mask_path

3.1 Built-in Demo Data

This repository provides a minimal runnable demo:

examples/samples_demo.tsv
examples/background/Demo_0.png
examples/foreground/Demo_0.png
examples/foreground_mask/Demo_0.png
examples/bbox/Demo_0.txt

Typical usage:

Directly reuse these demo files to verify the pipeline.
Replace the images with custom data while keeping the same filenames and directory structure.
Create a new TSV and os_test directory, and pass their paths via script arguments.

4. One-Click Demo: OSInsertModel

The main entry script is tests/test_os_insert.py, which calls libcom.os_insert.OSInsertModel. Legacy multi-script pipelines such as osinsert/run_osinsert_full.py are no longer required.

4.1 Demo Data

The repository includes demo data under:

examples/background/Demo_0.png
examples/foreground/Demo_0.png
examples/foreground_mask/Demo_0.png
examples/bbox/Demo_0.txt

These files can be replaced (while keeping filenames unchanged) for quick custom tests.

4.2 Running Conservative / Aggressive Modes

tests/test_os_insert.py exposes a --mode argument to select the run mode:

conservative: use InsertAnything only, performing insertion within the bbox region on the background image.
aggressive: full two-stage pipeline: ObjectStitch → SAM → InsertAnything.

Example commands:

conda activate osinsert
cd OSInsert-Image-Composition

# Conservative mode (default)
python -m test_os_insert --mode conservative --uniq_id Demo_0

# Aggressive mode (ObjectStitch + SAM + InsertAnything)
# Minimal aggressive demo (uses defaults: uniq_id=Bus_2, device=cuda:0, split_ratio=0.5, seed=123)
python -m test_os_insert --mode aggressive

# Maximal / reproducible aggressive run (explicitly fix key knobs)
python -m test_os_insert --mode aggressive --uniq_id Demo_0 --device cuda:0 --split_ratio 0.5 --seed 123

# Notes
# - You can freely remove optional flags (e.g. --device/--split_ratio/--seed/--verbose) and rely on defaults.
# - Use --uniq_id to switch which sample under examples/ to run.

Outputs are written to:

result_dir/osinsert_demo/: conservative mode results.
result_dir/osinsert_demo_aggressive/: aggressive mode results.

In aggressive mode, setting --verbose additionally keeps intermediate files under result_dir/*/intermediates/, including:

objectstitch_coarse_rgb.png: ObjectStitch coarse composite (BGR PNG).
sam_mask.png: raw SAM mask on the coarse composite.
blended_source.png: background and ObjectStitch composite blended by the SAM mask (source image).
bbox_mask.png: bbox (rectangular) mask used for the second phase.

4.3 OSInsertModel API Overview

The unified OSInsertModel is defined in libcom/os_insert/os_insert.py:

from libcom.os_insert import OSInsertModel

model = OSInsertModel(model_dir="model_dir", device="cuda:0")

model(
    background_path="examples/background/Demo_0.png",
    foreground_path="examples/foreground/Demo_0.png",
    foreground_mask_path="examples/foreground_mask/Demo_0.png",
    bbox_txt_path="examples/bbox/Demo_0.txt",
    result_dir="result_dir/osinsert_demo_aggressive",
    mode="aggressive",          # or "conservative"
    verbose=False,               # if True, save intermediate artifacts
    seed=123,
    strength=1.0,
    split_ratio=0.5,             # first half SAM-mask, second half bbox-mask
)

The internal behavior is as follows:

conservative:
- Use background + bbox to construct a rectangular mask.
- Call InsertAnything directly on this region.
aggressive:
- ObjectStitch: generate a coarse composite objectstitch_coarse.png on the background.
- SAM: run SAM on the coarse composite with the bbox and obtain a binary mask.
- Blending: blend the original background and the coarse composite according to the SAM mask to form a new source image and mask (aligned to the original background resolution).
- InsertAnything: run InsertAnything on this region to obtain the final high-quality insertion result. During the denoising process, OSInsert uses a two-phase mask schedule: the first half of timesteps uses the ObjectStitch/SAM mask, and the second half uses a bbox (rectangular) mask to encourage more complete shadow/illumination synthesis.

In aggressive mode, seed is also used to seed the ObjectStitch sampling step so that the coarse composite (and thus downstream SAM / blending) is reproducible.

5. Configuration Notes

5.1 Single place to edit checkpoint paths

For convenience, tests/test_os_insert.py contains a top-level CONFIG block where you can override all checkpoint paths (ObjectStitch / SAM / FLUX / LoRA) in one place. Any relative paths in that block are resolved against the repo root at runtime.

5.2 About `libcom/os_insert/source/ldm`

libcom/os_insert/source/ldm is a bundled copy of the minimal LDM code used by ObjectStitch.

When running, libcom/os_insert/source/objectstitch_infer.py automatically adds its own source directory to sys.path, so imports like from ldm.models.diffusion.ddim import DDIMSampler work without requiring you to manually set PYTHONPATH or any environment variables.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
examples		examples
figures		figures
libcom/os_insert		libcom/os_insert
tests		tests
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OSInsert-Image-Composition

0. Example Results

1. Environment

2. Models and Directory Layout

2.1 Checkpoints

3. Data Format

3.1 Built-in Demo Data

4. One-Click Demo: OSInsertModel

4.1 Demo Data

4.2 Running Conservative / Aggressive Modes

4.3 OSInsertModel API Overview

5. Configuration Notes

5.1 Single place to edit checkpoint paths

5.2 About `libcom/os_insert/source/ldm`

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

bcmi/OSInsert-Image-Composition

Folders and files

Latest commit

History

Repository files navigation

OSInsert-Image-Composition

0. Example Results

1. Environment

2. Models and Directory Layout

2.1 Checkpoints

3. Data Format

3.1 Built-in Demo Data

4. One-Click Demo: OSInsertModel

4.1 Demo Data

4.2 Running Conservative / Aggressive Modes

4.3 OSInsertModel API Overview

5. Configuration Notes

5.1 Single place to edit checkpoint paths

5.2 About libcom/os_insert/source/ldm

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

5.2 About `libcom/os_insert/source/ldm`

Packages